Data Engineer - II (Biometrics)
ExternalFull-timeOn-site2w ago
AirflowAWSComputer VisionData ModelingETLMachine Learning
Prepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Design, build, and maintain scalable and reliable data pipelines for dataset creation, transformation, and benchmarking
- Own and optimize Airflow pipelines on AWS for data processing, orchestration, and evaluation workflows
- Write efficient, production-grade SQL and Python code for large-scale data processing and analysis
- Partner closely with ML engineers to enable model training, evaluation, and benchmarking pipelines
- Improve pipeline performance, reliability, and observability, ensuring high data quality in production
- Build and maintain systems to support model performance tracking and data drift monitoring
- Troubleshoot and resolve data issues across pipelines, ensuring minimal impact on ML workflows
- Contribute to data architecture decisions and best practices across the platform
- Collaborate cross-functionally with ML, platform, and data teams to support scalable ML infrastructure
Requirements
- 3-5 years of experience in Data Engineering, Data Platforms, or related roles
- Strong proficiency in Python and SQL with experience in production systems
- Hands-on experience with AWS services (S3, EC2, SageMaker or similar)
- Solid experience building and managing Airflow (or similar orchestration tools)
- Strong understanding of data engineering fundamentals (ETL/ELT, data modeling, pipeline design)
- Experience working with large-scale datasets and distributed data systems
- Experience supporting ML workflows, datasets, or evaluation pipelines
- Strong problem-solving skills and ability to work independently in a fast-paced environment
- Experience with ML infrastructure, MLOps, or model evaluation workflows
- Exposure to biometric systems or computer vision datasets
- Familiarity with data quality frameworks, monitoring, and observability tools
- Experience working in SaaS or high-scale production environments
- Jumio Values:
- IDEAL: Integrity, Diversity, Empowerment, Accountability, Leading Innovation
- Equal Opportunities :
- Jumio is a collaboration of people with different ideas, strengths, interests and cultures. We welcome applications and colleagues from all backgrounds and of all statuses.
- About Jumio:
- Jumio is the leading provider of online identity verification, eKYC and AML solutions. With a global footprint, we're expanding the team to meet strong client demand across a range of industries including Financial Services, Travel, Sharing Economy, Fintech, Gaming, and others.
- Applicant Data Privacy
- We will only use your personal information in connection with Jumio's application, recruitment, and hiring processes, as described in Jumio's Applicant Privacy Notice. If you have any questions or comments, please send an email to privacy@jumio.com .
Benefits
Vision insurance
Additional Information
Role Purpose At Jumio, we're building trusted identity solutions powered by machine learning and biometrics. As a Data Engineer II , you will take ownership of designing, building, and scaling data pipelines that power our ML systems. You'll work closely with ML engineers, platform teams, and product stakeholders to enable robust dataset creation, benchmarking, and production-grade data workflows on AWS.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at jumio? Share your experience