ML Ops / Data Engineer - Robotics
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Heavy machinery, light years ahead. sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training. We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today. We are proudly backed by Point Nine and other Tier 1 investors.
Responsibilities
- Build & operate data pipelines: Ingest, process, and transform multi-sensor telemetry (radar point-clouds, video frames, log streams) into analytics-ready and ML-ready formats.
- Design scalable storage: Architect high-throughput, low-latency data lakes and warehouses (e.g., S3, Delta Lake, Redshift/Snowflake).
- Enable ML Ops workflows: Integrate DVC or MLflow, automate model training/retraining triggers, track data/model lineage.
- Ensure data quality: Implement validation, monitoring, and alerting to catch anomalies and schema changes early.
- Collaborate cross-functionally: Partner with Embedded Systems, Robotics, and Software teams to align on data schemas, APIs, and real-time requirements.
- Optimize performance: Tune distributed processing, queries, and storage layouts for cost-efficiency and throughput.
- Document & evangelize: Maintain clear documentation for data schemas, pipeline architectures, and ML Ops practices to uplift the whole team.
- Required Qualifications:
- 3+ years of hands-on experience building production data pipelines in the cloud (AWS, GCP, or Azure).
- Proficiency in Python, SQL, and at least one big-data framework.
- Familiarity with ML Ops tooling: DVC, MLflow, Kubeflow, or similar.
- Experience designing and operating data warehouses/data lakes (e.g., Redshift, Snowflake, BigQuery, Delta Lake).
- Strong understanding of distributed systems, data serialization (Parquet, Avro), and batch vs. streaming paradigms.
- Excellent problem-solving skills and the ability to work in ambiguous, fast-paced environments.
- Preferred Skills:
- Background in robotics or sensor data (radar, LiDAR, camera pipelines).
- Knowledge of real-time data processing and edge-computing constraints.
- Experience with infrastructure as code (Terraform, CloudFormation) and CI/CD for data workflows.
- Familiarity with Kubernetes and containerized deployments.
- Exposure to vision-language or action-planning ML models.
Benefits
Additional Information
sensmore automates the world's largest machines with unprecedented intelligence. Our proprietary Physical AI enables heavy machines such as wheel loaders to instantly adapt to dynamic environments and execute new tasks without prior training. We integrate cutting-edge robotics into a platform powering intelligence and automation products - transforming productivity and safety for customers in mining, construction, and adjacent industries today. Join us and play a pivotal role in transforming the automation landscape in heavy industries. Role Overview: As our Data Engineer, you will design, build, and maintain the data infrastructure that powers Sensmore's embodied AI and Vision-Language-Action Models (VLAMs). You'll collaborate with Robotics, ML and Software engineers to ensure clean, reliable data flows from our sensor arrays (radar, LiDAR, cameras, IMUs) into training and inference pipelines. This role blends classic data engineering (ETL/ELT, warehouse design, monitoring) with ML Ops best practices: model versioning, data drift detection, and automated retraining.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at sensmore? Share your experience