Skip to main content
Back to jobs

Senior Data Platform Engineer

External
stackav logoStackav · Pittsburgh, PA
Full-timeRemote6d ago
ApacheCI/CDFeature EngineeringKafkaMachine LearningMove
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

In the Compute Platform team, our mission is to provide the foundational compute platform that powers large-scale autonomous systems development. The team is responsible for enabling engineers and researchers to efficiently run compute and data intensive workloads on Stack AV infrastructure. The Data Platform team is responsible for designing, implementing and maintaining the Stack AV on-premises data platform. The team supports large scale OLAP/OLTP and feature engineering workloads for multiple Product Development groups across the company. You will work at the intersection of infrastructure, distributed systems, and developer experience-ensuring that our critical services and pipelines are reliable, efficient, and easy to run. As a Senior Data Platform Engineer, you will design and operate high scale data systems that power engineers across the company.

Responsibilities

  • Design and operate distributed storage systems for scheduling and executing large-scale batch workloads.
  • Build and maintain an open source, modern data platform.
  • Optimize utilization of storage resourcesImprove reliability and fault tolerance of large-scale storage systems and data platform components.
  • Collaborate with teams across the company to understand workload requirements and improve platform capabilities.
  • Contribute to platform tooling, automation, and CI/CD workflows.

Requirements

  • 7+ years of experience building and operating distributed storage systems or modern data platforms.
  • Experience operating streaming platforms such as Kafka or Pulsar.
  • Fluent in Python, and SQL, with experience writing and maintaining highly available data applications using Trino and Apache Spark.
  • Knowledge of table formats (Iceberg, Delta Lake, Hudi, Xtable).
  • Experience operating and optimizing at least one RDBMS (Postgres, MySQL).
  • Strong debugging and problem-solving skills in complex distributed systems.
  • Ability to collaborate across teams and communicate technical concepts clearly.
  • We are proud to be an equal opportunity workplace. We believe that diverse teams produce the best ideas and outcomes. We are committed to building a culture of inclusion, entrepreneurship, and innovation across gender, race, age, sexual orientation, religion, disability, and identity.
  • Check out our Privacy Policy.

Additional Information

About Stack: Stack is developing revolutionary AI and advanced autonomous systems designed to enhance safety, reliability, and efficiency of modern operations. Stack's autonomous technology incorporates cutting-edge advancements in artificial intelligence, robotics, machine learning, and cloud technologies, empowering us to create innovative solutions that address the needs and challenges of the dynamic trucking transportation industry. With decades of experience creating and deploying real world systems for demanding environments, the Stack team is dedicated to developing an autonomous solution ecosystem tailored to the trucking industry's unique demands.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at stackav? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect