Lead Software Engineer - Data Engineering
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Leadership & Delivery
- Lead and mentor a team of data engineers and platform developers
- Drive Agile execution and ensure predictable, high-quality delivery
- Establish engineering best practices, code quality, and CI/CD standards
- Data Platform & Architecture
- Architect scalable and secure data platforms on AWS
- Design robust data ingestion frameworks for batch and near real-time pipelines
- Define best practices in data modeling, governance, and metadata management
- Data Engineering & Ingestion
- Lead design and development of scalable ingestion pipelines (structured and unstructured data)
- Build and optimize Snowflake-based data platforms for performance and cost
- Enable ingestion of diverse sources (databases, APIs, files, streaming data)
- Cloud & Platform Engineering
- Leverage AWS services (S3, Glue, Lambda, EMR, Redshift, etc.) for end-to-end pipelines
- Implement CI/CD pipelines using Azure DevOps / Jenkins
- Ensure system scalability, resiliency, and operational readiness
- Software Engineering Excellence
- Enforce software engineering principles (modular design, code quality, testing, version control)
- Drive automation and continuous improvement
- Promote reusable frameworks for ingestion and transformation
- Stakeholder Collaboration
- Partner with product managers, SMEs, and business stakeholders
- Translate business needs into scalable data solutions
- What You Have:
- 10+ years of experience in Data Engineering / Data Platform roles
- Strong experience in AWS data ecosystem (S3, Glue, Lambda, EMR, Redshift)
- Deep expertise in Snowflake (architecture, optimization, data modeling)
- Strong programming skills in Python and SQL
- Extensive experience with data ingestion pipelines and ETL/ELT frameworks
- Exposure to real-time streaming (Kafka, Spark Streaming)
- Experience with CI/CD tools (GitHub, Jenkins, AWS CloudFormation etc.)
- Solid understanding of distributed systems and scalable architectures
- Strong foundation in software engineering principles (Git, testing, design patterns) Experienced in working with Agile teams
- Collaborate with Data Science and AI teams to operationalize ML models and analytics workflows.
- Promote integration of AI capabilities into data engineering pipelines (e.g., GenAI, MCP, ATA).
- Support real-time analytics and edge AI use cases in manufacturing environments.
- Use AI extensively in building and testing Data Ingestion and Data pipeline
- This position requires candidate to work a 5-day -a -week schedule in the office
- Nice-to-Have Skills
- Experience with Graph Databases (Neo4j, Neptune)
- Experience with Vector Databases (Milvus, OpenSearch)
- Knowledge of NVIDIA ecosystem and RAPIDS (cuDF, cuML, cuGraph)
- Experience integrating AI/ML pipelines or GenAI workflows
Requirements
- Bachelor's or Master's degree in Computer Science / Engineering
- Proven track
Additional Information
Career Area: Technology, Digital and Data Job Description: Your Work Shapes the World at Caterpillar Inc. When you join Caterpillar, you're joining a global team who cares not just about the work we do - but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here - we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it. Your Impact Shapes the World at Caterpillar Inc When you join Caterpillar, you're joining a global team who cares not just about the work we do - but also about each other. We are the makers, problem solvers and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here - we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it. What does it mean to live the Caterpillar Experience? It means you play a part in building a better world. You contribute to a winning culture-a spirit of accountability-that has driven change on every continent for more than 90 years. To work with us is a commitment to improve sustainability, invent for the digital age and innovate in order to bring solutions to millions of people. Sure, you work for Caterpillar. But more importantly, you work for the good of your career, your family, your community, and your world. Job Summary We are seeking a highly skilled Lead Software Engineer - Data Engineering the development of Caterpillar's next-generation Digital Manufacturing Data Platform. This platform enables large-scale data ingestion, transformation, and analytics across manufacturing, supply chain, and engineering ecosystems. The ideal candidate will bring deep expertise in data engineering, large-scale data ingestion, AWS-based architectures, and Snowflake, along with strong leadership and software engineering discipline. The preference for this role is to be based out of Chennai - Brigade World Trade Center
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Caterpillar? Share your experience