Senior Data Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Data Infrastructure Orchestration - Build and maintain cloud native infrastructure for Data Platform (AWS, Terraform)
- Data Pipelines - Create new pipelines and improve/maintain existing pipelines using Spark (Python, Pyspark, SQL)
- Data Modeling - Partner with analytic consumers to design logical and physical schemas, improve existing data models and build new ones
- Cross-functional Collaboration - Interface with Product, Engineering, Data Science, Analytics/BI, and Operations to understand their data needs, providing both consultative and data engineering solutions for consumers
- Build data expertise and own data quality across various business domains including healthcare claims and member experience
- Manage the Business Intelligence development lifecycle, from semantic model development and version control to user administration, ensuring high data quality and consistency from the pipeline through to the visualization layer.
- Enable fellow developers to "self-service" their data needs
- Leverage best in industry practices to build the next generation data ecosystem to collect, move, store and analyze data
- To be successful in this role, you'll need:
- BS degree in Computer Science or related technical field, or equivalent practical experience
- 4+ years proven work experience as a data engineer, working with at least one programming language (e.g. Scala, Python/PySpark) plus SQL expertise
- 4+ years experience with schema design, dimensional data modeling, and large-scale data warehousing architecture
- Expertise in building data pipelines through efficient ETL design, implementation and maintenance
- Background working with distributed data systems such as Spark, Presto, Hive, and Redshift. Experience with BI platform administration and/or schedulers/workflow management tools (e.g. Airflow) a plus
- Excellent communication skills to collaborate with stakeholders in Engineering, Product, Data Science, Analytics/BI, and Operations
- Pay Transparency Statement
- This is a hybrid position based out of our Plano office, with the expectation of being in office at least three weekdays per week. #LI-hybrid
- Plano, TX Pay Range
- $135,080 - $168,850 USD
- Why Join Us?
- Mission-driven culture that values innovation, collaboration, and a commitment to excellence in healthcare
- Impactful projects that shape the future of our organization
- Opportunities for professional development through internal mobility opportunities, mentorship programs, and courses tailored to your interests
- Flexible work arrangements and a supportive work-life balance
- Privacy Notice
- For more information about why we need your data and how we use it, please see our privacy policy: https://collectivehealth.com/privacy-policy/ .
Benefits
Additional Information
At Collective Health, we're transforming how employers and their people engage with their health benefits by seamlessly integrating cutting-edge technology, compassionate service, and world-class user experience design. We deliver a connected healthcare experience for over a quarter million members and 60+ companies across the nation who want the best for their employees. We've got a ton of interesting problems to solve around data pipeline design and implementation, data architecture and modeling, distributed systems, and more. If you're passionate about tackling hard problems while making a real difference in the world, we'd love to talk!
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at collectivehealth? Share your experience