Data Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Clasp is a venture-backed, mission-driven startup transforming access to education and career pathways. We are revolutionizing the way employers attract and retain critical talent, while simultaneously tackling the student debt crisis. (Yep, we think BIG.) Our innovative platform meaningfully connects employers, educational institutions, and diverse talent to drive mutual benefit-using accessible education financing as the thread. We like to think of ourselves as more than a fintech; we're a catalyst for economic mobility. A Forbes Fintech 50 company, portfolio company of SHRM (Society of Human Resource Management - the largest HR organization out there!) and recipient of "Startup of the Year" by StartUp Boston, Clasp is driven by our commitment to social impact and innovation. We are reshaping the future of the workforce one opportunity at a time. Join us on our journey to give power to learners and unlock fulfilling careers that drive positive change in their communities and beyond. The Role - Data Engineer We are seeking a Data Engineer to build & operate reliable data pipelines that ingest and make accessible to our team and partners the data on hundreds of millions of dollars worth of student loans, educational enrollment statuses and employment data. The ideal candidate has both strong programming experience in python and is passionate about both high performance data pipelines as well as the analytics that they fuel. At Clasp we run a DevOps culture where the engineers have full ownership of the code they write & the infrastructure on which it runs. Candidates should be enthused about making substantial contributions to the architecture driving the product roadmap and the Stride business, and achieving tremendous personal growth with us along the way! Our modern Data Technology Stack consists primarily of Airflow, dbt, Postgresql, Superset, BigQuery, Python and SQL
Responsibilities
- Data Pipeline Development & Reliability
- Build and maintain scalable data pipelines that ingest and transform financial, education, and employment data
- Ensure data is reliable, timely, and accessible across internal teams and external partners
- Proactively identify and resolve data quality issues, including upstream dependencies
- Partner with stakeholders in finance, operations and business development departments to ensure that their data sets are reliably ingested into our data warehouse and their questions or reports can be answered programmatically
- Ensure via automated testing and edge case handling that we can detect any errors with upstream data or data processing and that all reports contain the data as expected
- Develop processes to anonymize and protect sensitive data across environments
- Cloud Infrastructure & Data Platform Operations
- Support the configuration and optimization of data warehouse, storage, and compute resources
- Partner in managing infrastructure as code (e.g., Terraform) to ensure reproducibility and scalability
- Monitor performance and cost efficiency of data workloads, identifying opportunities for optimization
- Contribute to improving the reliability, scalability, and observability of our data platform
Requirements
- 3+ years of experience as a data engineer, or equivalent experience building data pipelines
- 2+ years of experience with elements of the following technologies:
- Data Analytics: Airflow, DBT
- Business Intelligence Systems: Tableau, Looker, Sisense, Apache Superset, etc.
- Languages: Python, SQL
- Databases: SQL; NoSQL a bonus
- Bonus - Data Warehousing: Snowflake, BigQuery, etc.
- Bonus -- Cloud Infrastructure: AWS or Google Cloud Experience
- Bonus -- DevOps: Experience with modern cloud and container tooling such as Docker, Kubernetes, Terraform, etc.
- Strong communication and collaboration skills, with the ability to effectively communicate the complexities of technical programs to both technical and nontechnical stakeholders
- Desire to mentor and collaborate with other members of the team
- Willingness to roll your sleeves up to rapidly acquire competencies in a wide range of technical disciplines
- Bachelor's degree in Computer Science, Software Engineering, Information Systems, or equivalent experience
- Why This Role Is Compelling
- Modern tech stack with the ability to have impact to many different user personas - recruiters, students and more
- High autonomy working in a highly collaborative team
- Can grow into a more formal people management role - expectation though is to be very hands on
- Step into being a 10x engineer and ride the AI wave with a team doubling down on how this technology allows us to focus on harder problems and solve for our customer's needs without compromising on quality
- Remote Policy
- To ensure smooth collaboration with the Boston-based team, we are limiting to Eastern/Central timezones with the expectation that it will be Boston hours. Additionally, they must be within 1
Benefits
Additional Information
Data Engineer Location: Boston, MA Open to Remote in EST or CST Locations
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Clasp? Share your experience