Data Engineer (Databricks) - Remote USA

External

Icf · Reston, VA

ContractRemoteToday

AgileApacheAzureCI/CDConfluenceETL

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

Responsibilities

Enable secure, scalable, and efficient data exchange between federal client and external data sharing partners using Databricks Delta Sharing.
Support the design and development of data pipelines and ETL routines in Azure Cloud environment for many source system types including RDBMS, API, and unstructured data using CDC, incremental, and batch loading techniques.
Conduct data profiling, transformation, and quality assurance on structured, semi-structured, and unstructured data.
Identify underlying issues and translate them into technical requirements.
Assist in building and optimizing data lakes, feature stores, and data warehouse structures to support analytics and machine learning.
Prepare, structure, and validate data for data science and MLOps workflows, ensuring it meets the quality and format requirements for modeling.
Help monitor and maintain the flow of data across BI dashboards, analytics environments, and machine learning pipelines.
Engage directly with clients and stakeholders to understand data needs and translate them into scalable solutions.
Collaborate with UX designers, business analysts, developers, and end users to define data and reporting requirements
Work with external data partners to determine their data product needs and work within the Databricks platform to enable rapid prototyping and extensible use cases
Meet with government employees at executive levels, platform stakeholders, and vendor partners.
Work within Agile teams to support iterative development, backlog grooming, and sprint-based delivery.
Provide mentorship to junior resources
Technologies you'll use:
Databricks on Azure for data engineering and ML pipeline support.
SQL, Python, Spark, Tableau.
Git, Jira, CI/CD tools (e.g., Jenkins, CodeBuild).
Jira, Confluence, SharePoint.
Mural, Miro, or other collaboration/whiteboarding tools.
Minimum Requirements:
Bachelor's degree
5+ years in data engineering, data security practices, data platforms, and analytics
U.S. Citizenship required due to federal contract requirements.
Ability to obtain and maintain a federal public trust clearance or equivalent client-required background investigation.
Candidate must reside in the U.S., be authorized to work in the U.S., and all work must be performed in the U.S.
Candidate must have lived in the U.S. for three (3) full years out of the last five (5) years
What we would like you to bring:
3+ years Databricks Platform Expertise - SME Level Proficiency including: Databricks, Delta Lake, and Delta Sharing
Deep experience with distributed computing using Apache Spark
Knowledge of Spark runtime internals and optimization
Ability to design and deploy performant end-to-end data architectures
4+ years of ETL Pipeline Development building robust, scalable data pipelines
Databricks certifications - Professional or specialty certifications
Hands-on experience with Azure services such as Synapse, Data Factory, or Databricks.
Familiarity with data visualization tools such as Tableau, Power BI, or similar.
Machine Learning and Analytical Skills including:
MLOps - Working knowledge of ML deployment and operations
Data Science Methodologies - Statistical analysis, modeling, and interpretation
Big Data Technologies - Experience beyond Spark with distributed systems
Experience with deployment pipelines, including Git-based version control and CI/CD pipelines and DevOps practices using Terraform for IaC.
Emergency management domain knowledge a plus
Advanced proficiency in data engineering and analytics using Python, Expert-level SQL skills for data manipulation and analysis and experiencewith Scala, preferred but not required (Pyt

Benefits

Remote work options

Additional Information

Our Digital Modernization and Experience (DMX) Group is growing, and we are looking for a motivated, experienced Senior Databricks SME who is passionate about turning complex data into actionable solutions that improve public systems and services. This role supports an enterprise initiative focused on platform infrastructure and analytics modernization for a federal customer. You'll be joining a cross-functional team of full stack developers, data engineers, and data analysts working within a modular, cloud-native platform supporting the emergency management sector. Your work will help ensure disaster management and mitigation decision-makers have access to accurate, timely, and meaningful data and data products to drive effective service delivery and measurable mission outcomes. If you thrive in a collaborative environment, enjoy working independently to solve real-world challenges through data, we want to hear from you. Job Location : This position is fully remote with up to 10% travel to the DC Metropolitan area for client meetings. This position requires that the job be performed in the United States. If you accept this position, you should note that ICF does monitor employee work locations and blocks access from foreign locations/foreign IP addresses and also prohibits personal VPN connections.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at ICF International? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect