Skip to main content
Back to jobs

Data Lake Engineer

External
Sosi1 logoSosi1 · Doral, FL
Full-timeOn-site3mo ago
ApacheAWSAzureCI/CDCloud SecurityCompliance
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

SOSi is seeking a Data Lake Engineer to support mission requirements for a structured approach to further develop, integrate, and sustain a scalable, federated data ecosystem that enhances interoperability, governance, and mission-driven analytics for a DoD customer. The primary objective of the program is to bridge the operational gaps between DoD, IC, interagency, and non-traditional international partners to enable real-time information sharing, dynamic data integration, and mission-tailored analytical capabilities. Essential Job Duties: The contractor shall design, implement, and maintain scalable Data Lake architectures to support structured and unstructured data ingestion, ensuring efficient data access and retrieval. The contractor shall configure and manage the integration interface between the Data Lake and the knowledge graph platform (Stardog), including SPARQL endpoint access, metadata federation, and catalog alignment. The contractor shall follow access control policies and usage scope defined by the Government and other coordinated Work Orders. The contractor shall confirm compliance with access policies on a quarterly basis and document the results in the Data Governance & Compliance Report. The contractor shall optimize ETL pipelines for high-volume data transformation, ensuring compliance with DoD IL-4/IL-5 security standards. The contractor shall implement storage tiering strategies and access controls, ensuring data is properly classified, retained, and accessed per DoD governance requirements. The contractor shall submit the Data Lake Performance & Optimization Report, detailing ingestion efficiency, access control improvements, and storage utilization metrics. Active TS/SCI Clearance. Master's degree or higher (e.g., Ph.D.) in Computer Science, Information Technology, Systems Engineering, Data Science, Business Administration, Engineering Management, or a closely related field, or a minimum of eleven (11) years of experience managing complex technical projects in enterprise data architecture, Databricks administration, and cloud-based data platforms. Knowledge and capability to support Data Lake platform administration and enterprise data architecture for DoD data-driven projects. Skilled in Data Lake platform administration, including workspace management and configuration, cluster optimization and performance tuning, cloud integration, and Unity Catalog integration for secure data governance. Proficient in ETL/ELT pipeline development, Delta Lake architecture and optimization, AI/ML workflow integration, and Data Lakehouse optimization for DoD analytics and mission-critical data workflows. Experienced in SysEngOps, DevSecOps, version control systems (Git), and CI/CD pipelines to streamline Data Lake development and deployment. Knowledgeable in identity and access management (IAM), role-based access control (RBAC), and cloud security best practices across AWS, Azure, and GCP. Hands-on expertise in Python, SQL/NoSQL, Apache Spark, Databricks SQL, Terraform, and cloud-native data services for large-scale data processing and analytics. Work Environment Normal office conditions Working at SOSi All interested individuals will receive consideration and will not be discriminated against for any reason.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Sosi1? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect
Data Lake Engineer at Sosi1