Fde - Data
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Requirements
- Required skills: AWS Glue, Lake Formation, S3, Athena, Python/PySpark, IaC (Terraform/CDK), SQL
- Great Expectations, SHIP-HATS CI/CD, data cataloging, prior public sector / Government Agency experience
Additional Information
Develop 20 data table ingestion pipelines end-to-end (source extraction → S3 landing → Glue ETL → Lake Formation curated zone) Implement ETL transformations per data mapping specifications Build data quality validation rules (Great Expectations / custom Glue checks) - completeness, schema conformance, referential integrity Configure error handling and dead-letter patterns for failed ingestion records Register all datasets in AWS Glue Data Catalog with standardised metadata tags (owner, classification, freshness SLA) Write and maintain IaC (CDK/Terraform) for pipeline resources - Glue jobs, crawlers, S3 buckets, IAM roles Execute unit testing (per-transform logic) and integration testing (end-to-end flow with sample data) Support UAT with Agency A data owners - validate output tables match expected schema and row counts Document pipeline configurations, runbooks, and data flow diagrams for handover Participate in daily stand-ups, sprint demos, and code reviews
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at ONEBYZERO PTE. LTD.? Share your experience