Data Engineer (Python, Data Systems & AI Enablement

External

Key Connect Recruitment · Singapore

S$96K–S$174K/yrContractUnknownToday

Information Technology

Cover Letter Connect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role

Responsibilities

Build and maintain scalable data pipelines using Python
Write production-grade Python code specifically for data processing, transformation, and ETL workflows
Perform data cleaning, preprocessing, and feature preparation for analytics and AI use cases
Use data analysis and manipulation tools to handle large datasets efficiently
Develop reusable Python modules for data ingestion and pipeline automation
Perform exploratory data analysis (EDA) to understand data patterns and quality issues
Optimize data workflows for performance, scalability, and reliability
Support data requirements for AI/ML and Generative AI systems
Build data services and APIs to support downstream AI applications
Ensure data quality, consistency, and observability across pipelines
Required Python & Data Libraries (Hands-on Experience Mandatory)
Candidates must have strong practical experience with:
pandas - data manipulation, transformation, and analysis
NumPy - numerical operations and array-based processing
Matplotlib - data visualization and reporting
scikit-learn - basic ML workflows and model evaluation
PyTorch - deep learning and AI model experimentation
AI / Generative AI Enablement
Prepare and structure datasets for ML and LLM-based systems
Support integration of AI models into data pipelines and applications
Enable workflows for Generative AI use cases (RAG systems, agent workflows)
Work with multiple AI model providers:
OpenAI
Anthropic
LLaMA
Mistral
Exposure to AI orchestration frameworks such as LangChain, AutoGen, and CrewAI
Core Requirements
Strong hands-on Python coding expertise focused on data systems (critical requirement)
Ability to write clean, efficient, production-grade Python code
Strong understanding of data structures, ETL pipelines, and data workflows
Experience working with large-scale structured and unstructured data
Strong SQL skills for data extraction and manipulation
Understanding of data modeling and analytics workflows
Ability to support end-to-end data-to-AI pipelines
Preferred / Good to Have
Experience with big data or distributed processing systems
Understanding of vector databases and embedding-based retrieval systems
Experience building APIs or services for data/AI systems
Familiarity with cloud platforms (AWS, Azure, GCP)
Exposure to production monitoring and data observability tools
What Success Looks Like
High-quality Python code powering scalable data pipelines
Reliable, clean, and well-structured datasets for AI systems
Efficient ETL workflows with minimal manual intervention
Seamless support for ML and GenAI applications in production

Additional Information

Job Title: Data Engineer (Python, Data Systems & AI Enablement) Role Overview Python-focused Data Engineer with strong hands-on coding skills in data-intensive systems. The role focuses on building scalable data pipelines, processing large datasets, and enabling AI/Generative AI applications through well-structured data infrastructure.

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at KEY CONNECT RECRUITMENT PTE. LTD.? Share your experience

Interested in this role?

Apply on the company's website.

Cover Letter Connect