Platform Engineer, CloudOps Infrastructure
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Build and maintain standardized, reproducible, secured deployment templates.
- Develop and operate the orchestration layer (orchestration workflows, e.g. Prefect) and the GPU-backed compute paths that run training and inference, on a schedule and on demand.
- Own infrastructure-as-code, CI/CD pipelines, and the tooling that makes standing up, updating, and tearing down an environment routine and auditable.
- Build observability - metrics, logging, alerting - that gives a clear picture of system health across environments.
- Run and improve the system day to day (DevOps/CloudOps): drive operational practices that emphasize stability, predictability, and low overhead, and partner with the infosec function on infrastructure security posture.
- Adapt and extend the platform as ML researchers introduce new workflows and models - turning new requirements into supported, repeatable capabilities.
- Evaluate and integrate orchestration and deployment technologies; prototype, then harden the patterns that work.
- Participate in design discussions, code reviews, and documentation.
Requirements
- Strong Terraform/OpenTofu engineering skills and hands-on AWS (or comparable cloud) experience.
- Production experience with containerization, container orchestration (e.g. ECS), and CI/CD pipelines.
- Infrastructure-as-code and reproducible environments.
- Solid understanding of distributed systems fundamentals.
- Strong operational instincts: observability, debuggability, and maintainability.
- Experience operating multi-tenant or large-fleet platforms preferred.
- Experience with workflow orchestration (Prefect, Airflow, Dagster, or similar) and/or GPU compute platforms preferred.
- Familiarity with GPU-backed environments and ML training/inference pipelines preferred.
- Awareness of infrastructure security posture and compliance frameworks (e.g. ISO/IEC 27001 or similar) preferred.
- Strong written communication and the ability to work effectively in a distributed team.
- TECH STACK & ENVIROMENT
- Infrastructure-as-code (Terraform/OpenTofu & Terragrunt)
- AWS (ECS and related services)
- Automated CI/CD with containerized services
- PostgreSQL and object storage (S3)
- Python with modern tooling (e.g. uv, pixi)
- Workflow orchestration (e.g. Prefect); GPU-backed compute for ML workloads
- FastAPI with a clean service-layer architecture (business logic isolated from transport)
- Agentic coding tools (e.g. Claude Code) as part of day-to-day development
- ABOUT IAMBIC THERAPEUTICS
- MISSION & CORE VALUES
Benefits
Additional Information
JOB SUMMARY Iambic is building a secure, cloud-based platform for running our ML-driven drug-discovery workflows. You'll build and own the platform that lets us run our drug-discovery ML workloads reliably and reproducibly: the deployment templates that make standing up a new environment routine, the orchestration layer that runs model training and inference, and the tooling, CI/CD, and observability that keep everything healthy. It's a mix of building new platform capabilities and keeping scientists' workloads running dependably and on time. This is a hands-on DevOps/CloudOps role as much as a build role: you'll operate and improve the running system day to day, and adapt the platform as our ML researchers introduce new workflows and models. It suits an engineer who treats infrastructure as a product. This position is based out of our new Ireland office.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at iambic-therapeutics? Share your experience