Skip to main content
Back to jobs

Platform Delivery Associate Lead

External
trustbank logoTrustbank · Singapore
Full-timeOn-site1mo ago
AWSChaos EngineeringCI/CDForecastingHelmIAM
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Own the observability platform. Logs, metrics, traces, events - the full picture. Make signals actionable, reduce alert fatigue, and give every team the tools to understand their systems in production.
  • Build agentic SRE systems. Design and ship AI agents that investigate incidents, execute runbooks, automate remediation, and close the loop on production issues. Bring context and harness engineering discipline to agent design: grounded context, well-scoped tools, evals, and guardrails.
  • Automate the platform. Turn manual processes into self-service. Every recurring operational task is a candidate for elimination.
  • Drive resiliency automation. Lead initiatives such as chaos engineering, cost and performance optimization, and capacity forecasting.
  • Lead through influence. Set technical direction, mentor engineers, help unblock them when they hit issues, and write the design docs that get referenced for years. Raise the bar on what "operational excellence" means here.
  • Be the calm in the storm. Jump in to diagnose and remediate critical infrastructure issues. Support the team during critical deployments - you're expected to be hands-on when it matters most.

Requirements

  • 8+ years in SRE, platform, or infrastructure engineering, with deep production ownership of a non-trivial cloud platform.
  • Strong system design chops. You can reason end-to-end about distributed systems and event-driven architectures, including failure modes and the tradeoffs between consistency, availability, latency, and cost.
  • AWS and Kubernetes experience. Comfortable with EKS internals, networking, IAM/IRSA, service mesh (Istio), and the control-plane/data-plane split.
  • Hands-on with the rest of our stack - Kafka, Aurora PostgreSQL / RDS PostgreSQL, S3, ELB - or equivalent managed data and networking primitives.
  • Proven track record owning resiliency in a high-velocity environment: incident command, postmortems that actually change things, SLO-driven engineering, progressive delivery.
  • Experience with context engineering and harness engineering - building LLM and agent systems where the prompt, retrieved context, tool surface, and eval harness are first-class engineering artifacts.
  • A clear vision for what zero-downtime, zero-touch, zero-trust looks like in practice, and a track record of moving real systems toward it.
  • Fluent in IaC (Terraform, Helm, or similar) and CI/CD. You believe platforms are products and treat them as such.
  • FinOps mindset. Comfortable analyzing cloud cost reports, identifying inefficiencies, and driving optimization initiatives that balance cost with performance and reliability.
  • Our ideal candidate has
  • Financial services or other regulated-environment experience.
  • Experience integrating AI/LLM systems into production operational workflows - not just experiments.
  • Expertise in OpenTelemetry and the broader observability ecosystem.
  • Experience with Sumologic and PagerDuty (or equivalent observability and incident management tooling).
  • A continuous learning mindset - you stay current and bring new ideas to the table.
  • Strong stakeholder communication and management skills - you can translate technical concepts for non-technical audiences and build alignment across teams.
  • If you apply for a job with Trust or submit any personal information in connection with a possible job opportunity, you agree to our privacy notice for job applicants.
  • Come as you are! Trust is an inclusive and open-minded workplace. If you are good at what you do and care about doing a good job, that's what we focus and want from you. So come as you are. 😊

Benefits

Vision insuranceParental leave

Additional Information

Trust is the first of a new breed of banks in Singapore - digitally native and focused on delivering a delightful customer experience. You will work in a fast-paced and collaborative environment to solve new and interesting challenges each day. Together with our Trust team, you will help shape the future of our bank. We're a fast-growing cloud-native digital bank looking for a Lead SRE / Platform Engineer to drive initiatives to improve the reliability, resilience, and efficiency of the platform that powers our products. This is a hands-on role with leadership responsibilities: you'll help set the technical direction for how we operate, scale, and secure our infrastructure, and you'll build the systems that let us move quickly without breaking things. You'll be part of a core team that is trusted to keep it running smoothly, make it faster, and turn toil into code.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at trustbank? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect