Senior DevOps Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Who are we? So you might ask, who's CreditorWatch? Well, we are a leading Australian data and technology company that provides businesses with access to unique data and innovative products. By using our platform, our customers can confidently manage their commercial relationships, improve productivity and reduce financial risk. As a commercial credit reporting bureau, we offer a complete suite of credit reporting products and data insights covering the entire customer lifecycle, from customer onboarding and credit decision automation to credit risk management and automated collections. We were established in 2010 and most recently were named as one of AFR's Top 10 Best Places to Work as well as certified by Great Place to Work consecutively across 2022-2025. We saw significant growth in 2025 and that's not about to change. We are on track to break records in 2026, scaling at pace, making this the perfect time to join CreditorWatch. Our Purpose โ Empower Australian businesses to trade confidently with their customers. Our Mission ๐ We aim to be number one in our industry by delivering unique data insights and innovative products. Your Role & Team We're hiring a Senior DevOps Engineer to be a senior individual contributor across our platform agenda. This is not a ticket-queue role. You'll own meaningful slices of the platform end-to-end; driving cloud cost efficiency, making reliability measurable and enforceable, building self-service developer tooling and API infrastructure, and using agentic AI harnesses (including Claude Code and MCP) as a genuine force-multiplier in everyday operations. You'll set technical direction in your areas and raise the bar across a team that punches well above its size. We run a substantial production platform: dozens of containerised services on AWS ECS Fargate, a fleet of Aurora databases, a large multi-repo GitHub estate, and a Datadog-based observability stack. We're a deliberately lean, high-leverage DevOps team - we scale our impact through automation, strong self-service tooling, and AI rather than through headcount. You'll report directly to the Head of DevOps in this role . Please note, it's a full-time opportunity offering hybrid working conditions out of our Sydney CBD Office . Some of your responsibilities include and are not limited to: Drive cost efficiency across a multi-account AWS environment: cost dashboards, budgets, right-sizing of compute and databases, log-volume reduction, storage lifecycle policies, and Savings Plan coverage. Eliminate platform risk by retiring end-of-life runtimes, simplifying over-engineered infrastructure, and consolidating accounts with consistent guardrails (SCPs) and tagging. Bring AI/Bedrock spend under governance with attribution and budgets. Extend SLO coverage across production services and manage SLOs and monitors as code in Terraform. Make error budgets actionable, cut alert noise, measure MTTD/MTTR, and integrate alerting cleanly into the incident management workflow. Establish and run disaster-recovery drills with documented recovery objectives. Extend our reusable CI/CD workflows and golden-path service templates so new services reach production in hours, not days. Build out an internal developer platform (self-service deployments, ephemeral environments) and a production API Gateway for external APIs. Move secrets to runtime injection via AWS Secrets Manager, harden the container build pipeline, and consolidate infrastructure-as-code onto a single approach. Apply agentic AI to incident root-cause analysis and IaC authoring, with sensible human-in-the-loop guardrails. Our ideal candidate 5+ years in DevOps/ Platform/ SRE/ Cloud Engineering, with senior ownership of production systems. Deep AWS expertise - ECS/ Fargate, RDS/Aurora (MySQL & PostgreSQL), Lambda, IAM, VPC, S3, CloudWatch, API Gateway - and comfort operating a multi-account AWS Organisation. Terraform at production scale: modules, workspaces, multi-environment, and code-review discipline. CI/CD with GitHub Actions - reusable workflows, composite actions, and OIDC-based cloud authentication. Containers & Docker - building, securing, and operating containerised workloads. Observability - hands-on with Datadog (or Prometheus/Grafana/New Relic equivalents): APM, SLOs, monitors-as-code, and log/metric/trace pipelines. Cloud cost management (FinOps) - right-sizing, Savings Plans/RIs, and cost attribution through tagging. Reliability practice - SLOs, error budgets, on-call, blameless RCA, and MTTR/MTTD measurement. Strong scripting (Python and/or Bash) and the ability to work across a polyglot codebase (PHP, TypeScript, Python). Security-conscious by default: least-privilege IAM, secrets management, and WAF/GuardDuty/Security Hub-style controls. AI-augmented engineering - fluent with agentic coding harnesses (e.g. Claude Code) and MCP, building reusable skills and prompts to automate operational toil. You treat AI as a core part of your workflow, not an afterthought.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at CreditorWatch? Share your experience