AI Operations Engineer (Python)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Keep existing agentic and multiagent AI solutions reliable, secure, and performant in production as usage and requirements evolve.
- Act as the point of contact for AI system issues, triaging, diagnosing, and resolving incidents while keeping stakeholders informed throughout.
- Own monitoring, observability, and quality across the AI estate, going beyond uptime to track health signals specific to agents, such as output quality, model and prompt regressions, and guardrail or tool call failures.
- Drive FinOps for AI workloads by tracking, attributing, and optimising the cost of LLM usage, compute, and cloud infrastructure to keep solutions efficient at scale.
- Manage and improve the underlying infrastructure across a stack including Python, AWS, Postgres, and Cloud Foundry and/or Kubernetes, applying sound operational and DevOps practices.
- Identify and remediate security vulnerabilities across the AI estate, patching dependencies and resolving scan findings so systems stay compliant with Sky's security standards and agentic AI governance.
- Support and maintain low code automation built in Copilot Studio and Power Automate, ensuring these flows stay reliable, governed, and integrated cleanly with the wider AI platform.
- As this is a production operations role, on-call cover may be required in the future to keep critical AI systems running reliably outside core hours.
- Essential criteria:
- Solid software engineering experience in Python, with a good understanding of Agile delivery in large scale enterprise environments.
- Experience supporting or operating production systems, ideally AI driven or data intensive, with strong monitoring, observability, and distributed systems diagnosis skills.
- Practical experience with cloud (particularly AWS), relational databases such as Postgres, and familiarity with container orchestration or PaaS such as Kubernetes and Cloud Foundry.
- Direct experience with low code automation in Copilot Studio and Power Automate, including configuring, troubleshooting, and supporting flows and their integrations.
- S trong communication and stakeholder management skills, comfortable being the calm point of contact during incidents and translating technical issues for business audiences.
- Practical experience keeping systems secure, managing dependencies, remediating vulnerabilities, and working to enterprise security standards.
- Desirable skills and experience:
- Exposure to LLM or agentic AI systems, FinOps cost management, or AI governance and responsible AI controls.
- Benefits and perks
- There's one thing people can't stop talking about when it comes to life at Sky: the perks . Here's a taster:
- Free Sky TV or NOW package , including Sky Sports and Sky Cinema
- Pension package with up to 9% employer contribution
- Private healthcare with mental health support
- Aviva Digital GP and dental insurance
- Discounts on Sky products, including Sky Mobile, Sky Broadband, Sky Glass and Sky Protect
- Sharesave and Tech schemes
- A range of Sky VIP rewards and experiences
- How you'll work
- We've adopted a hybrid working approach to give more flexibility on where and how we work. The hybrid working expectations for this role are 2 days in the office per week. The role is available in our Osterley or Livingston office.
- Your office base
- Osterley
- Our Osterley Campus is just a 10-minute walk from Syon Lane train station, or you can get on
Benefits
Additional Information
We don't just believe in better. We make it happen. Better content. Better products. And better careers. Working in Tech, Product or Data at Sky is about building the next and the new. From broadband to broadcast, streaming to mobile, Sky Stream to Sky Glass, we never stand still. We optimise and innovate. We turn big ideas into the products, content and services millions of people love. And we do it all right here at Sky. Role/Team overview We are seeking an AI Operations Engineer to join our Group AI Engineering team and play a key role in keeping our autonomous AI systems running reliably across Sky. You'll work with cutting-edge AI technologies , from large language models to multi-agent architectures , ensuring the agentic solutions already in production stay secure, performant, observable, and cost-effective. This is a hands-on engineering role within a forward-thinking team driving AI adoption at enterprise scale. Where the build teams take solutions from prototype to launch, you'll own what happens next: maintaining live agentic systems, acting as the frontline support and point of contact for the stakeholders who depend on them, and continuously improving the reliability, security, and efficiency of the platform in production.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at sky? Share your experience