Build and deliver a new platform on Microsoft Azure; continuously improve it through automation and standardization.
Own day-to-day operations, monitoring, and support of the Azure environment (compute, storage, networking, security).
Partner with development teams to ensure s afe, stable releases with strong monitoring and tested rollback readiness.
Implement Infrastructure as Code (Terraform) to provision and manage resources consistently across environments.
Build and maintain automation (Python preferred) to reduce manual work, improve repeatability, and increase reliability.
Participate in a rotating on-call schedule and troubleshoot and resolve issues across availability, performance, networking, and security; drive follow-ups to prevent recurrence.
Support Kubernetes/AKS operations and troubleshooting (pods/logs, probes, scaling, resource limits, node issues).
Improve observability by reducing alert noise, building dashboards, and adding synthetic checks for key user workflows.
Perform application and OS lifecycle tasks (patching, upgrades, maintenance, vulnerability remediation, and operational readiness).
Document designs, configurations, runbooks, and operational procedures to enable consistent on-call response.
Required Qualifications
5+ years in Cloud Engineering, SRE, DevOps, or a similar production operations role.
Strong hands-on experience with Microsoft Azure in production environments.
Hands-on experience with Terraform.
Strong scripting/automation skills (Python preferred) with real operational automation examples.
Experience with Azure Monitor, Log Analytics, Application Insights, KQL; Prometheus/Grafana is a plus.
Synthetic monitoring experience for login/API/workflow validation with alerting.
AI-assisted ops experience for troubleshooting/automation (with strong validation habits).
Azure certifications are a plus (e.g., Azure Administrator, Azure DevOps Engineer).
Behaviours
Problem-solver: Works to find root cause and prevent repeat issues, not just apply temporary manual fixes.
Proactive: Spots toil and automates it to improve reliability and speed.
Collaborative: Communicates clearly during incidents and supports teammates.
Adaptable: Stays calm in fast-changing environments and handles priority shifts.
Detail-oriented: Validates changes, avoids risky shortcuts, and documents outcomes.
Customer-focused: Understands impact and communicates clearly to restore service quickly.
How We Work
On-call: Rotating 24×7 coverage; lead/assist response and keep communication clear.
Collaboration: Work closely with dev/product/security; prioritize automation and permanent fixes over repeat manual work.
Growth & Culture: Ownership-driven environment where AI-assisted learning and experimentation is encouraged-move faster, but validate results and share learnings with the team.
Benefits
Min - Max :$85,276.80 - $127,915.20 (CAD)Health insuranceVision insurance
Additional Information
Meet your radiology, cardiology, enterprise imaging needs - on your own terms - with Merge by Merative imaging solutions. Trusted by 6 of the 10 largest U.S. health systems, Merge empowers healthcare organizations with advanced medical imaging solutions to enhance workflows, optimize care delivery, and help improve patient outcomes. Merge's Imaging Suite, built on a cloud-native foundation, provides solutions for Vendor-Neutral Archive (VNA), PACS, Enterprise Image Viewing and Workflow Orchestration. Merge's portfolio also includes Best in KLAS Cardiology and Hemodynamic Monitoring, and Digital Pathology.