Skip to main content
Back to jobs

Manager, Site Reliability Operations

External
canadiantirecorporation logoCanadiantirecorporation · Toronto, On, Canada
$79K–$131K/yrFull-timeOn-site2w ago
AzureBusiness AnalysisConfluenceDocumentationJavaJavaScript
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


Responsibilities

  • Directly support the stability, performance, and ongoing development of core Supply Chain applications with a focus on long term stability, performance, scalability, security, and effective operations
  • Participate in ongoing monitoring and triage of present issues, as well as in software, application, and system development activities alongside multiple teams
  • Promote ideas, standards, and practices that drive platform stability across various streams of work
  • Provide feedback during product design planning to ensure stability, scalability, security, and operational considerations are highlighted early in the product life cycle
  • Drive proactive development of stability focused improvements
  • Demonstrate strong leadership capabilities and actively drive interactions and communications between supporting development, platform, and product teams
  • Provide hands on application support of core business applications, including service, data and process integrity and optimization
  • Contribute to root cause analysis and resolution of incidents reported by monitoring systems and end users
  • Contribute to analysis of existing monitoring and telemetry systems, and assist in the definition and design of telemetry during product design
  • Manage and work with development teams and analysts to define, prioritize, and deliver stability, scalability, security, and operational improvement tickets
  • Collaborate with IT units and business stakeholders to understand the business processes and requirements supported by the application and identify process improvements
  • Maintain documentation, provide knowledge transfer for peers, prepare and facilitate training sessions, and address gaps in process and operational support documentation
  • Provide application expertise to identify business and technical requirements for project activities, raise risks, communicate support requirements, and lead transition of support activities from project to production
  • Communicate with business stakeholders in business terms at all levels of the organization
  • Define and support release management processes, and pipelines for new and existing applications.
  • What you bring:
  • 5+ years in a retail / Supply Chain environment and 5-10 year in an IT environment
  • Diploma or degree from an accredited technical institution in computer technologies
  • Strong understanding of Site Reliability Engineering aspects and practices.
  • Strong knowledge of Retail/Supply Chain applications and architecture from end to end, as well as the associated business processes
  • Strong knowledge of Azure cloud applications, standards, and frameworks
  • Ability to read, write, and understand multiple coding languages and frameworks (Java, Javascript, NodeJS, React, Terraform)
  • Strong understanding of database environments (SQL, CosmoDB) with the ability to read, write and execute SQL statements/scripts
  • Understanding and ability to operate Linux/Unix shell environment, with a knowledge of common basic commands and fundamentals of Linux operating systems
  • Excellent analytical skills and understanding of common application and business analysis practices
  • Strong commitment to creating and maintaining documentation
  • Ability to perform gap analysis to identify areas for improvement in application performance, functionality, and processes
  • Builds trust and credibility by consistently adhering to the organization's business principles and values. Is seen as direct, truthful, and trustworthy by co-workers, vendors, and customers.
  • Recovers quickly after change, disruptions, or mistakes and can remain productive and focused. Is adaptable and can apply lessons learned in one situation to another situation.
  • Familiarity with cloud platforms is an asset.
  • Experience with Collaboration & Change Management tools: Jira, Confluence, ServiceNow is an asset.
  • Familiarity with microservices architecture & system integrations is an asset.
  • Familiarity with DevOps Practices is an asset.
  • Knowledge of Retail and Supply Chain Business is an asset.
  • Good understanding of SAFe methodology and application is an asset.
  • We're always looking for great talent! In addition to competitive pay, we offer:
  • Comprehensive benefits and retirement programs
  • Performance incentives, Continuing Education Programs
  • Other perks to support your well-being
  • Career growth opportunities and product discounts
  • Broadband Salary Range: $79,000 - $131,000 .
  • Our typical hiring range is between $79,000 and $115,000 . Salary decisions are also dependent on other factors such as

Benefits

Vision insurance

Additional Information

Reporting to the AVP, Supply Chain Technology, SRE Operations, the Chapter Manager, SRE Development & Reliability, will be responsible for ensuring Supply Chain systems are operational and monitored. This role is also an active participant in all aspects of Production Operational Excellence, including technical vision, telemetry and observation decisions, automation strategy, framework development, solution delivery, incident and problem management.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at canadiantirecorporation? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect
Manager, Site Reliability Operations at Canadiantirecorporation