Skip to main content
Back to jobs

Senior Production Support Engineer

External
spgi logoSpgi · Mumbai, India
Full-timeOn-site2d ago
AWSCapacity PlanningDatadogIncident ResponseLeadershipMentoring
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Grade Level (for internal use): 10 The Team- The IT Operations team at S&P Dow Jones Indices (S&P DJI) is tasked with owning and maintaining the Production IT systems that underpin S&P DJI's index platforms and applications, ensuring their high availability. The team prioritizes service availability, service request management, and continuous improvement of support processes through collaborative engagement with business stakeholders, operations, infrastructure, and development teams. Additionally, the team is involved in critical activities such as incident management, emergency response, change management, problem management, and capacity planning to support the robustness of S&P DJI's index platforms. Key Responsibilities- - Support and maintain highly available, scalable IT systems and infrastructure hosting S&P DJI's critical index platforms and applications - Act as a working lead, providing technical leadership while remaining hands-on and contributing as an individual contributor based on operational demands, project requirements, and incident response needs - Lead incident response efforts, conducting root cause analysis and implementing preventive measures to minimize system downtime and improve reliability - Develop and maintain automation frameworks for deployment, monitoring, and infrastructure management to reduce manual intervention and increase operational efficiency - Collaborate with development teams to implement SRE best practices, including service level objectives (SLOs), error budgets, and reliability engineering principles - Monitor system performance, capacity planning, and resource optimization to ensure optimal performance of production environments - Drive continuous improvement initiatives by analysing system metrics, identifying bottlenecks, and implementing solutions that enhance overall system reliability

Requirements

  • Bachelor's degree in Computer Science, Information Systems or Engineering is required, or in lieu, a demonstrated equivalence in work experience.
  • 6+ years of experience in Technical operations or Application/Data support roles with focus on high availability systems.
  • Experience with cloud platforms such as AWS (including ECS, managed container orchestration services , S3, CloudFront) or equivalent cloud technologies.
  • Experience with monitoring and observability platforms such as Datadog and its key modules (APM, DBM, logging, and Infrastructure monitoring), or similar tools like Dynatrace, metrics and visualization platforms , or equivalent tools.
  • Proficiency in database technologies including PostgreSQL/Oracle PL/SQL, stored procedures, and non-relational databases.
  • Advanced PostgreSQL experience including performance tuning and optimization
  • Strong programming skills for automation using scripting languages such as Shell, Python, or similar.
  • Knowledge of networking protocols including TCP/IP, Unicast, Multicast, Sockets and IP addressing
  • Experience working with large datasets in Equity, Commodities, Forex, Futures and Options asset classes.
  • Familiarity with ITSM processes & tools such as ServiceNow, PagerDuty, or similar incident management platforms.
  • Excellent communication skills with strong verbal and writing proficiencies.
  • Additional Preferred Qualifications:
  • Datadog (APM, Logs, or Fundamentals) or equivalent SRE certifications with any other tools.
  • Understanding of observability best practices, log correlation, and distributed tracing methodologies.
  • Knowledge in Financial Services with experience in Index/Benchmarks, Asset Management, or Portfolio Investment domains.
  • Experience mentoring junior team members or L1/L2 support staff in technical and operational practices.
  • Ability to prioritize and manage multiple critical incidents with business impact in mind.
  • About S&P Global Dow Jones Indic e s
  • S&P Dow Jones Indices is a division of S&P Global (NYSE: SPGI). S&P Global is the world's foremost provider of credit ratings, benchmarks, analytics and workflow solutions in the global capital, commodity and automotive markets. With every one of our offerings, we help many of the world's lea

Benefits

Vision insuranceEquity / stock options

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at spgi? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect