Infrastructure Operations Engineer (Cloud & Monitoring)
ExternalS$48K–S$78K/yrContractUnknownToday
Information TechnologyRisk Management
Prepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Manage and maintain monitoring and observability platforms across applications, databases, infrastructure, and network environments (on-premise and cloud).
- Monitor system health through logs, metrics, and alerts to identify issues, perform incident triage, and coordinate timely resolution with relevant team.
- Develop and maintain dashboards and reports to monitor service availability, performance trends, and operational insights.
- Support cloud cost monitoring and governance initiatives, including cost tracking, tagging strategies, and optimization opportunities.
- Drive continuous improvements in monitoring coverage, automation, and operational processes to enhance service reliability and efficiency.
- Participate in incident management, maintain documentation, and provide after-hours support when required.
Requirements
- Degree/Diploma in Computer Science, Information Technology, Engineering, or related discipline.
- Minimum 3-5 years of experience in IT operations, infrastructure support, service assurance, NOC, or cloud environments.
- Hands-on experience with monitoring and observability platforms such as CloudWatch, Grafana, Prometheus, Splunk, ELK Stack , or similar tools.
- Experience working in hybrid environments ( on-premise and AWS cloud ).
- Strong understanding of infrastructure, system, network, and application monitoring concepts.
- Familiarity with AWS Cost Explorer, cloud budgeting, tagging strategies, and cost optimization practices .
- Knowledge of ITIL processes including Incident, Problem, and Change Management.
- Exposure to SRE practices and service reliability principles is advantageous.
- AWS Associate Certification and/or AWS FinOps Certified Practitioner preferred.
- Strong analytical and troubleshooting skills with the ability to correlate events across complex systems.
- Strong communication and stakeholder management abilities.
- Self-driven, proactive, and able to work independently in a fast-paced environment.
- Willing to provide after-hours support when required.
- To apply please click on the APPLY NOW button or email your resume for faster processing:
- We regret only shortlisted candidates will be notified. By submitting any application or résumé to us, you will be deemed to have agreed and consented to us collecting, using, retaining and disclosing your personal information to prospective employers for their consideration.
- Yoong Poh Feng
- EA License | 14C7092
- EA Registration Number | R1105076
Additional Information
Monitoring & Service Assurance Cloud FinOps & Governance Incident & Service Operations
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at SEARCH INDEX PTE. LTD.? Share your experience