Bachelor's Degree in Computer Science, Information Technology, Engineering, or related field.
Minimum 3-6 years experience in cloud infrastructure or systems administration roles.
Hands-on experience managing production cloud environments supporting high-availability applications.
Strong understanding of: cloud networking and system architecture principles
Linux system administration
distributed system reliability fundamentals
scripting or automation practices
Experience with scripting languages such as Python or Bash.
Strong troubleshooting, analytical, and problem-solving skills.
Ability to work effectively under operational pressure.
Experience working in Payment Gateway, FinTech, Banking, or high-transaction environments.
Familiarity with container platforms such as Docker or Kubernet
Benefits
Health insuranceVision insurance
Additional Information
Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work , offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.
Job Responsibilities :
We are seeking an experienced Cloud Systems Administrator to manage and continuously enhance highly available, secure, and scalable cloud infrastructure supporting mission-critical payment processing platforms.
This role is responsible for ensuring operational reliability, performance optimization, security compliance, and automation maturity across multi-cloud environments that power real-time transaction processing, merchant integrations, settlement workflows, and financial reporting systems. The position also contributes to modernization initiatives including intelligent automation and AI-assisted operational capabilities to improve system resilience and engineering productivity. Key Responsibilities:
Cloud Infrastructure Operations & Reliability
Manage and maintain cloud infrastructure environments across major providers such as AWS, Google Cloud, or Microsoft Azure.
Ensure high availability and performance of systems supporting transaction routing, payment processing, and backend platform services.
Implement capacity planning, scaling strategies, and infrastructure performance tuning.
Support multi-zone and disaster recovery architectures aligned with strict SLA and uptime targets.
Other duties as assigned
Monitoring, Incident Response & Operational Excellence
Monitor infrastructure health, application dependencies, and transaction performance indicators.
Troubleshoot production incidents and perform root cause analysis with preventive improvement actions.
Improve operational readiness through automation of routine maintenance and infrastructure recovery workflows.
Collaborate with DevOps and engineering teams to improve deployment stability and environment consistency.
Automation & Infrastructure Engineering
Develop scripts and tooling to automate provisioning, configuration management, patching, and environment validation.
Support Infrastructure as Code practices using tools such as Terraform or cloud-native automation frameworks.
Improve operational efficiency by standardizing environment builds and configuration baselines.
Maintain accurate system documentation, architecture diagrams, and operational runbooks.
Security, Compliance & Governance
Enforce cloud security best practices including identity access management, encryption, network segmentation, and secure configuration standards.
Support compliance initiatives aligned with PCI DSS, ISO 27001, and financial regulatory expectations.
Assist internal and external audit activities by maintaining traceability of infrastructure changes and system configurations.
Participate in vulnerability remediation and security hardening programs.
Cost Optimization & Resource Efficiency
Monitor cloud resource utilization and identify opportunities for performance optimization and cost efficiency.
Implement governance controls such as tagging standards, lifecycle policies, and rightsizing strategies.
Provide operational insights into infrastructure usage trends and resource consumption patterns.
AI-Assisted Cloud Operations & Innovation
Leverage AI-powered operational tooling to enhance monitoring insights, anomaly detection, and incident diagnostics.
Contribute to AI use cases such as: intelligent log correlation and automated incident summarization
predictive workload scaling recommendations
infrastructure configuration optimization suggestions
automated runbook generation and operational guidance
cloud cost anomaly detection and efficiency insights
Collaborate with DevOps and R&D teams to prototype intelligent automation workflows that reduce manual operational effort.
Promote responsible adoption of AI tooling with consideration for security, compliance, and operational validation.