Platform Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
We are looking for a Platform Engineer with 5 to 10 years of experience focused on the reliable operation of our Kubernetes-based platform. In this role, you will be responsible for day-to-day platform operations, environment provisioning, and maintaining platform services that support our customers and engineering teams.You will work closely with senior platform engineers and application teams to ensure that our infrastructure and platform services remain stable, secure, and scalable. This role is ideal for engineers who enjoy operational problem solving, troubleshooting production systems, and improving reliability through automation.We value engineers who take a customer-focused and result-oriented approach understanding that the platform exists to serve customers, development teams and ultimately deliver value to end users. Our goal is to build a platform that enables teams to move faster while maintaining high reliability and operational excellence.
Responsibilities
- Platform Operations :
- Provision and maintain Kubernetes environments for development, staging, and production
- Perform day-2 operations for platform services (databases, message queues, caching systems)
- Manage cluster upgrades, patching, and routine maintenance
- Monitor platform health and respond to alerts
- Environment Provisioning
- Set up new environments and namespaces for application teams
- Configure networking, ingress, secrets, and storage
- Maintain environment consistency across clusters
- Platform Service Operations
- Operate and maintain shared platform services such as:
- PostgreSQL / MySQL
- Redis
- Kafka / RabbitMQ
- Object storage
- CI/CD infrastructure
- Responsibilities include:
- Provisioning new instances
- Monitoring and capacity management
- Backup verification
- Incident troubleshooting
- Reliability & Incident Response
- Investigate and resolve platform incidents
- Participate in on-call rotation
- Perform root cause analysis and implement improvements
- Automation & Improvements
- Automate operational tasks where possible
- Improve monitoring, alerting, and operational runbooks
- Contribute to infrastructure-as-code repositories
- Required Skills & Experience
- Experience operating Kubernetes clusters in production
- Strong Linux and container fundamentals
- Experience with Kubernetes troubleshooting and debugging
- Understanding of:
- Kubernetes deployments, services, and networking
- Resource management and scaling
- Persistent storage in Kubernetes
- Experience operating at least some of the following:
- Databases (PostgreSQL, MySQL)
- Message queues (Kafka, RabbitMQ)
- Caching systems (Redis)
- Familiarity with infrastructure automation tools (Terraform, Helm, or similar)
- Experience with monitoring and logging systems
- Strong troubleshooting and incident response skills
Requirements
- Experience with CI/CD platforms
- Familiarity with GitOps workflows
- Cloud provider experience (AWS, GCP, Azure)
- Experience with observability platforms (Prometheus, Grafana, etc.)
- Experience supporting multi-team Kubernetes environments
- What We Value
- Customer-Focused Mindset
- The platform serves developers and internal teams. Understanding their needs and helping them succeed is a key part of the role.
- Result-Oriented Approach
- We value engineers who focus on outcomes - solving real problems, improving reliability, and delivering practical solutions.
- Operational Excellence
- You keep critical infrastructure reliable and well-maintained.
- Ownership
- You take responsibility for issues and follow them through to resolution.
- Continuous Improvement
- You automate repetitive tasks and improve operational processes.
- Collaboration
- You work closely with developers and platform engineers to keep systems running smoothly.
- What Success Looks Like
- Platform environments are reliable and well-maintained
- Engineers can request and receive environments quickly and consistently
- Platform services run without operational surprises
- Incidents are handled efficiently and professionally
- Operational tasks are gradually automated and standardized
- Internal teams experience the platform as stable, predictable, and easy to use.
Benefits
Additional Information
It's fun to work in a company where people truly BELIEVE in what they're doing! We're committed to bringing passion and customer focus to the business. Job Summary
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at optiva? Share your experience