Middle DevOps Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Responsibilities
- Platform & IaC Ownership: Analyze and implement infrastructure designs for services and shared components, managing them as Infrastructure as Code (IaC) using tools like Terraform and Helm within our cloud environment (AWS).
- Delivery Lifecycle Management: Design and implement robust CI/CD pipelines and own the full delivery lifecycle of infrastructure tools, services, and components from development testing through to production rollout.
- Developer Enablement: Actively participate in regular support cadences to provide hands-on technical assistance and expertise to development teams regarding platform adoption and usage.
- Reliability Integration: Integrate and maintain monitoring, logging, and alerting components for platform services, and participate in the team's on-call rotation for immediate incident mitigation within the platform ownership scope.
- Security & Compliance: Collaborate closely with the Security team to embed DevSecOps best practices and guardrails, ensuring the security and compliance of the platform and delivery process.
- Process Improvement: Drive continuous improvements in platform tooling usability, deployment efficiency, and environment stability.
- Required Skills and Experience
- Domain Knowledge: Solid understanding of Observability, DevSecOps, and basic FinOps (cloud cost estimation) practices.
Benefits
Additional Information
We are seeking a proactive and self-sufficient DevOps Engineer to join our team. You will be instrumental in the design, construction, and maintenance of our core infrastructure. Your focus will be on enhancing the entire developer experience by owning the shared infrastructure tools, services, and environments that enable speed, reliability, and security across the organization. Your Mission for 2026: You will play a key role in evolving our infrastructure into a more autonomous, self-service platform. Over the first 6 - 12 months, your impact will be focused on: Infrastructure Evolution: Driving the IaC 2.0 initiative - modernizing our Terragrunt-based repositories, eliminating configuration drift, and ensuring 100% IaC coverage. You will explore and implement AI-driven tools for configuration reviews and impact analysis, reducing human error and support burden. GitOps Evaluation: Leading the evaluation of GitOps for three strategic use-cases: simplifying K8s cluster configurations, managing Staging environment states, and potentially replacing current CD implementations for our products. Platform Reliability: Enhancing EKS control plane observability and experimenting with nftables adoption to improve network performance and stability. About Our Environment Our product supports over 67k B2B customers, managing business-critical document workflows that demand top availability and performance even under high load. Our highly resilient infrastructure runs on AWS and Kubernetes (EKS). We operate a service-oriented architecture comprising hundreds of microservices written primarily in Python and Java. Services interact using synchronous protocols (NATS, gRPC) and asynchronous, event-driven operations (RabbitMQ, Kafka, Debezium, and Flink). We manage hundreds of databases, predominantly PostgreSQL, with exposure to MySQL, MongoDB, OpenSearch, and Redis, handling many TBs of data. Due to the scale and complexity of this environment, all infrastructure management and service delivery are strictly enforced through Infrastructure as Code (IaC) and modern CI/CD practices to maintain a high availability target of 99.99%.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at pandadoc? Share your experience