Senior Platform Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Come Make an Impact on Millions of Brazilians! At RecargaPay, we're on a mission to deliver the best payment experience for Brazilian consumers and small businesses - by building a powerful digital ecosystem where the banked and unbanked connect, and where consumers and merchants have a one-stop shop for all their financial needs. We serve over 10 million users and process more than USD 4 billion annually. We've been profitable since 2022 and operate our own credit business. We are an AI-first, 100% remote team, scaling in the rapidly changing Brazilian financial market. Our goal? Deliver the best payment experience in Brazil for people and small businesses alike. We value autonomy, ownership, and a bias for action. We're looking for people who are curious, hands-on, and driven by impact - who want to solve real problems, work with strong teams, and rethink what's possible. If you're ready to do your best work, at scale, with purpose - this is your place. Position Overview As a Senior Platform Engineer, your primary mandate is to manage our platforms to improve business continuity, best practices and contribute to the developer experience end-to-end, eliminating friction, accelerating onboarding to first contribution, and lifting productivity across the entire software delivery lifecycle. You will drive modernization and platform initiatives and steward high-leverage practices (Engineering, DevEx, SRE, DevSecOps, and AI-assisted engineering), operating both hands-on and strategically while partnering with executive leadership and guiding Staff/Senior engineers to deliver scalable, reliable, compliant, and cost-efficient solutions on AWS. Crucially, you will translate day-to-day developer needs into Golden Paths, opinionated tooling, and policy-backed workflows, enabling adoption through clear documentation, targeted training, and automated self-service capabilities. Key Responsabilities Architectural Strategy: Define and steer medium and long-term architectural strategies for the platform, ensuring efficiency, scalability, and reliability across all AWS environments. Platform as a Product: Codify architecture into practice by delivering reference implementations and Golden Path templates for high-performance builds and standardized scaffolding. GitOps & Continuous Delivery: Engineer safe delivery and reusable GitHub Actions. Institutionalize GitOps using ArgoCD/Flux and Flagger for progressive rollouts (canary/blue-green) with automated, SLO-gated rollbacks. Cloud-Native Orchestration: Manage and optimize Kubernetes operations using Karpenter for intelligent node provisioning, as well as HPA and KEDA for event-driven autoscaling. Observability & SRE: Make observability the default through OpenTelemetry and New Relic. Define SLIs/SLOs, establish error-budget policies, and automate tracking of post-incident actions. Security & Service Mesh: Raise cloud and service security, operate and propose improvements related to OAuth2/JWT and secrets lifecycle management (Secrets Manager/KMS). Implement Policy-as-Code adoption across CI/CD and Kubernetes admission controllers. Event-Driven Excellence: Design and operate event topologies covering Kafka partitioning, compaction, retention policies, and schema evolution, including performance and reliability. AI-Assisted Engineering: Promote the adoption of AI-assisted practices to enhance code quality, automated refactoring, automated operations, and technical documentation. FinOps & Efficiency: Drive cost-aware architecture through cloud computing best practices, scalability, and platform/service efficiency, not merely focusing on savings plans or reservations. Technical Governance: Implement governance and controls that ensure the correct management of services, security, and compliance of our platforms and technologies, contributing to a culture of excellence. Software Engineering & Architecture - Academic Foundation: Background in Computer Science, Engineering, or related disciplines. - Expert Development: Extensive hands-on experience in software engineering, with solid proficiency in Java (Spring Boot) and TypeScript/Python. - Distributed Systems: Mastery of Domain-Driven Design (DDD) and microservices architecture, designing for high performance and high availability. - Build: Deep knowledge of Bazel build systems. Cloud & Infrastructure (AWS Expertise) - Expert-Level AWS: Deep, practical experience with core services: EKS, Lambda, API Gateway, DynamoDB, S3, IAM, Networking, and Security. - Infrastructure as Code: Advanced experience with Terraform (modules/policy enforcement) and Pulumi (multi-language stacks). - Kubernetes Internals: Deep knowledge of EKS, including Karpenter for scaling, Istio for service mesh, and Argo/Flux for GitOps. Security, Reliability & Observability - SRE & Resilience: Mastery of reliability patterns (circuit breakers, retries, back-pressure, stress tests, application architecture) and SLO-driven operations. - Security Hardening: Expert