Sr. Software Engineer - Kubernetes
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
Armada is the hyperscaler for the edge, delivering modular AI infrastructure from first deployment to AI factory with speed, scale and sovereignty. Named one of Fast Company's Most Innovative Companies and to the CNBC Disruptor 50, Armada's solutions are deployed in over 60 countries globally for organizations ranging from energy to defense. With nearly half a billion dollars in funding, Armada is backed by top investors such as Microsoft (M12), Founders Fund, and BlackRock, and has collaborations and partnerships including NVIDIA, Palantir and Dell Technologies. We are looking for the most brilliant minds in the world to join us. Working at Armada means taking ownership, driving autonomy, and delivering impact. You'll tackle challenges that haven't been solved before and help build something transformative from the ground up. What you do here will not only define your career but help further Armada's mission to bridge the digital divide for customers around the world. We are looking for a highly experienced Senior Platform Engineer to join our Edge Platform team. This role combines systems engineering, software development, and cloud-native infrastructure. You will be responsible for designing, building, and operating the software and platform capabilities that power our distributed edge computing infrastructure across Galleon mobile data centers and Commander cloud services. You will work extensively with Linux, Kubernetes, distributed systems, networking, and automation while developing platform components and operational tooling in Go and Python. The ideal candidate is equally comfortable debugging kernel-level performance issues, writing production-quality software, and designing resilient cloud-native platforms. This is a hands-on engineering role focused on building platform capabilities rather than solely operating infrastructure.
Responsibilities
- Platform Engineering & Software Development
- Design, develop, and maintain platform services, controllers, operators, and automation frameworks using Go and Python.
- Build internal tools and APIs that simplify provisioning, lifecycle management, observability, and operations of edge infrastructure.
- Develop software to automate infrastructure workflows, hardware lifecycle management, cluster operations, and fleet-wide orchestration.
- Contribute to platform architecture, technical standards, and engineering best practices.
- Design and implement self-healing and autonomous operational capabilities.
- Linux Systems Engineering
- Debug and optimize Linux systems across compute, storage, networking, and container workloads.
- Investigate complex performance issues involving CPU scheduling, memory management, I/O subsystems, networking, filesystems, and containers.
- Analyze kernel and userspace behavior using tools such as perf, eBPF, strace, tcpdump, bpftrace, and system profiling utilities.
- Drive platform reliability through deep understanding of Linux internals and operating system behavior.
- Participate in root cause analysis of production incidents involving operating systems, containers, and distributed systems.
- Kubernetes & Cloud Platform
- Architect, deploy, and manage highly available Kubernetes environments across edge and cloud infrastructure.
- Build and maintain Kubernetes operators, controllers, CRDs, admission webhooks, and platform extensions.
- Implement scalable networking, storage, security, and observability solutions for Kubernetes.
- Optimize cluster performance, resource utilization, and workload scheduling in resource-constrained edge environments.
- Drive Infrastructure-as-Code initiatives using Terraform, Ansible, Helm, and GitOps methodologies.
- Reliability & Operations
- Design and maintain observability platforms using Prometheus, Grafana, OpenTelemetry, and centralized logging solutions.
- Establish operational excellence through automation, monitoring, incident response, and postmortem processes.
- Collaborate with software, infrastructure, security, and product teams to deliver reliable platform capabilities.
- Participate in on-call rotations and continuously improve operational maturity through automation.
- Required Qualifications
- Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.
- 7+ years of experience in systems engineering, platform engineering, infrastructure software, or related fields.
- Strong software development experience in Go and/or Python, including building production services and automation systems.
- Deep understanding of Linux operating systems and systems internals.
- Strong understanding of:
- Linux process management, scheduling, memory management, filesystems, and networking
- TCP/IP, DNS, routing, VPNs, load balancing, and distributed systems fundamentals
- Containers and container runtimes
- Deep understanding of Kubernetes architecture and operations.
- Hands-on experience building and operating production Kubernetes environments.
- Experience de
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at armada? Share your experience