Skip to main content
Back to jobs

Lead Systems Engineer (Kafka) - CA - 2026

External
Nubank logoNubank · Toronto, Canada
Full-timeOn-siteToday
ApacheAWSKafkaKubernetesObservabilitySAFe
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Nu is one of the largest digital financial platforms in the world, with more than 122 million customers across Brazil, Mexico, and Colombia. Guided by our mission to fight complexity and empower people, we are redefining financial services in Latin America and this is still just the beginning of the purple future we're building. Listed on the New York Stock Exchange (NYSE: NU), we combine proprietary technology, data intelligence, and an efficient operating model to deliver financial products that are simple, accessible, and human. Our impact has been recognized by global rankings such as Time 100 Companies, Fast Company's Most Innovative Companies, and Forbes World's Best Bank. Visit our institutional page https://international.nubank.com.br/careers/ We are looking for an experienced software engineer to help evolve and operate Nubank's messaging platform and the infrastructure that supports asynchronous communication at scale. This role sits in a team responsible for highly critical platform capabilities that support a wide range of internal systems across multiple business domains and countries. The platform operates in a large and complex environment, with hundreds of clusters, thousands of brokers, hundreds of thousands of topics, and very large daily data volumes across multiple AWS accounts. At the Lead level, we are looking for someone who can independently own important technical problems, improve reliability and operability, and drive engineering decisions in partnership with the team. Kafka experience is desirable, but not required. Strong knowledge of distributed systems infrastructure, especially Kubernetes, networking, and AWS, is essential. What You'll Be Responsible For Operate and improve large-scale messaging and platform infrastructure based on kafka used by critical systems across Nubank Contribute to the reliability, scalability, and performance of asynchronous communication platforms Help design and implement solutions for high-throughput, low-latency, and fault-tolerant systems Improve observability, automation, and operational excellence across the platform Support incident analysis, troubleshooting, and root cause remediation in production environments Optimize infrastructure usage and help drive efficiency and cost awareness across AWS-based environments Work on platform capabilities that enable safe growth in message volume, topic count, and cluster footprint Partner with other engineers and teams to evolve platform standards, tooling, and best practices Contribute to architectural discussions involving messaging, traffic patterns, service communication, and platform reliability We Are Looking for a Person Who Has

Requirements

  • Strong software engineering fundamentals and experience working with distributed systems in production.
  • Solid experience with Kubernetes, networking, and AWS in large-scale or business-critical environments.
  • Experience operating infrastructure-heavy platforms with high reliability and availability requirements.
  • Ability to troubleshoot complex production issues across application, infrastructure, and network layers.
  • Experience improving observability, automation, and operational tooling.
  • Good understanding of scalability, resilience, performance, and failure isolation patterns.
  • Ability to work autonomously on ambiguous technical problems and drive them to execution.
  • Strong collaboration skills and ability to work across team boundaries.
  • Experience with Apache Kafka or other messaging and streaming technologies.
  • Experience with platform engineering, SRE, or infrastructure-focused backend engineering.
  • Familiarity with multi-account AWS environments and large-scale cloud operations.
  • Experience with high-throughput event-driven architectures.
  • Experience balancing reliability, performance, and cost in production systems.

Benefits

Total compensation includes base salary, RSUs and benefits. Base salary range: $210.000 - $252.000Health InsuranceLife InsurancePension PlanExtended maternity and paternity leavesNucleo - Our learning platform of coursesNuLanguage - Our language learning programNuCare - Our mental health and wellness assistance programVacationsWork Model for this RoleTransparency in the use of AIOur recruitment process may involve the use of artificial intelligence-enabled tools, such as automated interview transcription and analysis, to support the evaluation process. Artificial intelligence is not used to make final hiring decisions;Paid time offRemote work optionsEquity / stock options

Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Nubank? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect