Distributed Systems Engineer - Data Platform (Delivery, Database, Retrieval)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
About the role
At Cloudflare, we are on a mission to help build a better Internet. Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine's Top Company Cultures list and ranked among the World's Most Innovative Companies by Fast Company. At Cloudflare, we're not looking for people who wait for a polished roadmap; we're looking for the builders who see the cracks in the Internet that everyone else has simply learned to live with. We value candidates who have the instinct to spot a "normalized" problem and the AI-native curiosity to create a solution using the latest tools. Our culture is built on iteration, leveraging AI to ship faster today to make it better tomorrow, while ensuring that every improvement, no matter how small, is shared across the team to lift everyone up. If you're the type of person who values curiosity over bureaucracy, and that AI is a partner in solving tough problems to keep the Internet moving forward, you'll fit right in. Locations Available: Austin (US), Atlanta (US), Denver (US), Toronto (Canada) We are looking for experienced and highly motivated engineers to join our DATA Org and help build the future of data at Cloudflare. Our organisation is responsible for the entire data lifecycle - from ingestion and processing to storage and retrieval - powering the critical logs and analytics that provide our customers with real-time visibility into the health and performance of their online properties. Our mission is to empower customers to leverage their data to drive better outcomes for their business. We build and maintain a suite of high-performance, scalable systems that handle more than a billion events in a second. As an engineer in our organisation, you will have the opportunity to work on complex distributed systems challenges across different parts of our data stack. Our Data Org is composed of several key teams, and you could contribute to any of the following areas: Data Delivery: You will build and operate our distributed data delivery pipeline, a high-throughput, low-latency system (primarily written in Go) responsible for ingesting, processing, and routing massive volumes of data from across Cloudflare's global network to multi-core destination. Analytical Database Platform: Contribute to our core analytical platform powered by ClickHouse. This team builds and maintains a high-performance, scalable database platform optimised for the immense analytical workloads generated by our products and services. Data Retrieval: Be responsible for building the customer-facing products that make data accessible and actionable. This includes developing our public GraphQL API, building robust log delivery solutions and integrations with customer destinations, and contributing to our alerting products, which empower users to configure and receive near real-time alerts based on the logs and metrics observed by our data platform.
Responsibilities
- As a Software Engineer in our Data Organisation depending on the team you join, you will focus on a subset of the following areas:
- Design, develop, and maintain scalable and reliable distributed systems across the entire data lifecycle.
- Build and optimise key components of our high-throughput data delivery platform to ensure data integrity and low-latency delivery.
- Develop new and improve existing components for the Cloudflare Analytical Platform to extend functionality and performance.
- Scale, monitor, and maintain the performance of our large-scale database clusters to accommodate the growing volume of data.
- Develop and enhance our customer-facing GraphQL APIs, log delivery, and alerting solutions, focusing on performance, reliability, and user experience.
- Work to identify and remove bottlenecks across our data platforms, from streamlining data ingestion processes to optimizing query performance.
- Collaborate with other teams across Cloudflare to understand their data needs and build solutions that empower them to make data-driven decisions.
- Collaborate with the ClickHouse open-source community to add new features and contribute to the upstream codebase.
- Participate in the development of the next generation of our data platforms, including researching and evaluating new technologies and approaches.
- Key Qualifications
- 3+ years of experience working in software development covering distributed systems and databases.
- Strong programming skills (Go
Benefits
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Cloudflare? Share your experience