Distributed Systems Engineer - Data Platform (Delivery, Database, Retrieval)
AI Summary ✨
Requirements:
3+ years of experience working in software development covering distributed systems and databases
Strong programming skills (Golang is preferable), as well as a deep understanding of software development best practices and principles
Hands-on experience with modern observability stacks, including Prometheus, Grafana, and a strong understanding of handling high-cardinality metrics at scale
Strong knowledge of SQL and database internals, including experience with database design, optimization, and performance tuning
A solid foundation in computer science, including algorithms, data structures, distributed systems, and concurrency
Strong analytical and problem-solving skills, with a willingness to debug, troubleshoot, and learn about complex problems at high scale
Ability to work collaboratively in a team environment and communicate effectively with other teams across Cloudflare
What You'll Be Doing:
Design, develop, and maintain scalable and reliable distributed systems across the entire data lifecycle
Build and optimize key components of the high-throughput data delivery platform to ensure data integrity and low-latency delivery
Develop new and improve existing components for the Cloudflare Analytical Platform to extend functionality and performance
Scale, monitor, and maintain the performance of large-scale database clusters to accommodate the growing volume of data
Develop and enhance customer-facing GraphQL APIs, log delivery, and alerting solutions focusing on performance, reliability, and user experience
Work to identify and remove bottlenecks across data platforms, from streamlining data ingestion processes to optimizing query performance
Collaborate with other teams across Cloudflare to understand their data needs and build solutions that empower them to make data-driven decisions
Collaborate with the ClickHouse open-source community to add new features and contribute to the upstream codebase
Participate in the development of the next generation of data platforms, including researching and evaluating new technologies and approaches
Nice to Haves:
Experience with ClickHouse
Experience with data streaming technologies (e.g., Kafka, Flink)
Experience developing and scaling APIs, particularly GraphQL
Experience with Infrastructure as Code tools like SALT or Terraform
Experience with Linux container technologies, such as Docker and Kubernetes
Perks and Benefits:
An opportunity to work on cutting-edge technologies
Joining a world-class team of engineers
Contribute to building a better internet
Flexible work environment
Equal employment opportunity for all
Reasonable accommodations for qualified individuals with disabilities