Company Logo
Software Engineer

Netflix - 1d ago

Company Logo
Senior Software Engineer

Reddit - 4d ago

Staff Software Engineer - Databases SRE | Germany | Remote

Requirements:

  • 8+ years engineering experience, 4+ in SRE/CRE/production engineering. Strong preference for those with formal customer reliability engineering experience.
  • Strong Kubernetes experience in AWS, GCP, or Azure, and familiarity with infrastructure-as-code tooling (Helm, Terraform, Jsonnet, etc.).
  • Strong experience with technical leadership, leading a team through projects, mentoring other engineers on the team and serving as a force-multiplier
  • Experience operating multi-tenant systems in production
  • Strong experience designing and implementing SLOs
  • Experience with one or more programming languages (e.g. Go, Python, Java, etc)
  • Experience with Linux operating systems internals, and some knowledge of networking, cloud storage, and scaling.
  • Excellent problem-solving and troubleshooting skills.
  • Experience with calmly and actively participating in blame-free Incident Response, following up on actions, and writing high quality PIRs (Post Incident Reviews, a.k.a. post-mortem documents)
  • Ability to reason about performance, scaling, and failure modes
  • Comfortable working within an engineering team where individuals are encouraged to have a strong sense of autonomy and self-direction.
  • Ability to partner deeply with product engineering teams
  • We highly value those who are intellectually curious, who default to transparency, possess a high bias towards action, and who are also kind (this is important!)

What you'll be doing:

  • Regular 1:1s with your manager and colleagues
  • Reviewing and creating SLOs, proactively investigating ways to reduce budget burn for those SLOs
  • Improving observability of customers within their environments
  • Designing and implementing solutions for reliability and scalability
  • Developing fault-tolerant design patterns
  • Collaborating with Engineering Leaders to define product strategy, roadmaps, and technical designs
  • Participating in PR review and collaborating with other engineers
  • Teaching others about Site Reliability Engineering and communicating best practices
  • Participating in Incident Response, investigation through resolution, PIR, and communication with customers

Perks and Benefits:

  • 100% Remote, Global Culture
  • Scaling Organization
  • Transparent Communication
  • Innovation-Driven
  • Open Source Roots
  • Empowered Teams
  • Career Growth Pathways
  • Approachable Leadership
  • Passionate People
  • In-Person onboarding
  • Balance is Key
  • Equal Opportunity Employer
AI Summary ✨
Grafana Labs logo

Grafana Labs

Remote - Germany (Remote)

Remote
Experience: Staff
Posted: June 29, 2026
Last seen: an hour ago
Aws
Azure
Gcp
Golang
Java
Kubernetes
Python
Terraform
sitereliability

Why we track Grafana Labs

Grafana Labs is fully remote and the company behind Grafana, Loki, and Mimir. If you've used their open-source tools, you know how good they are. The engineering work is focused on observability infrastructure, and they're one of the most developer-loved companies in the space.

Similar jobs

  • 4 days ago
  • 12 days ago
  • 13 days ago
  • See all jobs in Germany