Company Logo
Software Engineer

Netflix - 1d ago

Company Logo
Senior Software Engineer

Reddit - 4d ago

Staff Software Engineer - Grafana Cloud k6 | Ireland | Remote

Requirements:

  • Strong experience with DevOps/SRE practices, including operating and evolving production systems at scale
  • Strong programming background in a modern language (Python and Go are primary, but prior experience is not required)
  • Experience designing, building, and operating large-scale distributed systems
  • Strong understanding of reliability engineering concepts (e.g. incident management, observability, and failure modes)
  • Experience with test automation, including performance and functional testing
  • Ability to influence engineering practices through clear technical communication, reviews, and collaboration
  • Strong interpersonal skills and ability to work effectively across teams
  • Familiarity with modern software engineering processes and delivery practices
  • Self-driven and comfortable operating with a high degree of autonomy and ambiguity

Bonus Points For:

  • Experience with containerized and cloud-native systems (Docker, Kubernetes, AWS)
  • Familiarity with observability tooling and platforms (e.g. the Grafana stack)
  • Experience working with Python, Go, JavaScript and/or Jsonnet
  • Experience building or operating event-driven or asynchronous systems
  • Experience defining or applying SLIs/SLOs, error budgets, or reliability metrics
  • Interest in, or experience with, building testing frameworks or developer tooling

What will you be doing?

  • Build and scale a strong culture of operational excellence by defining standards and coaching teams to own reliability and availability.
  • Drive mature DevOps/SRE practices, including incident response and PIRs, on-call readiness, runbooks, alerting, observability, and release/change management.
  • Establish reliability frameworks such as SLIs/SLOs and error budgets, and use them to guide prioritization and engineering trade-offs.
  • Provide visibility into system health through clear operational metrics and reliability reporting.
  • Guide teams in the design, development, evolution, and operation of large-scale, distributed cloud systems.
  • Influence product and system direction through design reviews, architectural discussions, and cross-team collaboration.
  • Share knowledge through clear, high-quality documentation and technical communication—internally and, where appropriate, externally—to help teams build and operate systems more effectively.
  • As the reliability foundation matures, grow into broader application and product development leadership, contributing architectural and technical depth beyond operations.

Perks and benefits:

  • 100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
  • Scaling Organization – Tackle meaningful work in a high-growth, ever-evolving environment.
  • Transparent Communication – Expect open decision-making and regular company-wide updates.
  • Innovation-Driven – Autonomy and support to ship great work and try new things.
  • Open Source Roots – Built on community-driven values that shape how we work.
  • Empowered Teams – High trust, low ego culture that values outcomes over optics.
  • Career Growth Pathways – Defined opportunities to grow and develop your career.
  • Approachable Leadership – Transparent execs who are involved, visible, and human.
  • Passionate People – Join a team of smart, supportive folks who care deeply about what they do.
  • In-Person onboarding - We want you to thrive from day 1 with your fellow new ‘Grafanistas’ to learn all about what we do and how we do it.
  • Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect.
  • Equal Opportunity Employer: We will recruit, train, compensate and promote regardless of race, religion, color, national origin, gender, disability, age, veteran status, and all the other fascinating characteristics that make us different and unique.
AI Summary ✨
Grafana Labs logo

Grafana Labs

Remote - Ireland (Remote)

Remote
Experience: Staff
Posted: May 14, 2026
Last seen: an hour ago
sitereliability

Why we track Grafana Labs

Grafana Labs is fully remote and the company behind Grafana, Loki, and Mimir. If you've used their open-source tools, you know how good they are. The engineering work is focused on observability infrastructure, and they're one of the most developer-loved companies in the space.

Similar jobs

  • 16 days ago
  • 17 days ago
  • kraken logo

    Senior Site Reliability Engineer - Payward Services

    UK, Canada, Portugal, Spain, Poland, Ireland, United Arab Emirates, Brazil, Romania, Czech Republic, Cyprus, Lithuania, Switzerland, Mexico

    20 days ago
    Remote
  • See all jobs in Ireland