Company Logo

Software Engineer

Netflix - 1d ago

Company Logo

Senior Software Engineer

Reddit - 4d ago

Sr Cloud Reliability Engineer, Platform Engineering

AI Summary ✨

Requirements

  • 5+ years of experience in cloud incident management, SRE, or operations
  • Expertise in multi-cloud environments
  • Experience with incident detection, response, and RCA processes
  • Strong analytical and problem-solving skills
  • Excellent communication and stakeholder management skills

Nice to Haves

  • Certifications in cloud platforms
  • Hands-on experience with incident escalation procedures and service recovery plans
  • Experience with automated logging and forensic analysis tools
  • Familiarity with SLAs, compliance, and audit processes
  • Prior experience working in a highly scalable global organization

What You'll Be Doing

  • Incident Management & Response: Lead cloud incident management efforts, ensuring rapid detection, triage, and resolution across all cloud platforms
  • Root Cause Analysis & SLA Compliance: Evolve key processes to ensure cloud incident RCAs are completed within the agreed Service Level Agreements
  • Monitoring & Automation: Unify automated monitoring, alerting mechanisms, and centralized incident logging
  • Reporting & Insights: Develop targeted reporting to provide directly relevant cloud reliability insights
  • Continuous Improvement: Identify patterns in incidents and optimize response playbooks

Perks and Benefits

  • Dedicated to building a future where everyone and everything can move independently
  • Opportunity for growth and collaboration
  • Potential accommodation available based on religious and/or medical conditions
Apply here
Uber logo

Uber

Amsterdam, Netherlands

Experience: Senior
Posted: March 5, 2025
sitereliability

Similar jobs

  • a month ago
    Still looking
  • 2 months ago
    Still looking
    Remote
  • 2 months ago
    Still looking
    Remote
  • See all jobs in Netherlands