Company Logo

Software Engineer

Netflix - 1d ago

Company Logo

Senior Software Engineer

Reddit - 4d ago

Senior Software Engineer, SRE, Cloud Incident Response

AI Summary ✨

Requirements:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with software development in one or more programming languages.
  • 5 years of experience with data structures or algorithms.
  • 3 years of experience in designing, analyzing, and troubleshooting distributed systems, and 2 years of experience leading projects and providing technical leadership.
  • Experience in SRE or incident management/response environments.

Nice to haves:

  • Experience working in computing, distributed systems, storage, or networking.
  • Experience in telemetry systems, incident and risk management.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Excellent problem-solving approach, with verbal and written communication skills.

What you'll be doing:

  • Ensure Google Cloud Platform (GCP) stability and reliability through critical incident support, while driving high-quality customer outcomes and continuous cross-GCP team collaboration.
  • Create training, end-to-end processes for incident management life-cycle and partnering with Cloud Support leadership team.
  • Build systems and tooling to support Incident Response team improve visibility into state of Cloud, detection of large-scale issues, communications to customers, stakeholders and customer facing teams.
  • Define and escalate risks in Cloud, reduce Major incident probabilities with tactical/pragmatic approaches as needed.
  • Ensure the scalability and reliability of systems throughout their life-cycle by proactively supporting pre-launch activities like system design consulting, developing platforms and frameworks, and capacity planning, while also driving continuous improvement through automation and changes that enhance reliability and velocity.

Perks and benefits:

  • Opportunity to work on challenging and meaningful projects.
  • Culture of intellectual curiosity, problem-solving, and openness.
  • Support and mentorship for learning and growth.
  • Equal opportunity and affirmative action employer.
Apply here
Google logo

Google

London, UK

Experience: Senior
Posted: March 12, 2025
Gcp
Nodejs
sitereliability

Similar jobs

  • 14 hours ago
    New
  • 2 days ago
    New
  • 8 days ago
  • See all jobs in UK