Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
5 years of experience with software development in one or more programming languages.
5 years of experience with data structures or algorithms.
3 years of experience in designing, analyzing, and troubleshooting distributed systems, and 2 years of experience leading projects and providing technical leadership.
Experience in SRE or incident management/response environments.
Nice to haves
Experience working in computing, distributed systems, storage, or networking.
Experience in telemetry systems, incident and risk management.
Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
Ability to debug, optimize code, and to automate routine tasks.
Excellent problem-solving approach, with verbal and written communication skills.
What you'll be doing
Ensure Google Cloud Platform (GCP) stability and reliability through critical incident support, while driving high-quality customer outcomes and continuous cross-GCP team collaboration.
Create training, end-to-end processes for incident management life-cycle and partnering with Cloud Support leadership team.
Build systems and tooling to support Incident Response team improve visibility into state of Cloud, detection of large-scale issues, communications to customers, stakeholders and customer facing teams.
Define and escalate risks in Cloud, reduce Major incident probabilities with tactical/pragmatic approaches as needed.
Ensure the scalability and reliability of systems throughout their life-cycle by proactively supporting pre-launch activities like system design consulting, developing platforms and frameworks, and capacity planning, while also driving continuous improvement through automation and changes that enhance reliability and velocity.
Perks and benefits
Intellectual curiosity, problem solving, and openness culture.
Collaborative work environment with diverse backgrounds, experiences, and perspectives.
Opportunity for self-direction, meaningful projects, support, and mentorship.