Software Engineer, Site Reliability Engineering, YouTube Data
AI Summary ✨
Requirements:
Bachelor's degree in Computer Science or a related technical field, or equivalent practical experience.
2 years of experience with data structures/algorithms and software development in one or more programming languages.
Nice to haves:
Master's degree in Computer Science or Engineering.
2 years of experience in designing, analyzing, and troubleshooting distributed systems.
What you'll be doing:
Be on-call for real-time and batch data processing systems.
Scale systems through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity.
Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
Provide guidance to other team members on managing availability and performance of mission services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health, and lead to a sustainable incident response.
Perks and benefits:
Opportunity to manage complex challenges unique to Google Cloud.
Culture of intellectual curiosity, problem solving, and openness.
Collaborative and supportive environment with mentorship for learning and growth.
Equal opportunity and affirmative action employer.