Software Engineer, Site Reliability Engineering, Early Career 2025
AI Summary ✨
Requirements:
Bachelor's degree in Computer Science or related technical field, or equivalent practical experience.
Experience with algorithms, data structures, complexity analysis, and software design.
Experience in one or more of the following: C, C++, Java, Python, Go.
Nice to haves:
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
Networking knowledge and understanding of network theory (e.g., different protocols (i.e., TCP/IP, UDP, ICMP, etc.), MAC addresses, IP packets, DNS, OSI layers, and load balancing).
Understanding of Unix/Linux systems from kernel to shell, taking in system libraries, file systems, and client-server protocols.
Familiarity with running web services at scale, and understanding of Unix systems internals and networking.
Systematic problem-solving approach, coupled with a sense of ownership and drive.
What you'll be doing:
Design, write, and deliver software to improve the availability, scalability, latency, and efficiency of Google's services.
Solve problems relating to mission-critical services and build automation to prevent problem recurrence with the goal of automating response to all non-exceptional service conditions.
Influence and create new designs, architectures, standards, and methods for large-scale distributed systems.
Engage in service capacity planning and demand forecasting, software performance analysis, and system tuning.
Conduct periodic on-call duties using a follow-the-sun model.
Perks and Benefits:
Opportunity to work on optimizing existing systems, building infrastructure, and eliminating work through automation.
Intellectual curiosity, problem-solving, and openness are valued in the SRE culture.
Collaboration with a diverse team and environment that supports learning and growth.