Principal Software Engineer, Google Compute Engine Control Plane
This job is offline
AI Summary ✨
Requirements:
15 years of experience with large scale distributed systems and architectures.
Experience in technical leadership, leading global projects and setting technical direction for teams.
Experience with customer focused, iterative product and feature delivery.
Experience in networking, compute infrastructure, and architecting, developing, or maintaining cloud solutions.
Nice to haves:
Experience working on or with hyperscale cloud technologies.
Deep understanding of AI/ML-related infrastructure technologies (e.g., GPUs, TPUs, LLMs, foundational models) and use cases (e.g., training, inference, tuning etc.).
What you'll be doing:
Develop new easy-to-use AI/ML related offerings leveraging Google’s software stack.
Design capacity-aware scheduling capabilities to automatically move workloads between zones and regions.
Drive key architectural decisions to ensure reliability, security, performance, and scalability.
Drive key implementation decisions to maximize code reuse, leveraging existing frameworks and minimizing accumulation of technical debt.
Ensure that APIs and semantics are modular, future proof, and compatible with other parts of GCE and GCP to ensure a consistent user experience.