Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Experience in HPC or Machine Learning
Nice to Haves:
Master's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Familiarity with Machine Learning, AI Infrastructure, Operating Systems fundamentals and virtualization technologies, Accelerators
Experience on Distributed Systems, High Performance Computing / Machine Learning middleware, Co-Designing Hardware-Software, Profiling and Performance Analysis Tools
What You'll Be Doing:
Analyzes functionality, integration, and performance issues at various levels of the hardware/software stack on current and future generations of AI training platforms
Designs and codes solutions that improve functional correctness, stability and performance of AI training oriented VM offerings and related services. When appropriate, drives internal partner teams or industry partners to implement such solutions
Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI). Applies metrics to drive the quality and stability of code, as well as appropriate coding patterns and best practices
Holds accountability as a Designated Responsible Individual (DRI), and collaborates with other engineers across products/solutions, working as on-call to monitor system/product/service for degradation, downtime, or interruptions
Develops a playbook for the team to resolve issues
Maintains communication with key partners across the Microsoft ecosystem of engineers
Your mission will be to help ensure Azure platform is consistent on performance, can scale on-demand, and engineered to withstand the unparalleled computing demand from the customer workloads. You will help build a test-driven engineering culture to reduce regressions and bugs in production and will set a higher bar for infrastructure quality
Perks and Benefits:
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work