Requirements
- Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling, or data engineering work
- OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work
- OR equivalent experience
- Experience with HPC (High-performance computing) and/or parallel programming
- Experience in the area of pretraining
- Experience working with GPU clusters
What You'll Be Doing
- Design and develop Python and CUDA/HIP C++ code that enable distributed training of multimodal LLMs ingesting text, audio, images, or video data
- Build and maintain cutting-edge infrastructure that can store and process petabytes of data needed to power models
- Partner with pretraining and post-training teams to improve data recipes through experimentation
- Collaborate with the product team and other engineers and researchers to identify gaps in the current generation of models
Perks and Benefits
Benefits and perks may vary depending on the nature of your employment with Microsoft and the country where you work.