Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling, or data engineering work
OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work
OR equivalent experience
Experience working with specific frameworks or libraries for model pre-training, such as TensorFlow, PyTorch, or Hugging Face Transformers
What You'll Be Doing:
Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a data-driven approach
Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on an in-house large-scale distributed stack
Collaborate closely with teams on infrastructure, data, post-training, and multimodality
Embody our culture and values
Perks and Benefits:
Benefits/perks may vary depending on the nature of your employment with Microsoft and the country where you work