Member of Technical Staff, AI Pretraining Platform
AI Summary ✨
Requirements
Design and develop Python and CUDA/HIP C++ code enabling distributed training of multimodal LLMs ingesting text, audio, images, or video data
Build and maintain cutting-edge infrastructure capable of storing and processing petabytes of data
Partner with pretraining and post-training teams to enhance data recipe through experimentation
Collaborate with product team and other engineers to identify gaps in current generation of models
Nice to Haves
Experience with HPC or parallel programming
Experience in pretraining
Experience working with GPU clusters
What You'll Be Doing
Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling, or data engineering work
OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work