Company Logo
Software Engineer

Netflix - 1d ago

Company Logo
Senior Software Engineer

Reddit - 4d ago

Senior HPC AI Cluster Engineer

Requirements:

  • A degree in Computer Science, Engineering, or a related field and 8+ years of experience
  • Knowledge of HPC and AI solution technologies from CPU’s and GPU’s to high-speed interconnects and supporting software
  • Experience with job scheduling workloads and orchestration tools such as Slurm, K8s
  • Excellent knowledge of Windows and Linux (Redhat/CentOS and Ubuntu) networking and internals
  • Experience with multiple storage solutions and familiarity with newer technologies
  • Python programming and bash scripting experience
  • Comfortable with automation and configuration management tools
  • Deep knowledge of Networking Protocols like InfiniBand, Ethernet
  • Deep understanding and experience with virtual systems
  • Familiarity with cloud computing platforms

Nice to Haves:

  • Knowledge of CPU and/or GPU architecture
  • Knowledge of Kubernetes, container-related microservice technologies
  • Experience with GPU-focused hardware/software (DGX, Cuda)
  • Experience with RDMA (InfiniBand or RoCE) fabrics

What you will be doing:

  • Design, implement, and maintain large-scale HPC/AI clusters with monitoring, logging, and alerting
  • Manage Linux job/workload schedules and orchestration tools
  • Develop and maintain continuous integration and delivery pipelines
  • Develop tooling to automate deployment and management of large-scale infrastructure environments
  • Deploy monitoring solutions for servers, network, and storage
  • Perform troubleshooting from bare metal to application level
  • Develop, re-define, and document standard methodologies
  • Support Research & Development activities and engage in POCs/POVs for future improvements

Perks and Benefits:

  • Equal opportunity employer
  • Value diversity at the company
  • Accommodations for individuals with disabilities
AI Summary ✨
NVIDIA logo

NVIDIA

UK, Sweden, Switzerland, Netherlands, France

Remote
Experience: Senior
Posted: May 22, 2026
Last seen: an hour ago
Aws
Azure
Gcp
Jenkins
Kubernetes
Python
backend

Why we track NVIDIA

NVIDIA has become one of the most important companies in tech thanks to AI and GPU computing. They have EU roles across several countries. If you're interested in hardware, CUDA, or ML infrastructure, they're hard to beat.

Similar jobs

  • 17 hours ago
    New
  • 19 hours ago
    New
    Remote
  • wise logo

    Senior Solutions Engineer

    Greater London, UK

    19 hours ago
    New
  • 19 hours ago
    New
  • See all jobs in UK