Company Logo

Software Engineer

Netflix - 1d ago

Company Logo

Senior Software Engineer

Reddit - 4d ago

Senior Systems Engineer, Artificial Intelligence Operations

AI Summary ✨

Requirements:

  • Bachelor of Science or equivalent experience
  • 12+ years of networking experience in enterprise or service provider environments, with strong hands-on expertise in routing and switching
  • Proficient in scripting and automation using Python or similar languages, with strong Linux expertise
  • Proven experience working directly with customers to resolve issues and ensure success in Systems Engineer or SRE roles
  • Exceptional oral, written, and presentation skills for clearly communicating complex technical topics
  • Demonstrated ability to collaborate effectively across teams, partnering with operations, engineering, and product development

Nice to Haves:

  • Experience with data center infrastructure and cloud architectures
  • Background in network performance monitoring or observability
  • Previous experience working at a technological start-up

What you'll be doing:

  • You will bring together and understand internal and external customer requirements to improve AI cluster resiliency and design AIOps-based solutions that address these needs
  • Develop automated workflows for issue detection and root cause analysis and closely collaborate with operators to debug sophisticated, full-stack AI cluster problems. We will bring to bear the findings for product improvements!
  • Deliver compelling technical presentations and lead hands-on demos or training. You'll also handle evaluation deployments (POC/POV) and ensure smooth, reliable installations by staying engaged and encouraging throughout the customer journey
Apply here
NVIDIA logo

NVIDIA

Remote - UK (Remote)

Experience: Senior
Posted: October 27, 2025
Python
sitereliability

Similar jobs

  • 5 hours ago
    New
  • 19 days ago
    Remote
  • 19 days ago
    Remote
  • See all jobs in UK