Strong understanding of the Linux operating system and TCP/IP suite of networking protocols
Ability to design, author, and release code in languages like Go or Python
Hands-on experience managing large numbers of diverse systems with configuration management or software delivery platforms (such as Puppet, Chef, Ansible)
Familiarity with microservices architecture and container orchestration with Kubernetes
Nice to Haves
Bare metal management experience and experience with deploying, supporting, and monitoring new and existing services, platforms, and application stacks
Acute drive to automate manual operations and to improve them through repeated iteration
Experience with scale testing, disaster recovery, and capacity planning
Experienced in managing and scaling distributed systems in a public, private, or hybrid cloud environment
Experience with the Prometheus ecosystem and a good understanding of infrastructure observability principles
What You'll Be Doing
Solving complex problems using data, teamwork, and expertise
Owning the full infrastructure stack from device driver performance debugging to content delivery network traffic management
Running a mix of open source, vendor licensed, and internally developed tools for system configuration management, provisioning, software deployment, logging, and monitoring
Collaborating with development teams to deliver the best results for Apple
Thinking critically and balancing the best solution with the need to get things done