AIML - Site Reliability Engineer (SRE), Siri Knowledge Platforms
AI Summary ✨
Requirements
A strong sense of ownership and integrity demonstrated through clear communication and collaboration.
Sophisticated knowledge of one or more of the following: Kubernetes, containerisation systems, and/or public cloud infrastructure (AWS, GCP).
Proficiency in Go, Python, or similar language to automate tasks.
Hands-on experience handling large numbers of diverse systems with configuration management or software delivery platforms (such as Puppet, Chef, Ansible, and Spinnaker).
Nice to Haves
Working knowledge of multi-tier applications and their dependencies including load balancing, TCP/IP networking, web services, LDAP and DNS.
Proficiency with web server administration including Apache and Nginx.
Knowledge of database design, support and administration including Postgres, MySQL, and HBase.
Network administration and troubleshooting.
Good interpersonal skills shown through previous projects or assignments.
What You'll Be Doing
Directly responsible for the infrastructure that powers Siri, search, and other high-impact user-facing solutions running on millions of Apple devices worldwide.
Improving the stability, security, efficiency, and scalability of a 24/7 global service.
Participating in on-call rotations and working in geographically distributed SRE teams for follow-the-sun support.
Isolating and resolving issues through investigative analysis.
Building and maintaining accurate, up-to-date documentation reflecting configuration.
Providing code reviews and mentoring new team members.
Perks and Benefits
Play a meaningful role in revolutionising how people use their computers and mobile devices.
Work with teams building ground breaking technology for algorithmic search, machine learning, natural language processing & artificial intelligence.
Work with the most scalable big-data systems in existence.