Prior demonstrated experience in a Site Reliability Engineering (SRE), DevOps, or an Infrastructure-focused role.
Proficiency in one or more programming languages (eg. Java, Python).
Support of internet-facing production services and distributed systems via deployments, On-Call and Incident Management.
Deep understanding of basic security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, PKI, X509 certificates and PGP.
Nice to Haves
Proficiency in implementing and coordinating telemetry using monitoring and observability tools like Splunk, Grafana, and Prometheus, or similar.
Firsthand experience in performance tuning of applications and databases.
Knowledge of WebMethods Integration server or a middleware platform will be a plus.
What You'll Be Doing
Implement and maintain best-in-class devops practices.
Work on complex technical challenges related to scalability, reliability, and performance of Apple B2B systems.
Manage the lifecycle of machine learning models in production and non-production environments.
Continuously assess and improve system processes, detect anomalies, identify areas of optimization, and implement solutions to enhance system reliability and performance.
Perks and Benefits
Join a dynamic team with a work culture fueled by machine learning, anomaly detection, and threat detection.
Collaborate with a highly motivated team of professionals who push boundaries and deliver exceptional results.
Exciting opportunity to build your career in a supportive environment prioritizing continuous learning and professional development.