## Requirements
- 8 years plus engineering management experience
- 5 years experience managing SRE teams, managing mission critical production services, with progressively larger charters
- Demonstrated success leading SRE teams, and managing infrastructure development engineers
- Understanding of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts
- Proficient in at least one of Python, Golang, Java, or Rust. Experience working in a standard SDLC
- Understanding of key Infrastructure Security concepts and principles
## Preferred Qualifications
- Proven experience with large scale, highly available, distributed, and fault tolerant systems
- Excellent understanding of operating systems concepts including multi-threading, memory management, networking and storage, performance and scale
- Experience with Kubernetes, Docker, and containerization (CNCF Kubernetes Administrator or equivalent)
- Deep knowledge of Linux security primitives, systems, packaging, container security and SELinux
- Understanding of MacOS security primitives
- BS/MS in Computer Science or Equivalent (5+ years of software development or production operations experience in a large-scale environment)
- Prior experience in security related fields (or equivalent experience) Certs like OSCP, OSCE, OSEE, etc. helpful but not vital
## What You'll Be Doing
- Lead the SRE teams responsible for reliability and performance of critical security infrastructure services
- Improve the reliability, observability, and manageability of the services
- Collaborate with multi-functional teams to design, implement, and maintain security measures, incident response protocols, and automation tools to strengthen security posture
## Perks and Benefits
- Not available