Lead a team of ~20 reliability engineers, fostering a culture of operational excellence, continuous learning, and customer obsession
Attract, develop, and retain top talent; build career paths that keep engineers engaged and growing
What You’ll Do
Define and drive HubSpot's reliability roadmap, balancing proactive resilience investments with reactive incident reduction
Partner with Infrastructure leadership to prioritize reliability initiatives alongside cost, performance, and platform evolution
Set and evolve SLO standards that align engineering effort with customer experience
What You’ll Bring
Required Qualifications
10+ years of experience in software engineering, SRE, or infrastructure, with 5+ years leading teams
Track record of building and scaling reliability functions at companies with significant operational complexity
Deep technical fluency-you can dive into architecture discussions, incident analysis, and system design with credibility
Curiosity and vision for how AI/ML can transform operations; experience with or strong interest in AIOps, agentic automation, or ML-driven observability is a plus
Proven ability to drive cultural and process change across a large engineering organization without top-down mandates
Strong executive communication skills; comfortable leading incident bridges, presenting to leadership, and representing reliability externally
Experience with modern cloud infrastructure (AWS preferred), observability tooling, and incident management practices
A philosophy that balances reliability with velocity-you understand that the goal is sustainable speed, not gates