Site reliability/Devops/Platform Engineer, Managed Operations
AI Summary ✨
Requirements
Able to troubleshoot at all levels, from network to operating systems to software applications and experience supporting cloud systems or other services. Proficient troubleshooting and anticipating problems that affect the performance, reliability, or availability of software systems
Familiarity with Linux, using the command line and basic administration, and computer networking fundamentals
Knowledge of coding languages such as Java, Typescript, Python, or Ruby
Nice to Haves
Experience working cross-organizationally and leading strategic team efforts requiring work from multiple team members
Experience performance tuning software applications and optimizing fleet utilization
Experience with Infrastructure as Code, (such as CDK, CloudFormation, Puppet, Chef, Ansible, or similar)
What You'll Be Doing
Operating and improving one of the largest software systems
Reviewing the operational health of the services in your team's care
Executing changes following a change management process to production systems
Resolving your team's backlog of operational issues
Participating in an "on-call" rotation to resolve incidents out-of-hours
Perks and Benefits
Flexible work hours and arrangements for work-life balance
Employee-led and company-sponsored affinity groups promoting inclusion
Mentorship and career growth resources for professional development