Site Reliability Engineer - Adobe Experience Platform
AI Summary ✨
Requirements
At least 5 years of commercial software development and technical operations experience, and experience managing large-scale cloud-based applications.
BS/MS in Computer Science or equivalent experience.
Able to build and deliver infrastructure solutions for scalability, reliability, high availability, performance, security, software maintainability, and operational excellence.
Experience with Linux-based open source software.
Experience with AWS technologies and Kubernetes, Terraform.
Expertise with config management tools (Ansible/ Salt-stack/ Puppet), NoSQL (Snowflake/Cassandra/MongoDB) and with monitoring and logging solutions (preferably Prometheus, Splunk, Grafana).
Expertise with at least one programming language (Java/ Scala or Python)
Excellent communication skills (verbal and written) are critical to the role.
Able to work efficiently across various time zones to coordinate with colleagues in different regions.
What you'll be doing
Extend our product services and production environment using traditional software engineering guidelines.
Contribute to the technical direction of our hybrid private/public cloud enterprise solution.
Collaborate with various internal teams to provide a high-quality customer experience.
Contribute service metrics and measurement.
Deliver automation to prevent problem recurrence and automate responses to all non-exceptional service conditions.
Establish credibility with the quality of your technical execution.
Participate in a cross-regional on-call rotation.
Continually evaluate and adopt the latest industry technologies to optimize costs and increase efficiency.
Participate fully in a culture that supports innovation and creativity while delivering high output in a predictable and reliable way.