Netflix - 1d ago
Reddit - 4d ago
7 to 10+ years production level experience with distributed applications at scale in public and/or private cloud
Experience architecting and implementing large-scale Observability platforms
Experience with internally hosted logging systems like Splunk, ClickHouse, Loki, Elastic, assisting clients and improving environment performance and stability
Demonstrated ability to drive ingestion cost optimization through data-driven analysis, pipeline guardrails, and direct engagement with customer engineering teams to reduce unnecessary log volume
Experience with OpenTelemetry — including collector configuration, pipelines, and instrumentation — as a core requirement given Adobe's OTel-native observability strategy
AI agent development and experience integrating AI workflows into large-scale deployments; ability to build AI-assisted workflows to surface actionable insights from large log datasets and automate routine user interactions
Experience architecting distributed environments with thousands of users
Programming experience with languages like Go, Python; experience building integrations and applications to large-scale Observability environments
Experience designing and implementing systems for fault tolerance, scalability and stability
Experience developing, deploying and running distributed applications on cloud platforms; experience with container and orchestration technologies (Docker, Kubernetes)
Comfortable owning on-call coverage across a multi-tool observability stack, with the ability to triage and resolve issues across platforms beyond primary area of expertise
Ensure the highest level of up-time and Quality of Service (QoS) to Adobe's customers through operational excellence
Knowledge in defining service level objectives (SLOs) and service level indicators (SLIs) to represent and measure service quality
Knowledge of (public and/or private) cloud deployments
Collaborate with SRE and Engineering/Product teams in driving critical initiatives
Experience in designing and maintaining production monitoring systems
Experience in solving performance and stability issues using a wide variety of tools
Excellent communicator in and across teams, driving projects to completion
Impacts the organization through contribution to technical direction and strategic decisions
Experience evaluating and prototyping alternative storage/processing backends (e.g., ClickHouse, Loki)
Experience with other Observability tooling like Grafana, Cortex, and Tempo
Promote the DevOps/SRE approach
Bucharest, Romania
Why we track Adobe
Adobe has had EU engineering teams for a long time, including a significant presence in Romania. The creative tools are used by millions, and they're increasingly investing in AI-powered features. More technically interesting than people give them credit for.
Bucharest, Romania
Senior Infrastructure Engineer – Cloud Development Environments
Bucharest, Romania

Principal Software Engineer, Site Reliability
Bucharest, Romania