As a Senior Technical Program Manager in DC Critical Environments, you will:
- Drive a program that covers end-to-end monitoring, and processing of critical environment (CE) infrastructure telemetry for all leased sites, to bring those sites on par with owned datacenter sites.
- Design and implement telemetry data ingestion and data processing systems for leased sites.
- Prototype, pilot, and deploy multi-signal anomaly detection and prevention systems leveraging machine learning and statistical analysis for DC leased sites
- Define and drive an operationalization plan for the telemetry pipeline for leased sites.
- Ensure interoperability of detection methods, systems, and workflows by defining conceptual, logical, and physical data models.
- Understand the signals coming from the EPMS and BAS systems for leased sites.
- Ensure high percent coverage and mapping of leased site signals including thermal, power, and other environmental conditions and data.
- Define a set of reusable primitives for mapping logical and physical topology of data centers leased sites.
- Ensure there is a high-frequency, high-volume, low-latency streaming and micro-batching capable pipeline to process DC CE telemetry from leased sites.
- Architect a staging model to ensure the onboarding of leased sites CE telemetry (thermal, power, and other environmental subjects).
Qualifications:
- Subject matter expertise level in supervised and unsupervised machine learning models for anomaly detection.
- Demonstrated subject matter expertise in utilizing data lakes within Lakehouse architectures to process, aggregate, and manage real-time data streams from cloud-based services.
- Demonstrated subject matter expertise in managing and processing large-scale data formats with a focus on real-time serialization and deserialization to ensure low-latency during data handling. This includes advanced proficiency in Kusto query language (KQL) with experience, and proficiency in coding with Python, GoLang, or Spark.
- Experience with generative AI or Copilots for troubleshooting data center environments.
- Expertise in processing data frames from networking layers and protocols, including BGP, TCP/IP, and GPRS tunneling protocol.
- Proven experience on building applications using artificial intelligence (AI) techniques, including machine learning (ML) and data science, to enhance and automate various IT operations (AIOPS).
- Bachelor's or master’s degree in computer science, data engineering, or a related field.
- Excellent problem-solving skills and attention to detail.
- Ability to work collaboratively with cross-functional teams.
- Strong written and verbal communication skills.
Preferred Qualifications:
- Indepth experience in designing and implementing telemetry systems for data center networks.
- Familiarity with HVAC, CRAC, AHU, Chillers, and other critical environment equipment.
- Knowledge of incident management and data center operations.
- Certifiable knowledge in cloud computing.
About Us: We are committed to maintaining the highest standards of operational excellence in our data centers. Join us in our mission to enhance our telemetry capabilities and ensure the reliability and efficiency of our critical environments.
Background Check Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
#COICareers
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.