Principal Forensic Engineer
Redmond, WA 
Share
Posted 10 days ago
Job Description
OverviewMicrosoft Cloud Infrastructure and Operations (CO+I) is the engine that powers Microsoft's cloud services. The group is responsible for designing, building, and operating Microsoft's global datacenters; managing the programmatic delivery of our critical infrastructure design, equipment procurement, construction delivery, infrastructure innovation, demand planning and capacity utilization of our unified infrastructure; and responsible for all operations needed to run the physical infrastructure. We focus on smart growth with an emphasis on automation, data-driven engineering, cost-effectiveness, and environmental sustainability. We deliver the core infrastructure and foundational technologies for Microsoft's 200+ online businesses including Azure, Office 365, Bing, Xbox Live, Skype, and OneDrive. Our portfolio is built and managed by a team of subject matter experts working 24x7x365 to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide. Within CO+I, the Forensic Engineering Team is responsible for performing Root Cause Analysis (RCA) on systemic issues and investigating when critical components fail. Within Forensic Engineering, we are seeking a motivated and experienced Principal Forensic Engineer to join our team. If you are a strategic thinker with a passion for driving business success, we encourage you to apply for this exciting opportunity. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day and to empower billions!
ResponsibilitiesLead and track forensic analysis of events which have occurred within the data center infrastructure.Serve as a functional specialist by being able to speak to all aspects of data center functions and failure modes in critical environments.Develop methodologies to validate data center performance, system control parameters and operational efficiency against design intent and determine quantifiable deviations.Perform troubleshooting and root cause analysis associated with equipment failure.Review equipment and system performance data to identify issues through trend analysis.Assists in the troubleshooting of issues in the field, remotely or in person.Review compliance with existing corrective and preventative maintenance program to enhance operational readiness.Analyze full time employee and vendor staffing to include training, procedures, and site requirements as part of root cause analysis.Foster and promote our proactive implementation of lesson learned from analysis across multiple design, construction, and operational organizations.Develop solutions for defects identified through trends and data analysis.Drive global standardization and consistency of processes, procedures, and reports with Operations teams for Quarterly Business Reviews.Work with Site Operations Engineers to establish visual standards, process improvement and error proofing systems to drive efficiency within the business.Identify and monitor the need for use of new tools to improve the quality of data and analytics.Embody our culture and values.

 

Job Summary
Company
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Experience
Open
Email this Job to Yourself or a Friend
Indicates required fields