Microsoft is seeking a highly motivated senior systems engineer to work in the Azure Cloud Hardware and Infrastructure Engineering (CHIE) team. This role involves collaborating across disciplines to create cutting-edge systems and modules for deployment in Microsoft's Azure Cloud.
As a Senior Systems Engineer, you will:
- Work directly with engineers across cross-functional teams to deliver hardware designs from concept to data center deployment.
- Interact with Microsoft's services teams and cross-discipline design teams, focusing on functional interfaces, developing test cases, and qualifying designs.
- Participate in architectural discussions and evaluate power, thermal, and cooling solutions for AI/ML workloads.
- Analyze new interfaces and subsystems, develop integration plans, analyze power efficiency, debug integration issues, and provide recommendations.
- Define system behavior and concept of operations for the platform to ensure compatibility with Microsoft Azure datacenter software, serviceability, telemetry, and customer expectations.
- Perform NUDD (new, unique, different, and difficult) technology and feature analysis, providing risk assessment and mitigations.
- Drive technical requirements and ensure solutions are flexible and scalable across the full (HW/FW/SW) stack.
- Enable platform and solution level discussions, influencing product architecture, and delivering on quality, reliability, and performance goals.
- Collaborate with internal, external, and open-source partners to onboard innovative technologies seamlessly.
Required Qualifications:
- 7+ years of technical engineering experience OR
- Bachelor's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 5+ years of technical engineering experience OR
- Master's degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 3+ years of technical engineering experience OR
- Doctorate degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field
- 5+ years of experience in developing power and thermal solutions for accelerator-based or compute-based hardware systems
- 5+ years of integrating and deploying new hardware technology
Preferred Qualifications:
- Hands-on experience developing GPU, FPGA-based accelerator platforms for AI/ML use cases
- Knowledge of high-volume silicon (SoCs, GPUs, or FPGAs), compute, storage, and/or networking design, manufacturing, and deployment
- Experience in power, thermal, and cooling systems
- Experience with high-speed interfaces such as PCIe, DDR, and Ethernet
- In-depth experience characterizing Silicon and system-level power consumption based on workloads
- Knowledge about datacenters & operations at scale
This role offers an exciting opportunity to work on state-of-the-art accelerator hardware solutions and contribute to Microsoft's cloud infrastructure. Join us in shaping the future of cloud computing and AI/ML technologies!