Software Engineer 2

Microsoft is a leading technology company that develops and sells software, hardware, and cloud services.
Backend
Mid-Level Software Engineer
Hybrid
4+ years of experience
AI · Enterprise SaaS · Cloud

Description For Software Engineer 2

Azure AI Infrastructure team is looking for passionate engineers to build the largest deep-learning infrastructure service at Microsoft. In this role, you will be tasked with building new components to bring the latest innovations in AI Infrastructure onto the Azure AI Platform. You will partner with top engineering talent within Azure AI Infrastructure and across Azure to work on cluster orchestration, job scheduling, storage, networking, containerization, and operating system integration.

Your work will enable various AI languages and run-times on Azure AI Infrastructure to bring distributed deep learning training and inferencing to life. You will build infrastructure components required to build, deploy, monitor, and service highly available and scalable Microsoft Service Fabric and Kubernetes clusters. You will lead development and customer support from the frontline and establish architecture, service excellence guidelines, and a high-quality bar.

We are engineers on Azure AI Infrastructure. We believe that building a planet-scale AI Supercomputer from the ground-up which addresses the fundamental pain-points of data scientists and AI practitioners and takes AI to unprecedented scale is an opportunity of a lifetime.

Azure AI Infrastructure is a globally distributed, multi-tenant service that provides robust, cost-effective, and competitive AI infrastructure (compute, networking, and storage) for AI training and inferencing. By abstracting workloads from underlying infrastructure, Azure AI Infrastructure creates a shared pool of resources that can be dynamically provisioned for full utilization of expensive GPU compute, enabling data scientists to productively build, scale, experiment, and iterate their models on top of a robust, performant, scalable, and cost-effective distributed infrastructure built for AI.

Responsibilities include:

  1. Deliver a robust container orchestration platform for Azure AI Infrastructure
  2. Design and build the scheduling sub-system for AI training and inferencing workloads
  3. Design and build storage and caching system for efficient DNN training and inferencing
  4. Design and build control plane APIs for creation and management of training jobs and inference model metadata
  5. Deliver node management, fault detection, and node repair as a service
  6. Deliver world-class monitoring systems and telemetry pipelines
  7. Codify security and compliance requirements
  8. Leverage performance and profiling tools to identify hot spots and bottlenecks across hardware and software boundaries

Join us in building the future of AI infrastructure at Microsoft!

Last updated 2 months ago

Responsibilities For Software Engineer 2

  • Deliver a robust container orchestration platform for Azure AI Infrastructure
  • Design and build the scheduling sub-system for AI training and inferencing workloads
  • Design and build storage and caching system for efficient DNN training and inferencing
  • Design and build control plane APIs for creation and management of training jobs and inference model metadata
  • Deliver node management, fault detection and node repair as a service
  • Deliver world-class monitoring systems and telemetry pipelines
  • Codify security and compliance requirements
  • Leverage performance and profiling tools to identify hot spots and bottlenecks

Requirements For Software Engineer 2

Kubernetes
Linux
Rust
Go
  • 4+ years of experience with coding in one of C#, C or C++, Rust, go
  • Experience working with the Linux operating system and Kubernetes cluster orchestration
  • Experience with improving service operations or engineering fundamentals
  • Excellent collaboration skills
  • A master's or bachelor's degree in computer science or a related field
  • At least 3 years of experience building and shipping production software or services
  • Ability to meet Microsoft, customer and/or government security screening requirements

Benefits For Software Engineer 2

Medical Insurance
Education Budget
Parental Leave
  • Industry leading healthcare
  • Educational resources
  • Discounts on products and services
  • Savings and investments
  • Maternity and paternity leave
  • Generous time away
  • Giving programs
  • Opportunities to network and connect

Interested in this job?

Jobs Related To Microsoft Software Engineer 2

Software Engineer II

Microsoft Software Engineer II position offering hybrid work, competitive pay, and opportunity to build scalable services impacting billions of users globally.

Software Engineer II - Windows

Microsoft seeks Software Engineer II to develop privacy features for Windows, focusing on AI product protection and user privacy, offering hybrid work and competitive benefits.

Software Engineer 2

Software Engineer 2 position at Microsoft focusing on OS, Runtimes, and Libraries development with C++ expertise.

Software Engineer - Fullstack

Microsoft Fullstack Software Engineer position in Redmond with hybrid work option, competitive pay, and comprehensive benefits, focusing on platform and infrastructure development.

Software Engineer II

Microsoft Bing Metrics Team seeks Software Engineer II to develop search quality metrics and LLM-based evaluation systems for billions of daily searches.