Google is seeking a Software Engineer III to join their XBorg team within the Google Cloud division. XBorg is a crucial component of the Borg Control Plane, responsible for orchestrating and scheduling throughput-oriented workloads across clusters, with a particular focus on Machine Learning training and inference workloads. The role is part of the ML, Systems, & Cloud AI (MSCA) organization, which is responsible for designing, implementing, and managing hardware, software, machine learning, and systems infrastructure for all Google services and Google Cloud.
The position offers an opportunity to work on cutting-edge technology that impacts billions of users worldwide. You'll be involved in developing features that enhance resource occupancy and efficiency for ML workloads across major Alphabet products, implementing concepts such as weighted fair queuing and seamless opportunistic access to unused resources.
As a Software Engineer III, you'll be working on projects critical to Google's needs, with the flexibility to switch teams and projects as both you and the business evolve. The role requires versatility and leadership qualities, as you'll be tackling problems across the full-stack while pushing technology forward. You'll be part of a team that prioritizes security, efficiency, and reliability in everything from developing latest TPUs to running a global network.
The position offers exposure to Google's advanced ML infrastructure and the opportunity to work with technologies like Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers. You'll be contributing to systems that handle information at massive scale, extending well beyond web search into areas such as distributed computing, large-scale system design, artificial intelligence, and natural language processing.
This role is perfect for someone who is passionate about machine learning infrastructure, has strong software development skills, and wants to make a significant impact on how ML workloads are managed and executed at scale. You'll be working in Warsaw, Poland, collaborating with global teams to shape the future of hyperscale computing while ensuring Google's services remain efficient and reliable for users worldwide.