Oracle Cloud Infrastructure (OCI) Cluster Networking team is seeking a Principal Software Engineer to join their innovative AI infrastructure development team. This role focuses on building ultra-high performance networking systems that support large-scale AI workloads, enabling customers to scale from tens to thousands of GPUs without compromising performance.
The position offers an exciting opportunity to be at the forefront of the AI revolution, working with a young and rapidly growing team on ambitious initiatives. As a Principal Engineer, you'll be responsible for designing, developing, and operating the network stack required for distributed AI workloads across massive GPU clusters.
Oracle, a world leader in cloud solutions with over 40 years of experience, offers a comprehensive benefits package including medical, dental, vision insurance, 401(k) with company match, flexible vacation, and various other perks. The company maintains a strong commitment to work-life balance and promotes diverse insights and perspectives.
The ideal candidate should be both a rock-solid developer and a distributed systems generalist, capable of diving deep into any part of the stack and low-level systems. You'll need 7+ years of systems/application development experience, strong programming skills in languages like Python, Java, or Go, and extensive knowledge of distributed systems concepts.
This role presents an exceptional opportunity to work on cutting-edge technology while solving complex challenges in AI infrastructure. You'll be joining a company that values innovation, integrity, and inclusive workforce development, making it an ideal place for those who seek to make a significant impact in the cloud computing industry.