Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

World leader in cloud solutions using tomorrow's technology to tackle today's problems, with 40+ years of experience.
United States
$94,200 - $223,500
Distributed Systems
Principal Software Engineer
In-Person
7+ years of experience
AI · Enterprise SaaS · Cloud

Description For Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

Oracle Cloud Infrastructure (OCI) Cluster Networking team is seeking a Principal Software Engineer to join their innovative AI infrastructure development team. This role focuses on building ultra-high performance networking systems that support large-scale AI workloads, enabling customers to scale from tens to thousands of GPUs without compromising performance.

The position offers an exciting opportunity to be at the forefront of the AI revolution, working with a young and rapidly growing team on ambitious initiatives. As a Principal Engineer, you'll be responsible for designing, developing, and operating the network stack required for distributed AI workloads across massive GPU clusters.

Oracle, a world leader in cloud solutions with over 40 years of experience, offers a comprehensive benefits package including medical, dental, vision insurance, 401(k) with company match, flexible vacation, and various other perks. The company maintains a strong commitment to work-life balance and promotes diverse insights and perspectives.

The ideal candidate should be both a rock-solid developer and a distributed systems generalist, capable of diving deep into any part of the stack and low-level systems. You'll need 7+ years of systems/application development experience, strong programming skills in languages like Python, Java, or Go, and extensive knowledge of distributed systems concepts.

This role presents an exceptional opportunity to work on cutting-edge technology while solving complex challenges in AI infrastructure. You'll be joining a company that values innovation, integrity, and inclusive workforce development, making it an ideal place for those who seek to make a significant impact in the cloud computing industry.

Last updated 3 days ago

Responsibilities For Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

  • Design, develop and operate network stack for distributed AI workloads
  • Build ultra-high performance network to support AI workloads
  • Scale systems from tens to thousands of GPUs without compromising performance
  • Work on broad distributed system interactions
  • Dive deep into any part of the stack and low-level systems

Requirements For Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

Python
Java
Go
  • 7+ years of experience with systems/application development
  • 3+ years of experience with distributed systems OR network programming
  • Proficient at programming in any two out of C/C++, Python, Java, Scala, GO
  • Proficient with data structures, algorithms, operating systems
  • Bachelors in computer science and Engineering or related engineering fields
  • Experience with distributed systems: familiarity with CAP theorem, Consensus, messaging, High Availability etc.

Benefits For Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Medical, dental, and vision insurance
  • Short term disability and long term disability
  • Life insurance and AD&D
  • Flexible Spending Accounts
  • Pre-tax commuter and parking benefits
  • 401(k) with company match
  • Flexible Vacation
  • 11 paid holidays
  • 72 hours of paid sick leave
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan
  • Financial planning and group legal

Interested in this job?

Jobs Related To Oracle Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

Principal Member of Technical Staff

Principal Engineer role at Oracle Health Applications & Infrastructure, focusing on distributed systems and cloud infrastructure.

Senior Principal Software Engineer - GPU Cluster Performance and Benchmark Engineering

Senior Principal Software Engineer role for GPU Cluster Performance and Benchmark Engineering at Oracle, focusing on large-scale GPU clusters and MLPerf benchmarks.

Software Developer 5

Oracle is seeking a skilled Software Developer 5 to design and develop high-performance software for their Clusterware team, focusing on scalable and fault-tolerant distributed systems.

Senior Principal Software Engineer - GPU Cluster Performance and Benchmark Engineering

Senior Principal Software Engineer role for GPU Cluster Performance and Benchmark Engineering at Oracle, focusing on large-scale GPU clusters and MLPerf benchmarks.

Principal Member of Technical Staff

Principal Member of Technical Staff at Oracle Health Applications & Infrastructure, focusing on distributed systems, identity, and security for cloud-centric healthcare applications.