Anyscale is revolutionizing distributed computing through Ray, a popular open-source project powering scalable machine learning applications. With over $250 million in funding from prestigious investors like Andreessen Horowitz and NEA, Anyscale is trusted by industry giants including OpenAI, Uber, Spotify, Instacart, and Cruise.
The Ray Core team is seeking a talented Software Engineer to contribute to Ray's C++ backend, focusing on the distributed scheduler, language runtime integration, and I/O and memory subsystems. This role is crucial for maintaining Ray's reliability, scalability, and performance while supporting higher-level libraries and use cases.
As a Software Engineer on the Ray Core team, you'll work on optimizing large-scale workloads, developing stability and stress testing infrastructure, and improving fault tolerance. You'll be at the forefront of distributed systems development, contributing to a project that's shaping the future of AI applications.
The ideal candidate brings strong systems software experience, with at least 2 years of relevant work experience and expertise in building scalable, fault-tolerant distributed systems. Knowledge of distributed model training, inference, and GPU programming is highly valued. This is an exceptional opportunity to work on cutting-edge technology while making distributed computing accessible to developers of all skill levels.