Anyscale, backed by prominent investors with $250+ million in funding, is revolutionizing distributed computing through Ray, their open-source project. They're seeking a Software Engineer for their Ray Data team to work on their Datasets library, which is crucial for machine learning pipelines and production use cases at major companies like Amazon and Alibaba.
The role involves developing and maintaining the Ray Datasets library, built on Apache Arrow and Ray Core. You'll work on performance optimization, ML training integration, stability testing, and streaming workload integration. The position requires expertise in distributed systems, data processing, and database internals.
This is an exciting opportunity to join a team that's making distributed computing accessible to developers of all skill levels. You'll contribute to open-source software used by industry leaders like OpenAI, Uber, and Spotify. The role offers competitive compensation ($170,112-$237,000) and comprehensive benefits including equity, healthcare, and education stipends.
The ideal candidate should have at least 2 years of experience, strong algorithmic background, and expertise in scalable systems. You'll be working in either San Francisco or Palo Alto, contributing to projects that directly impact the efficiency and accessibility of machine learning applications.
Working at Anyscale means being at the forefront of AI infrastructure development, with the opportunity to shape how distributed computing evolves. The company culture values technical excellence, innovation, and effective communication, as evidenced by the expectation to share knowledge through talks and blog posts.