Anyscale is seeking talented engineers to join their team and contribute to the development of next-generation, high-performance machine learning serving systems. The Platform team is dedicated to creating world-class systems for serving ML models in production, including building and maintaining the open-source Ray Serve library and contributing directly to the Anyscale platform.
As part of this role, you will:
- Develop a highly available service for ML model serving
- Enhance Ray Serve and other libraries to simplify the development of next-generation ML applications in production
- Improve autoscaling capabilities to drive performance enhancements and cost savings
- Optimize latency and throughput for both single- and multi-model serving scenarios
Requirements:
- Solid background in algorithms, data structures, and system design
- Experience working with modern machine learning tooling, including PyTorch, TensorFlow, and JAX
- At least 2+ years of relevant work experience
Bonus points for:
- Experience in building and maintaining open-source projects
- Experience in building and operating machine learning infrastructure in production
- Experience in building highly available serving systems
Anyscale offers competitive compensation and benefits, including:
- Target salary range: $170,112 ~ $237,000
- Stock Options
- Healthcare plans (99% premiums covered)
- 401k Retirement Plan
- Wellness and Education stipends
- Paid Parental Leave
- Flexible Time Off
- Commute reimbursement
- 100% of in-office meals covered
Anyscale is based in San Francisco, CA, with employees required to come to the office 3x a week. The company values diversity and inclusion and encourages individuals from underrepresented groups to apply.