Oracle Cloud Infrastructure (OCI) team is building a cloud service for data scientists, machine learning engineers, and software engineers to help them in their machine learning development and deployment lifecycle. As a Senior Machine Learning Engineer on the OCI Data Science team, you will design and deliver a high-quality cloud service with capabilities, scalability, and performance needed for enterprise data science teams. You'll work on interactive notebooks, distributed machine learning on CPU/GPU supporting a wide variety of LLM (Gen AI) / ML algorithms/libraries, distributed model serving, and robust monitoring and analytics of ML models.
Key Responsibilities:
- Build accelerated data science components and machine learning solutions for a cloud service
- Design and build distributed, scalable, fault-tolerant software systems using Dask, Spark, Horovod, Tensorflow, PyTorch, etc.
- Participate in the entire software lifecycle – development, testing, CI, and production operations
- Develop new features for the data science platform in various classical and deep learning frameworks
- Work with customers and ISVs to troubleshoot data science solutions
- Design, develop, and document robust software components
- Mentor new employees and work with less experienced software developers
- Participate in on-call rotation for the service
Required Qualifications:
- 6 to 10+ years of professional work experience
- B. Tech / B.E., M.S. in Computer Science / AI ML or relevant software field
- Strong programming abilities in Python (intermediate++)
- Understanding of machine learning
- Experience with software coding practices (unit tests, mock, logging, debugging, git, code review, etc.)
- Strong verbal and written communication skills
Desired Skills:
- DevOps experience: Containerization (Dockerizing python), Linux/UNIX Shell, package management/Conda
- Cloud experience with AWS, GCP, Azure
- Experience with distributed ML frameworks such as Tensorflow and PyTorch
- SQL experience
- Java development experience
This role offers unique opportunities to work on cutting-edge infrastructure and solve exciting challenges at the intersection of data science and cloud computing.