Oracle Analytics is seeking a Principal Data Scientist to join their data science team. The role involves working on diverse data science/ML problems using enterprise data, such as invoice payment prediction, churn prediction, demand prediction, and recommender systems. The team utilizes various technologies, including text processing, information retrieval, natural language processing (including large language models), and time series prediction.
Key responsibilities include:
- Understanding background literature and relevant research papers in problem domains
- Data engineering: processing, cleansing, and verification of data
- Performing ad-hoc analysis and data visualization
- Developing machine learning models using PySpark, scikit-learn, Keras, TensorFlow, PyTorch, etc.
- Working on data pipelines for deploying machine learning models (e.g., using OCI Data Flow)
- Conducting post-deployment analysis of ML models
The ideal candidate should have:
- BS in Computer Science, Data Science, Machine Learning, or related technical fields with 5+ years of applied experience
- Strong programming skills in Python/PySpark and industrial experience in data science/machine learning
- Experience in creating/deploying models in Spark environments
- Thorough understanding of CS fundamentals, including data structures, algorithms, and complexity analysis
- Strong software development experience through hands-on coding
- Ability to formulate analytical problems into actionable research and apply advanced machine learning techniques for problem-solving
Oracle offers a comprehensive benefits package, including medical, dental, and vision insurance, disability coverage, life insurance, flexible spending accounts, 401(k) with company match, paid time off, and various other perks. The company is committed to diversity, work-life balance, and creating an inclusive work environment.