Senior AI Ops Engineer

SingleStore is one platform for all data, built so you can engage with insight in every moment. It enables enterprises to adapt to change as it happens, embrace diverse data with ease, and accelerate the pace of innovation.
San Francisco, CA, USAPortland, OR, USASeattle, WA, USA
Machine Learning
Senior Software Engineer
Contact Company
6+ years of experience
AI · Enterprise SaaS

Description For Senior AI Ops Engineer

As a Senior AI Ops Engineer at SingleStore, you will be responsible for integrating AI and machine learning into our infrastructure and operations processes. You'll work closely with Infrastructure, DevOps, and Data Science teams to design, develop, and implement AI-driven automation and monitoring solutions. Your role involves leveraging AI for incident management, predictive maintenance, end-to-end automation, resource optimization, and building self-healing infrastructure. You'll also focus on dynamic anomaly detection, enhanced monitoring & observability, and continuous learning and optimization of AI/ML models.

Key responsibilities include:

  1. AI-Powered Incident Management
  2. Predictive Maintenance
  3. End-to-End Automation
  4. Intelligent Resource Optimization
  5. Self-Healing Infrastructure
  6. Dynamic Anomaly Detection
  7. Enhanced Monitoring & Observability
  8. Collaborative Operations
  9. Continuous Learning and Optimization
  10. Incident Post-Mortem Analysis

Requirements:

  • 6+ years in DevOps, Infrastructure, or SRE roles with automation and monitoring experience
  • Proven experience applying AI/ML techniques to operational problems
  • Familiarity with cloud infrastructure (AWS, GCP, Azure)
  • Strong understanding of machine learning concepts and experience with frameworks like TensorFlow, PyTorch, or Scikit-learn
  • Proficiency in Python and infrastructure automation tools
  • Knowledge of monitoring tools and CI/CD pipelines

SingleStore offers a range of benefits including a technology stipend, monthly cell phone and internet stipend, health and wellness benefits, flexible time off, and stock options. As a global company, benefits may vary by location.

Join SingleStore to define the future with The Database of Now™ and be part of a diverse and inclusive team committed to innovation in the data management space.

Last updated 35 minutes ago

Responsibilities For Senior AI Ops Engineer

  • AI-Powered Incident Management: Leverage AI/ML models to detect anomalies in real time, perform root cause analysis, and accelerate incident resolution
  • Predictive Maintenance: Implement machine learning algorithms to forecast system and hardware failures
  • End-to-End Automation: Design, build, and maintain AI-driven automation pipelines
  • Intelligent Resource Optimization: Utilize AI to analyze system usage patterns and make real-time decisions about resource allocation
  • Self-Healing Infrastructure: Develop and implement systems that automatically detect and correct issues
  • Dynamic Anomaly Detection: Build and refine models that dynamically learn from evolving system behavior
  • Enhanced Monitoring & Observability: Integrate AI-driven analytics into existing monitoring platforms
  • Collaborative Operations: Work closely with DevOps, SRE, and Data Science teams to implement AIOps solutions
  • Continuous Learning and Optimization: Regularly evaluate and improve AI/ML models based on operational feedback
  • Incident Post-Mortem Analysis: Automate the collection of data for incident post-mortem reports

Requirements For Senior AI Ops Engineer

Python
Kubernetes
  • 6+ years in DevOps, Infrastructure, or SRE roles with experience in automation and monitoring
  • Proven experience in applying AI/ML techniques to operational problems
  • Familiarity with cloud infrastructure (AWS, GCP, Azure)
  • Strong understanding of machine learning concepts and experience with frameworks like TensorFlow, PyTorch, or Scikit-learn
  • Proficiency in Python, with experience in developing automation scripts
  • Hands-on experience with infrastructure automation tools (Terraform, Ansible, etc.)
  • Knowledge of monitoring tools such as Prometheus, Grafana, or Datadog, and experience integrating AI for predictive analytics
  • Solid understanding of CI/CD pipelines and integrating AI-driven insights into these processes

Benefits For Senior AI Ops Engineer

Equity
  • Technology Stipend for New Employees
  • Monthly Cell Phone and Internet Stipend
  • Health and Wellness benefit
  • Company and team events
  • Flexible time off
  • Volunteer time off
  • Stock Options

Interested in this job?

Jobs Related To SingleStore Senior AI Ops Engineer

Machine Learning Engineer

Senior Machine Learning Engineer role at DoorDash, building world-class ML platforms and infrastructure for large-scale predictions and feature processing.

Research Scientist/Engineer - AI Safety (Biosecurity)

Join Anthropic's team to research and mitigate extreme risks from future AI models, focusing on biosecurity.

Senior Software Engineer - GenAI and Web Technologies

Senior Software Engineer role at Capco, focusing on GenAI and web technologies for financial services sector.

Senior Machine Learning Platform Engineer

Senior Machine Learning Platform Engineer optimizing large-scale AI training for autonomous vehicles

Machine Learning Engineer

Senior Machine Learning Engineer role at DoorDash, building world-class ML platforms for logistics and delivery optimization.