Senior AI Ops Engineer

SingleStore is one platform for all data, built so you can engage with insight in every moment. It enables enterprises to adapt to change as it happens, embrace diverse data with ease, and accelerate the pace of innovation.
San Francisco, CA, USAPortland, OR, USASeattle, WA, USA
Machine Learning
Senior Software Engineer
Contact Company
6+ years of experience
AI · Enterprise SaaS

Description For Senior AI Ops Engineer

As a Senior AI Ops Engineer at SingleStore, you will be responsible for integrating AI and machine learning into our infrastructure and operations processes. You'll work closely with Infrastructure, DevOps, and Data Science teams to design, develop, and implement AI-driven automation and monitoring solutions. Your role involves leveraging AI for incident management, predictive maintenance, end-to-end automation, resource optimization, and building self-healing infrastructure. You'll also focus on dynamic anomaly detection, enhanced monitoring & observability, and continuous learning and optimization of AI/ML models.

Key responsibilities include:

  1. AI-Powered Incident Management
  2. Predictive Maintenance
  3. End-to-End Automation
  4. Intelligent Resource Optimization
  5. Self-Healing Infrastructure
  6. Dynamic Anomaly Detection
  7. Enhanced Monitoring & Observability
  8. Collaborative Operations
  9. Continuous Learning and Optimization
  10. Incident Post-Mortem Analysis

Requirements:

  • 6+ years in DevOps, Infrastructure, or SRE roles with automation and monitoring experience
  • Proven experience applying AI/ML techniques to operational problems
  • Familiarity with cloud infrastructure (AWS, GCP, Azure)
  • Strong understanding of machine learning concepts and experience with frameworks like TensorFlow, PyTorch, or Scikit-learn
  • Proficiency in Python and infrastructure automation tools
  • Knowledge of monitoring tools and CI/CD pipelines

SingleStore offers a range of benefits including a technology stipend, monthly cell phone and internet stipend, health and wellness benefits, flexible time off, and stock options. As a global company, benefits may vary by location.

Join SingleStore to define the future with The Database of Now™ and be part of a diverse and inclusive team committed to innovation in the data management space.

Last updated a month ago

Responsibilities For Senior AI Ops Engineer

  • AI-Powered Incident Management: Leverage AI/ML models to detect anomalies in real time, perform root cause analysis, and accelerate incident resolution
  • Predictive Maintenance: Implement machine learning algorithms to forecast system and hardware failures
  • End-to-End Automation: Design, build, and maintain AI-driven automation pipelines
  • Intelligent Resource Optimization: Utilize AI to analyze system usage patterns and make real-time decisions about resource allocation
  • Self-Healing Infrastructure: Develop and implement systems that automatically detect and correct issues
  • Dynamic Anomaly Detection: Build and refine models that dynamically learn from evolving system behavior
  • Enhanced Monitoring & Observability: Integrate AI-driven analytics into existing monitoring platforms
  • Collaborative Operations: Work closely with DevOps, SRE, and Data Science teams to implement AIOps solutions
  • Continuous Learning and Optimization: Regularly evaluate and improve AI/ML models based on operational feedback
  • Incident Post-Mortem Analysis: Automate the collection of data for incident post-mortem reports

Requirements For Senior AI Ops Engineer

Python
Kubernetes
  • 6+ years in DevOps, Infrastructure, or SRE roles with experience in automation and monitoring
  • Proven experience in applying AI/ML techniques to operational problems
  • Familiarity with cloud infrastructure (AWS, GCP, Azure)
  • Strong understanding of machine learning concepts and experience with frameworks like TensorFlow, PyTorch, or Scikit-learn
  • Proficiency in Python, with experience in developing automation scripts
  • Hands-on experience with infrastructure automation tools (Terraform, Ansible, etc.)
  • Knowledge of monitoring tools such as Prometheus, Grafana, or Datadog, and experience integrating AI for predictive analytics
  • Solid understanding of CI/CD pipelines and integrating AI-driven insights into these processes

Benefits For Senior AI Ops Engineer

Equity
  • Technology Stipend for New Employees
  • Monthly Cell Phone and Internet Stipend
  • Health and Wellness benefit
  • Company and team events
  • Flexible time off
  • Volunteer time off
  • Stock Options

Interested in this job?

Jobs Related To SingleStore Senior AI Ops Engineer

Software Development Engineer, Prime Video Sports

Senior Software Engineer role at Amazon Prime Video Sports, focusing on ML/CV technology to enhance sports streaming experiences.

Machine Learning Engineer III, FAR (Frontier AI & Robotics)

Senior ML Engineer role at Amazon Robotics, optimizing large-scale foundation models and working with world-class AI researchers to advance robotics technology.

Applied Acoustic ML Engineer

Senior ML Engineer role at Apple focusing on audio technology and machine learning, developing innovative audio features for consumer electronics products.

AIML Sr SW Engineer - SystemRF

Senior AI/ML Engineer position at Apple, focusing on wireless systems optimization using advanced data analysis and machine learning techniques.

AIML - Machine Learning Engineer- Siri and Information Intelligence

Senior Machine Learning Engineer position at Apple, focusing on LLM development and deployment for Siri and Information Intelligence systems.