Senior AI Ops Engineer

SingleStore is one platform for all data, built so you can engage with insight in every moment. It enables enterprises to adapt to change as it happens, embrace diverse data with ease, and accelerate the pace of innovation.
San Francisco, CA, USAPortland, OR, USASeattle, WA, USA
Machine Learning
Senior Software Engineer
Contact Company
6+ years of experience
AI · Enterprise SaaS

Description For Senior AI Ops Engineer

As a Senior AI Ops Engineer at SingleStore, you will be responsible for integrating AI and machine learning into our infrastructure and operations processes. You'll work closely with Infrastructure, DevOps, and Data Science teams to design, develop, and implement AI-driven automation and monitoring solutions. Your role involves leveraging AI for incident management, predictive maintenance, end-to-end automation, resource optimization, and building self-healing infrastructure. You'll also focus on dynamic anomaly detection, enhanced monitoring & observability, and continuous learning and optimization of AI/ML models.

Key responsibilities include:

  1. AI-Powered Incident Management
  2. Predictive Maintenance
  3. End-to-End Automation
  4. Intelligent Resource Optimization
  5. Self-Healing Infrastructure
  6. Dynamic Anomaly Detection
  7. Enhanced Monitoring & Observability
  8. Collaborative Operations
  9. Continuous Learning and Optimization
  10. Incident Post-Mortem Analysis

Requirements:

  • 6+ years in DevOps, Infrastructure, or SRE roles with automation and monitoring experience
  • Proven experience applying AI/ML techniques to operational problems
  • Familiarity with cloud infrastructure (AWS, GCP, Azure)
  • Strong understanding of machine learning concepts and experience with frameworks like TensorFlow, PyTorch, or Scikit-learn
  • Proficiency in Python and infrastructure automation tools
  • Knowledge of monitoring tools and CI/CD pipelines

SingleStore offers a range of benefits including a technology stipend, monthly cell phone and internet stipend, health and wellness benefits, flexible time off, and stock options. As a global company, benefits may vary by location.

Join SingleStore to define the future with The Database of Now™ and be part of a diverse and inclusive team committed to innovation in the data management space.

Last updated 5 months ago

Responsibilities For Senior AI Ops Engineer

  • AI-Powered Incident Management: Leverage AI/ML models to detect anomalies in real time, perform root cause analysis, and accelerate incident resolution
  • Predictive Maintenance: Implement machine learning algorithms to forecast system and hardware failures
  • End-to-End Automation: Design, build, and maintain AI-driven automation pipelines
  • Intelligent Resource Optimization: Utilize AI to analyze system usage patterns and make real-time decisions about resource allocation
  • Self-Healing Infrastructure: Develop and implement systems that automatically detect and correct issues
  • Dynamic Anomaly Detection: Build and refine models that dynamically learn from evolving system behavior
  • Enhanced Monitoring & Observability: Integrate AI-driven analytics into existing monitoring platforms
  • Collaborative Operations: Work closely with DevOps, SRE, and Data Science teams to implement AIOps solutions
  • Continuous Learning and Optimization: Regularly evaluate and improve AI/ML models based on operational feedback
  • Incident Post-Mortem Analysis: Automate the collection of data for incident post-mortem reports

Requirements For Senior AI Ops Engineer

Python
Kubernetes
  • 6+ years in DevOps, Infrastructure, or SRE roles with experience in automation and monitoring
  • Proven experience in applying AI/ML techniques to operational problems
  • Familiarity with cloud infrastructure (AWS, GCP, Azure)
  • Strong understanding of machine learning concepts and experience with frameworks like TensorFlow, PyTorch, or Scikit-learn
  • Proficiency in Python, with experience in developing automation scripts
  • Hands-on experience with infrastructure automation tools (Terraform, Ansible, etc.)
  • Knowledge of monitoring tools such as Prometheus, Grafana, or Datadog, and experience integrating AI for predictive analytics
  • Solid understanding of CI/CD pipelines and integrating AI-driven insights into these processes

Benefits For Senior AI Ops Engineer

Equity
  • Technology Stipend for New Employees
  • Monthly Cell Phone and Internet Stipend
  • Health and Wellness benefit
  • Company and team events
  • Flexible time off
  • Volunteer time off
  • Stock Options

Interested in this job?

Jobs Related To SingleStore Senior AI Ops Engineer

Senior Software Development Engineer, Ring & Blink AI

Senior Software Engineer position at Amazon's Ring & Blink AI team focusing on computer vision and machine learning software development for smart home devices.

Senior Software Developer, Amazon Games AI

Senior Software Developer position at Amazon Games focusing on implementing ML, RL, and Generative AI techniques for game development.

Product Development Engineer, Annapurna Labs Silicon Operations

Senior Product Development Engineer role at AWS-Annapurna Labs focusing on silicon yield optimization for machine learning accelerator servers.

Sr. ES Product Manager

Lead AI and Agentforce Product Manager role at Salesforce, focusing on Employee Success products and solutions with 5+ years of product management experience required.

Senior Technical Consultant- AI

Senior Technical Consultant role specializing in AI solutions development using Salesforce Einstein, requiring 6+ years of Salesforce experience and strong AI/ML expertise.