Taro Logo

Senior Site Reliability Engineer

Cloud-native operational database company building scalable architecture for maximum performance and real-time data processing.
DevOps
Staff Software Engineer
Remote
101 - 500 Employees
3+ years of experience
Enterprise SaaS

Description For Senior Site Reliability Engineer

SingleStore (formerly MemSQL) is seeking a Senior Site Reliability Engineer to spearhead their Kubernetes product strategy for their managed service offering. This is a pivotal role that will shape the future of the company's cloud infrastructure and service delivery.

As a Senior SRE, you'll be responsible for designing and implementing container orchestration strategies across multiple cloud platforms. The role combines deep technical expertise in Kubernetes with broad infrastructure knowledge, requiring both strategic thinking and hands-on implementation skills.

Key responsibilities include:

  • Leading the development of production container orchestration strategy
  • Building and managing elastic Kubernetes clusters across multiple cloud providers (AWS, Azure, GCP) and on-premises environments
  • Designing highly reliable and scalable systems
  • Managing data center operations, including hardware/software monitoring and maintenance
  • Participating in SLA-driven on-call rotations

The ideal candidate brings:

  • Expert-level Kubernetes knowledge
  • Strong configuration management experience (Ansible, Puppet)
  • Deep understanding of Linux systems and networking
  • Multi-cloud platform experience
  • Strong programming skills in C, Python, and shell scripting
  • Experience troubleshooting complex production systems

SingleStore offers a dynamic environment working with cutting-edge technology, backed by top investors like GV and Accel Partners. The company serves major enterprises including Uber, Akamai, and Samsung, providing opportunities to work on large-scale, impactful systems.

This role offers the flexibility of remote work while being part of a team that's pushing the boundaries of database technology and cloud infrastructure. If you're passionate about building reliable, scalable systems and want to shape the future of cloud-native databases, this position offers an excellent opportunity to make a significant impact.

Last updated 12 days ago

Responsibilities For Senior Site Reliability Engineer

  • Help craft production container orchestration strategy
  • Design, build, and run elastic Kubernetes clusters across multiple environments
  • Design systems for peak reliability, scalability, and performance
  • Operate and monitor data center environment
  • Participate in SLA-driven on-call rotation

Requirements For Senior Site Reliability Engineer

Kubernetes
Linux
Python
Go
  • Expert-level knowledge of Kubernetes and container ecosystem
  • Strong working knowledge of Ansible and Puppet
  • Experience with Unix/Linux systems internals and administration
  • Familiar with AWS, Azure, or Google Cloud
  • Experience debugging complex production software
  • C, Python, POSIX shell programming experience
  • B.S. Degree in Computer Science or related field

Interested in this job?

Jobs Related To SingleStore Senior Site Reliability Engineer