Senior Site Reliability Engineer

The cloud-native, operational database built for speed and scale.
Site Reliability
Senior Software Engineer
Remote
101 - 500 Employees
3+ years of experience

Description For Senior Site Reliability Engineer

MemSQL is seeking a Senior Site Reliability Engineer to help drive our Kubernetes product strategy surrounding our managed service. You will be at the forefront; crafting the design, building out the collaborated vision, and sustaining your envisioned product strategy.

This role will be an integral part of building our managed service product line, and influencing future direction of the organization. As a technical leader in the space you will collaborate with the entire engineering team guiding decisions critical to both team and company success.

Role and Responsibilities:

  • Help MemSQL craft its production container orchestration strategy.
  • Design, build, and run elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.
  • Experience designing systems for peak reliability, scalability, and performance.
  • Efficiently operate within a data center environment; monitoring performance and health of hardware and software, installing new servers, and upgrading as needed
  • Participate in a SLA-driven on-call rotation, which will include after-hours, weekend, and rotating holiday participation.

Required Skills and Experience:

  • Expert-level knowledge of Kubernetes and the container ecosystem.
  • Strong working knowledge of configuration management tools such as Ansible and Puppet.
  • Experience with Unix/Linux operating systems internals and administration (e.g., filesystems, inodes, system calls) and networking (e.g., TCP/IP, routing, network topologies and hardware, SDN) and a keen interest in relational databases.
  • Familiar with at least one of AWS, Azure, or Google Cloud.
  • Experience debugging, diagnosing and troubleshooting complex, production software.
  • C, Python, POSIX shell programming experience required. Experience with C++ / Go are a strong plus.
  • Familiarity with JunOS, routing protocols (BGP), IPSec and Ceph storage a plus.
  • B.S. Degree in Computer Science or related field

About SingleStore: MemSQL is The No-Limits DatabaseTM, powering modern applications and analytical systems with a cloud-native, massively scalable architecture for maximum ingest and query performance at the highest concurrency. MemSQL envisions a world where every business can make decisions in real-time and every experience is optimized through data. Global enterprises use the MemSQL distributed database to easily ingest, process, analyze, and act on data in order to thrive in today's insight-driven economy. MemSQL is optimized to run on any public cloud or on-premises with commodity hardware.

Last updated 6 months ago

Responsibilities For Senior Site Reliability Engineer

  • Help MemSQL craft its production container orchestration strategy
  • Design, build, and run elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments
  • Design systems for peak reliability, scalability, and performance
  • Efficiently operate within a data center environment
  • Participate in a SLA-driven on-call rotation

Requirements For Senior Site Reliability Engineer

Kubernetes
Linux
Python
Go
  • Expert-level knowledge of Kubernetes and the container ecosystem
  • Strong working knowledge of configuration management tools such as Ansible and Puppet
  • Experience with Unix/Linux operating systems internals and administration
  • Familiar with at least one of AWS, Azure, or Google Cloud
  • Experience debugging, diagnosing and troubleshooting complex, production software
  • C, Python, POSIX shell programming experience required
  • B.S. Degree in Computer Science or related field

Interested in this job?

Jobs Related To SingleStore Senior Site Reliability Engineer

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and growth opportunities.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Site Reliability Engineer role at Google, focusing on building AI-powered infrastructure and maintaining large-scale distributed systems for Google Cloud Platform.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.