Senior Site Reliability Engineer

Gorgias is the conversational AI platform for ecommerce that drives sales and resolves support inquiries, trusted by over 15,000 ecommerce brands.
Site Reliability
Senior Software Engineer
Hybrid
101 - 500 Employees
5+ years of experience
Enterprise SaaS · E-Commerce

Description For Senior Site Reliability Engineer

Gorgias, a leading conversational AI platform for ecommerce, is seeking a Senior Site Reliability Engineer to join their growing team. The company serves over 15,000 ecommerce brands and is built for Shopify with advanced ecommerce integrations. The SRE team, currently 4 members strong with plans to expand to 6, maintains core infrastructure handling billions of daily queries with sub-millisecond response times.

The role involves managing multi-TB PostgreSQL clusters, operating high-throughput message queues, and maintaining multiple GKE clusters worldwide. You'll work with cutting-edge technologies including Kafka, Kubernetes, and various cloud services. The team has achieved significant milestones like optimizing database performance, implementing sophisticated connection pooling, and securing SOC2 certification.

As a Senior SRE, you'll be responsible for ensuring system reliability, scalability, and performance. The position requires strong expertise in cloud-native systems, infrastructure as code, and DevOps practices. You'll collaborate with product-engineering teams to implement best practices for system reliability, security, and disaster recovery.

The company offers an attractive benefits package including generous vacation time, remote work flexibility, comprehensive health coverage, and substantial professional development resources. Gorgias maintains a strong commitment to diversity and inclusion, encouraging applications from candidates who might not meet every requirement but bring unique valuable perspectives.

Working at Gorgias means joining a company with strong financial backing (recently raised Series C-2 for $29M), excellent workplace ratings, and a culture focused on continuous learning and growth. The role offers an opportunity to work on challenging technical problems while contributing to the success of a rapidly growing ecommerce platform.

Last updated 2 days ago

Responsibilities For Senior Site Reliability Engineer

  • Manage multi-TB PostgreSQL clusters in the public cloud
  • Operate RabbitMQ and Redis with high throughput
  • Manage 10+ full featured GKE clusters worldwide
  • Adopt new stack of: Kafka, Debezium, Apache Flink
  • Facilitate rollout strategies at scale with Gitlab CI and ArgoCD
  • Roll out best practices around Kubernetes/Helm/Operators
  • Automate complex infrastructure pieces with TF, Python/Golang

Requirements For Senior Site Reliability Engineer

Python
Go
PostgreSQL
Kubernetes
Redis
Kafka
Linux
  • Bachelor's degree in Computer Science or equivalent work experience
  • 5+ years experience as a Site Reliability Engineer
  • Proficiency in using Kubernetes for container orchestration
  • 5+ years experience with Cloud Providers (AWS, GCP)
  • Proficient in scripting and programming languages
  • Comfortable and confident in Linux systems
  • Solid understanding of infrastructure as code (IaC)
  • Experience with CI/CD pipelines
  • Excellent problem-solving and troubleshooting skills
  • Strong communication and collaboration skills

Benefits For Senior Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
  • 5-week vacation plus 2 weeks RTT
  • Paid sick leave
  • 6 weeks full remote/year
  • Paid parental leave (16 weeks)
  • 50% of public transportation reimbursed
  • MacBook Pro provided
  • Personal lunch credit card (Swile)
  • Private health insurance (Alan)
  • Up to €700 for home workstation setup
  • Up to €2000 learning and wellness budget per year
  • Quarterly company-wide summits
  • Annual team and company retreats

Interested in this job?

Jobs Related To Gorgias Senior Site Reliability Engineer

Site Reliability Engineer, Health Software

Senior Site Reliability Engineer role at Apple's Health team, focusing on large-scale system maintenance, automation, and healthcare software reliability.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at DriveWealth, managing platform reliability for global financial trading systems with competitive compensation and benefits.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at DriveWealth, managing platform reliability for global financial trading systems with competitive compensation and benefits.

SR. SITE RELIABILITY ENGINEER (STARSHIELD)

Senior Site Reliability Engineer position at SpaceX working on Starshield program, requiring Top Secret clearance and expertise in cloud infrastructure and containerization.

Site Reliability Engineer

Senior Site Reliability Engineer position at sFOX, focusing on maintaining and scaling crypto trading infrastructure using Kubernetes and AWS.