Site Reliability Engineer

Founded in 2013, Pendo helps product managers understand and improve product success, backed by top-tier investors like Battery Ventures and Salesforce Ventures.
$80,000 - $86,250
Site Reliability
Senior Software Engineer
In-Person
501 - 1,000 Employees
5+ years of experience
Enterprise SaaS

Description For Site Reliability Engineer

Pendo, a fast-growing startup founded in 2013, is seeking a Site Reliability Engineer to join their dynamic team. The SRE team is crucial in maintaining and scaling their cloud infrastructure, which processes over 15 billion events daily. Built on Google Kubernetes Engine (GKE), the platform utilizes various Google technologies and other vendor services.

The role combines development and operations, requiring expertise in infrastructure-as-code, cloud technologies, and programming. You'll work closely with developers and product managers to ensure service reliability, performance, and cost-efficiency. The position involves both development support through CI/CD pipelines and production support through on-call duties and incident management.

Key responsibilities include automating infrastructure deployment, ensuring system reliability, collaborating on service design, and maintaining security compliance. The ideal candidate will have strong technical skills in Go or Python, experience with cloud infrastructure tools, and a deep understanding of distributed systems.

Pendo offers a competitive salary range of £64,000 - £69,000 for their Sheffield, UK location, along with the opportunity to work with cutting-edge technologies. The company is backed by prestigious investors like Battery Ventures and Salesforce Ventures, and maintains a culture focused on passion, innovation, and inclusivity.

This role is perfect for experienced SREs who want to impact a rapidly growing platform while working with modern cloud technologies and contributing to a product that improves society's experience with software.

Last updated 2 months ago

Responsibilities For Site Reliability Engineer

  • Write high-quality infrastructure-as-code for provisioning, deployment, scaling, and monitoring
  • Write maintainable code focusing on operations, scale, resiliency, and monitoring
  • Ensure new services are well-designed with proper monitoring and SLOs
  • Debug and mitigate production issues
  • Maintain and automate runbooks
  • Track capacity, quotas, and performance limits
  • Participate in 24x7 on-call rotation

Requirements For Site Reliability Engineer

Go
Python
Kubernetes
  • Bachelor's Degree in Computer Science or related technical field
  • Minimum of 5 years of professional technical experience
  • Experience with cloud infrastructure using Ansible or Terraform
  • Strong programming skills in Go or Python
  • System thinking abilities regarding failure modes and bottlenecks
  • Good understanding of performance analysis and operational metrics
  • Experience as Site Reliability Engineer or DevOps Engineer preferred
  • Experience with distributed systems preferred
  • Experience with Kubernetes in production preferred

Interested in this job?

Jobs Related To Pendo Site Reliability Engineer

Site Reliability Engineer

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.

Senior Software Engineer, Site Reliability Engineering

Senior Site Reliability Engineering role at Google, focusing on building and maintaining large-scale distributed systems for Google Cloud services.