Site Reliability Engineer

Pendo

Founded in 2013, Pendo helps product managers understand and improve product success, backed by top-tier investors like Battery Ventures and Salesforce Ventures.

London, UK

$80,000 - $86,250

Site Reliability

Senior Software Engineer

In-Person

501 - 1,000 Employees

5+ years of experience

Enterprise SaaS

Description For Site Reliability Engineer

Pendo, a fast-growing startup founded in 2013, is seeking a Site Reliability Engineer to join their dynamic team. The SRE team is crucial in maintaining and scaling their cloud infrastructure, which processes over 15 billion events daily. Built on Google Kubernetes Engine (GKE), the platform utilizes various Google technologies and other vendor services.

The role combines development and operations, requiring expertise in infrastructure-as-code, cloud technologies, and programming. You'll work closely with developers and product managers to ensure service reliability, performance, and cost-efficiency. The position involves both development support through CI/CD pipelines and production support through on-call duties and incident management.

Key responsibilities include automating infrastructure deployment, ensuring system reliability, collaborating on service design, and maintaining security compliance. The ideal candidate will have strong technical skills in Go or Python, experience with cloud infrastructure tools, and a deep understanding of distributed systems.

Pendo offers a competitive salary range of £64,000 - £69,000 for their Sheffield, UK location, along with the opportunity to work with cutting-edge technologies. The company is backed by prestigious investors like Battery Ventures and Salesforce Ventures, and maintains a culture focused on passion, innovation, and inclusivity.

This role is perfect for experienced SREs who want to impact a rapidly growing platform while working with modern cloud technologies and contributing to a product that improves society's experience with software.

Last updated 2 months ago

Responsibilities For Site Reliability Engineer

Write high-quality infrastructure-as-code for provisioning, deployment, scaling, and monitoring
Write maintainable code focusing on operations, scale, resiliency, and monitoring
Ensure new services are well-designed with proper monitoring and SLOs
Debug and mitigate production issues
Maintain and automate runbooks
Track capacity, quotas, and performance limits
Participate in 24x7 on-call rotation

Requirements For Site Reliability Engineer

Python

Kubernetes

Bachelor's Degree in Computer Science or related technical field
Minimum of 5 years of professional technical experience
Experience with cloud infrastructure using Ansible or Terraform
Strong programming skills in Go or Python
System thinking abilities regarding failure modes and bottlenecks
Good understanding of performance analysis and operational metrics
Experience as Site Reliability Engineer or DevOps Engineer preferred
Experience with distributed systems preferred
Experience with Kubernetes in production preferred

Pendo

Founded in 2013, Pendo helps product managers understand and improve product success, backed by top-tier investors like Battery Ventures and Salesforce Ventures.

London, UK

$80,000 - $86,250

Site Reliability

Senior Software Engineer

In-Person

501 - 1,000 Employees

5+ years of experience

Enterprise SaaS

Interested in this job?

Jobs Related To Pendo Site Reliability Engineer

Site Reliability Engineer

AION

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Google

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Google

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Google

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.

Senior Software Engineer, Site Reliability Engineering

Google

Senior Site Reliability Engineering role at Google, focusing on building and maintaining large-scale distributed systems for Google Cloud services.