Principal Software Engineer, Site Reliability Engineering (SRE)

Global leader in CRM and enterprise cloud solutions, providing software services to businesses worldwide.
$223,000 - $323,400
Site Reliability
Principal Software Engineer
In-Person
15+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Principal/Architect- Availability Engineering & SRE

Principal/Architect role leading Salesforce's SRE team, focusing on large-scale distributed systems and service reliability with 15+ years experience required.

VP, Software Engineering, SRE

Lead Salesforce's SRE organization as VP, driving reliability innovation and cultural transformation while managing a global team of 100+ engineers.

Principal Engineer, AI, Trust, Security, Site Reliability Engineering

Lead technical initiatives in AI, Trust, and Security for Google's Site Reliability Engineering organization, architecting next-generation cloud platforms.

Principal/Architect- Availability Engineering & SRE

Principal/Architect role leading Salesforce's SRE team, focusing on large-scale distributed systems and service reliability with 15+ years experience required.

Principal Database Site Reliability Engineer

Principal Database SRE role at Oracle Health, focusing on cloud infrastructure and healthcare applications transformation.

Description For Principal Software Engineer, Site Reliability Engineering (SRE)

Site Reliability Engineering (SRE) at Salesforce represents a unique fusion of software and systems engineering, focused on building and maintaining large-scale, fault-tolerant systems. As a Principal SRE, you'll be at the forefront of ensuring system reliability and optimal uptime for both internal and external systems. The role involves working with product teams to deploy customer-facing software across a multi-substrate engineering platform that serves tens of millions of users daily.

You'll be tackling complex challenges of scale, leveraging your extensive coding skills, algorithm knowledge, and system design expertise. The position requires deep experience with concurrency, large-scale systems, and data management challenges. You'll be responsible for crafting highly available solutions and optimizing core back-end systems.

The role offers the opportunity to work with cutting-edge technologies including cloud platforms (AWS, GCP, Azure), containerization frameworks (Kubernetes, Docker, Mesos), and various open-source technologies. You'll be contributing to critical infrastructure services, implementing monitoring solutions, and driving proactive operational awareness.

As a Principal Engineer, you'll lead initiatives to improve system availability and resilience, mentor team members, and participate in architectural decisions that impact the entire platform. The position offers competitive compensation, with salaries ranging from $223,000 to $323,400 in California, reflecting the senior nature of the role.

This is an ideal opportunity for experienced engineers who are passionate about system reliability, enjoy solving complex technical challenges, and want to make a significant impact at a company that powers digital transformation for businesses worldwide.

Last updated 20 days ago

Responsibilities For Principal Software Engineer, Site Reliability Engineering (SRE)

  • Embed with delivery teams in a lead capacity, focusing on corrective and proactive availability measures
  • Design, develop, debug, and operate resilient applications across distributed systems
  • Champion resiliency best practices and observability tool integration
  • Develop using object oriented programming languages (up to 50%)
  • Contribute to open source technology
  • Develop Infrastructure-as-Code using Terraform
  • Build/integrate with APIs and microservices
  • Resolve complex technical issues
  • Participate in on-call rotation
  • Balance live runtime management, feature delivery, and technical debt retirement

Requirements For Principal Software Engineer, Site Reliability Engineering (SRE)

Java
Python
Kubernetes
  • Related technical degree required (masters preferred)
  • 15+ years of hands on software development experience
  • 5+ years in a Tech Lead, Principal or Architect capacity
  • Mastery of object oriented languages (Java, Golang, APEX, Python)
  • Deep experience with core web technologies: HTTP, JSON, REST, XML
  • Proficiency with databases including Oracle or other relational/NoSQL solutions
  • Experience owning and operating multiple instances of critical services
  • Expertise in Service ownership best practices, SLO/I/A definition
  • Thorough knowledge of Agile development methodology
  • Experience with Test/Behavioral Driven Development practices

Interested in this job?