Principal/Architect- Availability Engineering & SRE

Salesforce

A leading cloud-based software company providing customer relationship management and enterprise solutions.

San Francisco, CA, USA • Seattle, WA, USA

$211,500 - $384,100

Site Reliability

Principal Software Engineer

In-Person

5,000+ Employees

15+ years of experience

Enterprise SaaS

Description For Principal/Architect- Availability Engineering & SRE

Site Reliability Engineering (SRE) at Salesforce combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. This principal role will shape the technical strategy for SRE and influence the strategy for the Availability Cloud. The position offers unique challenges of scale specific to Salesforce, requiring expertise in coding, algorithms, complexity analysis, and large-scale system design.

The role involves embedding with product teams, defining availability roadmaps, and delivering against them. You'll be crucial in maturing the SRE practice, mentoring engineers, and scaling the impact of your community. The team culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment.

As a Principal/Architect, you'll work on meaningful projects while having the support and mentorship needed to learn and grow. You'll be responsible for developing full paved path observability platform integrations, maintaining service health, and scaling systems through automation. The position requires hands-on coding (at least 25%) while leading and mentoring others.

The ideal candidate brings 15+ years of software development experience, with deep expertise in distributed systems, service ownership, and technical leadership. You'll work with technologies like Kubernetes, Istio, and public cloud platforms, applying your knowledge of core web technologies and service ownership best practices.

Join Salesforce's SRE team to tackle complex challenges, drive technical strategy, and shape the future of large-scale system reliability while working with a diverse, collaborative team in a supportive environment focused on continuous learning and growth.

Last updated 2 months ago

Responsibilities For Principal/Architect- Availability Engineering & SRE

Spearhead and enable the culture of Service Ownership
Engage in and improve the whole lifecycle of services
Support services before they go live through system design consulting
Develop full paved path observability platform integrations
Scale systems sustainably through automation
Practice sustainable incident response and blameless post mortems
Hands on coding at least 25%
Develop and grow the engineering talent

Requirements For Principal/Architect- Availability Engineering & SRE

Java

Python

Kubernetes

15+ years of software development and engineering experience, 5+ years in a technical leadership role
Experience designing, building and operating large scale distributed systems
Experience leading initiatives spanning multiple teams
Ability to effectively collaborate across multiple teams
Experience mentoring and developing engineers
Mastery of object oriented languages (Java, Golang, Python, C++, C)
Experience in Kubernetes, Istio, Public Cloud (AWS)
Deep experience with core web technologies: HTTP, JSON, REST, XML
Experience owning and operating critical services
Expertise in Service ownership, SLO/I/A definition
Knowledge of Agile development methodology
Experience in fault modeling, chaos engineering, and load testing

Salesforce

A leading cloud-based software company providing customer relationship management and enterprise solutions.

San Francisco, CA, USA • Seattle, WA, USA

$211,500 - $384,100

Site Reliability

Principal Software Engineer

In-Person

5,000+ Employees

15+ years of experience

Enterprise SaaS

Interested in this job?

Jobs Related To Salesforce Principal/Architect- Availability Engineering & SRE

VP, Software Engineering, SRE

Salesforce

Lead Salesforce's global SRE organization, driving reliability strategy and transformation while managing a 100+ person team.

Principal/Architect- Software Engineering - Availability

Salesforce

Principal SRE role at Salesforce leading technical strategy, mentoring teams, and building reliable distributed systems at scale.

Software Engineering Reliability PMTS

Salesforce

Principal Software Engineer role focusing on Site/Product Reliability Engineering for Salesforce's AgentForce platform, specializing in AI and production support.

VP, Software Engineering, SRE

Salesforce

Lead Salesforce's global SRE organization, driving reliability strategy and transformation while managing a 100+ person team.

Engineering Director, P2020 Rollouts

Google

Lead Google's Rollouts platform development, managing continuous deployment solutions for Alphabet's services as Engineering Director in Dublin.