Principal/Architect- Availability Engineering & SRE

A leading cloud-based software company providing customer relationship management and enterprise solutions.
$223,000 - $372,900
Site Reliability
Principal Software Engineer
In-Person
15+ years of experience
Enterprise SaaS

Description For Principal/Architect- Availability Engineering & SRE

Salesforce is seeking a Principal/Architect for their Site Reliability Engineering (SRE) team to help build and run large-scale, massively distributed, fault-tolerant systems. This role combines software and systems engineering to ensure Salesforce services maintain reliability, capacity, performance, and availability. The position offers unique challenges of scale specific to Salesforce, focusing on enabling service owners to operate safely at scale through observability frameworks, system optimization, and infrastructure design. The SRE practice at Salesforce is evolving, and this role will significantly shape the technical strategy for SRE and influence the Availability Cloud strategy. The ideal candidate will embed with product teams, define availability roadmaps, and deliver against them while mentoring and developing other engineers. Success in this role is measured by scaling the impact and delivery of the SRE community. The position offers opportunities to work with diverse teams, tackle complex problems, and contribute to a blame-free environment that encourages innovation and growth. The role requires extensive experience in distributed systems, technical leadership, and a strong background in service reliability engineering practices.

Last updated 8 days ago

Responsibilities For Principal/Architect- Availability Engineering & SRE

  • Spearhead and enable the culture of Service Ownership
  • Engage in and improve the whole lifecycle of services
  • Support services before they go live through system design consulting
  • Develop full paved path observability platform integrations
  • Scale systems sustainably through automation
  • Practice sustainable incident response and blameless post mortems
  • Hands on coding at least 25%
  • Develop and grow the engineering talent

Requirements For Principal/Architect- Availability Engineering & SRE

Java
Python
Kubernetes
Go
  • 15+ years of software development and engineering experience, 5+ years in technical leadership
  • Experience designing, building and operating large scale distributed systems
  • Experience leading initiatives spanning multiple teams
  • Ability to effectively collaborate across multiple teams
  • Experience mentoring and developing engineers
  • Mastery of object oriented languages (Java, Golang, Python, C++, C)
  • Experience in Kubernetes, Istio, Public Cloud (AWS)
  • Deep experience with core web technologies: HTTP, JSON, REST, XML
  • Experience owning and operating critical services
  • Expertise in Service ownership, SLO/I/A definition
  • Knowledge of Agile development methodology
  • Experience in fault modeling, chaos engineering, and load testing

Interested in this job?

Jobs Related To Salesforce Principal/Architect- Availability Engineering & SRE

VP, Software Engineering, SRE

Lead Salesforce's SRE organization as VP, driving reliability innovation and cultural transformation while managing a global team of 100+ engineers.

Principal Engineer, AI, Trust, Security, Site Reliability Engineering

Lead technical initiatives in AI, Trust, and Security for Google's Site Reliability Engineering organization, architecting next-generation cloud platforms.

Principal Database Site Reliability Engineer

Principal Database SRE role at Oracle Health, focusing on cloud infrastructure and healthcare applications transformation.

Principal Site Reliability Engineer

Principal SRE position at Microsoft Azure focusing on customer experience, SLO implementation, and observability solutions with remote work options.

VP, Software Engineering, SRE

Lead Salesforce's SRE organization as VP, driving reliability innovation and cultural transformation while managing a global team of 100+ engineers.