Principal Software Engineering - Availability

Salesforce is a cloud-based software company that provides customer relationship management services and enterprise solutions.
$211,500 - $384,100
Site Reliability
Principal Software Engineer
In-Person
5,000+ Employees
15+ years of experience
Enterprise SaaS

Description For Principal Software Engineering - Availability

Site Reliability Engineering (SRE) at Salesforce combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. This principal role will shape the technical strategy for SRE and influence the Availability Cloud strategy. The position involves embedding with product teams, defining availability roadmaps, and delivering against them while mentoring other engineers.

The role focuses on enabling service owners to operate at scale through observability frameworks, system optimization, and infrastructure design. You'll tackle complex scaling challenges unique to Salesforce while utilizing expertise in coding, algorithms, and large-scale system design. The SRE team values diversity, intellectual curiosity, and problem-solving in a blame-free environment.

As a Principal Engineer, you'll lead technical initiatives, uncover themes, design solutions, and implement improvements to enhance service reliability. The position requires hands-on coding (minimum 25%) and collaboration with cross-functional teams. You'll need to challenge the status quo, communicate effectively, and influence through data-driven insights.

The ideal candidate brings 15+ years of software development experience, deep expertise in distributed systems, and a track record of leading multi-team initiatives. You should be passionate about mentoring others, have mastery of object-oriented programming, and extensive experience with cloud technologies and service ownership practices.

Join Salesforce to shape the future of enterprise reliability while working with cutting-edge technologies and talented engineers in a collaborative, growth-oriented environment.

Last updated 2 days ago

Responsibilities For Principal Software Engineering - Availability

  • Spearhead and enable the culture of Service Ownership
  • Engage in and improve the whole lifecycle of services
  • Support services before they go live through system design consulting
  • Develop full paved path observability platform integrations
  • Scale systems sustainably through automation
  • Practice sustainable incident response and blameless post mortems
  • Hands on coding at least 25%
  • Develop and grow the engineering talent

Requirements For Principal Software Engineering - Availability

Java
Go
Python
Kubernetes
  • 15+ years of software development and engineering experience
  • Experience designing, building and operating large scale distributed systems
  • Experience leading initiatives spanning multiple teams
  • Ability to effectively collaborate across multiple teams
  • Experience mentoring and developing engineers
  • Mastery of object oriented languages (Java, Golang, Python, C++, C)
  • Experience in Kubernetes, Istio, Public Cloud (AWS)
  • Deep experience with core web technologies
  • Experience owning and operating critical services
  • Expertise in Service ownership best practices
  • Knowledge of Agile development methodology
  • Experience in fault modeling and chaos engineering

Interested in this job?

Jobs Related To Salesforce Principal Software Engineering - Availability

Principal/Architect- Software Engineering - Availability

Principal Software Engineer role at Salesforce focusing on Site Reliability Engineering, requiring 15+ years of experience in building large-scale distributed systems.

Principal Engineer, AI, Trust, Security, Site Reliability Engineering

Principal Engineer position at Google focusing on AI, security, and site reliability engineering, leading technical initiatives for cloud platform infrastructure.

Director, Software Engineering, Site Reliability

Lead LinkedIn's Site Reliability Engineering team of 40+ engineers, driving infrastructure reliability and innovation for the world's largest professional network.

Principal Site Reliability Development Engineer

Principal SRE role at Oracle Cloud Infrastructure focusing on sovereign cloud operations and automation for government systems in Singapore.

Principal/Architect- Software Engineering - Availability

Principal Software Engineer role at Salesforce focusing on Site Reliability Engineering, requiring 15+ years of experience in building large-scale distributed systems.