Principal/Architect- Software Engineering - Availability

Global leader in CRM and enterprise cloud solutions, providing software and systems for customer relationship management.
$211,500 - $384,100
Site Reliability
Principal Software Engineer
In-Person
15+ years of experience
Enterprise SaaS

Description For Principal/Architect- Software Engineering - Availability

Salesforce is seeking a Principal/Architect Software Engineer for their Site Reliability Engineering (SRE) team. This role combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. The position focuses on ensuring Salesforce services maintain reliability, capacity, performance, and availability to meet customer needs.

The role involves managing complex challenges unique to Salesforce's scale while utilizing expertise in coding, algorithms, complexity analysis, and large-scale system design. The SRE team promotes a culture of diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll have the opportunity to shape technical strategy for SRE and influence the Availability Cloud's direction.

As a Principal/Architect, you'll embed with product teams, define availability roadmaps, and deliver against them. Key responsibilities include developing observability platforms, scaling systems through automation, and practicing sustainable incident response. The role requires strong leadership skills to mentor and develop other engineers, with success measured by scaling the impact of your community.

The ideal candidate brings 15+ years of software development experience, deep expertise in distributed systems, and a track record of leading cross-team initiatives. You'll need mastery of object-oriented programming, experience with cloud technologies, and a thorough understanding of service ownership best practices. This position offers the chance to work on meaningful projects while receiving support and mentorship to continue learning and growing.

Last updated 7 days ago

Responsibilities For Principal/Architect- Software Engineering - Availability

  • Spearhead and enable the culture of Service Ownership
  • Engage in and improve the whole lifecycle of services
  • Support services before they go live through system design consulting
  • Develop full paved path observability platform integrations
  • Scale systems sustainably through automation
  • Practice sustainable incident response and blameless post mortems
  • Hands on coding at least 25%
  • Develop and grow the engineering talent

Requirements For Principal/Architect- Software Engineering - Availability

Java
Go
Python
Kubernetes
  • 15+ years of software development and engineering experience
  • Experience designing, building and operating large scale distributed systems
  • Experience leading initiatives spanning multiple teams
  • Ability to effectively collaborate across multiple teams
  • Experience mentoring and developing engineers
  • Mastery of object oriented languages (Java, Golang, Python, C++, C)
  • Experience in Kubernetes, Istio, Public Cloud (AWS)
  • Deep experience with core web technologies
  • Experience owning and operating critical services
  • Subject matter expertise on Service ownership best practices
  • Knowledge of Agile development methodology
  • Experience in fault modeling and tolerance, chaos engineering

Interested in this job?

Jobs Related To Salesforce Principal/Architect- Software Engineering - Availability

Systems Engineering Principal

Principal Engineer role leading reliability engineering and post-incident analysis at Salesforce, driving systemic improvements across cloud platforms.

Principal/Architect- Availability Engineering & SRE

Principal/Architect role leading Salesforce's Site Reliability Engineering team, focusing on large-scale distributed systems and technical strategy.

VP, Software Engineering, SRE

Lead Salesforce's SRE organization as VP, driving reliability innovation and cultural transformation while managing a global team of 100+ engineers.

Principal Engineer, Core Networking Site Reliability

Lead technical role responsible for Google's core network infrastructure, combining deep networking expertise with strategic leadership to ensure reliable operation of global systems.

Principal Engineer, AI, Trust, Security, Site Reliability Engineering

Lead technical initiatives in AI, Trust, and Security for Google's Site Reliability Engineering organization, architecting and implementing large-scale distributed systems.