Salesforce is seeking a Principal/Architect Software Engineer for their Site Reliability Engineering (SRE) team. This role combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. The position focuses on ensuring Salesforce services maintain reliability, capacity, performance, and availability to meet customer needs.
The role involves managing complex challenges unique to Salesforce's scale while utilizing expertise in coding, algorithms, complexity analysis, and large-scale system design. The SRE team promotes a culture of diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll have the opportunity to shape technical strategy for SRE and influence the Availability Cloud's direction.
As a Principal/Architect, you'll embed with product teams, define availability roadmaps, and deliver against them. Key responsibilities include developing observability platforms, scaling systems through automation, and practicing sustainable incident response. The role requires strong leadership skills to mentor and develop other engineers, with success measured by scaling the impact of your community.
The ideal candidate brings 15+ years of software development experience, deep expertise in distributed systems, and a track record of leading cross-team initiatives. You'll need mastery of object-oriented programming, experience with cloud technologies, and a thorough understanding of service ownership best practices. This position offers the chance to work on meaningful projects while receiving support and mentorship to continue learning and growing.