Salesforce is seeking a Site Reliability Engineer for their GovCloud 24x7 team. This role is part of the GovCloud Incident Response (GIR) team, which maintains the current infrastructure with daily alert response, smart hands, and incident management. The ideal candidate must be a U.S. Citizen operating on U.S. Soil with the ability to meet customer and government screening standards.
Key responsibilities include:
The role requires working on a 24/7 team with rotating day and night shifts and participating in an on-call rotation. Candidates should have expertise in TCP/IP technologies, Unix variants (especially Linux and Solaris), monitoring security systems, and incident management. Experience with AWS/C2S infrastructure, scripting languages, and ITIL service operations is essential.
Preferred qualifications include experience with Chef/Puppet, Jenkins/Bamboo/Spinnaker, Java applications, Kubernetes, and certifications in Linux+, RedHat, and AWS. Familiarity with Agile and DevOps processes, as well as experience in resilience engineering and post-incident investigations, is highly valued.
This challenging role offers the opportunity to work with cutting-edge technologies in a dynamic, high-stakes environment, supporting critical government cloud infrastructure. Join Salesforce's GovCloud team to make a significant impact on the reliability and performance of essential services.