Salesforce is seeking a Lead Software Engineer/DevOps to build and manage a multi-substrate Kubernetes and microservices platform powering Core CRM and various applications across Salesforce. This role involves managing one of the largest Kubernetes fleets in the industry, providing Kubernetes platform-as-a-service to other teams, and driving centralized SRE/DevOps functionality to enhance trust, availability, and resiliency of the K8s platform.
Key responsibilities include:
- Ensuring high availability of 1000+ clusters running technologies like Kubernetes, Docker, software load balancers, service mesh, and Istio
- Contributing code to drive improvements
- Implementing automation using Python/Golang/Puppet/Jenkins
- Enhancing CI/CD pipelines with Terraform, Spinnaker, and Argo
- Improving platform visibility through monitoring and metrics implementation (Prometheus, Grafana)
- Developing self-healing mechanisms to reduce manual labor
- Collaborating with various infrastructure teams across Salesforce
- Evaluating new technologies to solve problems
Requirements:
- 7+ years of experience in SRE/DevOps/Systems Engineering roles
- Experience with large-scale distributed systems, especially in cloud environments
- Strong Linux systems administration and troubleshooting skills
- Expertise in Kubernetes, Docker, Container Orchestration, and Service Mesh
- Proficiency in scripting/programming languages like Python and Golang
- Knowledge of networking protocols and components
- Experience with observability tools (Prometheus, Grafana, Splunk, ElasticSearch)
- Familiarity with DevOps tools (Puppet, Chef, Ansible)
- Experience with AWS, Terraform, Spinnaker, ArgoCD
- Excellent problem-solving, analytical, and communication skills
This role offers the opportunity to work with cutting-edge technologies, solve complex problems, and contribute to the development of a critical platform that powers Salesforce's services.