Pure Storage is seeking a Site Reliability Engineer to join their Infrastructure Shared Service (ISS) team in Bengaluru, India. As an SRE, you'll work on improving the reliability and performance of Pure Storage's critical infrastructure applications. You'll be responsible for setting and owning SLO goals for uptime and latency, as well as helping colleagues leverage available features and workflows. The role involves working with backend web servers, load balancers, and database servers to ensure they run smoothly.
Key responsibilities include:
- Engaging in the entire lifecycle of services from design to operation
- Designing, operating, and troubleshooting enterprise systems
- Establishing sustainable incident response and blameless postmortems
- Supporting services pre-launch through system design and capacity planning
- Scaling systems through automation and scripting
- Collaborating with development teams and stakeholders across time zones
- Ensuring hardware design meets business and technical requirements
- Maintaining documentation on system configurations and procedures
- Performing day-to-day server, storage, and network administration
- Deploying infrastructure manually and via automation platforms
- Troubleshooting and resolving hardware, software, and network issues
The ideal candidate should have:
- 5+ years of experience as an SRE, DevOps Engineer, or Infrastructure Engineer
- Strong programming skills in Python or other languages
- Experience with distributed systems, Linux environments, and VMware
- Familiarity with observability platforms like Elastic or DataDog
- Knowledge of Infrastructure as Code tools (Ansible, Terraform)
- Experience with containerization and cloud environments (AWS & Azure)
This role offers the opportunity to work on cutting-edge technology in a fast-paced environment, contributing to the success of a company that's revolutionizing data storage and management. Join Pure Storage to be part of building the future of data infrastructure.