Lead Site Reliability Engineer - High Performance Compute

JPMorganChase is one of the oldest financial institutions offering innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands.
Jersey City, NJ, USA
$152,000 - $215,000
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
5+ years of experience
Finance

Description For Lead Site Reliability Engineer - High Performance Compute

As a Lead Site Reliability Engineer at JPMorgan Chase within the Markets Engineering & Architecture team, you will be responsible for solving complex business problems through innovative solutions. This role combines deep technical expertise with leadership responsibilities, focusing on high-performance computing infrastructure and cloud automation.

You will guide the design and implementation of deployment approaches using automated CI/CD pipelines, while ensuring the highest standards of availability, reliability, and scalability. The position requires expertise in infrastructure as code, cloud technologies (particularly AWS), and modern observability practices.

Working within the Commercial & Investment Bank division, you'll collaborate with global teams to deliver strategic technical solutions. Your responsibilities will include leading incident response efforts, conducting architecture resiliency reviews, and implementing best practices for site reliability engineering.

The ideal candidate brings 5+ years of SRE experience, strong programming skills in languages like Python, Java, or Golang, and extensive knowledge of cloud platforms and container orchestration. You'll work with cutting-edge technologies including Kubernetes, Terraform, and various monitoring tools while contributing to one of the world's largest financial institutions.

JPMorgan Chase offers a comprehensive benefits package including competitive salary, health coverage, retirement plans, and continuous learning opportunities. The role is based in Jersey City, NJ, offering the opportunity to work with enterprise-scale systems in a collaborative, innovation-focused environment.

This position represents an excellent opportunity for an experienced SRE professional to make a significant impact on critical financial technology infrastructure while working with a diverse, global team of experts. You'll be instrumental in shaping the reliability and performance of systems that process millions of transactions daily.

Last updated a month ago

Responsibilities For Lead Site Reliability Engineer - High Performance Compute

  • Guide and assist others in building appropriate level designs and gaining consensus from peers
  • Collaborate with software engineers to implement deployment approaches using automated CI/CD pipelines
  • Design, develop, test, and implement availability, reliability, scalability solutions
  • Implement infrastructure, configuration, and network as code
  • Lead incident response efforts as a subject matter expert
  • Build and maintain standard infrastructure as code modules
  • Participate in architecture resiliency reviews

Requirements For Lead Site Reliability Engineer - High Performance Compute

Python
Java
Go
Kubernetes
Linux
  • 5+ years of applied experience in site reliability engineering
  • Experience in contributing to the reliability of production applications
  • 5+ years experience in Python, Java/Spring Boot, or Golang
  • 5+ years experience in observability practices
  • Experience with CI/CD tools like Jenkins, GitLab, and Spinnaker
  • Experience with container orchestration technologies
  • Experience with AWS and cloud automation tools

Benefits For Lead Site Reliability Engineer - High Performance Compute

Medical Insurance
Dental Insurance
Vision Insurance
401k
Mental Health Assistance
Education Budget
  • Competitive base salary
  • Health care coverage
  • On-site health and wellness centers
  • Retirement savings plan
  • Backup childcare
  • Tuition reimbursement
  • Mental health support
  • Financial coaching

Interested in this job?

Jobs Related To JPMorgan Chase Lead Site Reliability Engineer - High Performance Compute

Lead Site Reliability Engineer- Azure Cloud enablement

Lead Site Reliability Engineer position at JPMorgan Chase focusing on Azure cloud enablement, system reliability, and technical leadership, offering competitive compensation and benefits.

SRE - Lead Software Engineer

Lead Site Reliability Engineer position at JPMorgan Chase focusing on Kubernetes, cloud platforms, and scalable infrastructure automation.

Lead Site Reliability Engineer - Azure SRE/DevOps - Neovest - Athens

Lead Site Reliability Engineer position at JPMorgan Chase in Athens, focusing on Azure DevOps and SRE practices, offering competitive benefits and leadership opportunities.

Lead Site Reliability Engineer

Lead Site Reliability Engineer position at JPMorgan Chase, focusing on system reliability, technical leadership, and infrastructure optimization in financial technology.

Lead Site Reliability Engineer

Lead Site Reliability Engineer position at JPMorgan Chase, focusing on implementing SRE practices and managing critical payment systems infrastructure.