Staff Site Reliability Engineer

Global platform for digital assets and Web3, securing over 20% of world's crypto assets through Ledger Nanos.
Site Reliability
Staff Software Engineer
Hybrid
501 - 1,000 Employees
8+ years of experience
Blockchain · Cybersecurity

Description For Staff Site Reliability Engineer

Ledger, a leading global platform for digital assets and Web3, is seeking a Staff Site Reliability Engineer to join their growing team. With over 700 professionals and offices across multiple countries, Ledger secures 20% of the world's crypto assets through their hardware wallets.

The ideal candidate will bring 8+ years of cloud engineering experience and deep expertise in SRE practices. You'll be instrumental in driving technology transformation, building robust platforms, and implementing automation solutions. Working with cutting-edge technologies like Kubernetes, AWS, and Python, you'll ensure application availability and full-stack observability.

Key responsibilities include building DevOps culture, managing SRE team roadmap, improving system scalability, and implementing best practices for service level objectives. You'll also handle incident management, conduct performance testing, and drive the adoption of self-healing patterns.

The role offers comprehensive benefits including equity participation, flexible hybrid work arrangements, extensive health coverage, and professional development opportunities. Join a company that values pragmatism, audacity, and transparency while working on revolutionary digital asset security solutions.

This position is perfect for an experienced engineer who enjoys solving complex problems at scale and wants to make a significant impact in the cryptocurrency and digital assets space. You'll be part of a team that's shaping the future of digital asset security and Web3 technology.

Last updated a month ago

Responsibilities For Staff Site Reliability Engineer

  • Build DevOps / SRE culture and enable transition to modern infrastructure
  • Build SRE team roadmap and anticipate stakeholder needs
  • Perform integration of platform software components
  • Design and deliver solutions to improve system availability and scalability
  • Create standards & best practices for service level objectives
  • Automate key SRE metrics including SLOs/SLAs and error budgets
  • Provide expert support and troubleshoot priority incidents
  • Conduct performance tests and identify optimization opportunities

Requirements For Staff Site Reliability Engineer

Python
Linux
Kubernetes
  • 8+ years on cloud engineering at scale, on organizations operating SaaS solutions
  • Proficiency in Unix/Linux environments, Git, Python, Terraform, Kubernetes, AWS cloud solutions
  • Strong knowledge on observability practices
  • Experience of cross-functional work
  • Customer focused mindset
  • Creative problem-solving and analysis skills
  • Excellent presentation and written communication
  • Engineering degree

Benefits For Staff Site Reliability Engineer

Equity
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
  • Equity participation through stock options
  • Hybrid work policy
  • Annual company outing and social events
  • Comprehensive health insurance (medical, dental, vision)
  • Personal development and coaching
  • Five weeks paid leave plus holidays and RTT days
  • High performance office equipment including Apple products
  • Transportation reimbursement
  • Employee discount on products

Interested in this job?

Jobs Related To Ledger Staff Site Reliability Engineer

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Site Reliability Engineering II

Senior Site Reliability Engineer position at Microsoft focusing on identity and security engineering, requiring 5+ years of experience in identity technologies and security infrastructure.

Site Reliability Manager, Core Enterprise Systems

Lead a team of SRE engineers at Google, managing enterprise services and driving reliability improvements across critical internal systems.

Technical Program Manager III, Site Reliability, Storage

Technical Program Manager III position at Google, leading Storage Site Reliability Engineering initiatives and cross-functional programs.