Lead Site Reliability Engineer

Corporate wellness platform connecting employees to fitness, mindfulness, therapy, nutrition, and sleep partners through a single subscription.
Site Reliability
Staff Software Engineer
Hybrid
1,000 - 5,000 Employees
8+ years of experience
Healthcare · Enterprise SaaS

Description For Lead Site Reliability Engineer

Wellhub (formerly Gympass) is seeking a Lead Site Reliability Engineer to join their Platform team in Brazil. As a global corporate wellness platform connecting employees to various wellness partners, Wellhub is transforming how companies approach employee wellbeing. The role focuses on building and maintaining a robust, secure, and scalable infrastructure using cutting-edge technologies like Kubernetes, AWS, and various cloud-native tools. The ideal candidate will have deep expertise in cloud infrastructure, strong problem-solving abilities, and a passion for automation and efficiency. This position offers the opportunity to work with modern technologies while contributing to a mission-driven company that values work-life balance and employee wellbeing. The role combines technical leadership with hands-on engineering, requiring both technical expertise and strong communication skills. Benefits include comprehensive healthcare, flexible work arrangements, and access to Wellhub's wellness platform. The company's culture emphasizes personal growth, work-life balance, and making a positive impact on global wellbeing.

Last updated 8 days ago

Responsibilities For Lead Site Reliability Engineer

  • Build global, secure, scalable, and cost-effective Cloud platform using Kubernetes in AWS
  • Develop and evolve Kubernetes operators and cloud-native automation
  • Build products and tools for engineering teams to manage cloud resources
  • Ensure security and compliance through DevSecOps integrations
  • Improve observability, reliability, and cost awareness
  • Support engineering teams with products and tools usage
  • Maintain CI/CD tools and services
  • Manage highly available Kubernetes clusters
  • Contribute to product documentation
  • Participate in standards definition and best practices

Requirements For Lead Site Reliability Engineer

Kubernetes
Go
Python
Ruby
  • Proven technical experience with AWS cloud services, Kubernetes, and software engineering
  • Deep knowledge of Kubernetes and its ecosystem
  • Solid knowledge of observability systems
  • Experience with operator-managed Infrastructure as Code
  • Ability to write production software
  • Excellent analytical and problem-solving skills
  • CNCF Kubernetes Certifications
  • AWS Certifications
  • Excellent communication skills in English and Portuguese

Benefits For Lead Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
  • Health, dental, and life insurance
  • Flexible work options (hybrid or remote)
  • Home office stipend
  • Monthly flexible work allowance
  • Flexible schedule
  • Access to wellness platform (Gold plan)
  • Paid time off
  • 100% paid parental leave
  • Career growth opportunities
  • Global collaborative environment

Interested in this job?

Jobs Related To Wellhub Lead Site Reliability Engineer

Lead Site Reliability Engineer

Lead SRE position at Wellhub, focusing on Kubernetes and cloud infrastructure, offering comprehensive benefits and flexible work arrangements.

Lead Site Reliability Engineer

Lead SRE position at Wellhub, focusing on Kubernetes and cloud infrastructure, offering comprehensive benefits and flexible work arrangements.

Staff Site Reliability Engineer

Staff SRE position at Wellhub focusing on building scalable infrastructure with Kubernetes and AWS, offering flexible work and comprehensive benefits.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.