Site Reliability Engineer

Leading customer engagement platform that empowers brands to build and maintain engaging relationships with their customers through cross-channel messaging and AI-powered experimentation.
Site Reliability
Mid-Level Software Engineer
Hybrid
3+ years of experience
Enterprise SaaS

Description For Site Reliability Engineer

Braze, a leading customer engagement platform, is seeking a Site Reliability Engineer to join their global team. As an SRE, you'll be responsible for maintaining and improving the infrastructure that supports over 3.3 billion monthly active users, handling hundreds of billions of data points monthly. You'll work with a diverse technology stack including Ruby on Rails, MongoDB, Redis, Kafka, and Kubernetes.

The role combines software engineering and systems administration, focusing on applying engineering principles to infrastructure services. You'll collaborate with engineering teams to architect scalable solutions, debug reliability issues, and develop automation frameworks. The position involves being part of an on-call rotation and contributing to incident management and prevention.

Braze offers a collaborative, transparent culture recognized as a Great Place to Work® across multiple countries. The company provides comprehensive benefits, including equity compensation, flexible PTO, and strong professional development support. With offices worldwide and remote work options, Braze emphasizes work-life harmony and inclusive growth opportunities.

The ideal candidate brings 3+ years of relevant experience, strong systems thinking, and expertise in infrastructure technologies. You'll join a passionate team dedicated to solving complex challenges at scale while maintaining high reliability standards. If you're enthusiastic about infrastructure automation and building robust systems that support billions of operations daily, this role offers an exciting opportunity to make a significant impact.

Last updated 6 hours ago

Responsibilities For Site Reliability Engineer

  • Ensure site uptime and maintain internal-facing services and platforms
  • Partner with engineering teams on architecture and debugging
  • Develop infrastructure as code using Chef, Terraform, and Kubernetes
  • Create deployment pipelines using Docker and Kubernetes
  • Manage and respond to availability incidents through PagerDuty rotation
  • Implement monitoring and alerting systems
  • Ensure compliance with enterprise-grade SLAs
  • Provide centralized tooling and automation frameworks

Requirements For Site Reliability Engineer

Ruby
MongoDB
Redis
Kafka
Kubernetes
PostgreSQL
Go
  • 3+ years of experience as a Software, DevOps, or Site Reliability Engineer
  • Strong systems thinking capabilities
  • Experience with Linux and Unix Shell
  • Strong programming skills in Ruby and/or Go
  • Experience with Docker, Kubernetes, Terraform, or similar IaC technologies
  • Experience with MongoDB, Redis, Kafka, Postgres, or similar data technologies
  • Ability to collaborate across global remote teams
  • Strong documentation skills
  • Excellent multi-tasking abilities

Benefits For Site Reliability Engineer

401k
Medical Insurance
Dental Insurance
Vision Insurance
Parental Leave
Education Budget
Equity
  • Competitive compensation with equity
  • Retirement and Employee Stock Purchase Plans
  • Flexible paid time off
  • Comprehensive medical, dental, vision, life, and disability benefits
  • Family services including fertility benefits and equal paid parental leave
  • Professional development and tuition reimbursement
  • Community engagement opportunities
  • Employee Resource Groups

Interested in this job?

Jobs Related To Braze Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer position at Braze, managing infrastructure for billions of users using Kubernetes, MongoDB, and more. 3+ years experience required.

Site Reliability Engineer

Site Reliability Engineer position at Braze, managing infrastructure for billions of users using Kubernetes, MongoDB, and more. 3+ years experience required.

Site Reliability Engineer

Site Reliability Engineer position at Braze, managing infrastructure for billions of users using Kubernetes, MongoDB, and more. 3+ years experience required.

Site Reliability Engineer

Site Reliability Engineer position at Braze, managing infrastructure for billions of users using Kubernetes, MongoDB, and more. 3+ years experience required.

Site Reliability Developer 3

Site Reliability Developer position at Oracle focusing on maintaining and enhancing Enterprise Support services with emphasis on reliability, security, and performance.