Site Reliability Engineer

A company transforming the future of commerce through cloud-based infrastructure and platform solutions.
Site Reliability
Mid-Level Software Engineer
Hybrid
Enterprise SaaS · E-Commerce

Description For Site Reliability Engineer

commercetools is transforming the future of commerce through their cloud-based infrastructure and platform solutions. As a Site Reliability Engineer, you'll play a crucial role in managing critical infrastructure across AWS, GCP, and Azure, focusing on automation, reliability, and enhancing the developer experience. The position offers a hybrid work environment and the opportunity to work with cutting-edge technologies like Kubernetes, Terraform, and GitOps practices.

The role involves developing infrastructure automation, optimizing multi-cloud Kubernetes environments, and creating self-service platforms. You'll be part of a diverse, international team committed to continuous improvement and operational excellence. The ideal candidate should have experience with major cloud providers, Infrastructure as Code, and strong automation capabilities.

The company offers an attractive benefits package including competitive compensation with stock options, flexible work arrangements including up to 60 days of working from different countries, learning and development opportunities, and a strong commitment to diversity and inclusion. This is an excellent opportunity for someone who wants to grow their platform engineering skills while making a meaningful impact in the e-commerce industry.

Last updated 3 months ago

Responsibilities For Site Reliability Engineer

  • Develop infrastructure automation using Terraform and Crossplane
  • Optimize Kubernetes environments across multiple cloud providers
  • Create self-service platforms and workflows using Spacelift and GitOps practices
  • Participate in on-call rotations for infrastructure and platform services
  • Work closely with product teams to develop platform solutions
  • Develop scalable tools for automation and implement security best practices
  • Engage in pair programming and provide constructive code reviews

Requirements For Site Reliability Engineer

Go
Kubernetes
Python
  • Experience with at least two major cloud providers (AWS, GCP, or Azure)
  • Experience with Infrastructure as Code, particularly Terraform
  • Working knowledge of Kubernetes and its ecosystem
  • Understanding of GitOps practices and CI/CD pipelines
  • Strong automation and scripting capabilities (Python, Bash, Go)
  • Experience with monitoring tools like Prometheus and Grafana
  • Excellent problem-solving abilities and root cause analysis
  • Clear written and verbal communication skills in English

Benefits For Site Reliability Engineer

Education Budget
Equity
  • Competitive salary and stock options
  • Work from anywhere up to 60 days per year
  • Learning & Development Budget
  • Access to Coursera and Babbel training courses
  • Flexible working hours
  • Diverse and international workplace

Interested in this job?

Jobs Related To commercetools Site Reliability Engineer

Site Reliability Engineer II - Global Real Estate Technology - Chief Administrative Office - Bengaluru - India

Site Reliability Engineer II position at JPMorgan Chase focusing on system reliability, infrastructure automation, and service level optimization in the financial technology sector.

Site Reliability Developer 2

Site Reliability Developer position at Oracle focusing on cloud infrastructure, automation, and system reliability with 3-5+ years of experience required.

Site Reliability Engineer II

Site Reliability Engineer II position at Microsoft's Azure Data team, focusing on platform reliability, automation, and system performance with hybrid work options.

Site Reliability Engineer II

Microsoft seeks Site Reliability Engineer II for cybersecurity solutions, offering hybrid work, competitive pay, and comprehensive benefits.

Site Reliability Engineer

Site Reliability Engineer role at PEXA International focusing on platform reliability, incident management, and infrastructure optimization in a remote setting.