Staff Site Reliability Engineer

Acquia empowers the world's most ambitious brands to create digital customer experiences that matter. With open source Drupal at its core, the Acquia Digital Experience Platform (DXP) enables marketers, developers, and IT operations teams at thousands of global organizations to rapidly compose and deploy digital products and services that engage customers, enhance conversions, and help businesses stand out.
San José Province, San José, Costa Rica
DevOps
Staff Software Engineer
Remote
1,000 - 5,000 Employees
8+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer

Acquia is seeking a Staff Site Reliability Engineer to play a key role in designing, implementing, and maintaining CI/CD pipelines, cloud infrastructure, and monitoring solutions. This hands-on position requires expertise in tools like ArgoCD, Kubernetes, and cloud-native architecture to achieve operational excellence at scale. The ideal candidate will work closely with engineering teams to ensure rapid, safe, and reliable deployments.

Key responsibilities include:

  • Mastering CI/CD pipelines using tools like ArgoCD and Jenkins
  • Building and managing scalable infrastructure with Terraform and Kubernetes
  • Architecting cloud environments (AWS, GCP, or Azure) for optimal performance and cost
  • Implementing comprehensive monitoring solutions with Prometheus, Grafana, ELK, and Datadog
  • Championing DevOps culture and best practices across teams
  • Focusing on building resilient systems and implementing Service Level Objectives (SLOs)
  • Collaborating with security teams to implement robust security practices
  • Working closely with product development teams to integrate CI/CD practices

Required skills:

  • BS in Computer Science or equivalent experience
  • Proficiency in languages like Go, Python, Ruby, PHP, Java, or JavaScript
  • Strong Unix/Linux administration skills
  • Expertise in CI/CD tools, Kubernetes, cloud platforms, and Infrastructure as Code
  • Experience with monitoring and observability tools
  • Security-focused mindset and excellent problem-solving abilities

Preferred qualifications:

  • 8-13 years of hands-on DevOps or SRE experience
  • Deep knowledge of ArgoCD or similar tools
  • Strong scripting skills in Python, Go, or Bash
  • Experience with service mesh architectures
  • SRE Certification and Certified Kubernetes Administrator (CKA) are a plus

Join Acquia, a global leader in digital experience platforms, and be part of building the future of digital customer experiences.

Last updated 5 months ago

Responsibilities For Staff Site Reliability Engineer

  • Design, build, and optimize CI/CD pipelines
  • Build and manage scalable infrastructure using IaC tools
  • Architect and manage cloud environments
  • Implement comprehensive monitoring and alerting solutions
  • Champion DevOps culture and best practices
  • Focus on building resilient systems and implementing SLOs
  • Collaborate with security teams on infrastructure security
  • Work closely with product development teams

Requirements For Staff Site Reliability Engineer

Go
Java
JavaScript
Kubernetes
Linux
MongoDB
MySQL
Node.js
PHP
Python
PostgreSQL
Redis
Ruby
  • BS in Computer Science or equivalent experience
  • Proficiency in Go, Python, Ruby, PHP, Java, or JavaScript
  • Strong Unix/Linux administration skills
  • Expertise in CI/CD tools (ArgoCD, Jenkins, etc.)
  • Kubernetes and container orchestration experience
  • Cloud platform proficiency (AWS, GCP, or Azure)
  • Infrastructure as Code (Terraform, Ansible) skills
  • Experience with monitoring tools (Prometheus, Grafana, Datadog, ELK)
  • Security best practices knowledge
  • Excellent troubleshooting and problem-solving skills
  • Strong collaboration and communication abilities

Benefits For Staff Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance

Interested in this job?

Jobs Related To Acquia Staff Site Reliability Engineer

Engineering Project Manager, DevOps/SRE

Senior Engineering Project Manager position at Apple, focusing on DevOps and SRE, offering competitive salary and benefits, requiring 5+ years of experience.

Software Engineer (L5), Tools, Integrations, and Productivity

Staff Software Engineer position at Netflix focusing on developer tools, integrations, and productivity infrastructure.

Technical Program Manager - Optical Sensing

Technical Program Manager position at Apple leading manufacturing processes and factory readiness for optical sensing technology, offering competitive compensation and benefits.

AWS DevOps Engineer

Senior AWS DevOps Engineer position at Oracle, focusing on cloud infrastructure, CI/CD, and containerization with 10+ years of experience required.

Xcode Program, Build, and Release Team Manager

Lead the Xcode and Swift Playgrounds build and release team at Apple, managing software delivery processes and coordinating cross-functional teams while ensuring high-quality releases.