Site Reliability Engineer (Expert-level)

France
Site Reliability
Staff Software Engineer
Remote
Enterprise SaaS

Description For Site Reliability Engineer (Expert-level)

Last updated a month ago

Responsibilities For Site Reliability Engineer (Expert-level)

  • Managing global infrastructure
  • Monitoring KPIs
  • Automating processes
  • Planning for scalability
  • Working with distributed data stores
  • Managing and monitoring system infrastructure

Requirements For Site Reliability Engineer (Expert-level)

PostgreSQL
Cassandra
Kafka
Kubernetes
  • Experience with cloud provider GCP
  • Experience with configuration management tools Terraform and Ansible
  • Practical experience with distributed data stores (PostgreSQL, Cassandra, and Kafka)
  • Hands-on proficiency with modern monitoring tools (Prometheus and Grafana)
  • Ability to manage global infrastructure
  • Experience with monitoring KPIs
  • Experience with process automation
  • Experience with scalability planning

Interested in this job?

Jobs Related To Sinch Site Reliability Engineer (Expert-level)

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Senior Site Reliability Engineer

Remote Senior Site Reliability Engineer position at ZayZoon, focusing on AWS infrastructure and production deployments across Canada.

Site Reliability Engineering II

Senior Site Reliability Engineer position at Microsoft focusing on identity and security engineering, requiring 5+ years of experience in identity technologies and security infrastructure.

Site Reliability Manager, Core Enterprise Systems

Lead a team of SRE engineers at Google, managing enterprise services and driving reliability improvements across critical internal systems.