Site Reliability Engineer

A global technology company developing real-time communication platforms and services across 230+ countries.
Site Reliability
Senior Software Engineer
Hybrid
Enterprise SaaS

Description For Site Reliability Engineer

Hyperconnect's Platform Department is seeking a Senior Site Reliability Engineer to join their SRE team. The role focuses on maintaining stability across all services to ensure users can enjoy Hyperconnect's unique experiences without interruption. You'll work with AWS, Kubernetes, and service mesh in a modern computing environment, managing infrastructure for both B2B and B2C products across 230+ countries. The position involves deep engagement with backend engineering, working on high-performance/low-latency systems, and handling complex production environments at scale. You'll use tools like Terraform, Helm, ArgoCD, and Spinnaker for infrastructure management, while working with monitoring solutions including Zabbix, Prometheus, OpenTelemetry, and Elasticsearch. The role offers opportunities to contribute to core systems optimization, implement new technologies, and work closely with various development teams to improve service reliability and performance. The ideal candidate will have strong technical skills, excellent problem-solving abilities, and a passion for maintaining and improving large-scale distributed systems.

Last updated 4 months ago

Responsibilities For Site Reliability Engineer

  • Build and operate high-availability system infrastructure in public cloud environments
  • Implement and manage system/application logging, monitoring, and automation
  • Lead incident response and postmortem culture
  • Identify and optimize service improvements based on SLO/SLI
  • Conduct PoC of new technologies and implement them in production

Requirements For Site Reliability Engineer

Kubernetes
Go
Python
Linux
  • Strong understanding of CS fundamentals, especially Linux and Networking
  • Understanding of container technologies
  • Programming ability in Python, Golang
  • Practical experience with Linux servers in public cloud environments like AWS
  • Excellent communication skills and documentation abilities
  • Ability to identify and proactively solve various service problems
  • Enthusiasm for learning new technologies

Interested in this job?

Jobs Related To Hyperconnect Site Reliability Engineer

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.

Senior Software Engineer, Site Reliability Engineering

Senior Site Reliability Engineering role at Google, focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems for enterprise applications in Bengaluru.