Site Reliability Engineer - EP

Southeast Asia's largest digital ecosystem offering technology infrastructure, on-demand services, and financial solutions.
Site Reliability
Staff Software Engineer
Hybrid
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For Site Reliability Engineer - EP

GoTo Group is seeking a Site Reliability Engineer to join their Engineering Platform team in either Bengaluru or Gurugram. This role is crucial for improving and managing Gojek's engineering productivity, reliability, and observability across their platform that powers diverse applications across multiple business lines.

The position offers an opportunity to work with a highly driven team of engineers delivering fundamental functionality that enables multiple product groups at Gojek to handle complex scenarios at scale. As an SRE, you'll be directly responsible for improving engineering quality, productivity, and the experience of engineers driving fundamental business KPIs for the company.

The role involves significant work with cloud infrastructure, Kubernetes administration, automation, and DevOps practices. You'll be working with cutting-edge technology in cloud computing, managing real-time high-throughput systems with a wide range of programming stacks. The platform team focuses on designing abstractions and automations to enhance the productivity of Gojek Product Engineers.

GoTo Group, as the largest digital ecosystem in Indonesia, offers a unique opportunity to work on technology infrastructure that serves millions of users across Southeast Asia. The company's ecosystem includes Gojek's on-demand transportation services, food and grocery delivery, logistics, and GoTo Financial's payment services, making it the first platform in Southeast Asia to host these crucial services in a single ecosystem.

This position requires extensive experience in SRE/DevOps, strong technical skills in cloud platforms, Kubernetes, and infrastructure as code, and the ability to work with complex distributed systems. The hybrid work environment allows for flexibility while maintaining collaborative opportunities with the team.

Last updated 7 hours ago

Responsibilities For Site Reliability Engineer - EP

  • Administer cloud-based infrastructure deployment including resource provisioning, user administration, and monitoring
  • Design and build SRE tooling to automate monitoring, incident response, and alerting
  • Build and improve CI/CD tooling to automate and streamline deployments
  • Design and build GitOps practice for infrastructure management
  • Deploy and manage applications on Kubernetes
  • Manage pod and container lifecycle, service and ingress resource management
  • Handle infrastructure provisioning using Infrastructure as Code (IaC)
  • Build and manage Cloud product features for Enhanced Networking

Requirements For Site Reliability Engineer - EP

Kubernetes
Linux
  • 8+ years of experience in SRE or DevOps space (5+ in large enterprise Cloud)
  • Experience with large-scale applications in AWS or GCP
  • Strong hands-on experience in Kubernetes
  • Deep knowledge of Linux and container technologies
  • Ability to automate tasks and familiarity with scripting languages
  • Strong understanding of infrastructure-as-code principles
  • Solid understanding of networking concepts and protocols
  • Understanding of microservices architecture and event-driven architecture
  • Strong technical troubleshooting and communication skills

Interested in this job?

Jobs Related To GoTo Group Site Reliability Engineer - EP

Site Reliability Engineer – AIOps

Senior Site Reliability Engineer role focusing on AIOps at Oracle, building AI-driven solutions for cloud infrastructure reliability and automation.

Lead Site Reliability Engineer

Lead Site Reliability Engineer position at Bumble Inc., focusing on ensuring system reliability and scalability while working with cutting-edge technologies in a hybrid work environment in London.

Staff Site Reliability Engineer

Staff Site Reliability Engineer position at ClickUp, focusing on maintaining and improving the reliability of their all-in-one work management platform.

Staff Site Reliability Engineer

Staff Site Reliability Engineer position at Perchwell, leading infrastructure and reliability initiatives for a modern real estate technology platform in New York.

Staff Site Reliability Engineer

Staff Site Reliability Engineer position at Assured, focusing on building and maintaining scalable infrastructure for insurance claims processing platform.