Site Reliability Engineer - EP

GoTo Group

Southeast Asia's largest digital ecosystem offering technology infrastructure, on-demand services, and financial solutions.

Bengaluru, Karnataka, India • Gurugram, Haryana, India

Site Reliability

Staff Software Engineer

Hybrid

5,000+ Employees

8+ years of experience

Enterprise SaaS

Description For Site Reliability Engineer - EP

GoTo Group is seeking a Site Reliability Engineer to join their Engineering Platform team in either Bengaluru or Gurugram. This role is crucial for improving and managing Gojek's engineering productivity, reliability, and observability across their platform that powers diverse applications across multiple business lines.

The position offers an opportunity to work with a highly driven team of engineers delivering fundamental functionality that enables multiple product groups at Gojek to handle complex scenarios at scale. As an SRE, you'll be directly responsible for improving engineering quality, productivity, and the experience of engineers driving fundamental business KPIs for the company.

The role involves significant work with cloud infrastructure, Kubernetes administration, automation, and DevOps practices. You'll be working with cutting-edge technology in cloud computing, managing real-time high-throughput systems with a wide range of programming stacks. The platform team focuses on designing abstractions and automations to enhance the productivity of Gojek Product Engineers.

GoTo Group, as the largest digital ecosystem in Indonesia, offers a unique opportunity to work on technology infrastructure that serves millions of users across Southeast Asia. The company's ecosystem includes Gojek's on-demand transportation services, food and grocery delivery, logistics, and GoTo Financial's payment services, making it the first platform in Southeast Asia to host these crucial services in a single ecosystem.

This position requires extensive experience in SRE/DevOps, strong technical skills in cloud platforms, Kubernetes, and infrastructure as code, and the ability to work with complex distributed systems. The hybrid work environment allows for flexibility while maintaining collaborative opportunities with the team.

Last updated 7 hours ago

Responsibilities For Site Reliability Engineer - EP

Administer cloud-based infrastructure deployment including resource provisioning, user administration, and monitoring
Design and build SRE tooling to automate monitoring, incident response, and alerting
Build and improve CI/CD tooling to automate and streamline deployments
Design and build GitOps practice for infrastructure management
Deploy and manage applications on Kubernetes
Manage pod and container lifecycle, service and ingress resource management
Handle infrastructure provisioning using Infrastructure as Code (IaC)
Build and manage Cloud product features for Enhanced Networking

Requirements For Site Reliability Engineer - EP

Kubernetes

Linux

8+ years of experience in SRE or DevOps space (5+ in large enterprise Cloud)
Experience with large-scale applications in AWS or GCP
Strong hands-on experience in Kubernetes
Deep knowledge of Linux and container technologies
Ability to automate tasks and familiarity with scripting languages
Strong understanding of infrastructure-as-code principles
Solid understanding of networking concepts and protocols
Understanding of microservices architecture and event-driven architecture
Strong technical troubleshooting and communication skills

GoTo Group

Southeast Asia's largest digital ecosystem offering technology infrastructure, on-demand services, and financial solutions.

Bengaluru, Karnataka, India • Gurugram, Haryana, India

Site Reliability

Staff Software Engineer

Hybrid

5,000+ Employees

8+ years of experience

Enterprise SaaS

Interested in this job?

Jobs Related To GoTo Group Site Reliability Engineer - EP

Site Reliability Engineer – AIOps

Oracle

Senior Site Reliability Engineer role focusing on AIOps at Oracle, building AI-driven solutions for cloud infrastructure reliability and automation.

Lead Site Reliability Engineer

Bumble Inc.

Lead Site Reliability Engineer position at Bumble Inc., focusing on ensuring system reliability and scalability while working with cutting-edge technologies in a hybrid work environment in London.

Staff Site Reliability Engineer

ClickUp

Staff Site Reliability Engineer position at ClickUp, focusing on maintaining and improving the reliability of their all-in-one work management platform.

Staff Site Reliability Engineer

Perchwell

Staff Site Reliability Engineer position at Perchwell, leading infrastructure and reliability initiatives for a modern real estate technology platform in New York.

Staff Site Reliability Engineer

Assured

Staff Site Reliability Engineer position at Assured, focusing on building and maintaining scalable infrastructure for insurance claims processing platform.