Site Reliability Engineer

Cloud native API platform provider with the fastest, most adopted API gateway in the world, enabling companies to become API-first and securely accelerate AI adoption.
Site Reliability
Senior Software Engineer
Hybrid
5+ years of experience
Enterprise SaaS

Description For Site Reliability Engineer

Kong is seeking a Senior Site Reliability Engineer to join their team in Bangalore, India in a hybrid work arrangement. This role is crucial for developing and operating Kong's Managed Gateways offerings, including both Dedicated Cloud Gateways and Serverless Gateways. The position requires maintaining high reliability (99.99% uptime) for Kong's API Management suite, Konnect.

Kong's platform offers customers the ability to create managed API gateways in any cloud globally, with two main products: Dedicated Cloud Gateways providing private, isolated environments, and Serverless Gateways offering fully managed, elastic solutions. This role combines both development and operations, requiring expertise in cloud platforms, Kubernetes, and distributed systems.

The ideal candidate will have 5+ years of SaaS development experience, strong expertise in networking concepts, and proven experience with high-reliability systems. They'll work in a hybrid model (three days office, two days remote) and will be responsible for both technical leadership and mentoring other engineers.

Kong, with over 300 million downloads of their API gateway, is leading the cloud API technologies space, helping organizations from startups to Fortune 500 enterprises become API-first and accelerate their market presence. The company's mission is to build the nervous system that will safely and reliably connect all of humankind, making this an exciting opportunity for those passionate about scalable, reliable systems.

Last updated 4 hours ago

Responsibilities For Site Reliability Engineer

  • Own the end-to-end technical success of the Managed Gateways Platform
  • Architect and operate software systems to maintain 99.99% uptime SLA
  • Shape technical direction for the Managed Gateways product by driving innovation
  • Collaborate with product leadership to define strategy, roadmap, and objectives
  • Mentor other engineers in the organization

Requirements For Site Reliability Engineer

Go
Kubernetes
PostgreSQL
  • Bachelor's or Master's degree in Computer Science or related field
  • 3+ years of experience in building and operating highly reliable SaaS/PaaS systems
  • Experience with major cloud platforms (AWS, Azure, or GCP)
  • Strong experience with Kubernetes
  • Familiarity with observability tools (Datadog, Prometheus, Grafana, Victoria Metrics, Loki)
  • Expertise in designing and developing highly scalable distributed systems
  • Strong expertise in networking concepts (OSI Layer 4 and 7, DNS, TLS/SSL, HTTP)
  • Experience managing incidents under high-pressure situations
  • Backend development experience (preferably with GoLang)
  • 5+ years of experience in SaaS development with 99.99% reliability
  • Strong verbal and written communication skills

Interested in this job?

Jobs Related To Kong Site Reliability Engineer

Site Reliability Engineer

Senior Site Reliability Engineer position at Kong, working with cloud infrastructure, Kubernetes, and automated deployment systems in a hybrid work environment.

Site Reliability Engineer

Senior Site Reliability Engineer position at Behavox managing high-load distributed systems with 5+ years experience required in DevOps and cloud platforms.

Site Reliability Engineer - Video on Demand/Streaming Event Support

Senior Site Reliability Engineer role at Apple focusing on video streaming operations, offering $157K-$236K salary with comprehensive benefits in Irvine, CA.

Senior Site Reliability Engineer - NZ

Senior Site Reliability Engineer position at Datacom, focusing on maintaining and optimizing cloud infrastructure for the Smartly payroll platform.

Senior Software Developer, Reliability

Senior Software Developer position focusing on reliability engineering at Wealthsimple, working with Ruby, Java, and Kubernetes to ensure system reliability and scalability.