Site Reliability Engineer for Fita

Fita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches, empowering Indonesians to achieve their fitness goals through personalized virtual coaching sessions.
Jakarta, Indonesia
Site Reliability
Senior Software Engineer
In-Person
5+ years of experience
Healthcare
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Site Reliability Engineer - Apple Services Engineering

Senior Site Reliability Engineer position at Apple Services Engineering, focusing on large-scale storage infrastructure and data protection systems.

Software Engineer - Apple Services Engineering Storage SRE

Senior SRE position at Apple focusing on large-scale storage infrastructure, requiring Golang expertise and distributed systems knowledge.

Site Reliability Engineer (SRE) Specialist

Senior SRE position at Capco focusing on system reliability, cloud operations, and automation for financial services clients, offering competitive benefits and hybrid work model.

Site Reliability Engineer

Senior Site Reliability Engineer role at Glean, focusing on maintaining and scaling AI-powered enterprise search platform with competitive compensation and benefits.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and benefits.

Description For Site Reliability Engineer for Fita

Fita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches. Our mission is to empower Indonesians of all fitness levels to achieve their goals, maintain a healthy lifestyle, and build lasting habits through personalized virtual coaching sessions.

As a Site Reliability Engineer for Fita, you will:

  • Manage infrastructure on GCP
  • Participate in the entire software development process including design, development, delivery, monitoring, and improvement
  • Provide technical assistance to improve system performance, capacity, reliability, scalability, and security
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Closely collaborate with other engineering and product teams to ensure that expected system behavior is understood and monitoring exists to detect anomalies
  • Participate in continuous improvement and execution of quality and timely major incident root cause analysis and blameless post-mortem activities

Expected KPIs:

  • Maintain uptime for critical infrastructure components
  • Ensure all infrastructure components are monitored (Redis, PgSQL, etc) - by node count
  • Ensure 80% of alerts have Time To Acknowledge below 30m in office hours
  • Ensure 80% of infra provisioning / configuration requests are handled below 3h on average in office hours

Requirements:

  • Medior to Senior Level with 3-7 years of working experience at a tech company or startup (preferably)
  • Ready to work in October 2024
  • Strong background in Linux/Unix systems and scripting languages (e.g., Python, Bash)
  • Experience with cloud platforms (AWS, GCP, Azure) and containerization (Docker, Kubernetes)
  • Hands-on experience with monitoring tools (Prometheus, Grafana, Splunk, etc.)
  • Understanding of CI/CD pipelines and DevOps practices
  • Ability to communicate effectively and work collaboratively in a team-oriented environment
  • Strong problem-solving skills and a proactive attitude toward operational challenges

If you're ready to take the next step in your career, apply now at people@fita.co.id. Be sure to include "SRE" in the subject line of your email.

Last updated 6 months ago

Responsibilities For Site Reliability Engineer for Fita

  • Manage infrastructure on GCP
  • Participate in the entire software development process
  • Provide technical assistance to improve system performance
  • Scale systems sustainably through automation
  • Collaborate with other engineering and product teams
  • Participate in incident root cause analysis and post-mortem activities

Requirements For Site Reliability Engineer for Fita

Linux
Python
Kubernetes
Redis
PostgreSQL
  • 3-7 years of working experience
  • Strong background in Linux/Unix systems and scripting languages
  • Experience with cloud platforms and containerization
  • Hands-on experience with monitoring tools
  • Understanding of CI/CD pipelines and DevOps practices
  • Effective communication and collaboration skills
  • Strong problem-solving skills

Interested in this job?