Site Reliability Engineer for Fita

PT Telkomsel Ekosistem Digital

Fita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches, empowering Indonesians to achieve their fitness goals through personalized virtual coaching sessions.

Jakarta, Indonesia

Site Reliability

Senior Software Engineer

In-Person

5+ years of experience

Healthcare

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer for Fita

Fita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches. Our mission is to empower Indonesians of all fitness levels to achieve their goals, maintain a healthy lifestyle, and build lasting habits through personalized virtual coaching sessions.

As a Site Reliability Engineer for Fita, you will:

Manage infrastructure on GCP
Participate in the entire software development process including design, development, delivery, monitoring, and improvement
Provide technical assistance to improve system performance, capacity, reliability, scalability, and security
Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
Closely collaborate with other engineering and product teams to ensure that expected system behavior is understood and monitoring exists to detect anomalies
Participate in continuous improvement and execution of quality and timely major incident root cause analysis and blameless post-mortem activities

Expected KPIs:

Maintain uptime for critical infrastructure components
Ensure all infrastructure components are monitored (Redis, PgSQL, etc) - by node count
Ensure 80% of alerts have Time To Acknowledge below 30m in office hours
Ensure 80% of infra provisioning / configuration requests are handled below 3h on average in office hours

Requirements:

Medior to Senior Level with 3-7 years of working experience at a tech company or startup (preferably)
Ready to work in October 2024
Strong background in Linux/Unix systems and scripting languages (e.g., Python, Bash)
Experience with cloud platforms (AWS, GCP, Azure) and containerization (Docker, Kubernetes)
Hands-on experience with monitoring tools (Prometheus, Grafana, Splunk, etc.)
Understanding of CI/CD pipelines and DevOps practices
Ability to communicate effectively and work collaboratively in a team-oriented environment
Strong problem-solving skills and a proactive attitude toward operational challenges

If you're ready to take the next step in your career, apply now at people@fita.co.id. Be sure to include "SRE" in the subject line of your email.

Last updated 10 months ago

Responsibilities For Site Reliability Engineer for Fita

Manage infrastructure on GCP
Participate in the entire software development process
Provide technical assistance to improve system performance
Scale systems sustainably through automation
Collaborate with other engineering and product teams
Participate in incident root cause analysis and post-mortem activities

Requirements For Site Reliability Engineer for Fita

Linux

Python

Kubernetes

Redis

PostgreSQL

3-7 years of working experience
Strong background in Linux/Unix systems and scripting languages
Experience with cloud platforms and containerization
Hands-on experience with monitoring tools
Understanding of CI/CD pipelines and DevOps practices
Effective communication and collaboration skills
Strong problem-solving skills