Site Reliability Engineer for Fita

Fita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches, empowering Indonesians to achieve their fitness goals through personalized virtual coaching sessions.
Jakarta, Indonesia
Site Reliability
Senior Software Engineer
In-Person
5+ years of experience
Healthcare
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience.

Senior Software Engineer, Site Reliability Engineering, Data Cloud

Senior Site Reliability Engineer role at Google, focusing on building AI-powered infrastructure and maintaining large-scale distributed systems for Google Cloud Platform.

Senior Software Engineer, Site Reliability Engineering

Senior SRE position at Google focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Description For Site Reliability Engineer for Fita

Fita is a health-tech platform that brings together a community of fitness enthusiasts and expert coaches. Our mission is to empower Indonesians of all fitness levels to achieve their goals, maintain a healthy lifestyle, and build lasting habits through personalized virtual coaching sessions.

As a Site Reliability Engineer for Fita, you will:

  • Manage infrastructure on GCP
  • Participate in the entire software development process including design, development, delivery, monitoring, and improvement
  • Provide technical assistance to improve system performance, capacity, reliability, scalability, and security
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity
  • Closely collaborate with other engineering and product teams to ensure that expected system behavior is understood and monitoring exists to detect anomalies
  • Participate in continuous improvement and execution of quality and timely major incident root cause analysis and blameless post-mortem activities

Expected KPIs:

  • Maintain uptime for critical infrastructure components
  • Ensure all infrastructure components are monitored (Redis, PgSQL, etc) - by node count
  • Ensure 80% of alerts have Time To Acknowledge below 30m in office hours
  • Ensure 80% of infra provisioning / configuration requests are handled below 3h on average in office hours

Requirements:

  • Medior to Senior Level with 3-7 years of working experience at a tech company or startup (preferably)
  • Ready to work in October 2024
  • Strong background in Linux/Unix systems and scripting languages (e.g., Python, Bash)
  • Experience with cloud platforms (AWS, GCP, Azure) and containerization (Docker, Kubernetes)
  • Hands-on experience with monitoring tools (Prometheus, Grafana, Splunk, etc.)
  • Understanding of CI/CD pipelines and DevOps practices
  • Ability to communicate effectively and work collaboratively in a team-oriented environment
  • Strong problem-solving skills and a proactive attitude toward operational challenges

If you're ready to take the next step in your career, apply now at people@fita.co.id. Be sure to include "SRE" in the subject line of your email.

Last updated 4 months ago

Responsibilities For Site Reliability Engineer for Fita

  • Manage infrastructure on GCP
  • Participate in the entire software development process
  • Provide technical assistance to improve system performance
  • Scale systems sustainably through automation
  • Collaborate with other engineering and product teams
  • Participate in incident root cause analysis and post-mortem activities

Requirements For Site Reliability Engineer for Fita

Linux
Python
Kubernetes
Redis
PostgreSQL
  • 3-7 years of working experience
  • Strong background in Linux/Unix systems and scripting languages
  • Experience with cloud platforms and containerization
  • Hands-on experience with monitoring tools
  • Understanding of CI/CD pipelines and DevOps practices
  • Effective communication and collaboration skills
  • Strong problem-solving skills

Interested in this job?