Taro Logo

Site Reliability Engineer

Montevideo, Montevideo Department, Uruguay
Site Reliability
Senior Software Engineer
In-Person
5+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

XperiencOps Inc. is seeking a Senior Site Reliability Engineer (SRE) to join their team, focusing on ensuring system reliability, availability, and performance. The role involves working with customer engineering, support, and DevRel teams to address reliability gaps, implement automation, and improve system scalability. The position requires expertise in AWS cloud technologies and serverless architectures, particularly AWS Lambda. The SRE will be responsible for designing and managing infrastructure solutions, developing automation scripts, monitoring system health, and responding to incidents. The ideal candidate should have strong experience in cloud services, observability systems, and infrastructure as code, with excellent troubleshooting and communication skills. The company offers a competitive benefits package and opportunities for professional growth in an innovative environment. This role combines technical expertise with operational excellence to enhance customer experience through reliable system performance.

Last updated 6 months ago

Responsibilities For Site Reliability Engineer

  • Design, implement, and manage scalable and reliable infrastructure solutions
  • Develop automation scripts to support operations and streamline processes
  • Set up and monitor system alerts, metrics, and dashboards
  • Respond to incidents and outages, performing root cause analysis
  • Collaborate with development and operations teams
  • Continuously improve the architecture and deployment processes
  • Participate in on-call rotations
  • Document and communicate system architecture and procedures

Requirements For Site Reliability Engineer

Python
Go
Kubernetes
Linux
  • Bachelor's degree in Computer Science or related discipline
  • 5+ years of experience in Site Reliability Engineering, DevOps, or similar role
  • 3+ years of experience in cloud services, particularly AWS
  • Experience building observability systems on New Relic, Cloudwatch or similar
  • Experience implementing rate-limiting, API gateways, and load balancing
  • Exposure to security best practices and compliance frameworks
  • Proficient in infrastructure as code (IaC)
  • Hands-on experience with scripting and programming languages
  • Strong troubleshooting and debugging skills
  • Excellent communication and collaboration skills
  • Experience with incident management and post-mortem practices

Benefits For Site Reliability Engineer

  • Competitive salary and comprehensive benefits package
  • Opportunity to be part of a cutting-edge technology company
  • Professional growth and development opportunities
  • Supportive and collaborative work environment

Interested in this job?