Site Reliability Engineer

Renmoney

Renmoney is a financial technology company providing innovative solutions in Nigeria.

Lagos, Nigeria

DevOps

Mid-Level Software Engineer

Hybrid

3+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Site Reliability Engineer

Renmoney is seeking a Site Reliability Engineer to join their IT department in a hybrid work environment. This role focuses on ensuring the availability and reliability of UAT and production applications, as well as improving the entire lifecycle of services. The ideal candidate will have experience with databases, configuration management tools, containerization, and monitoring systems. They will be responsible for troubleshooting complex issues, implementing security measures, and scaling systems through automation. This position offers the opportunity to work with cutting-edge technologies and solve real-world challenges in a dynamic fintech environment. The successful candidate will join a team of amazing people in a flat organizational structure, contributing to the growth and success of Renmoney's digital infrastructure.

Last updated a year ago

Responsibilities For Site Reliability Engineer

Ensuring availability of UAT and production applications and foster capacity planning for production infrastructures
Monitoring of existing systems/applications using monitoring tools
Engage in and improve the whole lifecycle of services from inception and design, through deployment, operations
Troubleshooting problems that span systems, databases, storage, network, and codes
Suggesting/implementing security measures for the protection of systems, networks, and information
Scale systems sustainably through mechanisms like automation
Evolve systems by pushing for changes that improve reliability and velocity
Minimize and mitigate the risk of reliability-related failures pertaining to systems availability, performance, and correctness
Ensuring investigation into warnings and alerts from monitoring systems
Incident response, diagnosis, and follow-up on system outages
Documentation of process and procedure manuals

Requirements For Site Reliability Engineer

Kubernetes

Kafka

Redis

Python

Ruby

Linux

Working knowledge of databases and SQL
Minimum of 3 years work experience
Comfortable with Open-Source configuration management and orchestration tools (chef, Puppet, Ansible, Terraform, etc.)
Knowledge of Docker, Docker swamp, Fargate, and Kubernetes
Experience with caching systems such as Kafka and Redis
Working experience with building monitoring tools and setting measurement metrics
Proficiency with shell and a programming language used in an SRE/Operations engineering context (Python, Go, Ruby, etc.)
Experience with operating in a high availability environment
Excellent communication skills with a high level of emotional intelligence
Experience in working with remote teams
Server Administration skills (Redhat, Windows, CentOs, Ubuntu)

Benefits For Site Reliability Engineer

Competitive compensation
Work with amazing people
Work in a beautiful environment
Flat structure
Solve complex, real-world challenges