Site Reliability Engineer

Renmoney is a financial technology company providing innovative solutions in Nigeria.
Lagos, Nigeria
DevOps
Mid-Level Software Engineer
Hybrid
3+ years of experience

Description For Site Reliability Engineer

Renmoney is seeking a Site Reliability Engineer to join their IT department in a hybrid work environment. This role focuses on ensuring the availability and reliability of UAT and production applications, as well as improving the entire lifecycle of services. The ideal candidate will have experience with databases, configuration management tools, containerization, and monitoring systems. They will be responsible for troubleshooting complex issues, implementing security measures, and scaling systems through automation. This position offers the opportunity to work with cutting-edge technologies and solve real-world challenges in a dynamic fintech environment. The successful candidate will join a team of amazing people in a flat organizational structure, contributing to the growth and success of Renmoney's digital infrastructure.

Last updated 5 months ago

Responsibilities For Site Reliability Engineer

  • Ensuring availability of UAT and production applications and foster capacity planning for production infrastructures
  • Monitoring of existing systems/applications using monitoring tools
  • Engage in and improve the whole lifecycle of services from inception and design, through deployment, operations
  • Troubleshooting problems that span systems, databases, storage, network, and codes
  • Suggesting/implementing security measures for the protection of systems, networks, and information
  • Scale systems sustainably through mechanisms like automation
  • Evolve systems by pushing for changes that improve reliability and velocity
  • Minimize and mitigate the risk of reliability-related failures pertaining to systems availability, performance, and correctness
  • Ensuring investigation into warnings and alerts from monitoring systems
  • Incident response, diagnosis, and follow-up on system outages
  • Documentation of process and procedure manuals

Requirements For Site Reliability Engineer

Kubernetes
Kafka
Redis
Python
Go
Ruby
Linux
  • Working knowledge of databases and SQL
  • Minimum of 3 years work experience
  • Comfortable with Open-Source configuration management and orchestration tools (chef, Puppet, Ansible, Terraform, etc.)
  • Knowledge of Docker, Docker swamp, Fargate, and Kubernetes
  • Experience with caching systems such as Kafka and Redis
  • Working experience with building monitoring tools and setting measurement metrics
  • Proficiency with shell and a programming language used in an SRE/Operations engineering context (Python, Go, Ruby, etc.)
  • Experience with operating in a high availability environment
  • Excellent communication skills with a high level of emotional intelligence
  • Experience in working with remote teams
  • Server Administration skills (Redhat, Windows, CentOs, Ubuntu)

Benefits For Site Reliability Engineer

  • Competitive compensation
  • Work with amazing people
  • Work in a beautiful environment
  • Flat structure
  • Solve complex, real-world challenges

Interested in this job?

Jobs Related To Renmoney Site Reliability Engineer

Linux Desktop Support Engineer

Remote Linux Desktop Support Engineer position at Canonical, focusing on technical support and system administration for Ubuntu and open source products.

Program Manager, Construction Area Environmental Health and Safety

Lead EHS programs for Google's construction projects, ensuring safety compliance and risk management while working with cross-functional teams in data center development.

Data Center Operations Manager, Global Server Operations

Lead data center operations teams at Google, managing infrastructure, hardware installation, and technical projects while ensuring operational excellence.

Data Center Server Operations Manager

Lead data center operations team at Google, managing server infrastructure and technical implementations with competitive compensation and benefits.

Data Center Server Operations Manager

Lead Google's data center operations team, managing infrastructure and technical staff while ensuring optimal performance of server hardware and software systems.