Site Reliability Engineer (SRE)

Cairo, Cairo Governorate, EgyptAlexandria, Alexandria Governorate, EgyptRiyadh Saudi Arabia
Site Reliability
Mid-Level Software Engineer
Remote
3+ years of experience
Enterprise SaaS

Description For Site Reliability Engineer (SRE)

Lucidya is seeking a Site Reliability Engineer (SRE) to join their Cloud Engineering team. This role focuses on enhancing the reliability, scalability, and automation of cloud-based infrastructure. The ideal candidate will work with cloud environments, containerized workloads, and monitoring systems. Key responsibilities include managing high availability infrastructure, cloud operations across major providers, Kubernetes cluster management, and implementing monitoring solutions. The role requires experience with Infrastructure as Code, scripting languages, and modern DevOps practices. The position offers the flexibility of remote work with opportunities to work with cutting-edge cloud technologies and contribute to establishing best practices for infrastructure reliability. The successful candidate will be part of a collaborative team environment, participating in on-call rotations and driving continuous improvement in system reliability and performance. This role is perfect for someone who combines technical expertise with strong problem-solving abilities and excellent communication skills.

Last updated a minute ago

Responsibilities For Site Reliability Engineer (SRE)

  • Ensure high availability and scalability of critical infrastructure components
  • Proactively identify and eliminate single points of failure across the cloud environment
  • Handle infrastructure management tasks for Linux-based systems
  • Manage and optimize cloud-based workloads across AWS, GCP, or Azure
  • Manage Kubernetes clusters operations including deployment, scaling, upgrades, and troubleshooting
  • Implement and standardize monitoring solutions
  • Participate in on-call rotations and troubleshoot incidents
  • Develop and maintain automation scripts for routine operational tasks
  • Work closely with DevOps and Engineering teams

Requirements For Site Reliability Engineer (SRE)

Python
Kubernetes
Redis
MongoDB
  • 3 years of experience in SRE, DevOps, or Infrastructure Engineer role
  • Strong experience with at least one major cloud provider (AWS, GCP, or Azure)
  • Hands-on experience with Kubernetes and containerization
  • Proficient in scripting languages such as Python, Bash
  • Familiarity with Infrastructure as Code tools
  • Strong understanding of load balancers, networking, and HA architecture
  • Experience with CI/CD tools
  • Experience with modern monitoring and observability tools
  • Strong analytical skills and ability to resolve complex technical issues
  • Excellent communication and collaboration skills

Interested in this job?

Jobs Related To Lucidya Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

Remote Site Reliability Engineer position at Lucidya, focusing on cloud infrastructure, Kubernetes, and automation with 3 years of experience required.

Site Reliability Engineer (SRE)

Remote Site Reliability Engineer position at Lucidya, focusing on cloud infrastructure, Kubernetes, and automation with 3 years of experience required.

Site Reliability Engineer

Site Reliability Engineer role at commercetools focusing on multi-cloud infrastructure, Kubernetes, and automation with hybrid work model.

Site Reliability Engineer

Join PalUp as a Site Reliability Engineer to build and maintain robust infrastructure supporting millions of users on an AI-driven social platform.

Site Reliability Engineer II

Microsoft seeks Site Reliability Engineer II for M365 Core Security team to protect digital infrastructure using AI/ML and security expertise.