Site Reliability Engineer

3E is a leading technology and SaaS company providing digital solutions and expert services that maximise the performance of renewable energy assets.
Brussels, Belgium
Site Reliability
Senior Software Engineer
Hybrid
51 - 100 Employees
7+ years of experience
AI · Enterprise SaaS · Renewable Energy

Description For Site Reliability Engineer

3E, a leading SaaS company in renewable energy asset management, is seeking a Site Reliability Engineer to join their team in Brussels. With 20+ years of experience across 100 countries, 3E offers SynaptiQ, a cutting-edge platform for renewable energy portfolio management. The ideal candidate will have 7+ years of software engineering experience, with 3+ years in SRE roles. You'll work on defining SLOs, managing incidents, optimizing system performance, and building robust monitoring solutions using tools like Grafana and Prometheus. This role offers the opportunity to shape the resilience of 3E's platform while working in an international environment with flexible work arrangements and comprehensive benefits.

Last updated 17 days ago

Responsibilities For Site Reliability Engineer

  • Define and Monitor SLOs: Collaborate with teams to set Service Level Objectives (SLOs) that focus on the user journey, and ensure appropriate alerting, monitoring, and reporting on key performance indicators.
  • Incident Management: Troubleshoot and perform root cause analysis for system-wide incidents, working closely with our platform, operations, and software engineering teams.
  • Proactive Alerting: Implement effective monitoring and alerting routines to prevent incidents, ensuring the on-call team is only engaged in truly exceptional circumstances.
  • Continuous Improvement: Drive the improvement of incident response times by applying insights from postmortems and implementing lessons learned.
  • Optimize System Performance: Identify and resolve service performance bottlenecks across application, runtime, and operating systems, addressing issues related to CPU, memory, databases, networks, and more.
  • Build Monitoring Solutions: Set up and maintain a robust monitoring stack with insightful dashboards and visualizations using Grafana, Loki and Prometheus.

Requirements For Site Reliability Engineer

Linux
  • At least 3 years of professional experience in a similar SRE or infrastructure-related role.
  • Minimum of 7 years of experience as a software engineer, with a deep understanding of building scalable, resilient service-oriented architectures at a medium scale.
  • 3 years hands-on experience selecting, setting up, and optimizing monitoring stacks, including building dashboards that provide deep and pertinent insights into system health, and delivering comprehensive monthly reports to demonstrate SLA compliance to customers.
  • Hands-on experience in troubleshooting performance issues at all levels: services, runtime (JVM), containers (Docker), operating system (Linux), and networking.
  • Team player with excellent leadership, communication and collaboration skills.
  • Strong analytical skills and a security-first mindset.
  • Fluency in English.

Benefits For Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Commuter Benefits
  • Attractive salary package including group insurance, hospital insurance, meal vouchers (8 euros), eco vouchers, representation allowance, company mobile phone + subscription
  • 32 days of vacation
  • Flex income plan
  • 100% reimbursement of public transport fare
  • Bicycle allowance
  • Home working (2 days per week)
  • International environment: projects in over 100 countries worldwide, colleagues of 20 nationalities

Interested in this job?

Jobs Related To 3E Site Reliability Engineer

Site Reliability Engineer- SRE

Senior Site Reliability Engineer position at Apple, focusing on platform engineering and cloud infrastructure for hardware engineering tools and data analytics.

Senior Site Reliability Engineer - Observability and Telemetry Platform

Senior SRE position at NVIDIA focusing on observability and telemetry platforms, offering competitive salary and opportunity to work with cutting-edge cloud technologies.

Senior Production SRE Engineer - Storage

Senior Production SRE Engineer position at NVIDIA focusing on storage systems, requiring 5+ years experience and expertise in large-scale system reliability and automation.

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Disney, focusing on performance optimization and system reliability across Disney's digital platforms using cloud technologies.

Site Reliability Engineer

Senior Site Reliability Engineer role at Adobe focusing on cloud infrastructure, automation, and service reliability for the Experience Cloud platform.