Service Reliability and Observability Engineer/Architect (Evening/Night Shift)

A world leader in cloud solutions, providing tomorrow's technology to tackle today's challenges for over 40+ years.
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS · Cloud

Description For Service Reliability and Observability Engineer/Architect (Evening/Night Shift)

Oracle's Communications SRE/Ops team is seeking a Service Reliability and Observability Engineer/Architect to join their Enterprise Communications Platform team. This role is critical in ensuring the reliability and performance of Oracle's large-scale Communications Cloud solutions. The position involves working evening/night shifts and requires participation in 24x7x365 on-call rotations.

The role combines hands-on operational support with strategic engineering work, focusing on designing and implementing reliability, availability, and observability solutions. You'll be working with cutting-edge cloud technologies, including containerization, microservices, and modern monitoring tools. The position offers the opportunity to work with enterprise-scale systems while implementing DevOps and SRE best practices.

As part of Oracle's Communications Global Industry Unit, you'll be responsible for maintaining and improving critical communication services operating in the Oracle Cloud environment. The role requires a blend of operational expertise and software engineering skills, with opportunities to work on automation, observability, and service reliability improvements.

The ideal candidate will have strong experience with cloud platforms, DevOps practices, and modern observability tools. You'll need to be comfortable with both operational support and software development, as the role involves both maintaining existing systems and developing new solutions to improve service reliability.

This position offers the opportunity to work with a global team, tackle complex technical challenges, and contribute to the evolution of Oracle's cloud communications infrastructure. The role comes with competitive benefits, opportunities for professional growth, and the chance to work with cutting-edge technologies in a leading cloud company.

Last updated a day ago

Responsibilities For Service Reliability and Observability Engineer/Architect (Evening/Night Shift)

  • Lead and author strategies for operational, reliability, availability, and resiliency capabilities
  • Perform 24x7x365 production operations and maintenance
  • Author technical content for incident response and root cause analysis
  • Develop automation and orchestration solutions
  • Collaborate with cross-functional teams on technical topics
  • Support large-scale production communications cloud services

Requirements For Service Reliability and Observability Engineer/Architect (Evening/Night Shift)

Kubernetes
Python
Java
JavaScript
Kafka
Redis
PostgreSQL
  • Strong experience with DevOps and DevSecOps methodologies
  • Experience with telemetry and observability tools (ELK, Kibana, Grafana, Prometheus, Splunk)
  • Experience with Microservice architecture and containerization technologies
  • Strong experience with CI/CD tools and Pipeline development
  • Programming skills in Python, Java, Golang, or JavaScript
  • Experience with cloud platforms (OCI, Azure, GCP, AWS)
  • 4-year Degree in Computer Science or equivalent experience
  • Excellent communications skills in English
  • 24/7/365 On-Call Shift Rotation availability

Benefits For Service Reliability and Observability Engineer/Architect (Evening/Night Shift)

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
  • Competitive benefits package
  • Flexible medical, life insurance, and retirement options
  • Work-life balance
  • Volunteer programs

Interested in this job?

Jobs Related To Oracle Service Reliability and Observability Engineer/Architect (Evening/Night Shift)

Site Reliability Developer 4

Senior Site Reliability Developer position at Oracle, focusing on autonomous database cloud services framework, requiring expertise in Python, cloud infrastructure, and database technologies.

Site Reliability Developer 4

Site Reliability Developer position at Oracle focusing on infrastructure cloud services and automation with competitive compensation and comprehensive benefits.

Site Reliability Developer 4

Senior Site Reliability Engineering role at Oracle focusing on cloud infrastructure automation and reliability improvement, offering competitive compensation and comprehensive benefits.

Site Reliability Developer 4

Senior Site Reliability Developer position at Oracle focusing on cloud infrastructure, DevOps, and system reliability.

Site Reliability Developer Opportunities - Mexico

Site Reliability Developer role at Oracle Mexico, focusing on cloud infrastructure and automation for Database Autonomous Recovery Service.