Principal Site Reliability Engineer

A world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's problems, operating with integrity for 40+ years.
Ireland
Site Reliability
Principal Software Engineer
In-Person
6+ years of experience
Enterprise SaaS · Cloud

Description For Principal Site Reliability Engineer

Oracle Cloud Infrastructure (OCI) Incident Response team seeks a Principal Site Reliability Engineer to join their globally distributed team. This role is crucial in maintaining the high availability of Oracle's cloud services by detecting, triaging, and mitigating service-impacting events. The position involves working with state-of-the-art cloud technology and responding to critical issues within minutes to ensure minimal customer impact. As a Principal SRE, you'll interact with leaders across Oracle, driving programs to improve service availability. The role requires deep technical expertise in cloud computing, incident management, and system architecture. You'll be part of an agile team making significant impact on OCI's reliability and performance. Oracle offers a competitive benefits package and promotes an inclusive workplace that values diverse perspectives. The company's 40+ year track record of innovation and integrity makes this an excellent opportunity for experienced SRE professionals looking to work with cutting-edge cloud technology at scale.

Last updated a day ago

Responsibilities For Principal Site Reliability Engineer

  • Solve complex problems related to infrastructure cloud services and automate common tasks
  • Command and coordinate SMEs and Service leaders during Major Incidents
  • Utilize cloud computing design patterns to mitigate complex Major Incidents
  • Troubleshoot large, complex, interconnected systems
  • Document incident information and create Knowledge Base
  • Monitor and evaluate service and infrastructure dashboards
  • Design and deliver mission critical stack
  • Partner with development teams in defining operational requirements
  • Act as ultimate escalation point for complex issues

Requirements For Principal Site Reliability Engineer

Kubernetes
Linux
  • Bachelor's degree in Computer Science or relevant work experience
  • 5+ years experience in Site Reliability Engineering, DevOps or System Engineering
  • Public cloud operations experience (AWS, Azure, GCP, OCI)
  • Experience with Major Incident Management in cloud environment
  • Experience with modern object-oriented programming
  • Experience with Agile methodologies
  • Familiarity with infrastructure automation tools
  • Expertise with IaaS, CI/CD systems, Docker, RESTful APIs
  • Strong leadership and communication skills
  • Experience with distributed service-oriented architectures

Benefits For Principal Site Reliability Engineer

Medical Insurance
Vision Insurance
Dental Insurance
  • Medical Insurance
  • Life Insurance
  • Retirement Benefits
  • Volunteer Programs
  • Work-life Balance

Interested in this job?

Jobs Related To Oracle Principal Site Reliability Engineer

Principal Site Reliability Developer

Principal Site Reliability Developer position at Oracle focusing on cloud infrastructure, security, and scalability with 5+ years experience required.

Principal Site Reliability Developer

Principal Site Reliability Developer role at Oracle's Health Data Intelligence team, focusing on cloud infrastructure and healthcare platform development.

Principal Site Reliability Developer

Principal Site Reliability Developer role at Oracle, focusing on developing and supporting SRE frameworks and automation for database systems and cloud services.

Principal Site Reliability Developer

Principal Site Reliability Developer role at Oracle focusing on cloud infrastructure and system reliability.

Principal Network Reliability Engineer

Principal Network Reliability Engineer role at Oracle focusing on cloud infrastructure reliability, automation, and service architecture improvements.