Principal Site Reliability Developer

World leader in cloud solutions using tomorrow's technology to tackle today's problems, with 40+ years of experience.
Site Reliability
Principal Software Engineer
In-Person
5,000+ Employees
10+ years of experience
Enterprise SaaS · Cloud

Description For Principal Site Reliability Developer

Oracle, a global leader in cloud solutions with over 40 years of industry experience, is seeking a Principal Site Reliability Developer to join their team in Zapopan, Mexico. This role requires a seasoned professional with 10+ years of experience who will be instrumental in designing and delivering mission-critical infrastructure.

The ideal candidate will possess deep expertise in Exadata, Oracle database systems, and Linux fundamentals, combined with strong automation capabilities using Python, Perl, and Ansible. You'll be working with cutting-edge cloud technologies and be responsible for the end-to-end performance and operability of critical systems.

As a Principal SRE, you'll collaborate with global teams to enhance Oracle's cloud service portfolio, focusing on security, resiliency, scale, and performance. This role offers the opportunity to work with state-of-the-art technology while solving complex challenges in a distributed systems environment.

Oracle offers a comprehensive benefits package including medical, life insurance, and retirement options. The company promotes a diverse and inclusive workplace where innovation thrives through various perspectives and backgrounds. This role provides an excellent opportunity for professional growth within a leading technology company that continues to shape the future of cloud computing.

The position requires both English and Spanish language proficiency and offers the chance to work with modern tools and technologies including Docker, Github, and various virtualization infrastructures. Join Oracle to be part of a team that's pushing the boundaries of cloud technology and making a significant impact on global enterprise solutions.

Last updated 9 days ago

Responsibilities For Principal Site Reliability Developer

  • Work with Site Reliability Engineering (SRE) team on shared full stack ownership
  • Understand end-to-end configuration, technical dependencies, and behavioral characteristics of production services
  • Design and deliver mission critical stack focusing on security, resiliency, scale, and performance
  • Partner with development teams in defining and implementing service architecture improvements
  • Guide Development Teams to engineer and add capabilities to Oracle Cloud service portfolio
  • Act as ultimate escalation point for complex or critical issues
  • Troubleshoot issues and define mitigations using understanding of service topology
  • Understand and explain product architecture decisions impact on distributed systems

Requirements For Principal Site Reliability Developer

Python
Linux
  • Strong knowledge of Exadata, Real Application Clusters, Oracle database, Storage, and Linux fundamentals
  • Oracle Exadata Database Machine and Oracle Cloud Infrastructure (OCI) Certifications (Preferred)
  • Knowledge of network fundamentals (VCN, Ethernet, RoCE, TCP/IP, routing, DHCP)
  • Experience automating management of Linux based infrastructure
  • Infrastructure Network Security knowledge
  • Ability to automate tasks using Python, Perl, bash, Ansible
  • Good verbal and written communication skills
  • Excellent written and spoken English communication skills
  • Spanish language proficiency
  • 10+ years of experience
  • Experience with containerization technologies (Docker)
  • Experience with tools like Github, Jira, Teamcity, and Bitbucket
  • Experience with Virtualization Infrastructures (KVM and XEN)

Benefits For Principal Site Reliability Developer

Medical Insurance
Vision Insurance
Dental Insurance
  • Competitive suite of employee benefits
  • Medical insurance
  • Life insurance
  • Retirement options
  • Volunteer programs

Interested in this job?

Jobs Related To Oracle Principal Site Reliability Developer

Principal Site Reliability Engineer

Principal SRE position at Oracle Cloud Infrastructure focusing on incident response and service reliability, requiring 6+ years of experience in cloud operations.

Principal Site Reliability Developer

Principal Site Reliability Developer position at Oracle focusing on cloud infrastructure, security, and scalability with 5+ years experience required.

Principal Site Reliability Developer

Principal Site Reliability Developer role at Oracle's Health Data Intelligence team, focusing on cloud infrastructure and healthcare platform development.

Principal Site Reliability Developer

Principal Site Reliability Developer role at Oracle, focusing on developing and supporting SRE frameworks and automation for database systems and cloud services.

Principal Network Reliability Engineer

Principal Network Reliability Engineer role at Oracle focusing on cloud infrastructure reliability, automation, and service architecture improvements.