Principal Site Reliability Developer

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's problems. True innovation starts with diverse perspectives and various abilities and backgrounds.
United States
$97,400 - $199,500
Site Reliability
Principal Software Engineer
Remote
5,000+ Employees
6+ years of experience
AI · Enterprise SaaS · Cloud

Description For Principal Site Reliability Developer

Are you interested in the exciting challenges of building and operating large-scale distributed infrastructure for the cloud? Oracle's Cloud Infrastructure is building its next generation of cloud technologies that operate in a broadly distributed, highly available, highly scalable, multi-tenant environment. Our mission is to provide our customers with an enterprise level cloud infrastructure platform that delivers unmatched reliability, scalability and performance for critically important databases, applications, and workloads.

We are building and expanding the next generation Platform as a Service (PaaS) cloud and the next generation cloud support experience to go with it. As our cloud service grows, we are expanding our team of energetic, customer-focused site reliability engineers (SREs). Our team performs an operational role in supporting Oracle's Exadata platform in the Oracle Cloud Infrastructure (OCI). Oracle Exadata is a full-stack solution that improves the performance, scale, security, and availability of an enterprise's Oracle databases. It incorporates more than 60 unique features, such as Smart Scan SQL offload, that are engineered with Oracle Database to accelerate OLTP, analytics and machine-learning applications. Exadata also reduces capital costs and management expenses by enabling IT departments to consolidate hundreds of databases onto a single system. Blending traditional roles of system administration, database engineering, and cloud fields, you'll be part of a team that supports this amazing machine. As part of the broader engineering organization, you will act as the voice of the customer to influence product features and plans to improve customer experience. This role is integral to the success of our customer relationships and is critical to the success of the platform.

Last updated 17 days ago

Responsibilities For Principal Site Reliability Developer

  • Design, write and deliver software to improve the availability, scalability, latency, and efficiency of Oracle DBaaS service.
  • Drive and actively participate in the resolution of complex technical issues spanning DBaaS (ExaCS/Autonomous) services and work towards ensuring highly scalable database service under strict SLAs by developing solutions to complex problems and incidents
  • Act as a trusted technical advisor to customers and solve complex infrastructure and DevOps challenges.
  • Create and deliver standard methodologies recommendations, sample code, technical documents.
  • Contribute to making our infrastructure simple, reliable, and easy to operate.
  • Participate in the design and architecture of large scale Distributed DBaaS Service features.
  • Participate in research and prototyping (proof of concept) different aspect of Oracle DBaaS service.
  • Define and develop monitoring infrastructure criteria (SLIs, SLOs) for Oracle DBaaS service.
  • Solve complex and difficult problems and build automation to prevent problem recurrence.
  • Participate in DBaaS service capacity planning and demand forecasting, software performance analysis and system tuning.
  • Conduct periodic on call duties

Requirements For Principal Site Reliability Developer

Linux
Python
Java
  • At least a bachelor's degree, in Computer Science, MIS or another technical field, or equivalent work experience.
  • Advanced Scripting/coding skills (Shell, Perl and Python). Knowing C/C++ and Java is nice to have.
  • Experience in Building and managing Linux OS systems with good understanding how the Linux kernel works (IO, Network,…etc).
  • Experience in Building and managing Virtualized systems (KVM, OVM, Containers/Docker) and ability to read and understand source code.
  • Experience troubleshooting complex software and/or networking issues.
  • Strong understanding of distributed computing, cloud concepts and platforms.
  • Very strong analytical skills to identify problems root cause
  • Experience with Oracle databases including RAC, Data Guard, Oracle GI (clusterware, ASM), RMAN preferred.
  • Experience with Oracle Exadata strongly preferred.
  • Experience in cloud technical support, operations, NOC or similar is preferred, but not required.
  • Proven ability to quickly learn new technical domains and then train others.
  • Great verbal and written communication skills.

Benefits For Principal Site Reliability Developer

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Equity
  • Medical, dental, and vision insurance, including expert medical opinion
  • Short term disability and long term disability
  • Life insurance and AD&D
  • Supplemental life insurance (Employee/Spouse/Child)
  • Health care and dependent care Flexible Spending Accounts
  • Pre-tax commuter and parking benefits
  • 401(k) Savings and Investment Plan with company match
  • Paid time off: Flexible Vacation
  • 11 paid holidays
  • Paid sick leave
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan
  • Financial planning and group legal
  • Voluntary benefits including auto, homeowner and pet insurance

Interested in this job?

Jobs Related To Oracle Principal Site Reliability Developer

Principal Site Reliability Engineer (DBA)

Principal SRE position at Oracle focusing on database reliability and cloud infrastructure, requiring 7+ years of experience in DBA and SRE roles.

Principal Site Reliability Developer

Principal Site Reliability Developer role at Oracle, focusing on cloud infrastructure, system reliability, and automation with 10+ years of experience required.

Principal Site Reliability Engineer

Oracle seeks Principal SRE for Health Apps & Infrastructure, focusing on database lifecycle management, cloud architecture, and automation.

Principal Site Reliability Developer

Principal Site Reliability Developer at Oracle, focusing on SaaS Engineering and Oracle Cloud infrastructure.

Principal Site Reliability Engineer

Principal Site Reliability Engineer at Oracle to lead automation and infrastructure design for cloud applications.