Staff Site Reliability Engineer

Global leader in data-first contract lifecycle management (CLM) software, providing flexible Data-first Agreement Platform for managing contract processes.
United States
$150,000 - $220,000
Site Reliability
Staff Software Engineer
Remote
501 - 1,000 Employees
8+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer

Agiloft, the trusted leader in contract lifecycle management (CLM) software, is seeking a Staff Site Reliability Engineer to join their team. As a pioneer in data-first contract management, Agiloft has earned recognition from top analysts like Gartner, Forrester, and IDC. The company boasts an impressive customer satisfaction rate with nearly 100% of new customers satisfied with initial implementations and a 97% annual renewal rate.

The Staff SRE role offers an opportunity to work with cutting-edge technology in a company that values diversity, inclusion, and work-life balance. You'll be responsible for developing and implementing highly reliable and scalable systems, working closely with cross-functional teams. The position requires expertise in cloud operations, monitoring tools, and security practices, with opportunities to lead complex projects and mentor team members.

Agiloft's culture emphasizes the philosophy that "EX = CX" - excellent employee experience leads to excellent customer experience. The company supports multiple Employee Resource Groups and offers benefits like floating holidays and quarterly wellness days. They're committed to building a diverse workplace where individuals from all backgrounds can thrive and bring their authentic selves to work.

This role is perfect for an experienced SRE professional who wants to make a significant impact in a growing, successful company that's at the forefront of the CLM market. You'll have the opportunity to shape the reliability and scalability of systems that are becoming increasingly critical for organizations worldwide.

Last updated a month ago

Responsibilities For Staff Site Reliability Engineer

  • Define and enforce SRE best practices and standards
  • Architect and implement highly reliable and scalable systems
  • Lead complex post-incident reviews and implement systemic improvements
  • Collaborate with product and engineering teams to set reliability targets
  • Manage high-impact incidents and coordinate incident response
  • Contribute to budget planning and resource allocation
  • Lead efforts to establish disaster recovery strategies
  • Provide technical leadership and mentorship to the SRE team
  • Continuously track and improve metrics to optimize software delivery and operational performance
  • Participate in on-call rotation

Requirements For Staff Site Reliability Engineer

Python
Linux
Kubernetes
  • 8-10 years of experience in similar or related role
  • Bachelor's degree in Computer Science, Information Technology, or related field
  • In-depth knowledge of Cloud Ops technologies including AWS and Terraform
  • Advanced knowledge in Linux operating systems
  • Expertise in setting up and managing monitoring tools
  • In-depth understanding of monitoring and alerting systems, networking principles
  • Strong understanding of incident management
  • Advanced experience with security measures and practices
  • Strong analytical and problem-solving skills
  • Strong understanding of programming/scripting languages
  • Excellent communication and teamwork skills

Benefits For Staff Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
  • Floating holidays
  • Quarterly wellness day
  • Employee Resource Groups (ERGs)
  • Healthy work/life balance

Interested in this job?

Jobs Related To Agiloft Staff Site Reliability Engineer

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, incident management, and building scalable systems with competitive compensation and remote work options.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on building and scaling reliable systems, leading technical strategy, and mentoring teams while working remotely.

Senior Site Reliability Developer

Senior Site Reliability Developer position at Oracle, focusing on cloud infrastructure, automation, and large-scale distributed systems.

Lead Engineer, Product Site Reliability Engineer

Lead Engineer position for Product Site Reliability Engineering at Xero, focusing on building and leading SRE teams to ensure system reliability and observability.

Technical Program Manager, Site Reliability

Technical Program Manager position at Google, leading Site Reliability initiatives for AI, Trust and Security platforms with 8+ years of experience required.