Technical Duty Officer (Lead/Senior Site Reliability Engineer)

A platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive.
$120,000 - $175,000
Site Reliability
Staff Software Engineer
Hybrid
5+ years of experience
Enterprise SaaS

Description For Technical Duty Officer (Lead/Senior Site Reliability Engineer)

Xero, a platform revolutionizing small business accounting and bookkeeping, is seeking a Technical Duty Officer to join their Site Reliability Engineering (SRE) team. This role combines technical leadership with incident management expertise, focusing on maintaining and improving system reliability across Xero's platform.

As part of the SRE organization's Incident and Problem Management team, you'll be responsible for building and maintaining robust processes around incident management. The position requires a seasoned SRE professional who can lead technical responses to high-severity cloud issues while driving best practices across the organization.

The ideal candidate will bring 5+ years of SRE experience, strong AWS knowledge, and excellent communication skills. You'll lead critical outage responses, implement scalable processes, and foster a culture of continuous learning and technical excellence. The role offers competitive compensation ($120,000-$175,000) and comprehensive benefits including generous paid leave, healthcare, 401k matching, and 26 weeks of parental leave.

This is an excellent opportunity for an experienced SRE leader who wants to make a significant impact on a platform that serves millions of small businesses worldwide. You'll work in a human-first culture that values diversity, respect, and inclusion, with the flexibility of hybrid work arrangements and numerous opportunities for professional growth.

Last updated 3 days ago

Responsibilities For Technical Duty Officer (Lead/Senior Site Reliability Engineer)

  • Own the incident management process and ensure it drives enduring reliability across all products and services
  • Provide expert leadership during critical outages, coordinating multiple teams
  • Lead and advocate for the transformation to a world-leading SRE organization
  • Develop and implement scalable process frameworks and observability strategies
  • Collaborate with product teams to analyze failures and improve service reliability
  • Provide ongoing training across the business for incident management
  • Investigate incident causes and work proactively to prevent future incidents
  • Build playbooks and automated response to Business continuity and DR situations

Requirements For Technical Duty Officer (Lead/Senior Site Reliability Engineer)

Python
  • 5+ years of experience as a Site Reliability Engineer
  • Experience troubleshooting AWS hosted services
  • Networking knowledge (TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP)
  • Coding experience (preferably Python) for tools, scripting, or automation
  • Strong communication skills (oral & written)

Benefits For Technical Duty Officer (Lead/Senior Site Reliability Engineer)

401k
Dental Insurance
Medical Insurance
Mental Health Assistance
Parental Leave
Vision Insurance
  • Generous paid leave
  • Employee Assistance Program
  • Mental health care for you and family
  • Wellbeing programming and allowances
  • Medical, dental, vision, and disability insurance
  • Fertility and family forming financial support
  • 401k contribution matching
  • 26 weeks paid parental leave for primary caregivers
  • Employee Share Plan
  • Office with snacks and break areas
  • Flexible working
  • Career development

Interested in this job?

Jobs Related To Xero Technical Duty Officer (Lead/Senior Site Reliability Engineer)

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.

Technical Program Manager, Site Reliability Engineering

Technical Program Manager position at Google leading SRE initiatives, requiring 5+ years of program management experience and strong technical expertise.

Software Engineering Manager II, Site Reliability Engineering

Lead Google's Site Reliability Engineering team in building and maintaining large-scale distributed systems, managing technical projects, and ensuring service reliability.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead Site Reliability Engineering team at Google Cloud, managing distributed systems and ensuring service reliability at global scale.