Sr. Cloud Site Reliability Engineer

Serve Robotics

Serve Robotics develops sidewalk delivery robots to revolutionize urban logistics, making deliveries more efficient and accessible while reducing street congestion.

$150,000 - $200,000

Site Reliability

Senior Software Engineer

Remote

5+ years of experience

Robotics

Description For Sr. Cloud Site Reliability Engineer

Serve Robotics is revolutionizing urban delivery through innovative sidewalk robots, currently operating successfully in Los Angeles. As a Sr. Cloud Site Reliability Engineer, you'll join a team of industry veterans focused on scaling robotic deliveries from novelty to ubiquity. This senior-level position combines hands-on technical work with leadership responsibilities, focusing on building and maintaining critical SRE infrastructure.

You'll be instrumental in developing and implementing monitoring solutions, managing service reliability, and leading incident response processes. The role requires expertise in cloud platforms, containerization, and observability tools, while also demanding strong leadership skills to mentor team members and advocate for SRE best practices.

The position offers an exciting opportunity to work with cutting-edge robotics technology while solving real-world problems using machine learning and computer vision. You'll be part of an agile, diverse team that values collaborative problem-solving and respectful communication. The company culture emphasizes continuous learning and operational excellence.

Key technical areas include cloud infrastructure management, containerization, observability tools, and automation. You'll work across engineering, product, and operations teams to ensure system reliability while aligning with business objectives. The role combines technical depth with strategic thinking, requiring both hands-on engineering skills and the ability to guide architectural decisions.

This is an excellent opportunity for an experienced SRE professional who wants to make a significant impact in the robotics and autonomous delivery space while working with a team that's pushing the boundaries of what's possible in urban logistics.

Last updated 2 months ago

Responsibilities For Sr. Cloud Site Reliability Engineer

Develop and refine monitoring and observability tools for system availability and performance
Implement best practices for instrumentation using tools like Prometheus, Grafana, Datadog
Lead the definition and management of SLIs and SLOs
Perform capacity planning, load testing, and performance tuning
Own the incident response process including on-call rotation
Conduct and facilitate postmortems
Create reporting dashboards connecting reliability data with KPIs
Mentor junior and mid-level engineers
Conduct training sessions and share knowledge

Requirements For Sr. Cloud Site Reliability Engineer

Kubernetes

Python

Linux

5+ years of experience in Site Reliability Engineering, DevOps, or similar role
Experience with major cloud providers (Google Cloud, AWS, Azure)
Proficiency in Docker, Kubernetes, or similar containerization platforms
Hands-on experience with logging, metrics, and tracing tools
Familiarity with Infrastructure-as-Code and scripting
Experience with modern CI/CD pipelines
Bachelor's degree in Computer Science, Engineering, or related field
Strong leadership and communication skills
Strong analytical and problem-solving abilities

Benefits For Sr. Cloud Site Reliability Engineer

Equity

Competitive salary between $150K and $200K
Equity compensation

Serve Robotics

Serve Robotics develops sidewalk delivery robots to revolutionize urban logistics, making deliveries more efficient and accessible while reducing street congestion.

$150,000 - $200,000

Site Reliability

Senior Software Engineer

Remote

5+ years of experience

Robotics

Interested in this job?

Jobs Related To Serve Robotics Sr. Cloud Site Reliability Engineer

Senior Site Reliability Engineer

Oracle

Senior Site Reliability Engineer position at Oracle, focusing on cloud infrastructure and systems reliability with 3-5+ years of experience required.

Site Reliability Engineer

AION

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Google

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Google

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Google

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.