Sr. Cloud Site Reliability Engineer

Serve Robotics develops sidewalk delivery robots to revolutionize urban logistics, making deliveries more efficient and accessible while reducing street congestion.
$150,000 - $200,000
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Robotics

Description For Sr. Cloud Site Reliability Engineer

Serve Robotics is revolutionizing urban delivery through innovative sidewalk robots, currently operating successfully in Los Angeles. As a Sr. Cloud Site Reliability Engineer, you'll join a team of industry veterans focused on scaling robotic deliveries from novelty to ubiquity. This senior-level position combines hands-on technical work with leadership responsibilities, focusing on building and maintaining critical SRE infrastructure.

You'll be instrumental in developing and implementing monitoring solutions, managing service reliability, and leading incident response processes. The role requires expertise in cloud platforms, containerization, and observability tools, while also demanding strong leadership skills to mentor team members and advocate for SRE best practices.

The position offers an exciting opportunity to work with cutting-edge robotics technology while solving real-world problems using machine learning and computer vision. You'll be part of an agile, diverse team that values collaborative problem-solving and respectful communication. The company culture emphasizes continuous learning and operational excellence.

Key technical areas include cloud infrastructure management, containerization, observability tools, and automation. You'll work across engineering, product, and operations teams to ensure system reliability while aligning with business objectives. The role combines technical depth with strategic thinking, requiring both hands-on engineering skills and the ability to guide architectural decisions.

This is an excellent opportunity for an experienced SRE professional who wants to make a significant impact in the robotics and autonomous delivery space while working with a team that's pushing the boundaries of what's possible in urban logistics.

Last updated 2 months ago

Responsibilities For Sr. Cloud Site Reliability Engineer

  • Develop and refine monitoring and observability tools for system availability and performance
  • Implement best practices for instrumentation using tools like Prometheus, Grafana, Datadog
  • Lead the definition and management of SLIs and SLOs
  • Perform capacity planning, load testing, and performance tuning
  • Own the incident response process including on-call rotation
  • Conduct and facilitate postmortems
  • Create reporting dashboards connecting reliability data with KPIs
  • Mentor junior and mid-level engineers
  • Conduct training sessions and share knowledge

Requirements For Sr. Cloud Site Reliability Engineer

Kubernetes
Python
Go
Linux
  • 5+ years of experience in Site Reliability Engineering, DevOps, or similar role
  • Experience with major cloud providers (Google Cloud, AWS, Azure)
  • Proficiency in Docker, Kubernetes, or similar containerization platforms
  • Hands-on experience with logging, metrics, and tracing tools
  • Familiarity with Infrastructure-as-Code and scripting
  • Experience with modern CI/CD pipelines
  • Bachelor's degree in Computer Science, Engineering, or related field
  • Strong leadership and communication skills
  • Strong analytical and problem-solving abilities

Benefits For Sr. Cloud Site Reliability Engineer

Equity
  • Competitive salary between $150K and $200K
  • Equity compensation

Interested in this job?

Jobs Related To Serve Robotics Sr. Cloud Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Oracle, focusing on cloud infrastructure and systems reliability with 3-5+ years of experience required.

Site Reliability Engineer

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.