Sr. Cloud Site Reliability Engineer

Serve Robotics develops sidewalk delivery robots to revolutionize urban logistics, making deliveries more efficient and accessible while reducing street congestion.
$150,000 - $200,000
Site Reliability
Senior Software Engineer
Remote
5+ years of experience
Robotics

Description For Sr. Cloud Site Reliability Engineer

Serve Robotics is revolutionizing urban delivery through innovative sidewalk robots, currently operating successfully in Los Angeles. As a Sr. Cloud Site Reliability Engineer, you'll join a team of industry veterans focused on scaling robotic deliveries from novelty to ubiquity. This senior-level position combines hands-on technical work with leadership responsibilities, focusing on building and maintaining critical SRE infrastructure.

You'll be instrumental in developing and implementing monitoring solutions, managing service reliability, and leading incident response processes. The role requires expertise in cloud platforms, containerization, and observability tools, while also demanding strong leadership skills to mentor team members and advocate for SRE best practices.

The position offers an exciting opportunity to work with cutting-edge robotics technology while solving real-world problems using machine learning and computer vision. You'll be part of an agile, diverse team that values collaborative problem-solving and respectful communication. The company culture emphasizes continuous learning and operational excellence.

Key technical areas include cloud infrastructure management, containerization, observability tools, and automation. You'll work across engineering, product, and operations teams to ensure system reliability while aligning with business objectives. The role combines technical depth with strategic thinking, requiring both hands-on engineering skills and the ability to guide architectural decisions.

This is an excellent opportunity for an experienced SRE professional who wants to make a significant impact in the robotics and autonomous delivery space while working with a team that's pushing the boundaries of what's possible in urban logistics.

Last updated 2 minutes ago

Responsibilities For Sr. Cloud Site Reliability Engineer

  • Develop and refine monitoring and observability tools for system availability and performance
  • Implement best practices for instrumentation using tools like Prometheus, Grafana, Datadog
  • Lead the definition and management of SLIs and SLOs
  • Perform capacity planning, load testing, and performance tuning
  • Own the incident response process including on-call rotation
  • Conduct and facilitate postmortems
  • Create reporting dashboards connecting reliability data with KPIs
  • Mentor junior and mid-level engineers
  • Conduct training sessions and share knowledge

Requirements For Sr. Cloud Site Reliability Engineer

Kubernetes
Python
Go
Linux
  • 5+ years of experience in Site Reliability Engineering, DevOps, or similar role
  • Experience with major cloud providers (Google Cloud, AWS, Azure)
  • Proficiency in Docker, Kubernetes, or similar containerization platforms
  • Hands-on experience with logging, metrics, and tracing tools
  • Familiarity with Infrastructure-as-Code and scripting
  • Experience with modern CI/CD pipelines
  • Bachelor's degree in Computer Science, Engineering, or related field
  • Strong leadership and communication skills
  • Strong analytical and problem-solving abilities

Benefits For Sr. Cloud Site Reliability Engineer

Equity
  • Competitive salary between $150K and $200K
  • Equity compensation

Interested in this job?

Jobs Related To Serve Robotics Sr. Cloud Site Reliability Engineer

Site Reliability Engineer

Senior Site Reliability Engineer role at Baseten, building and maintaining scalable ML infrastructure with competitive compensation and benefits.

ASE Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Apple Services Engineering team, managing infrastructure for App Store and other Apple services.

Site Reliability Engineer 3

Senior Site Reliability Engineer role at Oracle Health, focusing on modernizing healthcare systems through AI and advanced technology.

Site Reliability Engineer

Senior Site Reliability Engineer role at One, focusing on ensuring service reliability and availability for a mission-driven fintech company.

Senior Site Reliability Engineer (GCP)

Senior Site Reliability Engineer position at Rackspace Technology focusing on GCP infrastructure, requiring 8+ years of experience in DevOps and cloud technologies.