xAI is seeking an experienced Site Reliability Engineer (SRE) to join their London team. This role focuses on improving observability, building dashboards and alerts, managing on-call rotations, and enhancing deployment processes. The ideal candidate should be an expert in languages like Rust, C++, or Go, and have deep knowledge of monitoring technologies, deployment tools, and Kubernetes. The position offers a dynamic startup environment, working on large-scale distributed systems, including the Grok production stack. Benefits include competitive compensation, equity, and health insurance. The role requires working from the London office, with occasional late meetings and business trips to California. Join xAI to tackle complex technical challenges and contribute to cutting-edge AI infrastructure.