Anthropic, a pioneering AI research company, is seeking a Senior Reliability Engineer to join their mission of creating safe and beneficial AI systems. This role is crucial for defining and achieving reliability metrics for both internal and external products and services.
The position offers an exciting opportunity to work at the intersection of Site Reliability Engineering and AI systems, focusing on maintaining and improving the infrastructure that powers large language models. You'll be responsible for developing Service Level Objectives, implementing monitoring systems, and managing high-availability infrastructure capable of serving millions of customers.
The ideal candidate brings extensive experience in distributed systems observability, understanding of AI infrastructure challenges, and proven expertise in implementing SLO/SLA frameworks. Strong candidates may have additional experience with large-scale model training infrastructure (>1000 GPUs), ML hardware accelerators, and AI-specific observability tools.
Anthropic offers a competitive compensation package ranging from $320,000 to $485,000 USD, along with benefits including equity options, visa sponsorship, generous vacation time, and flexible working hours. The position is hybrid-based in San Francisco, requiring at least 25% office presence.
The company operates as a public benefit corporation and values diversity and inclusion, encouraging applications from candidates of all backgrounds. They work as a cohesive team on large-scale research efforts, prioritizing impact and collaborative research discussions. This role presents an opportunity to contribute to groundbreaking AI technologies while ensuring their safe and reliable deployment for the benefit of humanity.
Working at Anthropic means joining a team that views AI research as an empirical science, combining elements of physics, biology, and computer science. The company's research builds upon significant work in areas like GPT-3, Circuit-Based Interpretability, and AI Safety, making this an ideal position for those passionate about advancing the field of AI reliability while maintaining high standards of safety and ethics.