Senior Site Reliability Engineer

Leading provider of cloud-based AI solutions for content understanding, search, and generation, trusted by hundreds of large organizations.
$160,000 - $250,000
Site Reliability
Senior Software Engineer
In-Person
101 - 500 Employees
3+ years of experience
AI · Enterprise SaaS

Description For Senior Site Reliability Engineer

Hive, a leading AI solutions provider with over $120M in funding, is seeking a Senior Site Reliability Engineer to join their DevOps and Systems team. The company operates its own data centers with a focus on distributed high-performance computing integrating GPUs, while maintaining a hybrid infrastructure with public clouds.

The role demands an individual who excels in an unstructured environment and is passionate about automation. The ideal candidate believes in the power of automation and takes pride in optimizing performance at scale across the entire stack. They will be responsible for managing and improving the reliability of Hive's enterprise SaaS offering, working with cutting-edge technologies including containerization, orchestration, and various cloud services.

The position offers a competitive base salary range of $160,000 - $250,000, along with equity options and comprehensive benefits. The team is based in Seattle, working alongside offices in San Francisco and Delhi. This is an excellent opportunity to contribute to one of the fastest-growing AI startups, working on revolutionary technology that serves billions of customer API requests monthly.

The role requires strong expertise in Linux systems, containerization technologies like Docker and Kubernetes, and various infrastructure components. The successful candidate will be part of a team that manages everything from network hardware to cloud services, ensuring the reliable operation of Hive's AI infrastructure that powers content moderation, brand protection, and sponsorship measurement solutions for hundreds of major organizations worldwide.

Last updated 42 minutes ago

Responsibilities For Senior Site Reliability Engineer

  • Automate manual operational processes
  • Improve workflows of developer, data, and machine learning teams
  • Manage secure integration and deployment tooling
  • Create, maintain, monitor, and audit secure infrastructure
  • Manage diverse technology platforms following best practices
  • Participate in on-call rotation and root cause analysis
  • Maintain awareness of industry best practices for data maintenance
  • Adhere to security policies and procedures
  • Report security violations/breaches to appropriate authority

Requirements For Senior Site Reliability Engineer

Python
Node.js
Kubernetes
PostgreSQL
RabbitMQ
Linux
  • 3 - 5 years of experience in development, operations, IT, or related field
  • Comfortable working on Linux infrastructures (Debian) via CLI
  • Able to learn quickly in a fast-paced environment
  • Able to debug, optimize, and automate routine tasks
  • Able to multitask, prioritize, and manage time efficiently
  • Able to physically lift equipment at least 30 pounds
  • Can communicate effectively across teams and management levels
  • Degree in computer science or similar (preferred)

Benefits For Senior Site Reliability Engineer

Medical Insurance
Vision Insurance
Dental Insurance
Equity
  • Health insurance
  • Vision insurance
  • Dental insurance
  • Equity options
  • Paid vacation
  • Gym membership

Interested in this job?

Jobs Related To Hive Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Cognite, focusing on cloud infrastructure optimization and reliability across GCP, AWS, and Azure platforms.

Site Reliability Engineer - UK Government

Senior Site Reliability Engineer position at Palantir, focusing on building and maintaining scalable infrastructure for UK Government projects, requiring 5+ years of Linux experience and security clearance eligibility.

Site Reliability Developer (JoinOCI-Ns2)

Senior Site Reliability Developer position at Oracle, focusing on cloud infrastructure and distributed systems, requiring TS/SCI clearance and 5+ years of experience.

Site Reliability Engineer

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.