Staff SRE (Site Reliability Engineer)

Gigster builds enterprise software on cutting-edge technology, working with entrepreneurs and Fortune 500 companies to deliver innovative solutions.
Site Reliability
Staff Software Engineer
Remote
51 - 100 Employees
8+ years of experience
AI · Blockchain · Enterprise SaaS

Description For Staff SRE (Site Reliability Engineer)

Gigster is seeking highly skilled and experienced Staff Site Reliability Engineers (SRE) to join their dynamic team. As a member of the Gigster Network, you'll be responsible for ensuring the reliability, scalability, and performance of critical systems and services. You'll play a pivotal role in shaping infrastructure for clients and driving initiatives that improve overall service quality.

Key responsibilities include:

  1. System Design and Architecture: Design, build, and maintain scalable and reliable infrastructure. Collaborate with engineering teams and evaluate new technologies.
  2. Monitoring and Incident Management: Implement monitoring systems, lead incident response efforts, and conduct post-incident reviews.
  3. Automation and Optimization: Architect and build innovative automation projects, create scripts to automate mundane tasks, and develop infrastructure as code.
  4. Collaboration and Mentorship: Work with cross-functional teams, mentor junior SREs, and advocate for best practices.
  5. Continuous Improvement: Drive initiatives to improve service reliability, participate in capacity planning, and stay current with industry trends.

Requirements:

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
  • 8+ years of industry experience as a Software Engineer, SRE, or Platform Engineer
  • 3+ years of experience as a Platform Engineer or SRE
  • Deep understanding of Linux/Unix systems and networking
  • Proficiency in programming languages (e.g., Python, Go, Java)
  • Experience with cloud platforms, container orchestration, monitoring tools, and CI/CD pipelines
  • Strong problem-solving, communication, and leadership skills

Benefits include working on cutting-edge projects, 100% remote work, flexible hours, and being part of a world-class network of talented professionals.

The recruitment process involves an English Proficiency Assessment, Technical Assessment, Recruiter screen, and Technical Interview. Candidates must be able to work during Pacific time hours 8am - 5pm PST and be open to on-call rotation.

Last updated 7 months ago

Responsibilities For Staff SRE (Site Reliability Engineer)

  • Design, build, and maintain scalable and reliable infrastructure
  • Implement and maintain monitoring and alerting systems
  • Lead incident response efforts
  • Architect and build innovative automation projects
  • Develop and maintain infrastructure as code
  • Collaborate with cross-functional teams
  • Mentor and guide junior SREs
  • Drive initiatives to improve service reliability, capacity, and performance

Requirements For Staff SRE (Site Reliability Engineer)

Python
Go
Java
Linux
Kubernetes
  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
  • 8+ years of industry experience as a Software Engineer, SRE, or Platform Engineer
  • 3+ years of experience as a Platform Engineer or SRE
  • Deep understanding of Linux/Unix systems and networking
  • Proficiency in programming languages (e.g., Python, Go, Java)
  • Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Docker, Kubernetes)
  • Strong knowledge of monitoring and logging tools
  • Familiarity with CI/CD pipelines and tools

Benefits For Staff SRE (Site Reliability Engineer)

  • World-class network of talented professionals
  • Work on cutting-edge projects
  • 100% remote and global work
  • Flexible work hours
  • Flexible offerings (choose work hours and earnings)
  • Swag

Interested in this job?

Jobs Related To Gigster Staff SRE (Site Reliability Engineer)

Lead Site Reliability Engineer (Observability)

Lead SRE role at Xero focusing on observability, implementing monitoring solutions, and driving reliability standards across a global engineering organization.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, incident management, and building scalable systems with competitive compensation and remote work options.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on building and scaling reliable systems, leading technical strategy, and mentoring teams while working remotely.

Senior Software Engineering Manager, Espresso SRE

Lead LinkedIn's Espresso SRE team managing distributed NoSQL database infrastructure serving 30M QPS, overseeing system reliability and team development in hybrid work environment.

Senior Software Engineering Manager, Espresso SRE

Senior Software Engineering Manager position at LinkedIn leading the Espresso SRE team, focusing on distributed NoSQL database infrastructure and team leadership.