Staff SRE (Site Reliability Engineer)

Gigster builds enterprise software on cutting-edge technology, working with entrepreneurs and Fortune 500 companies to deliver innovative solutions.
Site Reliability
Staff Software Engineer
Remote
51 - 100 Employees
8+ years of experience
AI · Blockchain · Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Staff Site Reliability Engineer

Staff Site Reliability Engineer position at Fivetran, focusing on infrastructure reliability, monitoring, and system evolution with hybrid work in Denver.

Production Support Engineering LMTS

Senior SRE position at Salesforce focusing on cloud infrastructure reliability, requiring U.S. citizenship and extensive experience with AWS, Kubernetes, and monitoring tools.

Site Reliability Engineer

Microsoft Site Reliability Engineer position in Cloud+AI team, focusing on secure infrastructure and Azure services deployment, offering hybrid work and competitive compensation.

Site Reliability Developer 3

Site Reliability Developer role at Oracle focusing on cloud infrastructure, automation, and system reliability with emphasis on security and scalability.

Site Reliability Developer 3

Site Reliability Developer role at Oracle focusing on cloud infrastructure, automation, and system reliability with emphasis on security and scalability.

Description For Staff SRE (Site Reliability Engineer)

Gigster is seeking highly skilled and experienced Staff Site Reliability Engineers (SRE) to join their dynamic team. As a member of the Gigster Network, you'll be responsible for ensuring the reliability, scalability, and performance of critical systems and services. You'll play a pivotal role in shaping infrastructure for clients and driving initiatives that improve overall service quality.

Key responsibilities include:

  1. System Design and Architecture: Design, build, and maintain scalable and reliable infrastructure. Collaborate with engineering teams and evaluate new technologies.
  2. Monitoring and Incident Management: Implement monitoring systems, lead incident response efforts, and conduct post-incident reviews.
  3. Automation and Optimization: Architect and build innovative automation projects, create scripts to automate mundane tasks, and develop infrastructure as code.
  4. Collaboration and Mentorship: Work with cross-functional teams, mentor junior SREs, and advocate for best practices.
  5. Continuous Improvement: Drive initiatives to improve service reliability, participate in capacity planning, and stay current with industry trends.

Requirements:

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
  • 8+ years of industry experience as a Software Engineer, SRE, or Platform Engineer
  • 3+ years of experience as a Platform Engineer or SRE
  • Deep understanding of Linux/Unix systems and networking
  • Proficiency in programming languages (e.g., Python, Go, Java)
  • Experience with cloud platforms, container orchestration, monitoring tools, and CI/CD pipelines
  • Strong problem-solving, communication, and leadership skills

Benefits include working on cutting-edge projects, 100% remote work, flexible hours, and being part of a world-class network of talented professionals.

The recruitment process involves an English Proficiency Assessment, Technical Assessment, Recruiter screen, and Technical Interview. Candidates must be able to work during Pacific time hours 8am - 5pm PST and be open to on-call rotation.

Last updated 8 months ago

Responsibilities For Staff SRE (Site Reliability Engineer)

  • Design, build, and maintain scalable and reliable infrastructure
  • Implement and maintain monitoring and alerting systems
  • Lead incident response efforts
  • Architect and build innovative automation projects
  • Develop and maintain infrastructure as code
  • Collaborate with cross-functional teams
  • Mentor and guide junior SREs
  • Drive initiatives to improve service reliability, capacity, and performance

Requirements For Staff SRE (Site Reliability Engineer)

Python
Go
Java
Linux
Kubernetes
  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience)
  • 8+ years of industry experience as a Software Engineer, SRE, or Platform Engineer
  • 3+ years of experience as a Platform Engineer or SRE
  • Deep understanding of Linux/Unix systems and networking
  • Proficiency in programming languages (e.g., Python, Go, Java)
  • Experience with cloud platforms (AWS, Azure, GCP) and container orchestration (Docker, Kubernetes)
  • Strong knowledge of monitoring and logging tools
  • Familiarity with CI/CD pipelines and tools

Benefits For Staff SRE (Site Reliability Engineer)

  • World-class network of talented professionals
  • Work on cutting-edge projects
  • 100% remote and global work
  • Flexible work hours
  • Flexible offerings (choose work hours and earnings)
  • Swag

Interested in this job?