Staff Site Reliability Engineer

2K is a global video game company, publishing titles developed by some of the most influential game development studios in the world. Founded in 2005, it is headquartered in Novato, California and is a wholly owned label of Take-Two Interactive Software, Inc.
$120,000 - $150,000
Site Reliability
Staff Software Engineer
Hybrid
7+ years of experience
Gaming
This job posting may no longer be active. You may be interested in these related jobs instead:
Technical Program Manager, Site Reliability Engineering

Technical Program Manager position at Google leading SRE initiatives, requiring 5+ years of program management experience and strong technical expertise.

Software Engineering Manager II, Site Reliability Engineering

Lead Google's Site Reliability Engineering team in building and maintaining large-scale distributed systems, managing technical projects, and ensuring service reliability.

Software Engineering Manager II, Site Reliability Engineering, Google Cloud

Lead Site Reliability Engineering team at Google Cloud, managing distributed systems and ensuring service reliability at global scale.

Staff Software Engineer, Site Reliability Engineering, Google Cloud

Lead Site Reliability Engineering role at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance optimization.

Senior Staff Software Engineer, Site Reliability Engineering

Senior Staff SRE position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services, requiring extensive experience in software development and system design.

Description For Staff Site Reliability Engineer

2K is seeking a Staff Site Reliability Engineer to support both production and production-dev environments. The ideal candidate will have robust systems and interpersonal skills to design solutions for a multi-datacenter environment and provide technical leadership and mentorship to a group of outstanding engineers.

Key Responsibilities:

  • Architect and operate highly resilient systems in a multi-datacenter global environment serving game and consumer services
  • Develop tools for the management and automation of systems and service infrastructure
  • Define and implement standards impacting systems, services, and multiple software environments
  • Provide domain expertise to internal customers on the full breadth of the development lifecycle
  • Diagnose and resolve technical issues from both internal and external customers
  • Participate in 24x7 on-call support for products

Required Qualifications:

  • 7+ years of experience
  • Proficiency in Python, Ruby, or Perl with a good understanding of code management
  • Background in architecture and maintenance of large-scale distributed infrastructure spanning terrestrial and cloud datacenters
  • Experience with LAN routing protocols, configuring network equipment, and parsing network traces
  • Hands-on expertise in data center management and understanding of signal flows
  • Experience with Unix/Linux operating systems and TCP/IP Networking Fundamentals

Bonus Skills:

  • Familiarity with Git technologies
  • Experience with Puppet roles and profiles, including custom module development
  • Understanding of encryption protocols, authentication, and intrusion detection
  • Background in building CI/CD pipelines in Jenkins, TeamCity, or Spinnaker
  • Proficiency in writing sophisticated SQL queries and optimizing queries and indexes

The role offers a competitive salary range of $120,000 to $150,000 per year for applicants based in Colorado, with potential for bonuses, equity awards, and a comprehensive benefits package. 2K is an equal opportunity employer committed to providing reasonable accommodations for qualified individuals with disabilities.

Last updated 4 months ago

Responsibilities For Staff Site Reliability Engineer

  • Architect and operate highly resilient systems in a multi-datacenter global environment
  • Develop tools for management and automation of systems and service infrastructure
  • Define and implement standards for systems, services and software environments
  • Provide domain expertise on the full development lifecycle
  • Diagnose and resolve technical issues from internal and external customers
  • Participate in 24x7 on-call support for products

Requirements For Staff Site Reliability Engineer

Python
Ruby
Linux
  • 7+ years experience
  • Proficient with Python, Ruby or Perl and code management
  • Background in architecture and maintenance of large scale distributed infrastructure
  • Experience with LAN routing protocols and configuring network equipment
  • Hands-on expertise in data center management
  • Experience with Unix/Linux operating systems and TCP/IP Networking Fundamentals

Benefits For Staff Site Reliability Engineer

Medical Insurance
Equity
  • Medical Insurance
  • Equity

Interested in this job?