Staff Site Reliability Engineer

2K is a global video game company, publishing titles developed by some of the most influential game development studios in the world. Founded in 2005, it is headquartered in Novato, California and is a wholly owned label of Take-Two Interactive Software, Inc.
$120,000 - $150,000
Site Reliability
Staff Software Engineer
Hybrid
7+ years of experience
Gaming
This job posting may no longer be active. You may be interested in these related jobs instead:
Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, incident management, and building scalable systems with competitive compensation and remote work options.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on building and scaling reliable systems, leading technical strategy, and mentoring teams while working remotely.

Lead Site Reliability Engineer (Observability)

Lead SRE role at Xero focusing on observability, implementing monitoring solutions, and driving reliability standards across a global engineering organization.

Senior Software Engineering Manager, Espresso SRE

Lead LinkedIn's Espresso SRE team managing distributed NoSQL database infrastructure serving 30M QPS, overseeing system reliability and team development in hybrid work environment.

Senior Software Engineering Manager, Espresso SRE

Senior Software Engineering Manager position at LinkedIn leading the Espresso SRE team, focusing on distributed NoSQL database infrastructure and team leadership.

Description For Staff Site Reliability Engineer

2K is seeking a Staff Site Reliability Engineer to support both production and production-dev environments. The ideal candidate will have robust systems and interpersonal skills to design solutions for a multi-datacenter environment and provide technical leadership and mentorship to a group of outstanding engineers.

Key Responsibilities:

  • Architect and operate highly resilient systems in a multi-datacenter global environment serving game and consumer services
  • Develop tools for the management and automation of systems and service infrastructure
  • Define and implement standards impacting systems, services, and multiple software environments
  • Provide domain expertise to internal customers on the full breadth of the development lifecycle
  • Diagnose and resolve technical issues from both internal and external customers
  • Participate in 24x7 on-call support for products

Required Qualifications:

  • 7+ years of experience
  • Proficiency in Python, Ruby, or Perl with a good understanding of code management
  • Background in architecture and maintenance of large-scale distributed infrastructure spanning terrestrial and cloud datacenters
  • Experience with LAN routing protocols, configuring network equipment, and parsing network traces
  • Hands-on expertise in data center management and understanding of signal flows
  • Experience with Unix/Linux operating systems and TCP/IP Networking Fundamentals

Bonus Skills:

  • Familiarity with Git technologies
  • Experience with Puppet roles and profiles, including custom module development
  • Understanding of encryption protocols, authentication, and intrusion detection
  • Background in building CI/CD pipelines in Jenkins, TeamCity, or Spinnaker
  • Proficiency in writing sophisticated SQL queries and optimizing queries and indexes

The role offers a competitive salary range of $120,000 to $150,000 per year for applicants based in Colorado, with potential for bonuses, equity awards, and a comprehensive benefits package. 2K is an equal opportunity employer committed to providing reasonable accommodations for qualified individuals with disabilities.

Last updated 6 months ago

Responsibilities For Staff Site Reliability Engineer

  • Architect and operate highly resilient systems in a multi-datacenter global environment
  • Develop tools for management and automation of systems and service infrastructure
  • Define and implement standards for systems, services and software environments
  • Provide domain expertise on the full development lifecycle
  • Diagnose and resolve technical issues from internal and external customers
  • Participate in 24x7 on-call support for products

Requirements For Staff Site Reliability Engineer

Python
Ruby
Linux
  • 7+ years experience
  • Proficient with Python, Ruby or Perl and code management
  • Background in architecture and maintenance of large scale distributed infrastructure
  • Experience with LAN routing protocols and configuring network equipment
  • Hands-on expertise in data center management
  • Experience with Unix/Linux operating systems and TCP/IP Networking Fundamentals

Benefits For Staff Site Reliability Engineer

Medical Insurance
Equity
  • Medical Insurance
  • Equity

Interested in this job?