Staff Site Reliability Engineer - Storage Engineering

LinkedIn is the world's largest professional network, built to create economic opportunity for every member of the global workforce.
$147,000 - $240,000
Site Reliability
Staff Software Engineer
Hybrid
4+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:
Staff Engineer, Site Reliability

Staff Site Reliability Engineer role at LinkedIn focusing on operating and scaling Internet-facing services.

Staff Engineer, Site Reliability

Staff Engineer, Site Reliability at LinkedIn - Develop and manage large-scale infrastructure for LinkedIn's edge services.

Staff Engineer, Site Reliability

Join LinkedIn as a Staff Engineer in Site Reliability, managing large-scale infrastructure and improving service delivery for over 1 Billion members.

Technical Program Manager, Site Reliability Engineering

Technical Program Manager position at Google leading SRE initiatives, requiring 5+ years of program management experience and strong technical expertise.

Software Engineering Manager II, Site Reliability Engineering

Lead Google's Site Reliability Engineering team in building and maintaining large-scale distributed systems, managing technical projects, and ensuring service reliability.

Description For Staff Site Reliability Engineer - Storage Engineering

LinkedIn is the world's largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day. We're also committed to providing transformational opportunities for our own employees by investing in their growth. We aspire to create a culture that's built on trust, care, inclusion, and fun – where everyone can succeed.

LinkedIn is looking to hire a Staff Site Reliability Engineer within the production Storage Engineering group. The Storage Engineering group is defining the strategy for LinkedIn's storage infrastructure and is responsible for architecture, design, tooling and automation related to that. The candidate will have wide latitude in making contributions in several areas. In this highly visible role, you will work within the various members of the engineering teams providing guidance in technologies, product feature definitions, implementation and tradeoffs and architecture definition.

Responsibilities: • Design and develop strategy for storage system design and consumption • Influence open source community in white-box storage developments and software defined storage • Utilize communication skills in interacting with peer groups & drive technical presentations • Maintain technical knowledge of storage platform industry directions and trends • Lead initial storage implementations, proof-of-concepts and pilots • Maintain relationships with user groups, staying apprised of future projects, project growth, schedule activities, and being a trusted point of contact • Adapt to the ever-evolving industry; learn and scrutinize new technologies, and envision its possible application towards the LinkedIn mission • Participate in a 12x7 rotation for second-tier escalations

This role offers a hybrid work option, meaning you can both work from home and commute to a LinkedIn office, depending on what's best for you and when it is important for your team to be together. Join us to transform the way the world works.

Last updated 4 months ago

Responsibilities For Staff Site Reliability Engineer - Storage Engineering

  • Design and develop strategy for storage system design and consumption
  • Influence open source community in white-box storage developments and software defined storage
  • Utilize communication skills in interacting with peer groups & drive technical presentations
  • Maintain technical knowledge of storage platform industry directions and trends
  • Lead initial storage implementations, proof-of-concepts and pilots
  • Maintain relationships with user groups, staying apprised of future projects, project growth, schedule activities, and being a trusted point of contact
  • Adapt to the ever-evolving industry; learn and scrutinize new technologies, and envision its possible application towards the LinkedIn mission
  • Participate in a 12x7 rotation for second-tier escalations

Requirements For Staff Site Reliability Engineer - Storage Engineering

Python
Go
Rust
Linux
  • BA/BS Degree in Computer Science, Electrical Engineering, or related technical discipline, or related practical experience
  • 4+ years of experience with GPFS, GlusterFS or any Software-defined storage solutions
  • 4+ years of experience in large enterprise-class storage build and support
  • 4+ years of experience with any scripting language such as Python, Go, Rust, etc.
  • 4+ years of experience in distributed/clustered storage designs
  • 4+ years of experience with multiple storage protocols
  • 4+ years of experience in deploying, analyzing and debugging storage networks
  • 4+ years of experience in application specific storage sub-system selection, design and configuration (drives, controllers, interconnects)
  • 4+ years of experience in configuring, tuning operating systems for use with storage including performance analysis
  • Excellent working knowledge of the Linux storage stack, both block and file-system
  • Hands on experience with kernel debuggers, performance counters and protocol analyzers
  • Excellent working knowledge of FC, FCoE, SCSI, iSCSI, iSER, PCIe, NVMe, RDMA, NFS and other storage protocols and interfaces
  • Excellent working knowledge of erasure coding, hardware and software RAID technologies, and data reduction methods
  • Deep exposure to SSD, HDD, RAID, and SAS drive and controller design
  • Working knowledge of congestion control mechanisms for high-speed storage networks
  • Industry knowledge about SDN technologies is desirable
  • Working knowledge of optical networking technologies, components and storage encapsulating protocols
  • Software engineering skills with efficient, maintainable and testable C/C++/Python
  • Experience deploying storage for shared-nothing applications
  • Experience leading cross functional teams engaged in storage system design and deployment
  • Experience mentoring junior and mid-level engineers
  • Experience creating and presenting storage industry related education

Benefits For Staff Site Reliability Engineer - Storage Engineering

  • 401k
  • Commuter Benefits
  • Dental Insurance
  • Education Budget
  • Equity
  • Medical Insurance
  • Mental Health Assistance
  • Parental Leave
  • Relocation Benefits
  • Vision Insurance

Interested in this job?