Senior Site Reliability Engineer - Storage Engineering

LinkedIn is the world's largest professional network, built to create economic opportunity for every member of the global workforce.
$121,000 - $198,000
Site Reliability
Senior Software Engineer
Hybrid
2+ years of experience
This job posting may no longer be active. You may be interested in these related jobs instead:
Sr. Software Engineer, Site Reliability

LinkedIn is hiring a Sr. Software Engineer for Site Reliability to maintain their large-scale Streaming ecosystem, including Kafka, processing 50 trillion messages daily.

Sr. Software Engineer, Site Reliability

Senior Software Engineer, Site Reliability role at LinkedIn, maintaining large-scale streaming systems and ensuring reliability of pubsub infrastructure.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on maintaining and optimizing large-scale distributed systems with competitive compensation and growth opportunities.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience.

Description For Senior Site Reliability Engineer - Storage Engineering

LinkedIn is the world's largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day. We're also committed to providing transformational opportunities for our own employees by investing in their growth. We aspire to create a culture that's built on trust, care, inclusion, and fun – where everyone can succeed.

LinkedIn is looking to hire a Senior Site Reliability Engineer within the production Storage Engineering group. The Storage Engineering group is defining the strategy for LinkedIn's storage infrastructure and is responsible for architecture, design, tooling and automation related to that. The candidate will have wide latitude in making contributions in several areas. In this highly visible role, you will work within the various members of the engineering teams providing guidance in technologies, product feature definitions, implementation and tradeoffs and architecture definition.

Responsibilities: • Design and develop strategy for storage system design and consumption • Influence open source community in white-box storage developments and software defined storage • Utilize communication skills in interacting with peer groups & drive technical presentations • Maintain technical knowledge of storage platform industry directions and trends • Lead initial storage implementations, proof-of-concepts and pilots • Maintain relationships with user groups, staying apprised of future projects, project growth, schedule activities, and being a trusted point of contact • Adapt to the ever-evolving industry; learn and scrutinize new technologies, and envision its possible application towards the LinkedIn mission • Participate in a 12x7 rotation for second-tier escalations

This role offers a hybrid work option, meaning you can both work from home and commute to a LinkedIn office, depending on what's best for you and when it is important for your team to be together. This role is based in our Sunnyvale, CA office location.

Join us to transform the way the world works!

Last updated 4 months ago

Responsibilities For Senior Site Reliability Engineer - Storage Engineering

  • Design and develop strategy for storage system design and consumption
  • Influence open source community in white-box storage developments and software defined storage
  • Utilize communication skills in interacting with peer groups & drive technical presentations
  • Maintain technical knowledge of storage platform industry directions and trends
  • Lead initial storage implementations, proof-of-concepts and pilots
  • Maintain relationships with user groups, staying apprised of future projects, project growth, schedule activities, and being a trusted point of contact
  • Adapt to the ever-evolving industry; learn and scrutinize new technologies, and envision its possible application towards the LinkedIn mission
  • Participate in a 12x7 rotation for second-tier escalations

Requirements For Senior Site Reliability Engineer - Storage Engineering

Python
Go
Rust
Linux
  • BA/BS Degree in Computer Science, Electrical Engineering, or related technical discipline, or related practical experience
  • 2+ years of experience with GPFS, GlusterFS or any Software-defined storage solutions
  • 2+ years of experience in large enterprise-class storage build and support
  • 3+ years of experience with any scripting language such as Python, Go. Rust, etc.
  • 3+ years of experience in distributed/clustered storage designs
  • 3+ years of experience with multiple storage protocols
  • 3+ years of experience in deploying, analyzing and debugging storage networks
  • 3+ years of experience in application specific storage sub-system selection, design and configuration (drives, controllers, interconnects)
  • 3+ years of experience in configuring, tuning operating systems for use with storage including performance analysis
  • Excellent working knowledge of the Linux storage stack, both block and file-system (e.g. LVM, VxVM, VxFS, ZFS, XFS, Lustre, GPFS, Gluster, Ceph, Swift, NFS)
  • Hands on experience with kernel debuggers, performance counters and protocol analyzers
  • Excellent working knowledge of FC, FCoE, SCSI, iSCSI, iSER, PCIe, NVMe, RDMA, NFS and other storage protocols and interfaces
  • Excellent working knowledge of erasure coding, hardware and software RAID technologies, and data reduction methods
  • Deep exposure to SSD, HDD, RAID, and SAS drive and controller design
  • Working knowledge of congestion control mechanisms for high-speed storage networks
  • Industry knowledge about SDN technologies is desirable
  • Working knowledge of optical networking technologies, components and storage encapsulating protocols (e.g. DWDM, CWDM, MMF, SMF, SR, LR, ZR, SONET, MPLS)
  • Software engineering skills with efficient, maintainable and testable C/C++/Python
  • Experience deploying storage for shared-nothing applications
  • Experience leading cross functional teams engaged in storage system design and deployment
  • Experience mentoring junior and mid-level engineers
  • Experience creating and presenting storage industry related education

Benefits For Senior Site Reliability Engineer - Storage Engineering

  • Hybrid work option

Interested in this job?