Sr. Software Engineer, Site Reliability

LinkedIn is the world's largest professional network, built to help members of all backgrounds and experiences achieve more in their careers.
$121,000 - $198,000
Site Reliability
Senior Software Engineer
Hybrid
1,000 - 5,000 Employees
2+ years of experience
Enterprise SaaS

Description For Sr. Software Engineer, Site Reliability

LinkedIn is the world's largest professional network, built to help members of all backgrounds and experiences achieve more in their careers. Our vision is to create economic opportunity for every member of the global workforce. Every day our members use our products to make connections, discover opportunities, build skills and gain insights. We believe amazing things happen when we work together in an environment where everyone feels a true sense of belonging, and that what matters most in a candidate is having the skills needed to succeed. It inspires us to invest in our talent and support career growth. Join us to challenge yourself with work that matters.

Streaming SRE team is a combination development and operational role ensuring the reliability for centralized Pubsub systems at LinkedIn. There will be an expectation of participating in an oncall ~1x per month.

Come join the Software Engineering SRE team responsible for maintaining one of the largest Streaming ecosystems on the planet including Kafka. The LinkedIn Streaming SRE team is responsible for maintaining LinkedIn's Kafka, Samza, Flink and overall pubsub ecosystem which processes over 50 trillion messages per day across more than 150 clusters. The pubsub system is the de facto way of moving data at LinkedIn, powering everything from database replication to our metrics and log collection.

As a member of the Streaming SRE team, you would be responsible for helping our pubsub systems, including Kafka and Stream-Processing technologies like Samza and Flink, to scale to meet LinkedIn's needs. This would also include writing code to automate solutions to new and exciting problems and working closely with our customers across all of LinkedIn Engineering. Additionally, as an embedded SRE, you will work closely with LinkedIn's pubsub development teams to ensure that the ecosystem of applications responsible for streaming data at LinkedIn continue to be reliable, operable, scalable, and maintainable.

At LinkedIn, we trust each other to do our best work where it works best for us and our teams. This role offers a hybrid work option, meaning you can both work from home and commute to a LinkedIn office, depending on what's best for you and when it is important for your team to be together.

This role will be based in Mountain View, CA.

Last updated 3 months ago

Responsibilities For Sr. Software Engineer, Site Reliability

  • Serve as a primary point responsible for the overall health, performance, and capacity of one or more of our Internet-facing services
  • Gain deep knowledge of our complex applications
  • Assist in the roll-out and ramp up of new product features and technologies to facilitate our rapid iteration and constant growth
  • Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale Linux environment
  • Function well in a fast-paced, rapidly-changing environment
  • Participate in a 24x7 rotation for second-tier escalations

Requirements For Sr. Software Engineer, Site Reliability

Java
Python
Linux
Kafka
  • B.S. or higher in Computer Science or other technical discipline, or related practical experience
  • 2+ years experience with administration and troubleshooting of Unix/Linux systems
  • Programming skills in one or more of Java, Rust, Go, Python, Ruby, C++
  • Knowledge of data structures, relational and non-relational databases, networking, Linux internals, filesystems, web architecture, and related topics

Benefits For Sr. Software Engineer, Site Reliability

Equity
  • Equity

Interested in this job?

Jobs Related To LinkedIn Sr. Software Engineer, Site Reliability

Sr. Software Engineer, Site Reliability

LinkedIn is hiring a Sr. Software Engineer for Site Reliability to maintain their large-scale Streaming ecosystem, including Kafka, processing 50 trillion messages daily.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on maintaining and optimizing large-scale distributed systems with competitive compensation and growth opportunities.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior Site Reliability Engineer position at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems, requiring 5+ years of software development experience.