Staff Software Engineer, Capacity Engineering

A platform where millions of people come to find new ideas and inspiration to create a life they love.
$170,371 - $350,763
Machine Learning
Staff Software Engineer
Hybrid
1,000 - 5,000 Employees
8+ years of experience
AI · Enterprise SaaS

Description For Staff Software Engineer, Capacity Engineering

Pinterest, a platform where millions find daily inspiration, is seeking a Staff Software Engineer for their Capacity Engineering team. This role focuses on managing and optimizing ML infrastructure, one of the largest cloud-native infrastructures globally. The position is highly impactful, with efficiency being a strategic priority and direct visibility across Pinterest Engineering and leadership.

The role involves managing ML hardware capacity, improving infrastructure efficiency, and developing profiling capabilities at Pinterest's scale. The ideal candidate should have deep expertise in GPU architectures, PyTorch, and ML software stack components, along with strong experience in Kubernetes and cloud environments.

Working in a hybrid model through PinFlex, Pinterest's flexible approach to work, you'll collaborate with ML Platform, Infrastructure Engineering, and SRE teams. The position requires in-person collaboration 1-2 times per quarter, offering flexibility in work location across the country.

Pinterest offers a competitive compensation package ranging from $170,371 to $350,763 USD, plus equity. They maintain a strong commitment to workplace equity and inclusion, making decisions based on merit while considering candidates regardless of background or criminal history.

The role combines technical expertise with strategic thinking, requiring someone who can handle complex infrastructure challenges while contributing to Pinterest's mission of helping people find inspiration and create a life they love. The position offers an opportunity to work on large-scale systems while being part of a positive corner of the internet.

Last updated 12 days ago

Responsibilities For Staff Software Engineer, Capacity Engineering

  • Manage the ML hardware capacity that powers the models running at Pinterest
  • Improve the efficiency of ML Infrastructure at Pinterest
  • Build develop and mature profiling and optimization capabilities for ML Infrastructure at Pinterest scale
  • Collaborate with ML Platform, Infrastructure Engineering and SRE teams

Requirements For Staff Software Engineer, Capacity Engineering

Python
Java
Kubernetes
  • Deep understanding of GPU Architectures, Pytorch, etc.
  • Deep understanding of supporting parts of ML software stack like Scheduling, Data and Storage
  • Hands on experience with shared platforms like Kubernetes
  • Strong technical and performance engineering skills
  • Experience building and managing highly available distributed applications at scale
  • Proficiency in software development languages such as Java, Python and C++
  • Excellent skills in communicating complex technical issues
  • Understanding of ML Models, Kernels and optimization opportunities
  • Hands-on experience with large, cloud-native multi-tenant platforms at Internet scale
  • Experience with AWS or similar cloud environments
  • Deep understanding of infrastructure capacity and performance
  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience

Benefits For Staff Software Engineer, Capacity Engineering

Equity
  • Competitive salary
  • Equity compensation
  • Flexible work arrangements

Interested in this job?

Jobs Related To Pinterest Staff Software Engineer, Capacity Engineering

Manager II, Machine Learning - Search Retrieval

Lead Pinterest's ML Search Retrieval team, managing large-scale search systems and ML engineers to enhance search experience for 500M+ users.

Staff Machine Learning Engineer - Responsible AI

Lead Responsible AI initiatives at Pinterest, developing fair and ethical ML systems that serve 500M+ users while working on cutting-edge technology with the Advanced Technologies Group.

Staff Software Engineer, Ads ML Training Infrastructure

Lead ML infrastructure engineer role at Pinterest, focusing on ads model training systems and frameworks, requiring 8+ years of experience in large-scale ML systems.

Lead Software Engineer - Python & AI/ML

Lead Software Engineer position at JPMorgan Chase focusing on Python and AI/ML development, requiring 5+ years of experience and expertise in cloud platforms and machine learning technologies.

Senior Manager of Software Engineering AI/ML Solutions

Lead AI/ML engineering teams at JPMorgan Chase, focusing on Gen AI implementation, LLM fine-tuning, and MLOps while providing technical leadership and team mentorship.