Staff Software Engineer - Service & Development Infrastructure

Moloco is a machine learning company empowering organizations of all sizes to grow and unlock the full value of their unique first-party data, elevating the traditional path to performance advertising.
$155,904 - $233,856
Site Reliability
Staff Software Engineer
In-Person
501 - 1,000 Employees
8+ years of experience

Description For Staff Software Engineer - Service & Development Infrastructure

Moloco is a machine learning company that operates at massive scale, ingesting 10 petabytes of training data per day with blazingly fast models that return predictions in 10 milliseconds or less. As a Staff Software Engineer focused on Site Reliability, you'll be part of a team managing infrastructure for ML model serving, CI/CD, and developing tools to improve engineering productivity. You'll contribute to technical decisions, participate in code reviews, support other engineering teams, and work on capacity planning and scaling. Key responsibilities include:

  • Contributing to company-wide infrastructure adoption and standard methodologies
  • Leading research with other technical leaders
  • Improving tooling, automation, monitoring, and workflow management
  • Identifying and addressing performance and reliability bottlenecks
  • Ensuring high performance during traffic spikes
  • Collaborating on reusable solutions to reduce operational complexity and risk

To succeed, you'll need:

  • Hands-on experience with GCP or other cloud platforms
  • Practical knowledge of a high-level language (e.g., Go, Python)
  • Experience with infrastructure software (e.g., Kubernetes, Helm, Terraform)
  • At least 8 years of large-scale software development experience
  • Passion for operational excellence and customer support
  • Problem-solving skills and end-to-end ownership of issues

Moloco offers competitive compensation and benefits, including equity, medical insurance, and more. They value diversity and inclusion, with a commitment to creating an equitable workplace. Join Moloco to be part of a fast-growing, profitable unicorn valued at $2 billion, working on cutting-edge machine learning technologies in the advertising industry.

Last updated 4 months ago

Responsibilities For Staff Software Engineer - Service & Development Infrastructure

  • Contribute to company-wide infrastructure adoption and standard methodologies
  • Lead research with other technical leaders
  • Improve tooling, automation, monitoring, and workflow management
  • Identify and address performance and reliability bottlenecks
  • Ensure high performance during traffic spikes
  • Collaborate on reusable solutions to reduce operational complexity and risk
  • Participate in capacity planning and scaling
  • Support other engineering teams with operational guidance

Requirements For Staff Software Engineer - Service & Development Infrastructure

Go
Python
Kubernetes
  • Hands-on experience with GCP or other cloud platforms
  • Practical knowledge of a high-level language (e.g., Go, Python)
  • Experience with infrastructure software (e.g., Kubernetes, Helm, Terraform)
  • At least 8 years of large-scale software development experience
  • Passion for operational excellence and customer support
  • Problem-solving skills and end-to-end ownership of issues

Benefits For Staff Software Engineer - Service & Development Infrastructure

Equity
Medical Insurance
  • Equity
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance
  • 401k

Interested in this job?

Jobs Related To Moloco Staff Software Engineer - Service & Development Infrastructure

Site Reliability Engineer (L5) - Security Engineering

Netflix seeks a Site Reliability Engineer (L5) for Security Engineering to enhance critical infrastructure reliability and support business growth in LIVE streaming, Gaming, and Ads.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer for Site Reliability Engineering at Airbnb, developing tools and systems for service reliability and incident management.

Engineering Manager, Reliability Engineering

Airbnb seeks an Engineering Manager for Site Reliability to drive long-term strategy and ensure infrastructure performance.

Site Reliability Developer 4

Site Reliability Developer 4 at Oracle in Bengaluru, India. Design and deliver mission-critical stack with focus on security, resiliency, scale, and performance.