Software Engineer, Infrastructure

Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world.
$170,000 - $350,000
DevOps
Senior Software Engineer
Remote
11 - 50 Employees
5+ years of experience
AI

Description For Software Engineer, Infrastructure

Imbue is seeking a Senior Software Engineer, Infrastructure to join their team working on cutting-edge AI systems. This role is crucial in leveraging large amounts of compute to make their small research team more effective. The position focuses on enabling and supporting large-scale compute efforts and building software infrastructure that creates a seamless research experience.

The ideal candidate will be responsible for building scalable solutions that allow code to run efficiently on large GPU clusters, improving system observability through better logging and tracing, and designing robust systems for managing configurations and stateful components. You'll work directly with researchers and engineers to create abstractions that enable the team to work at a higher level while ensuring everything operates seamlessly.

The role requires strong software engineering skills, particularly in Python and bash, as the team follows an "infrastructure as code" philosophy. You should be passionate about DevOps, have experience with distributed systems, and understand various technology trade-offs. Attention to detail and a focus on correctness are crucial, as the work supports scientific research requiring robust and reliable results.

Imbue offers an exceptional compensation package ranging from $170,000 to $350,000 in cash, plus significant equity potential ($10,000–$2,000,000). Benefits include a generous $20K+ yearly budget for professional development, regular team events, and a culture that values learning and collaboration. The position offers the flexibility of remote work or being based in San Francisco.

This is an opportunity to work on the frontier of AI development, helping build systems that reason and code, while being part of a team that aims to transform computers into truly intelligent tools that empower users. The role combines technical depth with the chance to directly impact the future of AI research and development.

Last updated 2 days ago

Responsibilities For Software Engineer, Infrastructure

  • Build and manage wrapper tools that allow code written for single hosts to scale to large GPU clusters
  • Debug distributed exceptions and improve logging and tracing stack
  • Design improvements to systems that manage secrets, configurations, ongoing jobs, and other stateful components
  • Search out and prototype possible additions to software stack
  • Dig deeply into open-source or third-party code, including C/C++ libraries
  • Work collaboratively with team members to debug, provide guidance, and design resilient software

Requirements For Software Engineer, Infrastructure

Python
  • Good software engineering skills with Python and bash
  • Passionate about enabling other engineers and creating good tooling
  • Experienced with DevOps and understanding of various technology trade-offs
  • Careful and detail oriented with focus on robustness and correctness

Benefits For Software Engineer, Infrastructure

Education Budget
  • Generous compensation and equity
  • $20K+ yearly budget for self-improvement: coaching, courses, conferences, etc.
  • Actively co-create and participate in a positive, intentional team culture
  • Time for learning, reading papers, and understanding prior work
  • Frequent team events, dinners, off-sites, and hanging out
  • Equity ranging from $10,000 to $2,000,000

Interested in this job?

Jobs Related To Imbue Software Engineer, Infrastructure

Software Engineer - Developer Tools

Senior Software Engineer role at Apple focusing on developer tools and reporting infrastructure development.

Software Engineering Senior DevOps Engineer

Senior DevOps Engineer role at Apple, focusing on documentation engineering infrastructure and developer tools, offering competitive compensation and comprehensive benefits.

Enterprise Media Operations Regional Engineer

Senior Enterprise Media Operations Engineer role at Meta, focusing on AV/VC systems management and executive support in Menlo Park.

Sr System Dev Engineer III, North American Customer Fulfillment (NACF) Reliability, Maintenance, Engineering (RME)

Senior System Development Engineer role at Amazon combining software development with industrial automation expertise for fulfillment center operations.

Sr. IT Systems Engineer

Senior IT Systems Engineer position at SpaceX focusing on Microsoft-based technologies and infrastructure to support space exploration initiatives.