Software Engineer, Infrastructure

Imbue builds AI systems that reason and code, enabling AI agents to accomplish larger goals and safely work in the real world.
$170,000 - $350,000
DevOps
Senior Software Engineer
Remote
11 - 50 Employees
5+ years of experience
AI

Description For Software Engineer, Infrastructure

Imbue is seeking a Senior Software Engineer, Infrastructure to join their team working on cutting-edge AI systems. This role is crucial in leveraging large amounts of compute to make their small research team more effective. The position focuses on enabling and supporting large-scale compute efforts and building software infrastructure that creates a seamless research experience.

The ideal candidate will be responsible for building scalable solutions that allow code to run efficiently on large GPU clusters, improving system observability through better logging and tracing, and designing robust systems for managing configurations and stateful components. You'll work directly with researchers and engineers to create abstractions that enable the team to work at a higher level while ensuring everything operates seamlessly.

The role requires strong software engineering skills, particularly in Python and bash, as the team follows an "infrastructure as code" philosophy. You should be passionate about DevOps, have experience with distributed systems, and understand various technology trade-offs. Attention to detail and a focus on correctness are crucial, as the work supports scientific research requiring robust and reliable results.

Imbue offers an exceptional compensation package ranging from $170,000 to $350,000 in cash, plus significant equity potential ($10,000–$2,000,000). Benefits include a generous $20K+ yearly budget for professional development, regular team events, and a culture that values learning and collaboration. The position offers the flexibility of remote work or being based in San Francisco.

This is an opportunity to work on the frontier of AI development, helping build systems that reason and code, while being part of a team that aims to transform computers into truly intelligent tools that empower users. The role combines technical depth with the chance to directly impact the future of AI research and development.

Last updated 3 months ago

Responsibilities For Software Engineer, Infrastructure

  • Build and manage wrapper tools that allow code written for single hosts to scale to large GPU clusters
  • Debug distributed exceptions and improve logging and tracing stack
  • Design improvements to systems that manage secrets, configurations, ongoing jobs, and other stateful components
  • Search out and prototype possible additions to software stack
  • Dig deeply into open-source or third-party code, including C/C++ libraries
  • Work collaboratively with team members to debug, provide guidance, and design resilient software

Requirements For Software Engineer, Infrastructure

Python
  • Good software engineering skills with Python and bash
  • Passionate about enabling other engineers and creating good tooling
  • Experienced with DevOps and understanding of various technology trade-offs
  • Careful and detail oriented with focus on robustness and correctness

Benefits For Software Engineer, Infrastructure

Education Budget
  • Generous compensation and equity
  • $20K+ yearly budget for self-improvement: coaching, courses, conferences, etc.
  • Actively co-create and participate in a positive, intentional team culture
  • Time for learning, reading papers, and understanding prior work
  • Frequent team events, dinners, off-sites, and hanging out
  • Equity ranging from $10,000 to $2,000,000

Interested in this job?

Jobs Related To Imbue Software Engineer, Infrastructure

Site Reliability Engineer III- DevOps

Senior Site Reliability Engineer role at JPMorgan Chase focusing on DevOps, cloud infrastructure, and system reliability with competitive compensation.

Systems Development Engineer, Enterprise Engineering

Senior Systems Development Engineer role at Amazon's Enterprise Engineering team, focusing on unified communications and cloud infrastructure.

Senior Systems Engineer

Senior Systems Engineer role at Disney Entertainment focusing on content delivery infrastructure and streaming technology solutions.

Senior SWQA Test Development Engineer

Senior SWQA Test Development Engineer role at NVIDIA focusing on AI-powered testing and automation for software quality assurance.

Senior Software Engineer – AI Infrastructure and Tooling

Senior Software Engineer role at NVIDIA focusing on AI infrastructure automation and tooling, requiring expertise in DevOps, cloud technologies, and distributed systems.