Pod Software Engineer

Etched builds model-specific hardware for AI, focusing on single-model ASICs and large-scale inference workloads.
Distributed Systems
Senior Software Engineer
In-Person
5+ years of experience
AI · Hardware

Description For Pod Software Engineer

Etched is seeking a Pod Software Engineer to join their System Software team in San Jose. This role focuses on developing high-performance networking solutions for large-scale AI inference workloads. The position involves working with RDMA-based networking, kernel development, and system optimization for AI hardware acceleration.

The ideal candidate will work on critical projects including RDMA implementation, performance optimization, and system-level software development. They will collaborate with kernel, platform, and telemetry teams to enhance peer-to-peer RDMA efficiency. The role requires expertise in C/C++, networking technologies, and Linux systems.

Etched differentiates itself by focusing on model-specific hardware for AI, believing in the "Bitter Lesson" principle that progress in AI comes from using more FLOPs through specialized hardware. The company maintains a fully in-person work environment in West San Jose, emphasizing both engineering and research capabilities.

The position offers comprehensive benefits including full medical coverage, a generous housing subsidy for nearby residents, and daily meals. This is an excellent opportunity for experienced software engineers passionate about high-performance computing, networking, and AI hardware optimization.

Last updated 5 hours ago

Responsibilities For Pod Software Engineer

  • Design, develop, and implement RDMA based networking peering
  • Develop tests that qualify host processors, NICs, TORs and device network interfaces
  • Furnish burn-in teams with tests for device to device networking
  • Define key metrics for system software to maintain high availability
  • Analyze performance deviations and optimize network stack configurations
  • Implement and validate peer RDMA support for GPU-to-GPU communication
  • Profile and benchmark inter-node RDMA latency and bandwidth
  • Optimize NIC and switch configurations

Requirements For Pod Software Engineer

Python
Go
Linux
Rust
  • Proficiency in C/C++
  • Proficiency in at least one scripting language (Python, Bash, Go)
  • Strong experience with device-to-device networking technologies (RDMA, GPUDirect)
  • Experience with zero-copy networking, RDMA verbs and memory registration
  • Familiarity with queue pairs, completions queues, and transport types
  • Strong understanding of operating systems (Linux preferred) and server hardware architectures
  • Ability to analyze complex technical problems
  • Excellent communication and collaboration skills
  • Experience with version control systems
  • Experience with reading and interpreting hardware logs

Benefits For Pod Software Engineer

Medical Insurance
Dental Insurance
Vision Insurance
Relocation Benefits
  • Full medical, dental, and vision packages with 100% premium covered
  • Housing subsidy of $2,000/month for those living within walking distance
  • Daily lunch and dinner in office
  • Relocation support for those moving to West San Jose

Interested in this job?

Jobs Related To Etched Pod Software Engineer

Senior Software Engineer, Systems Infrastructure

Senior Software Engineer position at LinkedIn focusing on building and maintaining large-scale distributed systems and infrastructure platforms that power LinkedIn's core services.

Senior Software Engineer-Distributed Inference

Senior Software Engineer position at NVIDIA focusing on distributed inference and AI performance optimization, offering competitive compensation and remote work options.

Senior Software Engineer, Systems Infrastructure

Senior Software Engineer role at LinkedIn focusing on building and maintaining large-scale distributed systems and infrastructure platforms that power LinkedIn's core services.

Senior Software Engineer, Infrastructure, Google Cloud Networking

Senior Software Engineer position at Google Cloud focusing on infrastructure and networking systems, requiring expertise in C++ and distributed systems development.

Senior Software Engineer, Infrastructure, Core

Senior Software Engineer position at Google focusing on infrastructure and core systems development, offering competitive compensation and opportunity to work on large-scale distributed systems.