Boson AI, an innovative startup in the AI space, is seeking a Senior Software Engineer with deep expertise in Ceph management for their deep learning datacenter in Toronto. Founded by renowned experts Alex Smola and Mu Li, the company is at the forefront of developing generative AI models for language, audio, and entertainment.
The role offers an exciting opportunity to work with cutting-edge technology, including NVIDIA H100 and A100 GPUs, managing over 25PB of disk and 5PB flash storage, Terabit networking, and hundreds of computers. The position requires strong problem-solving skills and the ability to learn new tools quickly.
As a Senior Software Engineer, you'll be responsible for deploying and operating Ceph and its integration with various infrastructure technologies and hardware systems. The role involves working with advanced technologies like Slurm, MAAS, Infiniband, NVIDIA deepops, and Layer 3 networking. Hardware configuration experience is necessary.
The ideal candidate must have prior Ceph experience (this is a strict requirement). You'll be working in a hybrid environment with access to state-of-the-art infrastructure. The compensation range of $150,000 - $250,000 reflects the senior nature of the role and the expertise required.
This is an excellent opportunity for a seasoned DevOps engineer who wants to work at the intersection of infrastructure and AI, managing critical storage systems that power cutting-edge AI research and development. The role offers the chance to work with the latest technology stack and contribute to the advancement of AI infrastructure.