AION is revolutionizing the AI cloud platform landscape through its innovative decentralized approach to high-performance computing (HPC). As a Site Reliability Engineer at AION, you'll be at the forefront of building and maintaining the infrastructure that powers this cutting-edge platform. The company is well-funded by major VCs and led by experienced founders with previous successful exits.
The role demands a reliability-focused engineer with deep expertise in cloud-native systems and infrastructure automation. You'll be responsible for designing and implementing comprehensive monitoring solutions, creating self-healing infrastructure, and maintaining high availability across distributed systems. This position offers a unique opportunity to work with cutting-edge technologies while implementing SRE best practices at scale.
Your work will directly impact AION's mission of democratizing access to compute power for AI training, fine-tuning, inference, and data labeling. The platform's innovative Proof of Compute Contribution (PoCC) protocol and integration with Tether ensure a stable and efficient ecosystem. Working from the Bangalore office in a hybrid setup, you'll collaborate with top-tier talent from the tech industry while having the flexibility to work remotely for several months each year.
This role is perfect for someone who wants to make a significant impact at the intersection of web3 and AI, working on some of the most exciting challenges in the industry. You'll be joining at the ground floor of an AI startup, with substantial opportunity to influence both the company's and the industry's future. The position offers competitive compensation, professional growth opportunities, and the chance to work with a mission-driven team that's bridging the AI wealth gap through innovative solutions.