Extreme Networks is seeking a talented Edge AI Principal Engineer with specialized expertise in GPU/TPU acceleration to join their team. This role focuses on shaping the future of Edge AI solutions, leveraging GPU/TPU acceleration and enterprise-grade, large-scale edge compute. The ideal candidate will have extensive hands-on experience in local Large Language Models (LLM) inference with embedded GPU/TPU architectures.
As a Principal Engineer specializing in Edge AI, you will play a crucial role in influencing the Edge AI strategy, making critical decisions on technical directions, and developing AI inference models for edge devices. You'll work on implementing low-latency model inference pipelines, collaborating with cross-functional teams, and optimizing performance for GPU/TPU acceleration.
Key responsibilities include high-level design and architecture, team leadership, and staying current with advancements in GPU/TPU technologies. You'll lead a team of engineers, oversee project planning and execution, and foster a positive work environment.
The ideal candidate should have a strong background in computer science or engineering, with 5+ years of hands-on experience in AI model development and deployment. Proficiency in Python, C++, and LLM frameworks is essential, along with extensive experience in GPU/TPU acceleration for AI inference.
This role offers an exciting opportunity to shape the future of AI at the edge and revolutionize industries with innovative edge AI solutions. Join Extreme Networks in pushing the boundaries of edge computing and GPU/TPU acceleration, particularly in local LLM inference, and be part of a dynamic and collaborative team.