Sr Site Reliability Engineer, AI Platform Inference

Adobe is a company that changes the world through digital experiences, providing tools for artists and global brands to design and deliver exceptional digital experiences.
$153,600 - $286,600
Site Reliability
Senior Software Engineer
In-Person
5+ years of experience
AI

Description For Sr Site Reliability Engineer, AI Platform Inference

Adobe is seeking an outstanding Site Reliability Engineer for their AI Inference Platform, Adobe Firefly. This role is crucial in building, scaling, and securing the AI Platform that enables Firefly product teams to manage and deploy Machine Learning capabilities across Adobe client applications.

The position involves working with Applied Research groups who will deploy thousands of models onto this platform in various lifecycle stages. The platform offers ML model serving at scale, with high-cost efficiency, across multiple cloud platforms.

As an SRE, you'll be responsible for ensuring system reliability, implementing scalable solutions, and maintaining high service quality. You'll work with cutting-edge AI technology while collaborating with various Adobe teams to innovate on Generative AI as a Service.

The ideal candidate excels in undefined environments, stays current with industry trends, and has strong experience with containerization, orchestration, and modern development techniques. Your expertise in infrastructure management, observability tools, and understanding of AI/ML frameworks will be essential for success.

This is an exciting opportunity to join Adobe's innovative AI platform team, working on technology that impacts millions of users worldwide. You'll be part of a company that values creativity, innovation, and provides comprehensive benefits including competitive compensation, equity opportunities, and various insurance options.

The role offers significant growth potential in the rapidly evolving field of AI infrastructure, allowing you to work with the latest technologies while contributing to Adobe's mission of changing the world through digital experiences.

Last updated 21 days ago

Responsibilities For Sr Site Reliability Engineer, AI Platform Inference

  • Identify and implement methodologies to increase reliability, scalability, security, and efficiency
  • Ensure highest uptime and Quality of Service (QoS) for Adobe's customers
  • Define service level objectives (SLOs) and indicators (SLIs)
  • Support and maintain globally distributed, multi-cloud environments
  • Automate common, repeatable tasks at large scale
  • Identify areas to improve service resiliency
  • Coordinate with other Adobe platform teams and service providers

Requirements For Sr Site Reliability Engineer, AI Platform Inference

Python
Go
Kubernetes
  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field
  • Experience in building and scaling distributed systems
  • Production level expertise with containerization orchestration engines
  • Fundamental programming skills in Python, Go
  • Knowledge of infrastructure configuration management tools
  • Experience with observability and tracing tools
  • Understanding of AI/ML frameworks and solutions

Benefits For Sr Site Reliability Engineer, AI Platform Inference

401k
Medical Insurance
Dental Insurance
Vision Insurance
  • Competitive Salary
  • Annual Incentive Plan
  • Long-term incentives in form of equity awards

Interested in this job?

Jobs Related To Adobe Sr Site Reliability Engineer, AI Platform Inference

Site Reliability Engineer

Senior Site Reliability Engineer role at Adobe, focusing on cloud services optimization and automation, offering competitive compensation of $133,900-$242,000 in San Jose.

Site Reliability Engineer, Adobe Pass

Join Adobe as a Site Reliability Engineer for Adobe Pass, shaping the future of TV Everywhere technology and working with cutting-edge cloud services and infrastructure.

Site Reliability Engineer

Senior Site Reliability Engineer position at OneDegree, focusing on cloud infrastructure, monitoring, and automation for insurance and cybersecurity platforms in APAC.