Computer Scientist - II (SRE)

Changing the world through digital experiences is what Adobe's all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences!
Site Reliability
Mid-Level Software Engineer
In-Person
8+ years of experience
Enterprise SaaS

Description For Computer Scientist - II (SRE)

Adobe is seeking a talented Computer Scientist - II (SRE) to join our Site Reliability Engineering team as we embark on a new phase of growth for our product. We are a metrics-driven organization that strives to deliver world-class service both externally and internally. The team strongly believes in the DevOps methodology and works very closely with our peers on the development team.

As a Computer Scientist - II (SRE), you will play a crucial role in developing software/tools and providing hands-on technical expertise to design, deploy, and optimize Cloud services. You'll build automation using industry-standard tools such as Chef, Jenkins, Terraform, and Spinnaker to deploy services efficiently. Your responsibilities will include participating in release cycles, deploying code to staging and production environments, and integrating with continuous integration (CI) and continuous delivery (CD) tools.

You'll be tasked with improving the security and availability of our services, identifying single points of failure and high-risk architecture issues, and implementing more resilient solutions. Your expertise will be vital in identifying system bottlenecks and recommending solutions to solve availability issues. As part of the on-call rotation, you'll drive issues to resolution and contribute to post-mortems, ensuring we learn and improve from incidents.

We're looking for someone who can proactively work on efficiency and capacity planning, setting clear requirements to reduce system resource usage. You'll evangelize SRE principles and guide the development team in building reliable services. Your ability to build automation and tools that increase team productivity will be highly valued.

The ideal candidate will have at least 8 years of experience as an SRE in Cloud engineering, with a minimum of 5 years working with containerized environments like Kubernetes and Docker. Proficiency in multi-cloud environments (AWS, Azure) and experience with observability tools such as Prometheus, Grafana, and Splunk are essential. You should be comfortable writing applications in Go, Python, or JavaScript and have experience with CI/CD tools like Jenkins.

If you're passionate about building and maintaining large-scale, high-performance systems and enjoy working with a variety of services and technologies, this role offers an exciting opportunity to make a significant impact at Adobe. Join us in our mission to change the world through digital experiences!

Last updated 3 months ago

Responsibilities For Computer Scientist - II (SRE)

  • Develop software/tools and provide hands-on technical expertise to design, deploy, and optimize Cloud services
  • Build automation using industry-standard tools such as Chef, Jenkins, Terraform, Spinnaker, etc to deploy services
  • Participate in release cycles of our services, deploying code to staging, and production environments
  • Come up with plans to improve security, availability of the services
  • Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions
  • Identify system bottlenecks and recommend solutions to solve the availability issue
  • Participate in On-Call and drive any issues found to resolution and also contribute to post-mortems
  • Proactively work on the efficiency and capacity planning to set clear requirements and reduce the system resources usage
  • Evangelize SRE principles and guide development team to build reliable services
  • Build automation and tools that will increase the productivity of teams

Requirements For Computer Scientist - II (SRE)

Kubernetes
Go
Python
JavaScript
  • At least 8 years of experience as SRE in Cloud engineering
  • Minimum 5 years of experience with containerized environment: Kubernetes, Docker
  • Experience with Argo will be a plus
  • Experience in automation and tool development
  • At least 5 years plus of experience building Cloud services and distributed systems: deployment, monitoring, scaling, debugging
  • Proficient in multi-cloud environments: AWS, Azure
  • Experience writing applications using Go, Python, or JavaScript
  • Knowledge of well-known open-source tools for monitoring, trending, and configuration management
  • Familiarity with Observability tools like Prometheus, Cortex, Grafana, NewRelic, DataDog, and Splunk
  • Experience with CI/CD tools like Jenkins/Groovy DSL
  • Experience in scaling to the limit with high throughput services
  • Enjoy working with a large variety of services and technologies
  • Have provided detailed reporting and analysis through metrics and logs
  • Experience with NewRelic, Splunk, and Prometheus will be a plus
  • Experience with Kubernetes architectures
  • Experience with any big data services, Hadoop, Data Bricks architectures
  • Experience with designing Infrastructure solutions
  • Ability and determination to solve complex system/application problems
  • Manage our uptime and performance using service level indicators and objectives (SLx)
  • CKA certification and Cloud Certifications are MUST

Interested in this job?

Jobs Related To Adobe Computer Scientist - II (SRE)

Site Reliability Engineer

Site Reliability Engineer position at Adobe focusing on Kubernetes platform development and maintenance, offering competitive compensation and opportunity to work with cutting-edge cloud technologies.

Site Reliability Engineer

Site Reliability Engineer position at Adobe focusing on Kubernetes platform development and maintenance, offering competitive compensation and opportunity to work with cutting-edge cloud technologies.

Cloud Site Reliability Engineer (SRE)

Cloud SRE position at Incorta focusing on infrastructure reliability, automation, and DevOps practices, requiring 2-3 years of experience.

Site Reliability Engineer

Site Reliability Engineer position focused on managing and supporting cloud applications and infrastructure using AWS and Atlassian tools.

Software Engineer, Traffic Trust SRE, DoS Infrastructure

Site Reliability Engineer position at Google focusing on Traffic Trust and DoS Infrastructure, combining software engineering with systems operations to maintain large-scale distributed systems.