Software Engineer - Reliability

Celonis is the global leader in Process Mining technology and one of the world's fastest-growing SaaS firms.
$164,000 - $214,000
Site Reliability
Senior Software Engineer
In-Person
5+ years of experience
Enterprise SaaS
This job posting may no longer be active. You may be interested in these related jobs instead:
Site Reliability Engineer

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.

Senior Software Engineer, Site Reliability Engineering

Senior Site Reliability Engineering role at Google, focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Description For Software Engineer - Reliability

We're Celonis, the global leader in Process Mining technology and one of the world's fastest-growing SaaS firms. We believe there is a massive opportunity to unlock productivity by placing data and intelligence at the core of business processes - and for that, we need you to join us.

The Role:

  • You will be part of a highly technical, collaborative and creative team, with a focus on SRE & Software Engineering.
  • Responsible for the design, implementation, reliability and management of cloud-based FedRAMP-compliant applications and platforms.
  • Responsible for application incident management escalations which involve troubleshooting complex technical problems and resolving application issues within defined service level objectives.
  • Design, write, and deliver software that enhances the availability, scalability, and efficiency of our services.
  • Partner with platform and application development teams to learn from incidents and improve the platform resiliency.
  • Share acquired knowledge and document accordingly while implementing SRE best practices.

The qualifications you need:

  • A bachelors or masters degree in a technical field (e.g. Computer Science, Software Engineering) or a comparable education.
  • Experience programming with Java, the Spring framework, and Python (or a similar scripting language in Linux environment).
  • A minimum of 5 years experience developing cloud based software applications.
  • Experience working with public cloud providers (AWS, Azure, or GCP) and modern cloud monitoring system observability frameworks (e.g., Datadog).
  • Experience in developing and running large-scale production services with elastic cloud services and Kubernetes.
  • Project experience of operation within the SRE domain.
  • Strong problem-solving skills and the ability to troubleshoot complex technical issues.
  • Excellent English verbal and written communication skills.

What Celonis can offer you:

  • The unique opportunity to work with industry-leading process mining technology
  • Investment in your personal growth and skill development
  • Great compensation and benefits packages (equity, life insurance, time off, generous leave for new parents from day one, and more)
  • Physical and mental well-being support
  • A global and growing team of Celonauts from diverse backgrounds
  • An open-minded culture with innovative, autonomous teams
  • Business Resource Groups to help you feel connected, valued and seen
  • A clear set of company values that guide everything we do

About Us: Since 2011, Celonis has helped thousands of the world's largest and most valued companies deliver immediate cash impact, radically improve customer experience and reduce carbon emissions. Its Process Intelligence platform uses industry-leading process mining technology and AI to present companies with a living digital twin of their end-to-end processes. Celonis is headquartered in Munich (Germany) and New York (USA) and has more than 20 offices worldwide.

Last updated 7 months ago

Responsibilities For Software Engineer - Reliability

  • Design, implement, and manage cloud-based FedRAMP-compliant applications and platforms
  • Handle application incident management escalations
  • Design and deliver software for service availability, scalability, and efficiency
  • Partner with teams to improve platform resiliency
  • Share knowledge and implement SRE best practices

Requirements For Software Engineer - Reliability

Java
Python
Kubernetes
  • Bachelor's or Master's degree in Computer Science, Software Engineering, or comparable education
  • Experience with Java, Spring framework, and Python
  • Minimum 5 years experience in cloud-based software development
  • Experience with public cloud providers (AWS, Azure, GCP) and monitoring frameworks
  • Experience with large-scale production services, elastic cloud services, and Kubernetes
  • Project experience in SRE domain
  • Strong problem-solving and troubleshooting skills
  • Excellent English communication skills

Benefits For Software Engineer - Reliability

  • Equity (restricted stock units)
  • Life insurance
  • Generous time off
  • Parental leave
  • Subsidized gym membership
  • Access to counseling
  • Career development opportunities
  • Global and diverse team

Interested in this job?