Site Reliability Engineer II - Guidewire Cloud Platform (Application)

Guidewire delivers software for Property and Casualty insurance companies, providing core applications for policy management, claims settlement, and customer billing.
Curitiba, State of Paraná, Brazil
Site Reliability
Mid-Level Software Engineer
Hybrid
3+ years of experience
Enterprise SaaS · Finance
This job posting may no longer be active. You may be interested in these related jobs instead:
Cloud Site Reliability Engineer (SRE)

Cloud SRE position at Incorta focusing on infrastructure reliability, automation, and DevOps practices, requiring 2-3 years of experience.

Site Reliability Engineer

Site Reliability Engineer position focused on managing and supporting cloud applications and infrastructure using AWS and Atlassian tools.

Software Engineer, Traffic Trust SRE, DoS Infrastructure

Site Reliability Engineer position at Google focusing on Traffic Trust and DoS Infrastructure, combining software engineering with systems operations to maintain large-scale distributed systems.

Software Engineer III, Site Reliability Engineer

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Databases Site Reliability Engineer

Site Reliability Engineer position at Google focusing on database systems, requiring expertise in distributed systems and infrastructure management.

Description For Site Reliability Engineer II - Guidewire Cloud Platform (Application)

Guidewire is seeking a Site Reliability Engineer II to join their cloud platform team, focusing on ensuring the reliability and performance of their insurance software solutions. The role combines software engineering with operational expertise to support Guidewire's cloud-based insurance platform.

As an SRE, you'll be responsible for maintaining and improving the reliability of applications running on the Guidewire Cloud Platform. This involves troubleshooting complex systems, developing automated solutions, and working closely with development teams to optimize performance. The position requires participation in on-call rotations to ensure 24/7 service reliability.

Guidewire's platform is crucial for Property and Casualty (P&C) insurance companies worldwide, handling billions of dollars in business. The company's mission is to provide essential tools and technology that help insurers protect and support their customers during critical times, including natural disasters, accidents, and cyber risks.

The ideal candidate will bring strong technical skills in Linux administration, cloud technologies (particularly AWS), and programming languages like Python, Go, or Java. Experience with monitoring tools, CICD pipelines, and infrastructure automation is essential. The role offers opportunities for growth, learning cutting-edge technologies, and making a real impact in the insurance industry.

Working at Guidewire means joining a mission-driven company with a culture that values innovation, teamwork, and work-life balance. The company offers competitive compensation, comprehensive benefits, and career development opportunities while working with talented peers on technology that makes a difference in people's lives.

Last updated a month ago

Responsibilities For Site Reliability Engineer II - Guidewire Cloud Platform (Application)

  • Assist in troubleshooting and resolving issues in collaboration with development teams
  • Develop and maintain automated runbooks for proactive issue resolution
  • Monitor applications and improve reliability and performance on the Guidewire Cloud Platform
  • Optimize systems and reduce manual tasks using software engineering skills
  • Document incidents and refine processes to prevent future occurrences
  • Participate in on-call rotations
  • Apply engineering principles and automation to enhance operating environments

Requirements For Site Reliability Engineer II - Guidewire Cloud Platform (Application)

Python
Go
Java
Linux
Kubernetes
  • Experience as an SRE or similar role
  • Strong problem-solving skills
  • Linux system administration skills
  • Programming/scripting skills in Python, Go, Java, or shell
  • Understanding of SLIs, SLOs, and Error Budgets
  • Experience with APM and telemetry tools
  • Experience with troubleshooting distributed systems on cloud infrastructure
  • Experience with CICD pipelines within K8S
  • Experience with Datadog monitoring tools
  • Experience with AWS or Kubernetes using Terraform
  • Knowledge of infrastructure configuration management (GitOps, Puppet, or Ansible)
  • Understanding of AWS cloud networking and security

Interested in this job?