Cloud Site Reliability Engineer I (CSRE I)

Zafin

SaaS product and pricing platform provider for top banks worldwide, simplifying core modernization and enabling collaborative product management.

Vancouver, BC, Canada

Site Reliability

Staff Software Engineer

Hybrid

8+ years of experience

Finance · Enterprise SaaS

Description For Cloud Site Reliability Engineer I (CSRE I)

Zafin, established in 2002, is a leading financial technology company providing SaaS product and pricing platforms for major banks worldwide. As a Cloud Site Reliability Engineer I (CSRE I), you'll play a crucial role in maintaining and optimizing Zafin's cloud infrastructure and applications. The position reports to the VP of Cloud Services and focuses on ensuring system reliability, scalability, and performance.

The role combines technical expertise with collaborative teamwork, requiring strong skills in Azure cloud platform, container orchestration, and incident management. You'll be responsible for level-3 technical support, conducting root cause analysis for critical incidents, and implementing cloud infrastructure optimizations. The position demands expertise in tools like Azure Kubernetes Service, monitoring systems, and automation scripting.

Zafin offers an excellent work environment that values diversity and teamwork. The company has a strong global presence with offices worldwide and serves prestigious clients including ING, CIBC, HSBC, Wells Fargo, PNC, and ANZ. As a certified Great Place to Work® in Canada, India, and the UK, Zafin provides competitive compensation, comprehensive benefits, and significant professional development opportunities.

The ideal candidate will have 8+ years of relevant experience, strong technical capabilities, and a dedication to maintaining high-quality service delivery. This role offers the opportunity to work with cutting-edge cloud technologies while contributing to the success of a leading financial technology provider.

Last updated a month ago

Responsibilities For Cloud Site Reliability Engineer I (CSRE I)

Act as a level-3 technical support expert for Zafin products and Azure cloud issues
Collaborate with Product, Platform Engineering, and DevOps teams
Conduct Root Cause Analysis (RCA) for Severity 1 and 2 incidents
Participate in external client escalation calls
Optimize cloud infrastructure for scalability, performance, and cost-effectiveness
Manage container orchestration platforms
Enhance monitoring and tracking tools
Implement best practices for Azure cloud deployment
Develop automation scripts
Maintain detailed documentation
Participate in rotating on-call schedule

Requirements For Cloud Site Reliability Engineer I (CSRE I)

Kubernetes

PostgreSQL

Python

Bachelor's degree in Computer Science, Engineering, or related field
8+ years of experience in cloud support, operations, or related role
Hands-on experience with Microsoft Azure
Proficiency in container orchestration platforms like AKS or OpenShift
Expertise in automated deployment pipelines, particularly Azure DevOps
Familiarity with enterprise monitoring platforms
Proficiency in scripting languages like PowerShell or Python
Proven experience in incident management
Knowledge of Postgres databases

Benefits For Cloud Site Reliability Engineer I (CSRE I)

Medical Insurance

Dental Insurance

Competitive salaries
Annual bonus potential
Generous paid time off
Paid volunteering days
Wellness benefits
Professional growth opportunities
Career advancement opportunities

Zafin

SaaS product and pricing platform provider for top banks worldwide, simplifying core modernization and enabling collaborative product management.

Vancouver, BC, Canada

Site Reliability

Staff Software Engineer

Hybrid

8+ years of experience

Finance · Enterprise SaaS

Interested in this job?

Jobs Related To Zafin Cloud Site Reliability Engineer I (CSRE I)

Cloud Site Reliability Engineer II (CSRE II)

Zafin

Lead cloud reliability initiatives and strategic operations for a global banking technology platform, managing Azure infrastructure and mentoring teams.

Cloud Site Reliability Engineer II (CSRE II)

Zafin

Lead cloud reliability initiatives and strategic operations for a global banking technology platform, managing Azure infrastructure and mentoring teams.

Cloud Site Reliability Engineer II (CSRE II)

Zafin

Lead cloud infrastructure reliability and performance optimization for a global banking software provider.

Cloud Site Reliability Engineer I (CSRE I)

Zafin

Cloud Site Reliability Engineer position at Zafin, focusing on maintaining and optimizing cloud infrastructure for a leading financial technology company.

Staff Software Engineer, Reliability Engineering

Airbnb

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, incident management, and building scalable systems with competitive compensation and remote work options.