Cloud Site Reliability Engineer I (CSRE I)

SaaS product and pricing platform provider for top banks worldwide, simplifying core modernization and enabling collaborative product management.
Site Reliability
Staff Software Engineer
Hybrid
8+ years of experience
Finance · Enterprise SaaS

Description For Cloud Site Reliability Engineer I (CSRE I)

Zafin, established in 2002, is a leading financial technology company providing SaaS product and pricing platforms for major banks worldwide. As a Cloud Site Reliability Engineer I (CSRE I), you'll play a crucial role in maintaining and optimizing Zafin's cloud infrastructure and applications. The position reports to the VP of Cloud Services and focuses on ensuring system reliability, scalability, and performance.

The role combines technical expertise with collaborative teamwork, requiring strong skills in Azure cloud platform, container orchestration, and incident management. You'll be responsible for level-3 technical support, conducting root cause analysis for critical incidents, and implementing cloud infrastructure optimizations. The position demands expertise in tools like Azure Kubernetes Service, monitoring systems, and automation scripting.

Zafin offers an excellent work environment that values diversity and teamwork. The company has a strong global presence with offices worldwide and serves prestigious clients including ING, CIBC, HSBC, Wells Fargo, PNC, and ANZ. As a certified Great Place to Work® in Canada, India, and the UK, Zafin provides competitive compensation, comprehensive benefits, and significant professional development opportunities.

The ideal candidate will have 8+ years of relevant experience, strong technical capabilities, and a dedication to maintaining high-quality service delivery. This role offers the opportunity to work with cutting-edge cloud technologies while contributing to the success of a leading financial technology provider.

Last updated 7 days ago

Responsibilities For Cloud Site Reliability Engineer I (CSRE I)

  • Act as a level-3 technical support expert for Zafin products and Azure cloud issues
  • Collaborate with Product, Platform Engineering, and DevOps teams
  • Conduct Root Cause Analysis (RCA) for Severity 1 and 2 incidents
  • Participate in external client escalation calls
  • Optimize cloud infrastructure for scalability, performance, and cost-effectiveness
  • Manage container orchestration platforms
  • Enhance monitoring and tracking tools
  • Implement best practices for Azure cloud deployment
  • Develop automation scripts
  • Maintain detailed documentation
  • Participate in rotating on-call schedule

Requirements For Cloud Site Reliability Engineer I (CSRE I)

Kubernetes
PostgreSQL
Python
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 8+ years of experience in cloud support, operations, or related role
  • Hands-on experience with Microsoft Azure
  • Proficiency in container orchestration platforms like AKS or OpenShift
  • Expertise in automated deployment pipelines, particularly Azure DevOps
  • Familiarity with enterprise monitoring platforms
  • Proficiency in scripting languages like PowerShell or Python
  • Proven experience in incident management
  • Knowledge of Postgres databases

Benefits For Cloud Site Reliability Engineer I (CSRE I)

Medical Insurance
Dental Insurance
  • Competitive salaries
  • Annual bonus potential
  • Generous paid time off
  • Paid volunteering days
  • Wellness benefits
  • Professional growth opportunities
  • Career advancement opportunities

Interested in this job?

Jobs Related To Zafin Cloud Site Reliability Engineer I (CSRE I)

Cloud Site Reliability Engineer II (CSRE II)

Lead cloud infrastructure reliability and performance optimization for a global banking software provider.

Cloud Site Reliability Engineer I (CSRE I)

Cloud Site Reliability Engineer position at Zafin, focusing on maintaining and optimizing cloud infrastructure for a leading financial technology company.

Staff Site Reliability Engineer

Staff SRE position at Wellhub focusing on building scalable infrastructure with Kubernetes and AWS, offering flexible work and comprehensive benefits.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.