Site Reliability Engineer (SRE)

All-in-one productivity platform that replaces individual workplace tools with a unified platform for project management, document collaboration, and AI.
Site Reliability
Senior Software Engineer
Remote
4+ years of experience
Enterprise SaaS

Description For Site Reliability Engineer (SRE)

ClickUp, a rapidly growing SaaS company, is revolutionizing workplace productivity with their all-in-one platform. As a Site Reliability Engineer, you'll be at the forefront of maintaining and improving their globally distributed cloud infrastructure that serves thousands of users daily. The role combines technical expertise in cloud services, particularly AWS, with a focus on system reliability and performance optimization.

The position offers an opportunity to work with cutting-edge technologies including Kubernetes, Docker, and various AWS services. You'll be responsible for designing and implementing robust systems, improving monitoring capabilities, and ensuring high availability of services. The ideal candidate should have strong experience in cloud infrastructure, DevOps practices, and a proven track record in managing production environments.

ClickUp's culture emphasizes innovation, merit, and continuous growth. They've earned recognition on the Forbes Cloud 100 and Fast Company's Most Innovative Companies lists. The company values diversity and provides an inclusive environment where employees can do their best work. While based in San Diego, they offer remote work opportunities and potential visa sponsorship for engineering roles.

This role is perfect for someone who combines technical expertise with problem-solving abilities and excellent communication skills. You'll be part of a team that's directly impacting millions of users' productivity, with the ambitious goal of saving them at least one day every week. The position offers the excitement of working with a fast-growing company while tackling complex technical challenges in a dynamic environment.

Last updated 19 days ago

Responsibilities For Site Reliability Engineer (SRE)

  • Participate in designing and building systems for maximum performance, reliability, and scalability
  • Work with engineering teams on product design, decisions, and troubleshooting
  • Increase general stability, observability, and metrics for uptime and stability
  • Champion monitoring infrastructure
  • Implement and improve site reliability posture
  • Respond to and troubleshoot downtime events
  • Participate in brainstorming sessions with the engineering team

Requirements For Site Reliability Engineer (SRE)

Kubernetes
Redis
PostgreSQL
Node.js
Linux
  • 4-6+ years of knowledge of Amazon Web Services ecosystem
  • Experience working with Kubernetes
  • Experience in managing production-critical infrastructures and DevOps mindset
  • Familiar with SRE best practices and procedures
  • Experience with IaC (CDK, Terraform), CI/CD (GitHub Actions, ArgoCD)
  • Familiar with Containerisation (Docker)
  • Knowledgeable in network, firewall, and security best practices
  • Experience with self-healing automation and monitoring tools
  • Knowledge of relational databases, preferably PostgreSQL
  • Strong self-starter and problem-solver
  • Excellent interpersonal, written, and oral communication skills
  • Experience with Linux-based EC2 instances management

Benefits For Site Reliability Engineer (SRE)

Visa Sponsorship
  • Visa Sponsorship

Interested in this job?

Jobs Related To ClickUp Site Reliability Engineer (SRE)

Site Reliability Engineer

Senior Site Reliability Engineer position at OneDegree, focusing on cloud infrastructure, monitoring, and automation for insurance and cybersecurity platforms in APAC.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Prove, focusing on building and maintaining scalable, reliable systems for digital identity solutions.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Prove, focusing on building and maintaining scalable, reliable systems for digital identity solutions.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and scalability.

Senior Software Engineer, Site Reliability Engineering, Google Cloud

Senior SRE position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.