PalUp is revolutionizing social interactions through their AI-driven platform that serves millions of users globally. As a Site Reliability Engineer, you'll be at the heart of their engineering team, ensuring the platform's stability, reliability, and efficiency.
The role demands a skilled engineer with 3+ years of SRE/DevOps experience who excels in cloud services (particularly GCP), Linux administration, and container orchestration with Kubernetes. You'll be working with cutting-edge technologies including Python, Golang, and modern monitoring tools like Grafana and Prometheus.
Your responsibilities will span from designing and implementing monitoring systems to optimizing CI/CD pipelines and managing cloud-based deployments. You'll be crucial in analyzing and improving system performance, ensuring high availability, and developing automation tools to streamline operations.
The ideal candidate values automation, proactive problem-solving, and collaborative teamwork. You'll thrive in PalUp's dynamic environment where innovation and technical excellence are paramount. The company emphasizes creating scalable solutions and empowering teams to deliver world-class experiences.
This is an excellent opportunity for a mid-level engineer passionate about site reliability and DevOps to make a significant impact in a growing AI-focused company. You'll work alongside talented engineers who value collaboration, fairness, and mutual respect, while helping shape the future of AI-driven social interactions.