The Senior DevOps engineer at Alarm.com designs and develops applications to deliver integration and automation solutions across Alarm.com's infrastructure and applications stack throughout the entire life-cycle. This position works closely with Engineering, Quality Engineering, Operations and Release Management in on-boarding of new and existing life safety and home automation applications to the cloud. The role serves as the main liaison between Release Management, Engineering and Operations in the delivery of automation solutions and software deployment frameworks from inception to development to production to maintenance. This role also works side-by-side with Engineering and Operations in all technical aspects of the Alarm.com cloud and applications with an ever-watchful eye on their availability, latency, performance, and capacity.
Key responsibilities include:
- Maintaining and supporting growing Kubernetes infrastructure
- Growing Kubernetes pipelines and capabilities
- Advancing administration and reliability through automation and testing
- Taking leadership of platform reliability, operational excellence and developer experience
- Providing technical direction on best practices for deploying and managing third party systems and custom software
- Architecting and implementing solutions to migrate on-premises workloads to the public cloud
- Mentoring and developing teammates
- Identifying and responding to production issues and participating in on-call support
- Building out Terraform, Ansible, Puppet, and other tooling
- Maintaining and growing data tier services, including Kafka, Redis, Elasticsearch, and NOSQL
The ideal candidate will have:
- 10+ years of experience
- B.A. or B.S. (or higher) in Computer Science or similar engineering program
- Strong knowledge of Kubernetes infrastructure and troubleshooting
- Experience with services such as Redis, Elasticsearch, Kafka, MySQL, NoSQL
- Experience with configuration management frameworks like Ansible, Puppet, Chef, Salt
- Knowledge of GitOps workflows
- Experience with cloud technologies like Azure, AWS, VMware
- Linux administration skills
- Experience in monitoring and alerting practices using tools like Data Dog, Grafana, Pager Duty
- Intermediate programming skills with Python, PowerShell or Go
- Experience with Kafka/Confluent capacity planning, installation, administration, and platform management
Alarm.com offers a collaborative environment, immediate impact opportunities, diverse experience, and a focus on team culture. The company values in-person collaboration, with employees working from the office 4 days a week. Benefits include competitive pay, subsidized medical plans, HSA with company contribution, 401(k) with employer match, paid holidays, wellness time, vacation, paid maternity and bonding leave, and more.