The Senior Kafka DevOps Engineer designs and develops applications to deliver integration and automation solutions across Alarm.com's infrastructure and applications stack throughout the entire life-cycle. This position works closely with Engineering, Quality Engineering, Operations and Release Management in on-boarding of new and existing life safety and home automation applications to the cloud and serves as the main liaison between Release Management, Engineering and Operations in the delivery of automation solutions and software deployment frameworks from inception to development to production to maintenance. This role also works side-by-side with Engineering and Operations in all technical aspects of the Alarm.com cloud and applications with an ever-watchful eye on their availability, latency, performance, and capacity.
Kafka Responsibilities
- Implement Confluent Kafka from the ground up using VMware and Cloud Providers.
- Troubleshoot and resolve Confluent Kafka issues as they occur.
- Deploy and manage Confluent Kafka using automation tools like Terraform, Ansible, and Puppet.
- Design and optimize event-driven and data streaming services from a Confluent Kafka perspective, including capacity planning.
- Deploy and maintain Confluent Kafka monitoring dashboards and alerts.
- Migrate Confluent Kafka into Kubernetes using GitOps workflows.
- Grow our Kubernetes pipelines and capabilities.
- Serve as Alarm.com's SME for Confluent/Kafka.
- Manage and optimize Linux VMs hosting Confluent/Kafka.
DevOps Responsibilities
- Advance administration and reliability of our environment through automation and testing.
- Maintain and grow data tier services, including Kafka, Redis, Elasticsearch, and NoSQL.
- Take leadership of platform reliability, operational excellence, and developer experience.
- Provide technical direction to engineering teams on best practices for deploying and managing third-party systems and custom software.
- Provide innovative solutions to increase the reliability, performance, and predictability of Alarm.com's systems and cloud.
- Architect and implement solutions to migrate on-premises workloads to the public cloud.
- Mentor and develop teammates.
- Identify and respond to production issues and participate in on-call support.
- Continue to build out Terraform, Ansible, Puppet, and other tooling.
Qualifications
- 10+ years of experience.
- B.A. or B.S. (or higher level degree) in Computer Science or a similar engineering program with strong academic performance preferred.
- Knowledge of Kubernetes infrastructure and troubleshooting using kubectl.
- Experience with services such as Redis, Elasticsearch, Kafka, MySQL, NoSQL, and other open-source technologies.
- Experience using a configuration management framework like Terraform, Ansible, Puppet, Chef, or Salt.
- Knowledge of GitOps workflows.
- Experience with cloud technologies like Azure, AWS, and VMware.
- Experience administering Linux.
- Experience in monitoring and alerting practices using tools such as DataDog, Grafana, PagerDuty.
- Intermediate programming skills with a language like Python, PowerShell, or Go.
Alarm.com is the leading cloud-based platform for smart security and the Internet of Things. More than 7.6 million home and business owners depend on our solutions every day to make their properties safer, smarter, and more efficient. We're innovating new technologies in rapidly evolving spaces including AI, video analytics, facial recognition, machine learning, energy analytics, and more. We're seeking those who are passionate about creating change through technology and who want to make a lasting impact on the world around them.