Senior Site Reliability Engineer

Material Bank is the world's largest material marketplace for the Architecture and Design industry, providing the fastest and most powerful way to search and sample materials.
Site Reliability
Senior Software Engineer
Hybrid
5+ years of experience

Description For Senior Site Reliability Engineer

Material Bank, the world's largest material marketplace for the Architecture and Design industry, is seeking a Senior Site Reliability Engineer to join their team. This role focuses on system-level optimization and implementing best practices to ensure high availability, reliability, and scalability.

As a Senior SRE at Material Bank, you'll be responsible for incident response, infrastructure management, monitoring & automation, process improvement, system debugging, and scalability planning. You'll work with cutting-edge technologies like Terraform, Kubernetes, and AWS to support the growth of Material Bank's platform, which serves hundreds of thousands of users.

Key responsibilities include:

  1. Participating in on-call rotation for incident response
  2. Managing and optimizing infrastructure using tools like Terraform and Kubernetes
  3. Developing monitoring systems for early issue detection
  4. Continuously refining operational processes
  5. Troubleshooting and resolving production issues
  6. Strategic planning for infrastructure expansion

The ideal candidate will have 5+ years of SRE experience with cloud platforms and Linux systems. You should be proficient in infrastructure as code, containerization, and have a strong understanding of operating systems, storage solutions, and networking. Experience with monitoring tools like New Relic and programming skills in Shell, GoLang, or Python are highly valued.

Material Bank offers a comprehensive benefits package, including medical, dental, and vision insurance, a 401(k) plan, generous PTO, and flexible work schedules. This is a hybrid position with options to work from New York City, Boston, or Miami-Boca Raton.

Join Material Bank and be part of a team that's transforming the architecture and design industry through innovative technology and efficient material sampling processes. Apply now to contribute to the growth and success of this fast-paced, high-growth company!

Last updated 7 months ago

Responsibilities For Senior Site Reliability Engineer

  • Participate in on-call rotation for incident response
  • Operate and manage infrastructure with tools like Terraform, GitHub/CodePipeline CI/CD, and Kubernetes
  • Develop monitoring systems for early detection of potential issues
  • Continuously refine operational processes
  • Troubleshoot and resolve production issues
  • Strategically plan infrastructure expansion and enhancement

Requirements For Senior Site Reliability Engineer

Kubernetes
Linux
MySQL
Redis
  • 5+ years of SRE practice experience with cloud platforms/providers and Linux systems
  • Experience with infrastructure as code (Terraform)
  • Experience with containerization (Kubernetes/ECS)
  • Ability to manage and troubleshoot operating systems, storage solutions, and networking
  • Experience with monitoring and instrumentation tools (New Relic)
  • Focus on engineering best practices
  • Proficiency in programming languages such as Shell, GoLang, and/or Python

Benefits For Senior Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
  • Generous PTO
  • Sick Days
  • Paid National Holidays
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance
  • Short-term/long-term disability plans
  • 401(k)
  • Flexible Work Schedules

Interested in this job?

Jobs Related To Material Bank Senior Site Reliability Engineer

Site Reliability Engineer

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.

Senior Software Engineer, Site Reliability Engineering

Senior Site Reliability Engineering role at Google, focusing on building and maintaining large-scale distributed systems for Google Cloud services.