Senior Site Reliability Engineer

Material Bank is the world's largest material marketplace for the Architecture and Design industry, providing the fastest and most powerful way to search and sample materials.
Site Reliability
Senior Software Engineer
Hybrid
5+ years of experience

Description For Senior Site Reliability Engineer

Material Bank, the world's largest material marketplace for the Architecture and Design industry, is seeking a Senior Site Reliability Engineer to join their team. This role focuses on system-level optimization and implementing best practices to ensure high availability, reliability, and scalability.

As a Senior SRE at Material Bank, you'll be responsible for incident response, infrastructure management, monitoring & automation, process improvement, system debugging, and scalability planning. You'll work with cutting-edge technologies like Terraform, Kubernetes, and AWS to support the growth of Material Bank's platform, which serves hundreds of thousands of users.

Key responsibilities include:

  1. Participating in on-call rotation for incident response
  2. Managing and optimizing infrastructure using tools like Terraform and Kubernetes
  3. Developing monitoring systems for early issue detection
  4. Continuously refining operational processes
  5. Troubleshooting and resolving production issues
  6. Strategic planning for infrastructure expansion

The ideal candidate will have 5+ years of SRE experience with cloud platforms and Linux systems. You should be proficient in infrastructure as code, containerization, and have a strong understanding of operating systems, storage solutions, and networking. Experience with monitoring tools like New Relic and programming skills in Shell, GoLang, or Python are highly valued.

Material Bank offers a comprehensive benefits package, including medical, dental, and vision insurance, a 401(k) plan, generous PTO, and flexible work schedules. This is a hybrid position with options to work from New York City, Boston, or Miami-Boca Raton.

Join Material Bank and be part of a team that's transforming the architecture and design industry through innovative technology and efficient material sampling processes. Apply now to contribute to the growth and success of this fast-paced, high-growth company!

Last updated 4 months ago

Responsibilities For Senior Site Reliability Engineer

  • Participate in on-call rotation for incident response
  • Operate and manage infrastructure with tools like Terraform, GitHub/CodePipeline CI/CD, and Kubernetes
  • Develop monitoring systems for early detection of potential issues
  • Continuously refine operational processes
  • Troubleshoot and resolve production issues
  • Strategically plan infrastructure expansion and enhancement

Requirements For Senior Site Reliability Engineer

Kubernetes
Linux
MySQL
Redis
  • 5+ years of SRE practice experience with cloud platforms/providers and Linux systems
  • Experience with infrastructure as code (Terraform)
  • Experience with containerization (Kubernetes/ECS)
  • Ability to manage and troubleshoot operating systems, storage solutions, and networking
  • Experience with monitoring and instrumentation tools (New Relic)
  • Focus on engineering best practices
  • Proficiency in programming languages such as Shell, GoLang, and/or Python

Benefits For Senior Site Reliability Engineer

Medical Insurance
Dental Insurance
Vision Insurance
401k
  • Generous PTO
  • Sick Days
  • Paid National Holidays
  • Medical Insurance
  • Dental Insurance
  • Vision Insurance
  • Short-term/long-term disability plans
  • 401(k)
  • Flexible Work Schedules

Interested in this job?

Jobs Related To Material Bank Senior Site Reliability Engineer

Senior Site Reliability Engineer - CTJ - POLY

Senior Site Reliability Engineer role at Microsoft working on Azure SQL services for government clouds, requiring security clearance and distributed systems expertise.

Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Senior Site Reliability Engineer position at Netflix focusing on cloud platform reliability for live streaming events, offering competitive compensation and comprehensive benefits.

Senior Site Reliability Engineer - AI Research Clusters

Senior Site Reliability Engineer position at NVIDIA focusing on AI research clusters, requiring 5+ years of experience in large-scale infrastructure and GPU computing.

AI & Machine Learning Site Reliability Engineer

Senior SRE position focusing on AI/ML infrastructure, requiring 5+ years of experience, offering remote work and comprehensive benefits at a growing Enterprise SaaS company.

Site Reliability Engineer, Managed Operations

Senior Site Reliability Engineer role at AWS Berlin, focusing on launching and managing the European Sovereign Cloud infrastructure and services.