Site Reliability Engineer

Portuguese IT consulting company founded in 2015, specializing in IT projects, cybersecurity assessments, and nearshoring services.
Site Reliability
Senior Software Engineer
Hybrid
3+ years of experience

Description For Site Reliability Engineer

Powertalent, established in 2015 as part of a technological disruption group, is a Portuguese company specializing in IT projects. With their extensive experience, they succeed where others fail. The company offers tailored consulting solutions, performs cybersecurity assessments and diagnostics, and designs custom solutions for their clients. Taking advantage of Portugal's strengths such as climate, gastronomy, and security, they provide nearshoring services to enhance their clients' growth. They also offer turnkey project services for a hands-off experience.

As a Site Reliability Engineer, you'll be joining a company that prides itself on being "a company of people, for people!" The role requires a strong background in SRE practices with at least 3 years of experience. You'll be working with modern technologies including SQL, Kibana, Elasticsearch, Prometheus, Grafana, and AWS, while collaborating through platforms like Jira, Confluence, and GitLab.

The ideal candidate will have a passion for monitoring systems, problem-solving, and continuous improvement. You'll be expected to proactively suggest improvements in logging systems, metrics, and alerting mechanisms. The role requires someone who can understand complex systems and their interactions within the ecosystem, while effectively communicating with various stakeholders from business to engineering teams.

The position offers attractive benefits including health insurance, meal allowance, and opportunities for continuous training. The hybrid work model provides flexibility while maintaining collaborative opportunities. Located in Lisbon, Portugal, you'll be part of a growing technology company that values innovation and personal growth.

Last updated 3 months ago

Requirements For Site Reliability Engineer

  • 3+ years of experience in application support in SRE (Site Reliability Engineering)
  • Knowledge of technologies: SQL, Kibana, Elasticsearch, Prometheus, Grafana, AWS and Shell
  • Knowledge of collaborative platforms: Jira, Confluence and GitLab
  • Ability to understand various distinct systems, their functionalities and where they fit in an ecosystem
  • Passion for monitoring, analyzing problems, collecting evidence, identifying causes and finding solutions
  • Proactivity in proposing and validating logs and metrics to detect potential problems
  • Proactivity in suggesting improvements in logging systems, metrics and alerts
  • Proactivity in suggesting improvements in platforms and/or tools to help with daily work
  • Able to adapt communication to various stakeholders (business, engineering, etc.)

Benefits For Site Reliability Engineer

Medical Insurance
  • Meal Allowance
  • Health Insurance
  • Hybrid Work
  • Continuous Training

Interested in this job?

Jobs Related To Powertalent Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer position at Oracle, focusing on cloud infrastructure and systems reliability with 3-5+ years of experience required.

Site Reliability Engineer

Senior Site Reliability Engineer role at AION, building and maintaining infrastructure for a decentralized AI cloud platform with focus on automation and reliability.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior Software Developer role in Site Reliability Engineering at Google Cloud, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Senior Software Developer, Site Reliability Engineering, Google Cloud

Senior SRE role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and comprehensive benefits.

Senior Software Engineer, SRE, Cloud Incident Response

Senior SRE position at Google focusing on Cloud Incident Response, requiring expertise in distributed systems and incident management.