AD - Data Engineer - 0007

A technology company specializing in data engineering and analytics solutions.
Bogotá, Bogota, Colombia
Data
Mid-Level Software Engineer
Remote
3+ years of experience
Enterprise SaaS

Description For AD - Data Engineer - 0007

We are seeking a skilled Data Engineer to join our dynamic team at Thaloz. The role focuses on designing, building, and maintaining scalable data pipelines while ensuring data integrity and availability for analysis. As a Data Engineer, you'll work with Python, SQL, and PySpark to process large datasets in a distributed computing environment. You'll collaborate closely with data scientists and analysts to support data-driven decision-making across the organization. The position requires expertise in data warehousing, ETL processes, and cloud-based technologies, particularly Databricks. Working in an Agile environment, you'll participate in sprint planning and use Jira for project management. The ideal candidate brings 3+ years of experience, strong problem-solving abilities, and excellent communication skills. This remote position offers an opportunity to work with cutting-edge data technologies while contributing to the company's data infrastructure and analytics capabilities.

Last updated a day ago

Responsibilities For AD - Data Engineer - 0007

  • Design, develop, and maintain robust data pipelines to process large volumes of data from various sources
  • Collaborate with data scientists and analysts to understand data requirements and deliver high-quality datasets
  • Optimize data storage and retrieval processes to improve performance and efficiency
  • Implement data quality checks and monitoring to ensure the accuracy and reliability of data
  • Work with cloud-based technologies, particularly Databricks, to manage and analyze data
  • Participate in Agile development processes, including sprint planning and retrospectives using Jira
  • Document data processes, architectures, and workflows for future reference and compliance
  • Stay updated with industry trends and best practices in data engineering and analytics

Requirements For AD - Data Engineer - 0007

Python
  • Proficient in Python as a core programming language for data manipulation and analysis
  • Strong knowledge of SQL for querying databases and performing data transformations
  • Experience with PySpark for processing large datasets in a distributed computing environment
  • Familiarity with Databricks is a plus, enabling efficient data processing and collaboration
  • Understanding of data warehousing concepts and ETL processes
  • Experience working in Agile environments, particularly with Scrum methodologies
  • Proficient in using Jira for project management and tracking progress
  • Minimum of 3 years of experience in data engineering or related fields
  • Proven track record of building and maintaining data pipelines in production environments
  • Excellent communication skills to effectively collaborate with cross-functional teams
  • Strong problem-solving abilities and attention to detail

Interested in this job?

Jobs Related To Thaloz AD - Data Engineer - 0007

Business Intelligence Engineer II, CMT

Business Intelligence Engineer role at Amazon focusing on pricing analytics and data-driven decision making, requiring 5+ years of experience in data analysis and BI tools.

Business Intelligence Engineer, AWS DC Bridge

Business Intelligence Engineer role at AWS DC Bridge team, focusing on data analysis and insights for AWS data centers, offering competitive compensation and growth opportunities.

Data Architect, Professional Services

AWS Data Architect role combining cloud expertise with customer consulting, implementing scalable solutions and driving cloud transformation.

Business Intelligence Engineer, AWS DC Bridge

Business Intelligence Engineer role at AWS DC Bridge team, focusing on data analysis and insights for data center operations, offering competitive compensation and benefits.

Software Engineer II, Search Science Data Infrastructure

Software Engineer II position at Amazon focusing on search infrastructure, ML data processing, and distributed systems development.