Data Engineer

A nonprofit authorized by Congress to support CDC's health protection mission through partnerships.
Cleveland, OH, USA
$103,500 - $143,500
Data
Mid-Level Software Engineer
Remote
501 - 1,000 Employees
3+ years of experience
Healthcare

Description For Data Engineer

The CDC Foundation is seeking a Data Engineer to join their Workforce Acceleration Initiative (WAI), a federally funded program aimed at enhancing public health agencies' technological capabilities. This role, based remotely but working with the Cleveland Department of Public Health, focuses on designing and maintaining data infrastructure for public health surveillance and analytics. The position offers a competitive salary range of $103,500-$143,500 and involves collaborating with epidemiologists and data analysts to transform and integrate public health data.

The role combines technical expertise in data engineering with public health impact, requiring skills in data pipeline development, ETL processes, and cloud services (particularly Azure Databricks). The successful candidate will work on crucial data integration projects, implementing security measures, and ensuring data quality for public health surveillance analyses.

Working within the Office of Epidemiology and Population Health, the Data Engineer will play a vital role in modernizing public health data systems. The position offers the opportunity to contribute to meaningful public health initiatives while working with cutting-edge data technologies. The CDC Foundation's mission of protecting public health through collaboration makes this an ideal role for someone passionate about using data engineering skills for social impact.

This grant-funded position offers the flexibility of remote work for U.S.-based candidates, combining the stability of an established organization with the opportunity to make a significant impact on public health data systems. The role requires a blend of technical expertise, collaborative skills, and public health interest, making it perfect for a data engineer looking to apply their skills in a meaningful context.

Last updated an hour ago

Responsibilities For Data Engineer

  • Design, build, and maintain data infrastructure
  • Utilize software engineering methods to integrate, process and prepare data for analyses
  • Perform data linkages between public health surveillance data and geospatial data assets
  • Create and manage data pipelines and systems
  • Implement security measures to protect sensitive information
  • Design and manage data storage systems
  • Implement and maintain ETL processes
  • Provide technical guidance to other staff
  • Document data transformation processes

Requirements For Data Engineer

Python
Java
  • Bachelor's degree in computer science or information systems, or equivalent experience
  • Demonstrated ability in complex data management and data preparation
  • Experience working with data integration frameworks
  • Experience working with cloud services & infrastructure (Microsoft Azure Databricks preferred)
  • Experience in designing, writing, and delivering code in a team environment
  • Ability to thrive in a project-based, team environment

Benefits For Data Engineer

Medical Insurance
  • Medical Insurance

Interested in this job?

Jobs Related To CDC Foundation Data Engineer

Data Engineer

Data Engineer position at Spotify building large-scale data processing systems and solutions to enhance user experience across multiple platforms.

Mid Senior Data Developer

Mid Senior Data Developer position at CI&T, working remotely in Brazil, focusing on data engineering, pipeline development, and cloud solutions.

Software Engineer - Data Engineer (Geo)

Remote Data Engineer position at Jobgether, focusing on location-based services and data pipelines using AWS, Python/Java, and SQL.

Software Engineer (Ray Data)

Software Engineer position at Anyscale focusing on Ray Data library development, optimization, and scaling of distributed computing systems.

Software Engineer (Ray Data)

Software Engineer position at Anyscale focusing on Ray Data, building and optimizing distributed data processing capabilities for machine learning applications.