Data Engineer

A Silicon Valley startup combining Generative AI with biology and medicine, pioneering pan-modal Large Biological Models (LBM) for healthcare transformation.
Data
Mid-Level Software Engineer
In-Person
11 - 50 Employees
3+ years of experience
AI · Healthcare · Biotech

Description For Data Engineer

GenBio is a pioneering Silicon Valley startup at the intersection of Generative AI and biomedicine. As the first mover in pan-modal Large Biological Models (LBM), we're revolutionizing healthcare through advanced AI applications. Our team consists of leading innovators in AI and Biological Science, working from our headquarters in Silicon Valley and our Paris office.

As a Data Engineer, you'll be integral to our mission of transforming biology and medicine through AI. You'll work on cutting-edge systems for foundation model development, collaborating with top minds in the field. Your role involves building scalable solutions for large-scale model training and deployment, while ensuring robust backend systems and APIs support our complex workflows.

The position offers unique exposure to both advanced AI technologies and biological sciences. You'll work with modern tools including Python, JavaScript, and various deep learning frameworks, while contributing to groundbreaking developments in biomedicine. Our strong R&D team and leadership in LLM and generative AI provide an exceptional environment for professional growth.

This role is perfect for someone who wants to be at the forefront of AI's application in healthcare and biology. You'll have the opportunity to work with state-of-the-art technologies while contributing to potentially life-changing medical advancements. Our inclusive culture and commitment to diversity create an environment where innovation thrives and every team member can make a significant impact.

Last updated 14 days ago

Responsibilities For Data Engineer

  • Design, develop, optimize, and maintain software systems for foundation model development and deployment lifecycle
  • Build and maintain scalable codebases for large-scale foundation model training
  • Collaborate with data engineers and research scientists for model production integration
  • Implement software engineering best practices
  • Build and optimize back-end systems, APIs, and databases
  • Ensure code quality through testing and code reviews

Requirements For Data Engineer

Python
JavaScript
Node.js
PostgreSQL
MongoDB
Kubernetes
  • Bachelor's or Master's degree in Computer Science, Engineering, or related field
  • Strong programming skills in JavaScript, Python, and modern web development frameworks
  • Proficiency with deep learning frameworks (PyTorch, HuggingFace Transformers)
  • Familiarity with resource management systems (SLURM, Kubernetes)
  • Proficiency in back-end frameworks and database technologies
  • Expertise in distributed systems, cloud computing, and containerization tools

Interested in this job?

Jobs Related To GenBio Data Engineer

Data Integrations Engineer

Data Integrations Engineer role at ProPublica, leading development of publishing and audience data platforms, offering remote work and competitive compensation.

Data Integrations Engineer

Data Integrations Engineer role at ProPublica, leading development of publishing and audience data platforms, offering remote work and competitive compensation.

Data Engineer

Mid-senior Data Engineer position at GWI in Athens, focusing on business data platform development with hybrid work arrangement and comprehensive benefits.

Software Developer 3

Mid-level software developer position at Oracle focusing on data engineering, pipeline development, and secure data storage solutions.

Software Developer 3

Mid-level Software Developer role at Oracle focusing on Business Data Intelligence and Analytics platforms, requiring 3-5 years of experience in data engineering and modeling.