Bioinformatics Data Engineer
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Requirements
- Educational Background: Bachelor's or Master's degree in Bioinformatics, Computational Biology, Computer Science, or a related field with a heavy focus on the life sciences.
- Biological Data: Deep, hands-on familiarity with multiple biomedical data modalities (e.g., genomics, transcriptomics, spatial omics, protein structure, biomedical imaging, clinical/phenotypic data, etc.).
- Biological Tools: Familiar with Bioconda,Biopython,Bioconductor, samtools,bamtools,bcftools,gffutils etc.
- Scripting & Tooling: Strong programming skills in Python (Pandas, NumPy) and proficiency with standard bioinformatics workflow managers and tools (e.g., Ray, Kubeflow).
- Engineering Handoff: Experience writing clean, modular code that can be easily picked up by core data engineers for optimization in cloud environments (AWS/GCP/HF) and containerized setups (Docker).
- AI/ML Awareness: A solid understanding of machine learning workflows and how biological data must be formatted and batched for deep learning frameworks (e.g., PyTorch).
Additional Information
GenBio AI develops multiscale foundation models to decode and simulate human biology. Our team is accelerating towards an ambitious future where scientists can unlock humanity's biggest challenges in drug discovery, healthcare, and fundamental research with AIDO (AI-Driven Digital Organism): a unified framework for predicting, simulating, and programming biology across all scales. The foundation of this vision begins today as we engineer the virtual cell to model and simulate the fundamental unit of life. This vision has brought together a talent-dense group of product-minded researchers and engineers dedicated to bringing it to reality. Our team prides itself on our strong engineering culture and highly interdisciplinary and collaborative approach. We are based in Palo Alto, with satellite offices in Paris and Abu Dhabi. As our data ingestion needs grow, we are looking for a Bioinformatics Data Engineer to act as the crucial bridge between raw biological data and our scalable infrastructure. Reporting to the Data Engineering Lead, you will leverage your deep biological domain expertise to build the initial scripts and processing logic for complex datasets, ensuring they are primed for large-scale foundation model training. Join us as we embark on this journey to redefine the future of biology and medicine. We are an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. GenBio AI participates in the U.S. Department of Homeland Security's E-Verify program to confirm the employment eligibility of all newly hired employees. For more information on E-Verify, please visit www.e-verify.gov.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at genbio? Share your experience