Data Engineer - Computational Biology (Senior Associate)
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Requirements
- MS in Computational Biology, Biology, Physics, Statistics, or a related technical discipline OR
- BS with 2+ years of relevant research experience developing data products and data integration solutions
- Single‑cell/NGS, functional genomics, genetics, or proteomics data analysis experience
- Hands-on experience with workflow tools such as Nextflow, or strong interest in developing this expertise.
- Proficiency in Python and familiarity with software development practices.
- Experience troubleshooting technical problems and learning new analytical approaches.
- Excellent communication and collaboration skills
- Experience working effectively in cross-functional teams.
- Background or demonstrated interest in life sciences, pharmaceutical research, drug discovery, or bioinformatics.
- Exposure to software engineering best practices, including python package development, cloud computing environments, CI/CD, and engineering tooling.
- Hands-on experience handling, processing, integrating, and analyzing large heterogenous data sets data in a drug discovery research environment.
- Interest in using AI-assisted coding tools to improve development productivity.
- Work Location Assignment: This is a hybrid role requiring you to live within commuting distance and work on-site an average of 2.5 days per week or more as needed.
- Relocation assistance may be available based on business needs and/or eligibility.
- Candidates must be authorized to be employed in the U.S. by any employer.
- U.S. work visa sponsorship (such as TN, O-1, H-1B, etc.) is not available for this role now or in the future.
- Sunshine Act
Benefits
Additional Information
POSITION SUMMARY You will apply strong bioinformatics and cloud engineering practices to develop, operationalize and evolve production bioinformatics pipelines to deliver reliable data products for our Pfizer R&D research units. You will contribute to the development and operation of bioinformatics pipelines and data products, working closely with senior scientists and engineers to advance a cutting‑edge *omics ecosystem platform for Pfizer R&D. You will leverage your expertise to design innovative approaches that extract valuable insights from Pfizer's proprietary and external datasets, enabling the generation of testable hypotheses across the entire drug discovery value chain. POSITION RESPONSIBILITIES Contribute to the development, deployment, and operation of production‑grade Nextflow pipelines on cloud infrastructure, ensuring scalable and reproducible execution. Support pipeline lifecycle management, including upgrades, troubleshooting, performance tuning, and reliability improvements for reusable workflows. Learn and apply DevOps best practices for pipelines and platform services (e.g., CI/CD, automation, and engineering tooling). Work with wet-lab and research scientists, with guidance from senior team members to translate data analysis requirements into robust, production‑ready pipeline and platform solutions. Develop and evolve an omics data platform that enables efficient, scalable processing and delivery of *omics datasets as reliable data products. Participate in collaborations with external partners and vendors to strengthen pipeline quality, sustainability, and adoption of best practices. Help support the user community by answering questions, documenting workflows, and sharing best practices.
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at Pfizer? Share your experience