Skip to main content
Back to jobs

Staff Software Engineer, Data Extraction

External
Pivotal Health logoPivotal Health · Remote
$210K–$240K/yrFull-timeRemote1d ago
ClassificationData ModelingDesign SystemsDocumentationFHIRHL7
Cover LetterConnect

Prepare for this interview

Elite

AI-generated questions, company research, and talking points tailored to this role


About the role

Pivotal Health is expanding its investment in clinical data infrastructure as a core strategic initiative. We are building a state-of-the-art clinical data platform that collects, processes, and transforms complex healthcare information from a variety of sources and formats into structured, high-quality data assets. This platform serves as the foundation for a growing set of products, analytics capabilities, and operational workflows across the organization. As healthcare data continues to grow in volume and complexity, building systems that securely process, govern, and deliver high-quality clinical data has become a critical capability. This engineer will own key systems within that platform, helping define how clinical data is ingested, processed, normalized, and delivered at scale. As a Staff Software Engineer focused on Data Extraction, you will design and build the ingestion, parsing, normalization, and enrichment pipelines that turn messy healthcare data into structured assets. You'll work across OCR technologies, document processing frameworks, healthcare interoperability standards, and large-scale data extraction and processing systems. Your work will directly power claims evidence generation, provider advocacy initiatives, and efforts to improve Independent Dispute Resolution (IDR) outcomes and win rates. This role is ideal for someone who enjoys solving difficult data problems where there is no clean source of truth. You'll thrive here if you're excited by large-scale healthcare datasets, ambiguous technical challenges, and the opportunity to define foundational systems that will influence product strategy for years to come.

Responsibilities

  • Own clinical data extraction architecture. Design and evolve the systems that ingest, process, and normalize structured and unstructured clinical data from multiple healthcare sources.
  • Build document intelligence pipelines. Develop services that extract meaningful information from PDFs, scanned records, images, and clinical documentation.
  • Lead healthcare data modeling efforts. Create approaches for transforming raw clinical artifacts into structured, queryable datasets.
  • Develop scalable ingestion services. Design resilient systems capable of processing large volumes of healthcare records with high reliability and accuracy.
  • Build scalable data extraction systems. Leverage the best available technologies and architectural approaches to improve information extraction, classification, and normalization across diverse healthcare data sources.
  • Establish data quality standards. Create validation frameworks, monitoring systems, and feedback loops that improve extraction accuracy over time.
  • Influence and collaborate across clinical, product, engineering, data, and operational domains: Driving alignment on business objectives and translating them into technical systems that support provider evidence generation.
  • Drive technical strategy for unstructured healthcare data. Evaluate technologies, standards, and architectural approaches that accelerate clinical data adoption.
  • Mentor engineers across the organization. Raise engineering standards through design reviews, technical leadership, and hands-on collaboration.

Requirements

  • 8+ years building backend software systems in production environments.
  • Experience with healthcare data standards such as HL7, FHIR, CDA, or EHR integrations.
  • Familiarity with medical records, claims, or healthcare operations.
  • Experience designing large-scale data processing or document processing platforms, specifically working with clinical data - familiarity with medical terminology/coding practices.
  • Strong proficiency in Python and SQL, or similar backend languages.
  • Experience working with unstructured data and NLP systems.
  • Proven ability to design systems that operate reliably across imperfect or incomplete datasets.
  • Experience making architectural decisions that influence multiple teams or business functions.
  • Co

Benefits

Health insurance

Additional Information

About Pivotal Health Pivotal Health is the leading technology platform that helps healthcare providers get paid fairly in an increasingly complex reimbursement landscape. Today, many providers face persistent underpayment from health insurance companies, despite delivering high-quality care. While processes like IDR (Independent Dispute Resolution) were designed to promote fairness, they're often administrative-heavy, time-consuming, and difficult to navigate without the right tools. Pivotal Health combines software, data, and service into a seamlessly integrated, AI-driven platform that simplifies these complex reimbursement workflows. We help providers efficiently dispute underpaid claims, reduce administrative burden, and recover the reimbursement they're entitled to; without adding more work to already stretched teams. Our full-service IDR solution is just the starting point. We're building solutions that enable providers to operate with clarity, control, and confidence across the reimbursement journey.


Your Match

How well this role fits your profile.

Company Intel

What employees say

Worked at Pivotal Health? Share your experience

Interested in this role?

Apply on the company's website.

Cover LetterConnect