Truveta brand logo mark in teal on a black background, featuring stacked chevron shapes forming the Truveta symbol.

ISPOR 2026: Patient journey foundation models improve EHR data quality through unit of measure imputation

by | May 18, 2026

Authors: Ehsan Alipour, MD, PhD ⊕,Truveta, Inc, Bellevue, WA, Wilson Lau, PhD ⊕, Truveta, Inc, Bellevue, WA, Youngwon Kim, PhD ⊕, Truveta, Inc, Bellevue, WA, Sihang Zeng ⊕, Truveta, Inc, Bellevue, WA, Anand Oka, PhD ⊕, Truveta, Inc, Bellevue, WA, Jay Nanduri, MBA, MS  Truveta, Inc, Bellevue, WA
Patient journey foundation models improve EHR data quality through unit of measure imputation
  • Patient journey foundation models (PJFMs) achieved 98% accuracy in imputing missing or incorrect units of measure (UoMs) across 438 unique UCUM units in EHR data.
  • The model accurately distinguished between clinically plausible units, including overlapping laboratory units such as mg/dL versus mg/L.
  • These findings demonstrate how foundation models trained on longitudinal patient journeys can improve EHR data quality and support downstream clinical and research applications.
This report is an extension of our poster presented at ISPOR 2026, titled Patient journey foundational model for scalable imputation of missing units of measurement (UoM) in electronic health records data.

Electronic health records (EHRs) contain large amounts of longitudinal clinical information but are often incomplete or inconsistently represented. One common challenge is missing or incorrect units of measure (UoMs), which can substantially alter interpretation of laboratory and clinical values. For example, the same numeric value may represent very different clinical meanings depending on whether it is recorded in mg/dL or mmol/L.

Foundation models trained on longitudinal patient trajectories may help address this challenge by learning contextual relationships across clinical events. In this study, we evaluated whether a Patient Journey Foundation Model (PJFM) could accurately impute missing UoMs across diverse laboratory and observational data.

Methods

Using a subset of Truveta Data, we represented each patient journey as a chronological sequence of clinical events including diagnoses, laboratory values, units, and timing information. Events were encoded using SNOMED-CT, LOINC, and UCUM vocabularies.

The PJFM architecture was based on GPT-2 and extended to jointly model event type, medical code, value, unit, and time using transformer-based embeddings. The model was pretrained on the task of next event prediction with an auxiliary head trained to predict UoMs across 438 unique UCUM units for each event in the patient journey from the observations and laboratory test tables (Figure 1). Missing UoMs occurred in 19% of laboratory tests and 50% of observations. Performance was evaluated using accuracy, precision, recall, and weighted F1-score.

Diagram showing a patient journey foundation model architecture for predicting missing units of measure in electronic health record data. A sequence of longitudinal clinical events is displayed over time, including fasting blood glucose, LDL, and HDL laboratory events with associated numeric values. The model uses the chronological patient journey as input and predicts units of measure such as mmol/L and mg/dL based on longitudinal clinical context. Red outlines highlight examples where the same laboratory event appears at different points in the patient timeline. A caption states that the model uses longitudinal patient context to predict units of measure at the token level.

Results

The PJFM achieved 98% overall accuracy in imputing UoMs, with a weighted average F1-score of 0.90. Performance was highest for commonly occurring units such as mg/dL, %, mmol/L, and g/dL.

The model also performed well for measurements with multiple clinically plausible units. For example, body weight and height units were predicted with 95–100% accuracy across kilograms, pounds, centimeters, feet, and inches. The model incorporated broader physiologic context when making predictions. In a synthetic example, the same numeric values produced different predicted units depending on whether the resulting body mass index (BMI) was clinically plausible.

As an example for measurements with units of measurement with overlapping distributions, strong performance was observed for C-reactive protein (CRP), which may be reported in either mg/dL or mg/L. Despite overlapping numeric ranges, the model achieved 94% accuracy for mg/dL and 89% accuracy for mg/L.

Table showing examples of clinical measurements and the accuracy of a patient journey foundation model in predicting units of measure from electronic health record data. The table includes body weight, body height, and body temperature measurements with associated UCUM unit labels, UCUM names, and prediction accuracy percentages. The model achieved 99–100% accuracy for most units, including grams, kilograms, pounds, centimeters, feet, and temperature units in Celsius and Fahrenheit. Accuracy for meters was lower at 83%. A caption notes that the examples include height, weight, and temperature units.

Discussion

This study demonstrates that patient journey foundation models can accurately impute missing or incorrect units of measure across diverse EHR data. Beyond predictive modeling, PJFMs may provide important data quality improvements by harmonizing incomplete structured clinical information.

The model’s ability to use broader patient context rather than isolated values suggests foundation models can learn clinically meaningful physiologic relationships from longitudinal EHR data. Improving UoM consistency may enhance downstream clinical research, interoperability, and real-world evidence generation.

While future applications of PJFMs may include disease prediction and clinical decision support, this work highlights how even focused use cases such as UoM imputation can meaningfully improve healthcare data quality.

These are preliminary research findings and not peer reviewed. Data are constantly changing and updating. These findings are consistent with data accessed September 2025.

Citations

  1. A. Radford et al., Language Models are Unsupervised Multitask Learners (2019).
  2. J. Su et al., RoFormer: Enhanced Transformer with Rotary Position Embedding (2024).
  3. K. Bradwell et al., Harmonizing Units of Measure in Electronic Health Records (2022).