2025
Representation Learning to Advance Multi-institutional Studies with Electronic Health Record Data.
CoRR, February, 2025

ARCH: Large-scale knowledge graph via aggregated narrative codified health records analysis.
J. Biomed. Informatics, 2025

2024
Clustering Sequence Data with Mixture Markov Chains with Covariates Using Multiple Simplex Constrained Optimization Routine (MSiCOR).
J. Comput. Graph. Stat., 2024

2023
Informative missingness: What can we learn from patterns in missing laboratory data in the electronic health record?
J. Biomed. Informatics, March, 2023

2022
International electronic health record-derived post-acute sequelae profiles of COVID-19 patients.
npj Digit. Medicine, 2022

International comparisons of laboratory values from the 4CE collaborative to predict COVID-19 mortality.
npj Digit. Medicine, 2022

SurvMaximin: Robust federated approach to transporting survival risk prediction models.
J. Biomed. Informatics, 2022

2021
Validation of an internationally derived patient severity phenotype to support COVID-19 analytics from electronic health record data.
J. Am. Medical Informatics Assoc., 2021

2020
Automated ICD coding via unsupervised knowledge integration (UNITE).
Int. J. Medical Informatics, 2020

SAMGEP: A Novel Method for Prediction of Phenotype Event Times Using the Electronic Health Record.
Proceedings of the AMIA 2020, 2020

2015
Demonstrating the Advantages of Applying Data Mining Techniques on Time-Dependent Electronic Medical Records.
Proceedings of the AMIA 2015, 2015