Cao Xiao

Orcid: 0000-0002-3869-6942

According to our database1, Cao Xiao authored at least 140 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FRAMM: Fair ranking with missing modalities for clinical trial site selection.
Patterns, March, 2024

Dynamic Uncertainty Ranking: Enhancing In-Context Learning for Long-Tail Knowledge in LLMs.
CoRR, 2024

Segment as You Wish - Free-Form Language-Based Segmentation for Medical Images.
CoRR, 2024

Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval.
CoRR, 2024

TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets.
CoRR, 2024

KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge.
CoRR, 2024

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement.
CoRR, 2024

TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

PILOT: Legal Case Outcome Prediction with Case Law.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

MediTab: Scaling Medical Tabular Data Predictors via Data Consolidation, Enrichment, and Refinement.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Recent Advances in Predictive Modeling with Electronic Health Records.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Certifiably Byzantine-Robust Federated Conformal Prediction.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Zero-Resource Hallucination Prevention for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Unlocking Memorization in Large Language Models with Dynamic Soft Prompting.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Unity in Diversity: Collaborative Pre-training Across Multimodal Medical Sources.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ConSequence: Synthesizing Logically Constrained Sequences for Electronic Health Record Generation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
PyTrial: A Comprehensive Platform for Artificial Intelligence for Drug Development.
CoRR, 2023

GraphCare: Enhancing Healthcare Predictions with Open-World Personalized Knowledge Graphs.
CoRR, 2023

AnyPredict: Foundation Model for Tabular Prediction.
CoRR, 2023

Synthesize Extremely High-dimensional Longitudinal Electronic Health Records via Hierarchical Autoregressive Language Model.
CoRR, 2023

MedLink: De-Identified Patient Health Record Linkage.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Fast Online Value-Maximizing Prediction Sets with Conformal Cost Control.
Proceedings of the International Conference on Machine Learning, 2023

AutoTrial: Prompting Language Models for Clinical Trial Design.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ClinicalRisk: A New Therapy-related Clinical Trial Dataset for Predicting Trial Status and Failure Reasons.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic Tree-based Memory Network.
Proceedings of the 14th ACM International Conference on Bioinformatics, 2023

SPOT: Sequential Predictive Modeling of Clinical Trial Outcome with Meta-Learning.
Proceedings of the 14th ACM International Conference on Bioinformatics, 2023

2022
CHEER: Rich Model Helps Poor Model via Knowledge Infusion.
IEEE Trans. Knowl. Data Eng., 2022

MOLER: Incorporate Molecule-Level Reward to Enhance Deep Generative Model for Molecule Optimization.
IEEE Trans. Knowl. Data Eng., 2022

HINT: Hierarchical interaction network for clinical-trial-outcome predictions.
Patterns, 2022

Clinical trial site matching with improved diversity using fair policy learning.
CoRR, 2022

PopNet: Real-Time Population-Level Disease Prediction with Data Latency.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

AutoMap: Automatic Medical Code Mapping for Clinical Prediction Model Deployment.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

ATD: Augmenting CP Tensor Decomposition by Self Supervision.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Differentiable Scaffolding Tree for Molecule Optimization.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Benchmarking Automated Clinical Language Simplification: Dataset, Algorithm, and Evaluation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

MedAttacker: Exploring Black-Box Adversarial Attacks on Risk Prediction Models in Healthcare.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

SCRIB: Set-Classifier with Class-Specific Risk Bounds for Blackbox Models.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
An Adaptive Pattern Learning Framework to Personalize Online Seizure Prediction.
IEEE Trans. Big Data, 2021

Machine learning applications for therapeutic tasks with genomics data.
Patterns, 2021

FLANNEL (Focal Loss bAsed Neural Network EnsembLe) for COVID-19 detection.
J. Am. Medical Informatics Assoc., 2021

STAN: spatio-temporal attention network for pandemic prediction using real-world evidence.
J. Am. Medical Informatics Assoc., 2021

MedAttacker: Exploring Black-Box Adversarial Attacks on Risk Prediction Models in Healthcare.
CoRR, 2021

Self-supervised EEG Representation Learning for Automatic Sleep Staging.
CoRR, 2021

Differentiable Scaffolding Tree for Molecular Optimization.
CoRR, 2021

Augmented Tensor Decomposition with Stochastic Optimization.
CoRR, 2021

SafeDrug: Dual Molecular Graph Encoders for Safe Drug Recommendations.
CoRR, 2021

Therapeutics Data Commons: Machine Learning Datasets and Tasks for Therapeutics.
CoRR, 2021

HINT: Hierarchical Interaction Network for Trial Outcome Prediction Leveraging Web Data.
CoRR, 2021

PyHealth: A Python Library for Health Predictive Models.
CoRR, 2021

SumGNN: multi-typed drug interaction prediction via efficient knowledge graph summarization.
Bioinform., 2021

MolTrans: Molecular Interaction Transformer for drug-target interaction prediction.
Bioinform., 2021

DeepPurpose: a deep learning library for drug-target interaction prediction.
Bioinform., 2021

MedPath: Augmenting Health Risk Prediction via Medical Knowledge Paths.
Proceedings of the WWW '21: The Web Conference 2021, 2021

AID: Active Distillation Machine to Leverage Pre-Trained Black-Box Models in Private Data Settings.
Proceedings of the WWW '21: The Web Conference 2021, 2021

UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced Data.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier Detection.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

EVA: Generating Longitudinal Electronic Health Records Using Conditional Variational Autoencoders.
Proceedings of the Machine Learning for Healthcare Conference, 2021

MTC: Multiresolution Tensor Completion from Partial and Coarse Observations.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Advances in Mining Heterogeneous Healthcare Data.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Probabilistic and Dynamic Molecule-Disease Interaction Modeling for Drug Discovery.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

SafeDrug: Dual Molecular Graph Encoders for Recommending Effective and Safe Drug Combinations.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Change Matters: Medication Change Prediction with Recurrent Residual Networks.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Multi-version Tensor Completion for Time-delayed Spatio-temporal Data.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

MedRetriever: Target-Driven Interpretable Health Risk Prediction via Retrieving Unstructured Medical Text.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

CUP: Cluster Pruning for Compressing Deep Neural Networks.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

SPEAR: self-supervised post-training enhancer for molecule optimization.
Proceedings of the BCB '21: 12th ACM International Conference on Bioinformatics, 2021

Fusion: Towards Automated ICD Coding via Feature Compression.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

STELAR: Spatio-temporal Tensor Factorization with Latent Epidemiological Regularization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

MIMOSA: Multi-constraint Molecule Sampling for Molecule Optimization.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Introduction to Deep Learning for Healthcare
Springer, ISBN: 978-3-030-82183-8, 2021

2020
Dr. Agent: Clinical predictive model via mimicked second opinions.
J. Am. Medical Informatics Assoc., 2020

A Benchmark Dataset for Understandable Medical Language Translation.
CoRR, 2020

DeepRite: Deep Recurrent Inverse TreatmEnt Weighting for Adjusting Time-varying Confounding in Modern Longitudinal Observational Data.
CoRR, 2020

MolDesigner: Interactive Design of Efficacious Drugs with Deep Learning.
CoRR, 2020

Fast Graph Attention Networks Using Effective Resistance Based Graph Sparsification.
CoRR, 2020

SkipGNN: Predicting Molecular Interactions with Skip-Graph Networks.
CoRR, 2020

DeepPurpose: a Deep Learning Based Drug Repurposing Toolkit.
CoRR, 2020

SUOD: A Scalable Unsupervised Outlier Detection Framework.
CoRR, 2020

CLARA: Clinical Report Auto-completion.
CoRR, 2020

DeepEnroll: Patient-Trial Matching with Deep Embedding and Entailment Prediction.
CoRR, 2020

Opportunities and challenges of deep learning methods for electrocardiogram data: A systematic review.
Comput. Biol. Medicine, 2020

Optimal collection of medical specimens and delivery to central laboratory.
Ann. Oper. Res., 2020

Patient-Trial Matching with Deep Embedding and Entailment Prediction.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

StageNet: Stage-Aware Neural Networks for Health Risk Prediction.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

REST: Robust and Efficient Neural Networks for Sleep Monitoring in the Wild.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Clinical Report Auto-completion.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

HiTANet: Hierarchical Time-Aware Attention Networks for Risk Prediction on Electronic Health Records.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

COMPOSE: Cross-Modal Pseudo-Siamese Network for Patient Trial Matching.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

LSAN: Modeling Long-term Dependencies and Short-term Correlations with Hierarchical Attention for Risk Prediction.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

Supervised Topic Compositional Neural Language Model for Clinical Narrative Understanding.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

α-MOP: Molecule optimization with α-divergence.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

CASTER: Predicting Drug Interactions with Chemical Substructure Representation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CORE: Automatic Molecule Optimization Using Copy & Refine Strategy.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CONAN: Complementary Pattern Augmentation for Rare Disease Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Doctor2Vec: Dynamic Doctor Representation Learning for Clinical Trial Recruitment.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CUP: Cluster Pruning for Compressing Deep Neural Networks.
CoRR, 2019

GENN: Predicting Correlated Drug-drug Interactions with Graph Energy Neural Networks.
CoRR, 2019

Predicting Treatment Initiation from Clinical Time Series Data via Graph-Augmented Time-Sensitive Model.
CoRR, 2019

Rare Disease Detection by Sequence Modeling with Generative Adversarial Networks.
CoRR, 2019

Improved Maximum Margin Clustering via the Bundle Method.
IEEE Access, 2019

Longitudinal Adversarial Attack on Electronic Health Records Data.
Proceedings of the World Wide Web Conference, 2019

EEGtoText: Learning to Write Medical Reports from EEG Recordings.
Proceedings of the Machine Learning for Healthcare Conference, 2019

SLEEPER: interpretable Sleep staging via Prototypes from Expert Rules.
Proceedings of the Machine Learning for Healthcare Conference, 2019

Tutorial: Data Mining Methods for Drug Discovery and Development.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Retaining Privileged Information for Multi-Task Learning.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Pre-training of Graph Augmented Transformers for Medication Recommendation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

MINA: Multilevel Knowledge-Guided Attention for Modeling Electrocardiography Signals.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

RDPD: Rich Data Helps Poor Data via Imitation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

DDL: Deep Dictionary Learning for Predictive Phenotyping.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

PEARL: Prototype Learning via Rule Learning.
Proceedings of the 10th ACM International Conference on Bioinformatics, 2019

GAMENet: Graph Augmented MEmory Networks for Recommending Medication Combination.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Optimal Expert Knowledge Elicitation for Bayesian Network Structure Identification.
IEEE Trans Autom. Sci. Eng., 2018

Switching-State Dynamical Modeling of Daily Behavioral Data.
J. Heal. Informatics Res., 2018

Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review.
J. Am. Medical Informatics Assoc., 2018

AWE: Asymmetric Word Embedding for Textual Entailment.
CoRR, 2018

RDPD: Rich Data Helps Poor Data via Imitation.
CoRR, 2018

Health-ATM: A Deep Architecture for Multifaceted Patient Health Record Representation and Risk Prediction.
Proceedings of the 2018 SIAM International Conference on Data Mining, 2018

Constrained Generation of Semantically Valid Graphs via Regularizing Variational Autoencoders.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

MiME: Multilevel Medical Embedding of Electronic Health Records for Predictive Healthcare.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Pairwise-Ranking based Collaborative Recurrent Neural Networks for Clinical Event Prediction.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Drug Similarity Integration Through Attentive Multi-view Graph Auto-Encoders.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling.
Proceedings of the 6th International Conference on Learning Representations, 2018

Heterogeneous Hyper-Network Embedding.
Proceedings of the IEEE International Conference on Data Mining, 2018

Uncovering Dynamic Functional Connectivity of Parkinson's Disease Using Topological Features and Sparse Group Lasso.
Proceedings of the Brain Informatics - International Conference, 2018

2017
Unsupervised Sequential Outlier Detection With Deep Architectures.
IEEE Trans. Image Process., 2017

An RNN Architecture with Dynamic Temporal Matching for Personalized Predictions of Parkinson's Disease.
Proceedings of the 2017 SIAM International Conference on Data Mining, 2017

Patient Subtyping via Time-Aware LSTM Networks.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Subtyping Parkinson's Disease with Recurrent Neural Network Models.
Proceedings of the AMIA 2017, 2017

Adverse Drug Reaction Prediction with Symbolic Latent Dirichlet Allocation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Multitask Dyadic Prediction and Its Application in Prediction of Adverse Drug-Drug Interaction.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Optimization Models for Feature Selection of Decomposed Nearest Neighbor.
IEEE Trans. Syst. Man Cybern. Syst., 2016

A Patient-Specific Model for Predicting Tibia Soft Tissue Insertions From Bony Outlines Using a Spatial Structure Supervised Learning Framework.
IEEE Trans. Hum. Mach. Syst., 2016

An integrated feature ranking and selection framework for ADHD characterization.
Brain Informatics, 2016

A Novel Mutual-Information-Guided Sparse Feature Selection Approach for Epilepsy Diagnosis Using Interictal EEG Signals.
Proceedings of the Brain Informatics and Health - International Conference, 2016

An Efficient Time Series Subsequence Pattern Mining and Prediction Framework with an Application to Respiratory Motion Prediction.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Detecting Clusters of Fake Accounts in Online Social Networks.
Proceedings of the 8th ACM Workshop on Artificial Intelligence and Security, 2015


  Loading...