Hongyu Zhao

Orcid: 0000-0003-1195-9607

Affiliations:
  • Yale University, New Haven, CT, USA


According to our database1, Hongyu Zhao authored at least 110 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
StrokeClassifier: ischemic stroke etiology classification by ensemble consensus modeling using electronic health records.
npj Digit. Medicine, 2024

Joint modeling of human cortical structure: Genetic correlation network and composite-trait genetic correlation.
NeuroImage, 2024

LDER-GE estimates phenotypic variance component of gene-environment interactions in human complex traits accurately with GE interaction summary statistics and full LD information.
Briefings Bioinform., 2024

2023
Genomic risk prediction of cardiovascular diseases among type 2 diabetes patients in the UK Biobank.
Frontiers Bioinform., May, 2023

HBV-infected hepatocellular carcinoma can be robustly classified into three clinically relevant subgroups by a novel analytical protocol.
Briefings Bioinform., March, 2023

A novel Bayesian framework for harmonizing information across tissues and studies to increase cell type deconvolution accuracy.
Briefings Bioinform., January, 2023

CARE as a wearable derived feature linking circadian amplitude to human cognitive functions.
npj Digit. Medicine, 2023

Comprehensive Analysis of Ubiquitously Expressed Genes in Humans from A Data-driven Perspective.
Genom. Proteom. Bioinform., 2023

MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
phyloMDA: an R package for phylogeny-aware microbiome data analysis.
BMC Bioinform., December, 2022

A Manifold Proximal Linear Method for Sparse Spectral Clustering with Application to Single-Cell RNA Sequencing Data Analysis.
INFORMS J. Optim., April, 2022

Non-linear archetypal analysis of single-cell RNA-seq data by deep autoencoders.
PLoS Comput. Biol., 2022

An unbiased kinship estimation method for genetic data analysis.
BMC Bioinform., 2022

fastANCOM: a fast method for analysis of compositions of microbiomes.
Bioinform., 2022

ResPAN: a powerful batch correction model for scRNA-seq data through residual adversarial networks.
Bioinform., 2022

A Markov random field model-based approach for differentially expressed gene detection from single-cell RNA-seq data.
Briefings Bioinform., 2022

MZINBVA: variational approximation for multilevel zero-inflated negative-binomial models for association analysis in microbiome surveys.
Briefings Bioinform., 2022

A general kernel boosting framework integrating pathways for predictive modeling based on genomic data.
Proceedings of the BCB '22: 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, Northbrook, Illinois, USA, August 7, 2022

2021
Roles of statistical modeling in characterizing the genetic basis of human diseases and traits.
Quant. Biol., December, 2021

Transcriptome wide association studies: general framework and methods.
Quant. Biol., 2021

Integrative modeling of transmitted and <i>de novo</i> variants identifies novel risk genes for congenital heart disease.
Quant. Biol., 2021

Special issue on genome wide association study.
Quant. Biol., 2021

Model-Based Microbiome Data Ordination: A Variational Approximation Approach.
J. Comput. Graph. Stat., 2021

Calculating Orthologous Protein-Coding Sequence Set Probability Using the Poisson Process.
J. Comput. Biol., 2021

<i>RePhine</i>: An Integrative Method for Identification of Drug Response-related Transcriptional Regulators.
Genom. Proteom. Bioinform., 2021

A Markov random field model for network-based differential expression analysis of single-cell RNA-seq data.
BMC Bioinform., 2021

Transformation and differential abundance analysis of microbiome data incorporating phylogeny.
Bioinform., 2021

Comparison of methods for estimating genetic correlation between complex traits using GWAS summary statistics.
Briefings Bioinform., 2021

2020
Leveraging functional annotation to identify genes associated with complex diseases.
PLoS Comput. Biol., November, 2020

Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies.
PLoS Comput. Biol., 2020

An empirical Bayes approach to normalization and differential abundance testing for microbiome data.
BMC Bioinform., 2020

Predicting viral exposure response from modeling the changes of co-expression networks using time series gene expression data.
BMC Bioinform., 2020

NITUMID: Nonnegative matrix factorization-based Immune-TUmor MIcroenvironment Deconvolution.
Bioinform., 2020

Inference of Dynamic Graph Changes for Functional Connectome.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Network Clustering Analysis Using Mixture Exponential-Family Random Graph Models and Its Application in Genetic Interaction Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2019

ProteomicsBrowser: MS/proteomics data visualization and investigation.
Bioinform., 2019

An evaluation of noncoding genome annotation tools through enrichment analysis of 15 genome-wide association studies.
Briefings Bioinform., 2019

2018
Pivotal variable detection of the covariance matrix and its application to high-dimensional factor models.
Stat. Comput., 2018

Estimating a sparse reduction for general regression in high dimensions.
Stat. Comput., 2018

Spectral clustering based on learning similarity matrix.
Bioinform., 2018

Improving SNP prioritization and pleiotropic architecture estimation by incorporating prior knowledge using graph-GPA.
Bioinform., 2018

2017
Variable importance-weighted random forests.
Quant. Biol., 2017

Leveraging functional annotations in genetic risk prediction for human complex diseases.
PLoS Comput. Biol., 2017

graph-GPA: A graphical model for prioritizing GWAS results and investigating pleiotropic architecture.
PLoS Comput. Biol., 2017

gCoda: Conditional Dependence Network Inference for Compositional Data.
J. Comput. Biol., 2017

A comparison of graph- and kernel-based -omics data integration algorithms for classifying complex traits.
BMC Bioinform., 2017

GRAPE: a pathway template method to characterize tissue-specific functionality from gene expression profiles.
BMC Bioinform., 2017

2016
Efficient Drug-Pathway Association Analysis via Integrative Penalized Matrix Decomposition.
IEEE ACM Trans. Comput. Biol. Bioinform., 2016

Leveraging protein quaternary structure to identify oncogenic driver mutations.
BMC Bioinform., 2016

NBLDA: negative binomial linear discriminant analysis for RNA-Seq data.
BMC Bioinform., 2016

GenoWAP: GWAS signal prioritization through integrated analysis of genomic functional annotation.
Bioinform., 2016

Predicting synergistic effects between compounds through their structural similarity and effects on transcriptomes.
Bioinform., 2016

2015
Functional Module Analysis for Gene Coexpression Networks with Network Integration.
IEEE ACM Trans. Comput. Biol. Bioinform., 2015

The application of sparse estimation of covariance matrix to quadratic discriminant analysis.
BMC Bioinform., 2015

M<sup>3</sup>-S: a genotype calling method incorporating information from samples with known genotypes.
BMC Bioinform., 2015

CCLasso: correlation inference for compositional data through Lasso.
Bioinform., 2015

2014
Low-Rank Modeling and Its Applications in Image Analysis.
ACM Comput. Surv., 2014

Ttn as a likely causal gene for QTL of alcohol preference on mouse chromosome 2.
BMC Bioinform., 2014

A graph theoretic approach to utilizing protein structure to identify non-random somatic mutations.
BMC Bioinform., 2014

A spatial simulation approach to account for protein structure when identifying non-random somatic mutations.
BMC Bioinform., 2014

2013
Multisample aCGH Data Analysis via Total Variation and Spectral Regularization.
IEEE ACM Trans. Comput. Biol. Bioinform., 2013

HapBoost: A Fast Approach to Boosting Haplotype Association Analyses in Genome-Wide Association Studies.
IEEE ACM Trans. Comput. Biol. Bioinform., 2013

Sparse principal component analysis by choice of norm.
J. Multivar. Anal., 2013

Utilizing protein structure to identify non-random somatic mutations.
BMC Bioinform., 2013

Differential expression analysis for paired RNA-seq data.
BMC Bioinform., 2013

Accounting for non-genetic factors by low-rank representation and sparse regression for eQTL mapping.
Bioinform., 2013

Joint analysis of expression profiles from multiple cancers improves the identification of microRNA-gene interactions.
Bioinform., 2013

2012
Improved mean estimation and its application to diagonal discriminant analysis.
Bioinform., 2012

FacPad: Bayesian sparse factor modeling for the inference of pathways responsive to drug treatment.
Bioinform., 2012

iFad: an integrative factor analysis model for drug-pathway association inference.
Bioinform., 2012

M<sup>3</sup>: an improved SNP calling algorithm for Illumina BeadArray data.
Bioinform., 2012

2011
Some theoretical properties of Silverman's method for Smoothed functional principal component analysis.
J. Multivar. Anal., 2011

Bias Detection and Correction in RNA-Sequencing Data.
BMC Bioinform., 2011

Score regularization for peptide identification.
BMC Bioinform., 2011

COSINE: COndition-SpecIfic sub-NEtwork identification using a global optimization method.
Bioinform., 2011

Reverse Engineering of Gene Regulation Networks with an Application to the DREAM4 <i>in silico</i> Network Challenge.
Proceedings of the Handbook of Statistical Bioinformatics., 2011

2010
Pathway analysis using random forests with bivariate node-split for survival outcomes.
Bioinform., 2010

2009
Genotyping and inflated type I error rate in genome-wide association case/control studies.
BMC Bioinform., 2009

Effect of false positive and false negative rates on inference of binding target conservation across different conditions and species from ChIP-chip data.
BMC Bioinform., 2009

2008
Is Subcellular Localization Informative for Modeling Protein-Protein Interaction Signal?
J. Electr. Comput. Eng., 2008

Building pathway clusters from Random Forests classification using class votes.
BMC Bioinform., 2008

Statistical methods to infer cooperative binding among transcription factors in <i>Saccharomyces cerevisiae</i>.
Bioinform., 2008

Considering dependence among genes and markers for false discovery control in eQTL mapping.
Bioinform., 2008

Estimating dynamic models for gene regulation networks.
Bioinform., 2008

2007
Integrating domain knowledge with statistical and data mining methods for high-density genomic SNP disease association analysis.
J. Biomed. Informatics, 2007

A new protocol of analyzing isotope-coded affinity tag data from high-resolution LC-MS spectrometry.
Comput. Biol. Chem., 2007

2006
Multiple Peak Alignment in Sequential Data Analysis: A Scale-Space-Based Approach.
IEEE ACM Trans. Comput. Biol. Bioinform., 2006

Detecting and aligning peaks in mass spectrometry data with applications to MALDI.
Comput. Biol. Chem., 2006

PSMIX: an R package for population structure inference via maximum likelihood method.
BMC Bioinform., 2006

Pathway analysis using random forests classification and regression.
Bioinform., 2006

2005
Integrating mRNA Decay Information into Co-Regulation Study.
J. Comput. Sci. Technol., 2005

Case Report: A High Productivity/Low Maintenance Approach to High-performance Computation for Biomedicine: Four Case Studies.
J. Am. Medical Informatics Assoc., 2005

A statistical method to detect chromosomal regions with DNA copy number alterations using SNP-array-based CGH data.
Comput. Biol. Chem., 2005

Are scale-free networks robust to measurement errors?
BMC Bioinform., 2005

HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.
Bioinform., 2005

Inferring protein-protein interactions through high-throughput interaction data from diverse organisms.
Bioinform., 2005

Detection of DNA copy number alterations using penalized least squares regression.
Bioinform., 2005

VitaPad: visualization tools for the analysis of pathway data.
Bioinform., 2005

A semiparametric approach for marker gene selection based on gene expression data.
Bioinform., 2005

Negative correlation between compositional symmetries and local recombination rates.
Bioinform., 2005

Aligning Peaks Across Multiple Mass Spectrometry Data Sets Using A Scale-Space Based Approach.
Proceedings of the Fourth International IEEE Computer Society Computational Systems Bioinformatics Conference Workshops & Poster Abstracts, 2005

2004
BGI-RIS: an integrated information resource and comparative analysis workbench for rice genomics.
Nucleic Acids Res., 2004

Handling multiple testing while interpreting microarrays with the Gene Ontology Database.
BMC Bioinform., 2004

A computational approach for ordering signal transduction pathway components from genomics and proteomics Data.
BMC Bioinform., 2004

Information assessment on predicting protein-protein interactions.
BMC Bioinform., 2004

A statistical method for identifying differential gene-gene co-expression patterns.
Bioinform., 2004

2003
PathMAPA: a tool for displaying gene expression and performing statistical tests on metabolic pathways at multiple levels for <i>Arabidopsis</i>.
BMC Bioinform., 2003

Comparison of statistical methods for classification of ovarian cancer using mass spectrometry data.
Bioinform., 2003

2002

2001
On Relationship Inference Using Gamete Identity by Descent Data.
J. Comput. Biol., 2001


  Loading...