Fengzhu Sun

Orcid: 0000-0002-8552-043X

According to our database1, Fengzhu Sun authored at least 91 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A comparative analysis of mutual information methods for pairwise relationship detection in metagenomic data.
BMC Bioinform., December, 2024

DeepDecon accurately estimates cancer cell fractions in bulk RNA-seq data.
Patterns, 2024

Imputing Metagenomic Hi-C Contacts Facilitates the Integrative Contig Binning Through Constrained Random Walk with Restart.
J. Comput. Biol., 2024

DeepLINK-T: deep learning inference for time series data using knockoffs and LSTM.
CoRR, 2024

ImputeCC Enhances Integrative Hi-C-Based Metagenomic Binning Through Constrained Random-Walk-Based Imputation.
Proceedings of the Research in Computational Molecular Biology, 2024

2023
Batch normalization followed by merging is powerful for phenotype prediction integrating multiple heterogeneous studies.
PLoS Comput. Biol., October, 2023

Identifying local associations in biological time series: algorithms, statistical significance, and applications.
Briefings Bioinform., September, 2023

2022
AC-PCoA: Adjustment for confounding factors using principal coordinate analysis.
PLoS Comput. Biol., 2022

A model-based approach to assess reproducibility for large-scale high-throughput MRI-based studies.
NeuroImage, 2022

Michael Waterman's Contributions to Computational Biology and Bioinformatics.
J. Comput. Biol., 2022

Special Issue: Professor Michael Waterman's 80th Birthday, Part 1.
J. Comput. Biol., 2022

Normalizing Metagenomic Hi-C Data and Detecting Spurious Contacts Using Zero-Inflated Negative Binomial Regression.
J. Comput. Biol., 2022

A New Context Tree Inference Algorithm for Variable Length Markov Chain Model with Applications to Biological Sequence Analyses.
J. Comput. Biol., 2022

HiFine: integrating Hi-C-based and shotgun-based methods to refine binning of metagenomic contigs.
Bioinform., 2022

2021
CRAFT: Compact genome Representation toward large-scale Alignment-Free daTabase.
Bioinform., 2021

KIMI: Knockoff Inference for Motif Identification from molecular sequences with controlled false discovery rate.
Bioinform., 2021

Corrigendum to: Assessment of metagenomic assemblers based on hybrid reads of real and simulated metagenomic sequences.
Briefings Bioinform., 2021

2020
Confidence intervals for Markov chain transition probabilities based on next generation sequencing reads data.
Quant. Biol., 2020

Identifying viruses from metagenomic data using deep learning.
Quant. Biol., 2020

International Workshop on Applications of Probability and Statistics to Biology, July 11-13, 2019.
Quant. Biol., 2020

Assessment of metagenomic assemblers based on hybrid reads of real and simulated metagenomic sequences.
Briefings Bioinform., 2020

2019
Statistical significance approximation for local similarity analysis of dependent time series data.
BMC Bioinform., December, 2019

NetGO: improving large-scale protein function prediction with massive network information.
Nucleic Acids Res., 2019

SolidBin: improving metagenome binning with semi-supervised normalized cut.
Bioinform., 2019

A new statistic for efficient detection of repetitive sequences.
Bioinform., 2019

2018
GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank.
Bioinform., 2018

2017
CAFE: aCcelerated Alignment-FrEe sequence analysis.
Nucleic Acids Res., 2017

Prediction of virus-host infectious association by supervised learning methods.
BMC Bioinform., 2017

Improving contig binning of metagenomic data using (d<sub>2</sub><sup>S</sup>) oligonucleotide frequency dissimilarity.
BMC Bioinform., 2017

COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.
Bioinform., 2017

Sparse generalized linear model with L 0 approximation for feature selection and prediction with big omics data.
BioData Min., 2017

2016
Inference of domain-disease associations from domain-protein, protein-disease and disease-disease relationships.
BMC Syst. Biol., 2016

Inference of Markovian properties of molecular sequences from NGS data and applications to comparative genomics.
Bioinform., 2016

2015
Finding Genetic Overlaps Among Diseases Based on Ranked Gene Lists.
J. Comput. Biol., 2015

Statistical significance approximation in local trend analysis of high-throughput time-series data using the theory of Markov chains.
BMC Bioinform., 2015

Network tuned multiple rank aggregation and applications to gene ranking.
BMC Bioinform., 2015

Multilevel regularized regression for simultaneous taxa selection and network construction with metagenomic count data.
Bioinform., 2015

2014
DiseaseConnect: a comprehensive web server for mechanism-based disease-disease connections.
Nucleic Acids Res., 2014

Alignment-free protein interaction network comparison.
Bioinform., 2014

New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing.
Briefings Bioinform., 2014

Integrative approaches for predicting protein function and prioritizing genes for complex phenotypes using protein interaction networks.
Briefings Bioinform., 2014

2013
Meeting report on RECOMB 2013 (the 17th Annual International Conference on Research in Computational Molecular Biology).
Quant. Biol., 2013

Preface: Research in Computational Molecular Biology (RECOMB 2013).
J. Comput. Biol., 2013

Alignment-Free Sequence Comparison Based on Next-Generation Sequencing Reads.
J. Comput. Biol., 2013

A unified approach for allele frequency estimation, SNP detection and association studies based on pooled sequencing data using EM algorithms.
BMC Genom., 2013

Efficient statistical significance approximation for local similarity analysis of high-throughput time series data.
Bioinform., 2013

Multiple alignment-free sequence comparison.
Bioinform., 2013

2012
CEDER: Accurate Detection of Differentially Expressed Genes by Combining Significance of Exons Using RNA-Seq.
IEEE ACM Trans. Comput. Biol. Bioinform., 2012

Normal and Compound Poisson Approximations for Pattern Occurrences in NGS Reads.
J. Comput. Biol., 2012

Preface: Special Issue.
J. Comput. Biol., 2012

Alignment-Free Sequence Comparison Based on Next Generation Sequencing Reads: Extended Abstract.
Proceedings of the Research in Computational Molecular Biology, 2012

2011
Sequence Alignment as Hypothesis Testing.
J. Comput. Biol., 2011

DomainRBF: a Bayesian regression approach to the prioritization of candidate domains for complex diseases.
BMC Syst. Biol., 2011

Extended local similarity analysis (eLSA) of microbial community and other time series data with replicates.
BMC Syst. Biol., 2011

Integrating multiple protein-protein interaction networks to prioritize disease genes: a Bayesian regression approach.
BMC Bioinform., 2011

Variance adjusted weighted UniFrac: a powerful beta diversity measure for comparing communities based on phylogeny.
BMC Bioinform., 2011

Inferring Signaling and Gene Regulatory Network from Genetic and Genomic Information.
Proceedings of the Handbook of Statistical Bioinformatics., 2011

2010
The Power of Detecting Enriched Patterns: An HMM Approach.
J. Comput. Biol., 2010

Alignment-Free Sequence Comparison (II): Theoretical Power of Comparison Statistics.
J. Comput. Biol., 2010

2009
Alignment-Free Sequence Comparison (I): Statistics and Power.
J. Comput. Biol., 2009

Bayesian Models and Gibbs Sampling Strategies for Local Graph Alignment and Motif Identification in Stochastic Biological Networks.
Commun. Inf. Syst., 2009

Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach.
BMC Bioinform., 2009

2008
Compound Poisson Approximation of the Number of Occurrences of a Position Frequency Matrix (PFM) on Both Strands.
J. Comput. Biol., 2008

The effects of protein interactions, gene essentiality and regulatory regions on expression variation.
BMC Syst. Biol., 2008

Testing gene set enrichment for subset of genes: Sub-GSE.
BMC Bioinform., 2008

2007
Inferring activity changes of transcription factors by binding association with sorted expression profiles.
BMC Bioinform., 2007

CGI: a new approach for prioritizing genes by combining gene expression and protein-protein interaction data.
Bioinform., 2007

2006
A model-based approach to selection of tag SNPs.
BMC Bioinform., 2006

An integrated approach to the prediction of domain-domain interactions.
BMC Bioinform., 2006

Searching for interpretable rules for disease mutations: a simulated annealing bump hunting strategy.
BMC Bioinform., 2006

A dynamic programming algorithm for binning microbial community profiles.
Bioinform., 2006

Local similarity analysis reveals unique associations among marine bacterioplankton species and environmental factors.
Bioinform., 2006

MARD: a new method to detect differential gene expression in treatment-control time courses.
Bioinform., 2006

An integrative approach for causal gene identification and gene regulatory pathway inference.
Proceedings of the Proceedings 14th International Conference on Intelligent Systems for Molecular Biology 2006, 2006

2005
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.
Bioinform., 2005

HapBlock: haplotype block partitioning and tag SNP selection software using a set of dynamic programming algorithms.
Bioinform., 2005

2004
Extreme Value Distribution Based Gene Selection Criteria for Discriminant Microarray Data Analysis Using Logistic Regression.
J. Comput. Biol., 2004

An Integrated Probabilistic Model for Functional Prediction of Proteins.
J. Comput. Biol., 2004

Mapping gene ontology to proteins based on protein-protein interaction data.
Bioinform., 2004

A Comparison of Different Strategies for Computing Confidence Intervals of the Linkage Disequilibrium Measure.
Proceedings of the Biocomputing 2004, 2004

2003
The Mutation Process of Microsatellites During the Polymerase Chain Reaction.
J. Comput. Biol., 2003

Prediction of Protein Function Using Protein-Protein Interaction Data.
J. Comput. Biol., 2003

Dynamic programming algorithms for haplotype block partitioning: applications to human chromosome 21 haplotype data.
Proceedings of the Sventh Annual International Conference on Computational Biology, 2003

Assessment of the reliability of protein-protein interactions and protein function prediction.
Proceedings of the 8th Pacific Symposium on Biocomputing, 2003

2002
Dynamic Programming Algorithms for Haplotype Block Partitioning and Tag SNP Selection Using Haplotype Data or Genotype Data.
Proceedings of the Computational Methods for SNPs and Haplotype Inference, 2002

Inferring domain-domain interactions from protein-protein interactions.
Proceedings of the Sixth Annual International Conference on Computational Biology, 2002

2000
Estimation of the Mutation Rate During Error-prone Polymerase Chain Reaction.
J. Comput. Biol., 2000

1999
Modeling DNA Shuffling.
J. Comput. Biol., 1999

1998
Optimizing Nonadaptive Group Tests for Objects with Heterogeneous Priors.
SIAM J. Appl. Math., 1998

1997
Pooling Strategies for Establishing Physical Genome Maps Using FISH.
J. Comput. Biol., 1997

1995
The Polymerase Chain Reaction and Branching Processes.
J. Comput. Biol., 1995


  Loading...