Zengyou He

Orcid: 0000-0001-9526-8816

According to our database1, Zengyou He authored at least 107 papers between 2002 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Interpretable sequence clustering.
Inf. Sci., 2025

2024
Node Centrality Inference via Hypothesis Testing.
Stat. Anal. Data Min., October, 2024

Central node identification via weighted kernel density estimation.
Data Min. Knowl. Discov., May, 2024

A randomized algorithm for clustering discrete sequences.
Pattern Recognit., 2024

Random subsequence forests.
Inf. Sci., 2024

Conjunction Subspaces Test for Conformal and Selective Classification.
CoRR, 2024

Interpretable Clustering: A Survey.
CoRR, 2024

Interpretable Multi-View Clustering.
CoRR, 2024

2023
Random forest clustering for discrete sequences.
Pattern Recognit. Lett., October, 2023

The statistical nature of h-index of a network node and its extensions.
J. Informetrics, August, 2023

Mining Statistically Significant Communities From Weighted Networks.
IEEE Trans. Knowl. Data Eng., June, 2023

On the Statistical Significance of a Community Structure.
IEEE Trans. Knowl. Data Eng., March, 2023

Decision Tree for Sequences.
IEEE Trans. Knowl. Data Eng., 2023

Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence Classification.
CoRR, 2023

A testing-based approach to assess the clusterability of categorical data.
CoRR, 2023

Personalized Interpretable Classification.
CoRR, 2023

2022
Detecting Statistically Significant Communities.
IEEE Trans. Knowl. Data Eng., 2022

Significance-Based Essential Protein Discovery.
IEEE ACM Trans. Comput. Biol. Bioinform., 2022

Significance-Based Categorical Data Clustering.
CoRR, 2022

The statistical nature of h-index of a network node.
CoRR, 2022

2021
Essential Protein Recognition via Community Significance.
IEEE ACM Trans. Comput. Biol. Bioinform., 2021

A graph-traversal approach to identify influential nodes in a network.
Patterns, 2021

Instance-Based Classification Through Hypothesis Testing.
IEEE Access, 2021

2020
Protein Complexes Identification with Family-Wise Error Rate Control.
IEEE ACM Trans. Comput. Biol. Bioinform., 2020

Computing exact P-values for community detection.
Data Min. Knowl. Discov., 2020

Reference-Based Sequence Classification.
IEEE Access, 2020

2019
Reinforce: An Ensemble Approach for Inferring PPI Network from AP-MS Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2019

面向PCP-MS数据的PPI网络推断算法 (PPI Network Inference Algorithm for PCP-MS Data).
计算机科学, 2019

Mining conditional discriminative sequential patterns.
Inf. Sci., 2019

Significance-based discriminative sequential pattern mining.
Expert Syst. Appl., 2019

Correlation-Based Community Detection.
CoRR, 2019

2018
On the statistical significance of protein complex.
Quant. Biol., 2018

Detecting Statistically Significant Communities.
CoRR, 2018

Protein complexes identification based on go attributed network embedding.
BMC Bioinform., 2018

PCM: A Pairwise Correlation Mining Package for Biological Network Inference.
Proceedings of the Intelligent Computing Theories and Application, 2018

RPML: A Learning-Based Approach for Reranking Protein-Spectrum Matches.
Proceedings of the Intelligent Computing Theories and Application, 2018

2017
Inferring MicroRNA-Disease Associations by Random Walk on a Heterogeneous Network with Multiple Data Sources.
IEEE ACM Trans. Comput. Biol. Bioinform., 2017

Conditional discriminative pattern mining: Concepts and algorithms.
Inf. Sci., 2017

A two-step framework for inferring direct protein-protein interaction network from AP-MS data.
BMC Syst. Biol., 2017

2016
Computing exact permutation p-values for association rules.
Inf. Sci., 2016

Protein inference: A protein quantification perspective.
Comput. Biol. Chem., 2016

2015
BagReg: Protein inference through machine learning.
Comput. Biol. Chem., 2015

Network inference from AP-MS data: computational challenges and solutions.
Briefings Bioinform., 2015

Discriminative pattern mining and its applications in bioinformatics.
Briefings Bioinform., 2015

High Dimensional Explicit Feature Biased Matrix Factorization Recommendation.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2015

A Semi-Supervised Framework for Social Spammer Detection.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

2014
Improved and Promising Identificationof Human MicroRNAs by Incorporatinga High-Quality Negative Set.
IEEE ACM Trans. Comput. Biol. Bioinform., 2014

Mining Conditional Phosphorylation Motifs.
IEEE ACM Trans. Comput. Biol. Bioinform., 2014

Decoy-free protein-level false discovery rate estimation.
Bioinform., 2014

Data construction for phosphorylation site prediction.
Briefings Bioinform., 2014

2013
A Combinatorial Perspective of the Protein Inference Problem.
IEEE ACM Trans. Comput. Biol. Bioinform., 2013

ProteinLasso: A Lasso regression approach to protein inference problem in shotgun proteomics.
Comput. Biol. Chem., 2013

2012
Peptide Reranking with Protein-Peptide Correspondence and Precursor Peak Intensity Information.
IEEE ACM Trans. Comput. Biol. Bioinform., 2012

On unbiased performance evaluation for protein inference
CoRR, 2012

Protein Inference and Protein Quantification: Two Sides of the Same Coin
CoRR, 2012

A linear programming model for protein inference problem in shotgun proteomics.
Bioinform., 2012

Comments on 'MMFPh: A Maximal Motif Finder for Phosphoproteomics Datasets'.
Bioinform., 2012

Protein inference: a review.
Briefings Bioinform., 2012

2011
A Partial Set Covering Model for Protein Mixture Identification Using Mass Spectrometry Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2011

Clustering categorical data streams.
J. Comput. Methods Sci. Eng., 2011

Attribute value weighting in k-modes clustering.
Expert Syst. Appl., 2011

The choice of null distributions for detecting gene-gene interactions in genome-wide association studies.
BMC Bioinform., 2011

Score regularization for peptide identification.
BMC Bioinform., 2011

Motif-All: discovering all phosphorylation motifs.
BMC Bioinform., 2011

2010
G-ANMI: A mutual information based genetic clustering algorithm for categorical data.
Knowl. Based Syst., 2010

Optimization-Based Peptide Mass Fingerprinting for Protein Mixture Identification.
J. Comput. Biol., 2010

Efficient K-Nearest Neighbor Join Algorithms for High Dimensional Sparse Data
CoRR, 2010

Stable feature selection for biomarker discovery.
Comput. Biol. Chem., 2010

2009
Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis.
BMC Bioinform., 2009

SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies.
Bioinform., 2009

Improving peptide identification with single-stage mass spectrum peaks.
Bioinform., 2009

Dynamic Self-configuration of User QoS for Next Generation Network.
Proceedings of the NPC 2009, 2009

2008
k-ANMI: A mutual information based clustering algorithm for categorical data.
Inf. Fusion, 2008

A SCOR Reference Model of the Supply Chain Management System in an Enterprise.
Int. Arab J. Inf. Technol., 2008

Peak bagging for peptide mass fingerprinting.
Bioinform., 2008

2007
On the Impact of Dissimilarity Measure in k-Modes Clustering Algorithm.
IEEE Trans. Pattern Anal. Mach. Intell., 2007

K-means Clustering - A Survey.
Proceedings of the 2007 International Conference on Information & Knowledge Engineering, 2007

2006
Mining top-k strongly correlated item pairs without minimum correlation threshold.
Int. J. Knowl. Based Intell. Eng. Syst., 2006

Farthest-Point Heuristic based Initialization Methods for K-Modes Clustering
CoRR, 2006

Approximation Algorithms for K-Modes Clustering
CoRR, 2006

Improving categorical data clustering algorithm by weighting uncommon attribute value matches.
Comput. Sci. Inf. Syst., 2006

A Fast Greedy Algorithm for Outlier Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2006

Approximation Algorithms for <i>K</i>-Modes Clustering.
Proceedings of the Computational Intelligence, 2006

2005
TCSOM: Clustering Transactions Using Self-Organizing Map.
Neural Process. Lett., 2005

A cluster ensemble method for clustering categorical data.
Inf. Fusion, 2005

Scalable algorithms for clustering large datasets with mixed type attributes.
Int. J. Intell. Syst., 2005

Mining action rules from scratch.
Expert Syst. Appl., 2005

K-Histograms: An Efficient Clustering Algorithm for Categorical Dataset
CoRR, 2005

Clustering Mixed Numeric and Categorical Data: A Cluster Ensemble Approach
CoRR, 2005

A Fast Greedy Algorithm for Outlier Mining
CoRR, 2005

A Unified Subspace Outlier Ensemble Framework for Outlier Detection in High Dimensional Spaces
CoRR, 2005

Mining Top-k Approximate Frequent Patterns
CoRR, 2005

Data Mining for Actionable Knowledge: A Survey
CoRR, 2005

FP-outlier: Frequent pattern based outlier detection.
Comput. Sci. Inf. Syst., 2005

A Unified Subspace Outlier Ensemble Framework for Outlier Detection.
Proceedings of the Advances in Web-Age Information Management, 2005

An Optimization Model for Outlier Detection in Categorical Data.
Proceedings of the Advances in Intelligent Computing, 2005

An FP-Tree Based Approach for Mining All Strongly Correlated Item Pairs.
Proceedings of the Computational Intelligence and Security, International Conference, 2005

Improving K-Modes Algorithm Considering Frequencies of Attribute Values in Mode.
Proceedings of the Computational Intelligence and Security, International Conference, 2005

2004
Mining class outliers: concepts, algorithms and applications in CRM.
Expert Syst. Appl., 2004

A FP-Tree Based Approach for Mining All Strongly Correlated Pairs without Candidate Generation
CoRR, 2004

A Link Clustering Based Approach for Clustering Categorical Data
CoRR, 2004

Modeling Complex Higher Order Patterns
CoRR, 2004

A Frequent Pattern Discovery Method for Outlier Detection.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

Mining Class Outliers: Concepts, Algorithms and Applications.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004

2003
Discovering cluster-based local outliers.
Pattern Recognit. Lett., 2003

2002
Squeezer: An Efficient Algorithm for Clustering Categorical Data.
J. Comput. Sci. Technol., 2002

Outlier Detection Integrating Semantic Knowledge.
Proceedings of the Advances in Web-Age Information Management, 2002


  Loading...