2025
BG<sup>2</sup>VN: benchmark graph generator for vital node recognition.
Frontiers Comput. Sci., September, 2025
Clusterability test for categorical data.
Knowl. Inf. Syst., May, 2025
Interpretable multi-view clustering.
Pattern Recognit., 2025
Interpretable categorical data clustering via hypothesis testing.
Pattern Recognit., 2025
Significance-based decision tree for interpretable categorical data clustering.
Inf. Sci., 2025
Community structure testing by counting frequent common neighbor sets.
Inf. Sci., 2025
Conjunction subspaces test for conformal and selective classification.
Inf. Sci., 2025
Significance-based interpretable sequence clustering.
Inf. Sci., 2025
Interpretable sequence clustering.
Inf. Sci., 2025
2024
Node Centrality Inference via Hypothesis Testing.
Stat. Anal. Data Min., October, 2024
Central node identification via weighted kernel density estimation.
Data Min. Knowl. Discov., May, 2024
A randomized algorithm for clustering discrete sequences.
Pattern Recognit., 2024
Random subsequence forests.
Inf. Sci., 2024
Interpretable Clustering: A Survey.
CoRR, 2024
Essential protein discovery on weighted PPI networks via statistical information fusion.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024
Integrating topology and biological information to predict essential proteins via Shannon entropy.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024
KAS-former: a transformer-based model for predicting histone modifications using KAS-seq.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024
EEG-based Seizure Type Classification with Temporal-Spatial-Spectral Attention.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024
Partition, Predict and Assemble: Targeting Long RNA Secondary Structure Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2024
2023
Random forest clustering for discrete sequences.
Pattern Recognit. Lett., October, 2023
The statistical nature of h-index of a network node and its extensions.
J. Informetrics, August, 2023
Mining Statistically Significant Communities From Weighted Networks.
IEEE Trans. Knowl. Data Eng., June, 2023
On the Statistical Significance of a Community Structure.
IEEE Trans. Knowl. Data Eng., March, 2023
Decision Tree for Sequences.
IEEE Trans. Knowl. Data Eng., 2023
Hamming Encoder: Mining Discriminative k-mers for Discrete Sequence Classification.
CoRR, 2023
A testing-based approach to assess the clusterability of categorical data.
CoRR, 2023
Personalized Interpretable Classification.
CoRR, 2023
2022
Detecting Statistically Significant Communities.
IEEE Trans. Knowl. Data Eng., 2022
Significance-Based Essential Protein Discovery.
IEEE ACM Trans. Comput. Biol. Bioinform., 2022
Significance-Based Categorical Data Clustering.
CoRR, 2022
The statistical nature of h-index of a network node.
CoRR, 2022
2021
Essential Protein Recognition via Community Significance.
IEEE ACM Trans. Comput. Biol. Bioinform., 2021
A graph-traversal approach to identify influential nodes in a network.
Patterns, 2021
Instance-Based Classification Through Hypothesis Testing.
IEEE Access, 2021
2020
Protein Complexes Identification with Family-Wise Error Rate Control.
IEEE ACM Trans. Comput. Biol. Bioinform., 2020
Computing exact P-values for community detection.
Data Min. Knowl. Discov., 2020
Reference-Based Sequence Classification.
IEEE Access, 2020
2019
Reinforce: An Ensemble Approach for Inferring PPI Network from AP-MS Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2019
面向PCP-MS数据的PPI网络推断算法 (PPI Network Inference Algorithm for PCP-MS Data).
计算机科学, 2019
Mining conditional discriminative sequential patterns.
Inf. Sci., 2019
Significance-based discriminative sequential pattern mining.
Expert Syst. Appl., 2019
Correlation-Based Community Detection.
CoRR, 2019
2018
On the statistical significance of protein complex.
Quant. Biol., 2018
Detecting Statistically Significant Communities.
CoRR, 2018
Protein complexes identification based on go attributed network embedding.
BMC Bioinform., 2018
PCM: A Pairwise Correlation Mining Package for Biological Network Inference.
Proceedings of the Intelligent Computing Theories and Application, 2018
RPML: A Learning-Based Approach for Reranking Protein-Spectrum Matches.
Proceedings of the Intelligent Computing Theories and Application, 2018
2017
Inferring MicroRNA-Disease Associations by Random Walk on a Heterogeneous Network with Multiple Data Sources.
IEEE ACM Trans. Comput. Biol. Bioinform., 2017
Conditional discriminative pattern mining: Concepts and algorithms.
Inf. Sci., 2017
A two-step framework for inferring direct protein-protein interaction network from AP-MS data.
BMC Syst. Biol., 2017
2016
Computing exact permutation p-values for association rules.
Inf. Sci., 2016
Protein inference: A protein quantification perspective.
Comput. Biol. Chem., 2016
2015
BagReg: Protein inference through machine learning.
Comput. Biol. Chem., 2015
Network inference from AP-MS data: computational challenges and solutions.
Briefings Bioinform., 2015
Discriminative pattern mining and its applications in bioinformatics.
Briefings Bioinform., 2015
High Dimensional Explicit Feature Biased Matrix Factorization Recommendation.
Proceedings of the Trends and Applications in Knowledge Discovery and Data Mining, 2015
A Semi-Supervised Framework for Social Spammer Detection.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015
2014
Improved and Promising Identificationof Human MicroRNAs by Incorporatinga High-Quality Negative Set.
IEEE ACM Trans. Comput. Biol. Bioinform., 2014
Mining Conditional Phosphorylation Motifs.
IEEE ACM Trans. Comput. Biol. Bioinform., 2014
Decoy-free protein-level false discovery rate estimation.
Bioinform., 2014
Data construction for phosphorylation site prediction.
Briefings Bioinform., 2014
2013
A Combinatorial Perspective of the Protein Inference Problem.
IEEE ACM Trans. Comput. Biol. Bioinform., 2013
ProteinLasso: A Lasso regression approach to protein inference problem in shotgun proteomics.
Comput. Biol. Chem., 2013
2012
Peptide Reranking with Protein-Peptide Correspondence and Precursor Peak Intensity Information.
IEEE ACM Trans. Comput. Biol. Bioinform., 2012
On unbiased performance evaluation for protein inference
CoRR, 2012
Protein Inference and Protein Quantification: Two Sides of the Same Coin
CoRR, 2012
A linear programming model for protein inference problem in shotgun proteomics.
Bioinform., 2012
Comments on 'MMFPh: A Maximal Motif Finder for Phosphoproteomics Datasets'.
Bioinform., 2012
Protein inference: a review.
Briefings Bioinform., 2012
2011
A Partial Set Covering Model for Protein Mixture Identification Using Mass Spectrometry Data.
IEEE ACM Trans. Comput. Biol. Bioinform., 2011
Clustering categorical data streams.
J. Comput. Methods Sci. Eng., 2011
Attribute value weighting in k-modes clustering.
Expert Syst. Appl., 2011
The choice of null distributions for detecting gene-gene interactions in genome-wide association studies.
BMC Bioinform., 2011
Score regularization for peptide identification.
BMC Bioinform., 2011
Motif-All: discovering all phosphorylation motifs.
BMC Bioinform., 2011
2010
G-ANMI: A mutual information based genetic clustering algorithm for categorical data.
Knowl. Based Syst., 2010
Optimization-Based Peptide Mass Fingerprinting for Protein Mixture Identification.
J. Comput. Biol., 2010
Efficient K-Nearest Neighbor Join Algorithms for High Dimensional Sparse Data
CoRR, 2010
Stable feature selection for biomarker discovery.
Comput. Biol. Chem., 2010
2009
Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis.
BMC Bioinform., 2009
SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies.
Bioinform., 2009
Improving peptide identification with single-stage mass spectrum peaks.
Bioinform., 2009
Dynamic Self-configuration of User QoS for Next Generation Network.
Proceedings of the NPC 2009, 2009
2008
k-ANMI: A mutual information based clustering algorithm for categorical data.
Inf. Fusion, 2008
A SCOR Reference Model of the Supply Chain Management System in an Enterprise.
Int. Arab J. Inf. Technol., 2008
Peak bagging for peptide mass fingerprinting.
Bioinform., 2008
2007
On the Impact of Dissimilarity Measure in k-Modes Clustering Algorithm.
IEEE Trans. Pattern Anal. Mach. Intell., 2007
K-means Clustering - A Survey.
Proceedings of the 2007 International Conference on Information & Knowledge Engineering, 2007
2006
Mining top-k strongly correlated item pairs without minimum correlation threshold.
Int. J. Knowl. Based Intell. Eng. Syst., 2006
Farthest-Point Heuristic based Initialization Methods for K-Modes Clustering
CoRR, 2006
Approximation Algorithms for K-Modes Clustering
CoRR, 2006
Improving categorical data clustering algorithm by weighting uncommon attribute value matches.
Comput. Sci. Inf. Syst., 2006
A Fast Greedy Algorithm for Outlier Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2006
Approximation Algorithms for <i>K</i>-Modes Clustering.
Proceedings of the Computational Intelligence, 2006
2005
TCSOM: Clustering Transactions Using Self-Organizing Map.
Neural Process. Lett., 2005
A cluster ensemble method for clustering categorical data.
Inf. Fusion, 2005
Scalable algorithms for clustering large datasets with mixed type attributes.
Int. J. Intell. Syst., 2005
Mining action rules from scratch.
Expert Syst. Appl., 2005
K-Histograms: An Efficient Clustering Algorithm for Categorical Dataset
CoRR, 2005
Clustering Mixed Numeric and Categorical Data: A Cluster Ensemble Approach
CoRR, 2005
A Fast Greedy Algorithm for Outlier Mining
CoRR, 2005
A Unified Subspace Outlier Ensemble Framework for Outlier Detection in High Dimensional Spaces
CoRR, 2005
Mining Top-k Approximate Frequent Patterns
CoRR, 2005
Data Mining for Actionable Knowledge: A Survey
CoRR, 2005
FP-outlier: Frequent pattern based outlier detection.
Comput. Sci. Inf. Syst., 2005
A Unified Subspace Outlier Ensemble Framework for Outlier Detection.
Proceedings of the Advances in Web-Age Information Management, 2005
An Optimization Model for Outlier Detection in Categorical Data.
Proceedings of the Advances in Intelligent Computing, 2005
An FP-Tree Based Approach for Mining All Strongly Correlated Item Pairs.
Proceedings of the Computational Intelligence and Security, International Conference, 2005
Improving K-Modes Algorithm Considering Frequencies of Attribute Values in Mode.
Proceedings of the Computational Intelligence and Security, International Conference, 2005
2004
Mining class outliers: concepts, algorithms and applications in CRM.
Expert Syst. Appl., 2004
A FP-Tree Based Approach for Mining All Strongly Correlated Pairs without Candidate Generation
CoRR, 2004
A Link Clustering Based Approach for Clustering Categorical Data
CoRR, 2004
Modeling Complex Higher Order Patterns
CoRR, 2004
A Frequent Pattern Discovery Method for Outlier Detection.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004
Mining Class Outliers: Concepts, Algorithms and Applications.
Proceedings of the Advances in Web-Age Information Management: 5th International Conference, 2004
2003
Discovering cluster-based local outliers.
Pattern Recognit. Lett., 2003
2002
Squeezer: An Efficient Algorithm for Clustering Categorical Data.
J. Comput. Sci. Technol., 2002
Outlier Detection Integrating Semantic Knowledge.
Proceedings of the Advances in Web-Age Information Management, 2002