Jian Pei

Orcid: 0000-0002-2200-8711

Affiliations:
  • Simon Fraser University, Burnaby, Canada


According to our database1, Jian Pei authored at least 497 papers between 2000 and 2024.

Collaborative distances:

Awards

ACM Fellow

ACM Fellow 2015, "For contributions to the foundation, methodology and applications of data mining.".

IEEE Fellow

IEEE Fellow 2014, "For contributions to data mining and knowledge discovery".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Interest Multi-Round Conversational Recommendation System with Fuzzy Feedback Based User Simulator.
Trans. Recomm. Syst., December, 2024

Sentiment classification of movie reviews: a powerful method based on ensemble of classifiers and features.
Int. J. Mach. Learn. Cybern., December, 2024

Optimization of Graph Clustering Inspired by Dynamic Belief Systems.
IEEE Trans. Knowl. Data Eng., November, 2024

Bias-Reduced SDR Method for Locating a Noncooperative Moving Source Using TOAs and FOAs.
IEEE Trans. Aerosp. Electron. Syst., October, 2024

Counterfactual Explanation of the Shapley Value in Data Coalitions.
Proc. VLDB Endow., July, 2024

FairSample: Training Fair and Accurate Graph Convolutional Neural Networks Efficiently.
IEEE Trans. Knowl. Data Eng., April, 2024

Marginal likelihood estimation for the negative binomial INGARCH model.
Commun. Stat. Simul. Comput., April, 2024

Trustworthy Graph Neural Networks: Aspects, Methods, and Trends.
Proc. IEEE, February, 2024

Fast Shapley Value Computation in Data Assemblage Tasks as Cooperative Simple Games.
Proc. ACM Manag. Data, February, 2024

Database Native Model Selection: Harnessing Deep Neural Networks in Database Systems.
Proc. VLDB Endow., January, 2024

Protecting Data Buyer Privacy in Data Markets.
IEEE Internet Comput., 2024

Ask Questions with Double Hints: Visual Question Generation with Answer-awareness and Region-reference.
CoRR, 2024

EHRmonize: A Framework for Medical Concept Abstraction from Electronic Health Records using Large Language Models.
CoRR, 2024

A Comprehensive Survey on Data Augmentation.
CoRR, 2024

Powering In-Database Dynamic Model Slicing for Structured Data Analytics.
CoRR, 2024

Anytime Neural Architecture Search on Tabular Data.
CoRR, 2024

TrustLLM: Trustworthiness in Large Language Models.
CoRR, 2024

Linear-Time Graph Neural Networks for Scalable Recommendations.
Proceedings of the ACM on Web Conference 2024, 2024

Applications and Computation of the Shapley Value in Databases and Machine Learning.
Proceedings of the Companion of the 2024 International Conference on Management of Data, 2024

The Fourth International Workshop on Smart Data for Blockchain and Distributed Ledger (SDBD'24).
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024


Fair and Efficient Contribution Valuation for Vertical Federated Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Cost-Sensitive Learning for Medical Insurance Fraud Detection With Temporal Information.
IEEE Trans. Knowl. Data Eng., October, 2023

DP2-Pub: Differentially Private High-Dimensional Data Publication With Invariant Post Randomization.
IEEE Trans. Knowl. Data Eng., October, 2023

Reducing Bias for Multistatic Localization of a Moving Object by Transmitter at Unknown Position.
IEEE Trans. Aerosp. Electron. Syst., October, 2023

Decentralized Composite Optimization in Stochastic Networks: A Dual Averaging Approach With Linear Convergence.
IEEE Trans. Autom. Control., August, 2023

Permutation-Equivariant and Proximity-Aware Graph Neural Networks With Stochastic Message Passing.
IEEE Trans. Knowl. Data Eng., June, 2023

Multi-Behavior Sequential Recommendation With Temporal Graph Transformer.
IEEE Trans. Knowl. Data Eng., June, 2023

Fast Markov Clustering Algorithm Based on Belief Dynamics.
IEEE Trans. Cybern., June, 2023

Efficiently Cleaning Structured Event Logs: A Graph Repair Approach.
ACM Trans. Database Syst., March, 2023

Eigen-GNN: A Graph Structure Preserving Plug-in for GNNs.
IEEE Trans. Knowl. Data Eng., March, 2023

A Two-Step Simulated Annealing Algorithm for Spectral Data Feature Extraction.
Sensors, January, 2023

Applications of Differential Privacy in Social Network Analysis: A Survey.
IEEE Trans. Knowl. Data Eng., 2023

Design and Study of a Reflector-Separated Light Dispersion-Compensated 3D Microscopy System.
Sensors, 2023

Data and AI Model Markets: Opportunities for Data and Model Sharing, Discovery, and Integration.
Proc. VLDB Endow., 2023

Efficient Sampling Approaches to Shapley Value Approximation.
Proc. ACM Manag. Data, 2023

Graph Neural Networks for Natural Language Processing: A Survey.
Found. Trends Mach. Learn., 2023

A Summary of ICDE 2022 Research Session Panels.
IEEE Data Eng. Bull., 2023

Coherent Entity Disambiguation via Modeling Topic and Categorical Dependency.
CoRR, 2023

Instructed Language Models with Retrievers Are Powerful Entity Linkers.
CoRR, 2023

A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT.
CoRR, 2023

Bridge the Gap between Language models and Tabular Understanding.
CoRR, 2023

Knowledge-enhanced Neural Machine Reasoning: A Review.
CoRR, 2023

Offline Policy Evaluation in Large Action Spaces via Outcome-Oriented Action Grouping.
Proceedings of the ACM Web Conference 2023, 2023


Typos-aware Bottlenecked Pre-Training for Robust Dense Retrieval.
Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023

Preface: The 2023 ACM SIGKDD Workshop on Causal Discovery, Prediction and Decision.
Proceedings of the KDD'23 Workshop on Causal Discovery, 2023

Deep Learning on Graphs: Methods and Applications (DLG-KDD2023).
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Serverless Federated AUPRC Optimization for Multi-Party Collaborative Imbalanced Data Mining.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Graph Neural Networks: Foundation, Frontiers and Applications.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation.
Proceedings of the International Conference on Machine Learning, 2023

Disentangled Graph Social Recommendation.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Dynamic Shapley Value Computation.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Bias Reduced Semidefinite Relaxation Method for Multistatic Localization in the Absence of Transmitter Position And Its Synchronization.
Proceedings of the IEEE International Conference on Acoustics, 2023

Factual Observation Based Heterogeneity Learning for Counterfactual Prediction.
Proceedings of the Conference on Causal Learning and Reasoning, 2023

RUEL: Retrieval-Augmented User Representation with Edge Browser Logs for Sequential Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Clinical Assessment of Pneumocystosis with MIMIC Data.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

Structural Contrastive Pretraining for Cross-Lingual Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Alleviating Over-smoothing for Unsupervised Sentence Representation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

A Graph Fusion Approach for Cross-Lingual Machine Reading Comprehension.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Identify Event Causality with Knowledge and Analogy.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Two-Dimensional Functional Principal Component Analysis for Image Feature Extraction.
J. Comput. Graph. Stat., October, 2022

A Survey on Data Pricing: From Economics to Data Science.
IEEE Trans. Knowl. Data Eng., 2022

Optimal Estimation of Low-Rank Factors via Feature Level Data Fusion of Multiplex Signal Systems.
IEEE Trans. Knowl. Data Eng., 2022

Finding Multidimensional Simpson's Paradox.
SIGKDD Explor., 2022

Study of the Off-Axis Fresnel Zone Plate of a Microscopic Tomographic Aberration.
Sensors, 2022

On Shapley Value in Data Assemblage Under Independent Utility.
Proc. VLDB Endow., 2022

Data pricing in machine learning pipelines.
Knowl. Inf. Syst., 2022

Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization.
J. Mach. Learn. Res., 2022

Distilling Causal Metaknowledge from Knowledge Graphs.
IEEE Data Eng. Bull., 2022

Knowledge-Injected Federated Learning.
CoRR, 2022

Revealing Unfair Models by Mining Interpretable Evidence.
CoRR, 2022

Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation.
CoRR, 2022

Multi-level Contrastive Learning for Cross-lingual Spoken Language Understanding.
CoRR, 2022

Membership Privacy Protection for Image Translation Models via Adversarial Knowledge Distillation.
CoRR, 2022

Fair and efficient contribution valuation for vertical federated learning.
CoRR, 2022

Multiple Choice Questions based Multi-Interest Policy Learning for Conversational Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Robust Self-Supervised Structural Graph Neural Network for Social Network Prediction.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Heterogeneous Global Graph Neural Networks for Personalized Session-based Recommendation.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Transformer-Empowered Content-Aware Collaborative Filtering.
Proceedings of the Fourth Knowledge-aware and Conversational Recommender Systems Workshop co-located with 16th ACM Conference on Recommender Systems (RecSys 2022), 2022

Combining Unstructured Content and Knowledge Graphs into Recommendation Datasets.
Proceedings of the Fourth Knowledge-aware and Conversational Recommender Systems Workshop co-located with 16th ACM Conference on Recommender Systems (RecSys 2022), 2022

Revisiting Graph Contrastive Learning from the Perspective of Graph Spectrum.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Mining Minority-Class Examples with Uncertainty Estimates.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Improving Social Network Embedding via New Second-Order Continuous Graph Neural Networks.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Deep Learning on Graphs: Methods and Applications (DLG-KDD2022).
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Communication-Efficient Robust Federated Learning with Noisy Labels.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Toward Unified Data and Algorithm Fairness via Adversarial Data Augmentation and Adaptive Model Fine-tuning.
Proceedings of the IEEE International Conference on Data Mining, 2022

vChain+: Optimizing Verifiable Blockchain Boolean Range Queries.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Spatial-Temporal Hypergraph Self-Supervised Learning for Crime Prediction.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Improving Fairness for Data Valuation in Horizontal Federated Learning.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Lexicon-Enhanced Self-Supervised Training for Multilingual Dense Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Label-aware Multi-level Contrastive Learning for Cross-lingual Spoken Language Understanding.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

TrustLOG: The First Workshop on Trustworthy Learning on Graphs.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

From Good to Best: Two-Stage Training for Cross-Lingual Machine Reading Comprehension.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Cosine Model Watermarking against Ensemble Distillation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Visually aware recommendation with aesthetic features.
VLDB J., 2021

Influence Analysis in Evolving Networks: A Survey.
IEEE Trans. Knowl. Data Eng., 2021

Skyline Diagram: Efficient Space Partitioning for Skyline Queries.
IEEE Trans. Knowl. Data Eng., 2021

Group-Based Skyline for Pareto Optimal Groups.
IEEE Trans. Knowl. Data Eng., 2021

Demonstration of Dealer: An End-to-End Model Marketplace with Differential Privacy.
Proc. VLDB Endow., 2021

Dealer: An End-to-End Model Marketplace with Differential Privacy.
Proc. VLDB Endow., 2021

Comprehensible Counterfactual Explanation on Kolmogorov-Smirnov Test.
Proc. VLDB Endow., 2021

SlimChain: Scaling Blockchain Transactions through Off-Chain Storage and Parallel Processing.
Proc. VLDB Endow., 2021

Model complexity of deep learning: a survey.
Knowl. Inf. Syst., 2021

Multi-Choice Questions based Multi-Interest Policy Learning for Conversational Recommendation.
CoRR, 2021

From Good to Best: Two-Stage Training for Cross-lingual Machine Reading Comprehension.
CoRR, 2021

Improving Fairness for Data Valuation in Federated Learning.
CoRR, 2021

Achieving Model Fairness in Vertical Federated Learning.
CoRR, 2021

FedFair: Training Fair Models In Cross-Silo Federated Learning.
CoRR, 2021

CalibreNet: Calibration Networks for Multilingual Sequence Labeling.
Proceedings of the WSDM '21, 2021

Robust Counterfactual Explanations on Graph Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

The Third International Workshop on Smart Data for Blockchain and Distributed Ledger (SDBD2021): Joint Workshop with SIGKDD 2021 Trust Day.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Towards Fair Federated Learning.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

AsySQN: Faster Vertical Federated Learning Algorithms with Better Computation Resource Utilization.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

The Sixth International Workshop on Deep Learning on Graphs - Methods and Applications (DLG-KDD'21).
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Language Scaling: Applications, Challenges and Approaches.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Data Pricing and Data Asset Governance in the AI Era.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Auto-Split: A General Framework of Collaborative Edge-Cloud AI.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Automating Entity Matching Model Development.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Eclipse: Generalizing kNN and Skyline.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Finding Representative Interpretations on Convolutional Neural Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Modeling Event-Pair Relations in External Knowledge Graphs for Script Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Reasoning over Entity-Action-Location Graph for Procedural Text Understanding.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Reinforced Multi-Teacher Selection for Knowledge Distillation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Knowledge-Enhanced Hierarchical Graph Transformer Network for Multi-Behavior Recommendation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Personalized Cross-Silo Federated Learning on Non-IID Data.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Mining top-k sequential patterns in transaction database graphs.
World Wide Web, 2020

VLDB SI 2018 editorial.
VLDB J., 2020

Efficient Contour Computation of Group-Based Skyline.
IEEE Trans. Knowl. Data Eng., 2020

Continuous Influence Maximization.
ACM Trans. Knowl. Discov. Data, 2020

On spatial keyword covering.
Knowl. Inf. Syst., 2020

Feature Extraction of EEG Signals Based on Local Mean Decomposition and Fuzzy Entropy.
Int. J. Pattern Recognit. Artif. Intell., 2020

Comprehensible Counterfactual Interpretation on Kolmogorov-Smirnov Test.
CoRR, 2020

Differential Privacy and Its Applications in Social Network Analysis: A Survey.
CoRR, 2020

Accelerated Zeroth-Order Momentum Methods from Mini to Minimax Optimization.
CoRR, 2020

Personalized Federated Learning: An Attentive Collaboration Approach.
CoRR, 2020

Recent Advances on Graph Analytics and Its Applications in Healthcare.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Mining Implicit Relevance Feedback from User Behavior for Web Question Answering.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Data Pricing - From Economics to Data Science.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Measuring Model Complexity of Neural Networks with Curve Activation Functions.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

AM-GCN: Adaptive Multi-channel Graph Convolutional Networks.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Sinkhorn Regression.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Momentum-Based Policy Gradient Methods.
Proceedings of the 37th International Conference on Machine Learning, 2020

Exact and Consistent Interpretation of Piecewise Linear Models Hidden behind APIs: A Closed Form Solution.
Proceedings of the 36th IEEE International Conference on Data Engineering, 2020

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Discrete Model Compression With Resource Constraint for Deep Neural Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Graph Representation of Semi-structured Data for Web Question Answering.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Cross-lingual Machine Reading Comprehension with Language Branch Knowledge Distillation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
SimRank*: effective and scalable pairwise similarity search based on graph topology.
VLDB J., 2019

Secure and Efficient Skyline Queries on Encrypted Data.
IEEE Trans. Knowl. Data Eng., 2019

A Survey on Network Embedding.
IEEE Trans. Knowl. Data Eng., 2019

Online Density Bursting Subgraph Detection from Temporal Graphs.
Proc. VLDB Endow., 2019

Finding Theme Communities from Database Networks.
Proc. VLDB Endow., 2019

Classification with label noise: a Markov chain sampling framework.
Data Min. Knowl. Discov., 2019

Nonconvex Zeroth-Order Stochastic ADMM Methods with Lower Function Query Complexity.
CoRR, 2019

Detecting Customer Complaint Escalation with Recurrent Neural Networks and Manually-Engineered Features.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Tackle Balancing Constraint for Incremental Semi-Supervised Support Vector Learning.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

AutoNE: Hyperparameter Optimization for Massive Network Embedding.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Mining Graphs and Networks: A 15-Year Journey.
Proceedings of the 8th International Workshop on Big Data, 2019

Learning From Networks: Algorithms, Theory, and Applications.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

ProGAN: Network Embedding via Proximity Generative Adversarial Network.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Conditional Random Field Enhanced Graph Convolutional Neural Networks.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Multi-Horizon Time Series Forecasting with Temporal Attention Learning.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Demystifying Dropout.
Proceedings of the 36th International Conference on Machine Learning, 2019

Is There a Data Science and Engineering Brain Drain? If So, How Can We Rebalance Them?
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Practicing the Art of Data Science.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

SkyRec: Finding Pareto Optimal Groups.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Tracking Top-k Influential Users with Relative Errors.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Association Rules.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

High-Order Proximity Preserved Embedding for Dynamic Networks.
IEEE Trans. Knowl. Data Eng., 2018

Cleaning Crowdsourced Labels Using Oracles For Statistical Classification.
Proc. VLDB Endow., 2018

Front Matter.
Proc. VLDB Endow., 2018

Subspace multi-clustering: a review.
Knowl. Inf. Syst., 2018

Tracking Top-K Influential Vertices in Dynamic Networks.
CoRR, 2018

Online Compact Convexified Factorization Machine.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics.
Proceedings of the 2018 International Conference on Management of Data, 2018

Arbitrary-Order Proximity Preserved Network Embedding.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Sketched Follow-The-Regularized-Leader for Online Factorization Machine.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Exact and Consistent Interpretation for Piecewise Linear Neural Networks: A Closed Form Solution.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Finding Maximal Significant Linear Representation between Long Time Series.
Proceedings of the IEEE International Conference on Data Mining, 2018

Mining Density Contrast Subgraphs.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Skyline Diagram: Finding the Voronoi Counterpart for Skyline Queries.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

TIMERS: Error-Bounded SVD Restart on Dynamic Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Tracking Influential Individuals in Dynamic Networks.
IEEE Trans. Knowl. Data Eng., 2017

Activity Maximization by Effective Information Diffusion in Social Networks.
IEEE Trans. Knowl. Data Eng., 2017

Editorial.
IEEE Trans. Knowl. Data Eng., 2017

Efficient Mining of Regional Movement Patterns in Semantic Trajectories.
Proc. VLDB Endow., 2017

Measuring in-network node similarity based on neighborhoods: a unified parametric approach.
Knowl. Inf. Syst., 2017

Finding multiple stable clusterings.
Knowl. Inf. Syst., 2017

Multidimensional Business Benchmarking Analysis on Data Warehouses.
Int. J. Data Warehous. Min., 2017

<i>JASIST</i> special issue on biomedical information retrieval.
J. Assoc. Inf. Sci. Technol., 2017

Multidimensional benchmarking in data warehouses.
Intell. Data Anal., 2017

Finding Theme Communities from Database Networks: from Mining to Indexing and Query Answering.
CoRR, 2017

Preference-driven similarity join.
Proceedings of the International Conference on Web Intelligence, 2017

Schemaless Join for Result Set Preferences.
Proceedings of the 2017 IEEE International Conference on Information Reuse and Integration, 2017

Secure Skyline Queries on Cloud Platform.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Principal Patern Mining on Graphs.
Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, Sydney, Australia, July 31, 2017

Community Preserving Network Embedding.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Online Visual Analytics of Text Streams.
IEEE Trans. Vis. Comput. Graph., 2016

EIC Editorial.
IEEE Trans. Knowl. Data Eng., 2016

State of the Journal Editorial.
IEEE Trans. Knowl. Data Eng., 2016

Scalable and Accurate Online Feature Selection for Big Data.
ACM Trans. Knowl. Discov. Data, 2016

Continuous similarity search for evolving queries.
Knowl. Inf. Syst., 2016

Efficient discovery of contrast subspaces for object explanation and characterization.
Knowl. Inf. Syst., 2016

Preface.
J. Comput. Sci. Technol., 2016

Discovering outlying aspects in large datasets.
Data Min. Knowl. Discov., 2016

Tracking Influential Nodes in Dynamic Networks.
CoRR, 2016

Activity Maximization by Effective Information Diffusion in Social Networks.
CoRR, 2016

Using Computer Intelligence for Depression Diagnosis and Crowdsourcing.
Computer, 2016

Preface.
Big Data Res., 2016

Continuous Influence Maximization: What Discounts Should We Offer to Social Network Users?
Proceedings of the 2016 International Conference on Management of Data, 2016

Asymmetric Transitivity Preserving Graph Embedding.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

When Social Influence Meets Item Inference.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Finding Gangs in War from Signed Networks.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Finding the minimum spatial keyword cover.
Proceedings of the 32nd IEEE International Conference on Data Engineering, 2016

Urban Traffic Prediction through the Second Use of Inexpensive Big Data from Buildings.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

Tradeoffs between density and size in extracting dense subgraphs: A unified framework.
Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2016

2015
Editorial.
IEEE Trans. Knowl. Data Eng., 2015

Classification with Streaming Features: An Emerging-Pattern Mining Approach.
ACM Trans. Knowl. Discov. Data, 2015

Proximity-Aware Local-Recoding Anonymization with MapReduce for Scalable Big Data Privacy Preservation in Cloud.
IEEE Trans. Computers, 2015

Finding Pareto Optimal Groups: Group-based Skyline.
Proc. VLDB Endow., 2015

ALID: Scalable Dominant Cluster Detection.
Proc. VLDB Endow., 2015

Mining multidimensional contextual outliers from categorical relational data.
Intell. Data Anal., 2015

Mining outlying aspects on numeric data.
Data Min. Knowl. Discov., 2015

In-Network Neighborhood-Based Node Similarity Measure: A Unified Parametric Model.
CoRR, 2015

Scalable Outlying-Inlying Aspects Discovery via Feature Ranking.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

Reliable Early Classification on Multivariate Time Series with Numerical and Categorical Attributes.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2015

COSNET: Connecting Heterogeneous Social Networks with Local and Global Consistency.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Tornado Forecasting with Multiple Markov Boundaries.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Cleaning structured event logs: A graph repair approach.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Mining Frequent Co-occurrence Patterns across Multiple Data Streams.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Efficiently Computing Top-K Shortest Path Join.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

2014
Mining most frequently changing component in evolving graphs.
World Wide Web, 2014

Malicious URL detection by dynamically mining patterns without pre-defined elements.
World Wide Web, 2014

Consensus-Based Ranking of Multivalued Objects: A Generalized Borda Count Approach.
IEEE Trans. Knowl. Data Eng., 2014

EIC Editorial.
IEEE Trans. Knowl. Data Eng., 2014

Editorial [State of the Transactions].
IEEE Trans. Knowl. Data Eng., 2014

Email mining: tasks, common techniques, and tools.
Knowl. Inf. Syst., 2014

A spatiotemporal compression based approach for efficient big data processing on Cloud.
J. Comput. Syst. Sci., 2014

Managing Data-Intensive Applications in the Cloud.
Computer, 2014

Shortest Unique Queries on Strings.
Proceedings of the String Processing and Information Retrieval, 2014

Efficient Matching of Substrings in Uncertain Sequences.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

How Can I Index My Thousands of Photos Effectively and Automatically? An Unsupervised Feature Selection Approach.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

An Iterative Fusion Approach to Graph-Based Semi-Supervised Learning from Multiple Views.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Mining Contrast Subspaces.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Structure-Aware Distance Measures for Comparing Clusterings in Graphs.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2014

Distance metric learning using dropout: a structured regularization approach.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Towards Scalable and Accurate Online Feature Selection for Big Data.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

SNOC: Streaming Network Node Classification.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

An Appliance-Driven Approach to Detection of Corrupted Load Curve Data.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Within-Network Classification Using Radius-Constrained Neighborhood Patterns.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Do neighbor buddies make a difference in reblog likelihood? An analysis on SINA Weibo data.
Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2014

Pattern-Growth Methods.
Proceedings of the Frequent Pattern Mining, 2014

2013
Finding email correspondents in online social networks.
World Wide Web, 2013

A vlHMM approach to context-aware search.
ACM Trans. Web, 2013

Editorial [2012 & 2013 Associate Editors].
IEEE Trans. Knowl. Data Eng., 2013

Clustering Uncertain Data Based on Probability Distribution Similarity.
IEEE Trans. Knowl. Data Eng., 2013

Introduction to the Special Issue ACM SIGKDD 2012.
ACM Trans. Knowl. Discov. Data, 2013

Mining search and browse logs for web search: A Survey.
ACM Trans. Intell. Syst. Technol., 2013

More is Simpler: Effectively and Efficiently Assessing Node-Pair Similarities Based on Hyperlinks.
Proc. VLDB Endow., 2013

A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series.
Proc. VLDB Endow., 2013

Skyline distance: a measure of multidimensional competence.
Knowl. Inf. Syst., 2013

Recommendations for two-way selections using skyline view queries.
Knowl. Inf. Syst., 2013

What distinguish one from its peers in social networks?
Data Min. Knowl. Discov., 2013

Is My Electricity Bill Accurate? A Model-Driven Approach to Corrupted Load Data Identification.
CoRR, 2013

Some New Progress in Analyzing and Mining Uncertain and Probabilistic Data for Big Data Analytics.
Proceedings of the Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing, 2013

Parallel field alignment for cross media retrieval.
Proceedings of the ACM Multimedia Conference, 2013

Price Information Patterns in Web Search Advertising: An Empirical Case Study on Accommodation Industry.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Mining Statistically Significant Sequential Patterns.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Mining Probabilistic Frequent Spatio-Temporal Sequential Patterns with Gap Constraints from Uncertain Databases.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

On shortest unique substring queries.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Towards Cohesive Anomaly Mining.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays.
IEEE Trans. Parallel Distributed Syst., 2012

Aggregate keyword search on large relational databases.
Knowl. Inf. Syst., 2012

Early classification on time series.
Knowl. Inf. Syst., 2012

Probabilistic skylines on uncertain data: model and bounding-pruning-refining methods.
J. Intell. Inf. Syst., 2012

Efficient and Effective Aggregate Keyword Search on Relational Databases.
Int. J. Data Warehous. Min., 2012

Clustering in applications with multiple data sources - A mutual subspace clustering approach.
Neurocomputing, 2012

Top-10 Data Mining Case Studies.
Int. J. Inf. Technol. Decis. Mak., 2012

Multi-level relationship outlier detection.
Int. J. Bus. Intell. Data Min., 2012

Mining query subtopics from search log data.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

A practical method for estimating performance degradation on multicore processors, and its application to HPC workloads.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

Community Preserving Lossy Compression of Social Networks.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Random Error Reduction in Similarity Search on Time Series: A Statistical Approach.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Aggregate queries on probabilistic record linkages.
Proceedings of the 15th International Conference on Extending Database Technology, 2012

On compressing weighted time-evolving graphs.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Ranking Queries on Uncertain Data
Advances in Database Systems 42, Kluwer, ISBN: 978-1-4419-9379-3, 2011

Ranking queries on uncertain data.
VLDB J., 2011

Can the Utility of Anonymized Data be Used for Privacy Breaches?
ACM Trans. Knowl. Discov. Data, 2011

Mining Concept Sequences from Large-Scale Search Logs for Context-Aware Query Suggestion.
ACM Trans. Intell. Syst. Technol., 2011

On Pruning for Top-K Ranking in Uncertain Databases.
Proc. VLDB Endow., 2011

Best papers from the Fifth International Conference on Advanced Data Mining and Applications (ADMA 2009).
Knowl. Inf. Syst., 2011

The <i>k</i>-anonymity and <i>l</i>-diversity approaches for privacy preservation in social networks against neighborhood attacks.
Knowl. Inf. Syst., 2011

Ranking uncertain sky: The probabilistic top-k skyline operator.
Inf. Syst., 2011

Publishing anonymous survey rating data.
Data Min. Knowl. Discov., 2011

Multidimensional mining of large-scale search logs: a topic-concept cube approach.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Citation recommendation without author supervision.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

On k-skip shortest paths.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Privacy-aware data management in information networks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Enhancing web search by mining search and browse logs.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Extracting Interpretable Features for Early Classification on Time Series.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Towards bounding sequential patterns.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Outlier detection on uncertain data: Objects, instances, and inferences.
Proceedings of the 27th International Conference on Data Engineering, 2011

Early Classification on Temporal Sequences.
Proceedings of the Extraction et gestion des connaissances (EGC'2011), 2011

Data Mining: Concepts and Techniques, 3rd edition
Morgan Kaufmann, ISBN: 978-0123814791, 2011

2010
Mining discriminative items in multiple data streams.
World Wide Web, 2010

Threshold-based probabilistic top-<i>k</i> dominating queries.
VLDB J., 2010

Superseding Nearest Neighbor Search on Uncertain Spatial Databases.
IEEE Trans. Knowl. Data Eng., 2010

Probabilistic Reverse Nearest Neighbor Queries on Uncertain Data.
IEEE Trans. Knowl. Data Eng., 2010

A brief survey on sequence classification.
SIGKDD Explor., 2010

Special issue on the best papers of SDM'10.
Stat. Anal. Data Min., 2010

Computing Closed Skycubes.
Proc. VLDB Endow., 2010

A binary decision diagram based approach for mining frequent subsequences.
Knowl. Inf. Syst., 2010

Exploring Disease Association from the NHANES Data: Data Mining, Pattern Summarization, and Visual Analytics.
Int. J. Data Warehous. Min., 2010

Towards Progressive and Load Balancing Distributed Computation: A Case Study on Skyline Analysis.
J. Comput. Sci. Technol., 2010

Document clustering of scientific texts using citation contexts.
Inf. Retr., 2010

Hierarchical distributed data classification in wireless sensor networks.
Comput. Commun., 2010

Web search/browse log mining: challenges, methods, and applications.
Proceedings of the 19th International Conference on World Wide Web, 2010

Context-aware citation recommendation.
Proceedings of the 19th International Conference on World Wide Web, 2010

Logging every footstep: quantile summaries for the entire history.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Context-aware ranking in web search.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Search and browse log mining for web information retrieval: challenges, methods, and applications.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Neighbor query friendly compression of social networks.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Probabilistic Inference Protection on Anonymized Data.
Proceedings of the ICDM 2010, 2010

Correlation hiding by independence masking.
Proceedings of the 26th International Conference on Data Engineering, 2010

Probabilistic path queries in road networks: traffic uncertainty aware path selection.
Proceedings of the EDBT 2010, 2010

2009
Association Rules.
Proceedings of the Encyclopedia of Database Systems, 2009

Top-<i>k</i> typicality queries and efficient query answering methods on large databases.
VLDB J., 2009

Anonymization-based attacks in privacy-preserving data publishing.
ACM Trans. Database Syst., 2009

Online Skyline Analysis with Dynamic Preferences on Nominal Attributes.
IEEE Trans. Knowl. Data Eng., 2009

Continuous K-Means Monitoring with Low Reporting Cost in Sensor Networks.
IEEE Trans. Knowl. Data Eng., 2009

Link spam target detection using page farms.
ACM Trans. Knowl. Discov. Data, 2009

Mining frequent cross-graph quasi-cliques.
ACM Trans. Knowl. Discov. Data, 2009

Summary of the first ACM SIGKDD workshop on knowledge discovery from uncertain data (U'09).
SIGKDD Explor., 2009

PADS: a simple yet effective pattern-aware dynamic search method for fast maximal frequent pattern mining.
Knowl. Inf. Syst., 2009

Continuously monitoring top-<i>k</i> uncertain data streams: a probabilistic threshold method.
Distributed Parallel Databases, 2009

Anonymization with Worst-Case Distribution-Based Background Knowledge
CoRR, 2009

OrthoClusterDB: an online platform for synteny blocks.
BMC Bioinform., 2009

News article extraction with template-independent wrapper.
Proceedings of the 18th International Conference on World Wide Web, 2009

Towards context-aware search by learning a very large variable length hidden markov model from search logs.
Proceedings of the 18th International Conference on World Wide Web, 2009

MobileMiner: a real world case study of data mining in mobile communication.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2009

Understanding Importance of Collaborations in Co-authorship Networks: A Supportiveness Analysis Approach.
Proceedings of the SIAM International Conference on Data Mining, 2009

Debt Detection in Social Security by Sequence Classification Using Both Positive and Negative Patterns.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

OLAP on search logs: an infrastructure supporting data-driven applications in search engines.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Can we learn a template-independent wrapper for news article extraction from a single training site?
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Early Prediction on Time Series: A Nearest Neighbor Approach.
Proceedings of the IJCAI 2009, 2009

Distance-Based Representative Skyline.
Proceedings of the 25th International Conference on Data Engineering, 2009

Privacy Preserving Publishing on Multiple Quasi-identifiers.
Proceedings of the 25th International Conference on Data Engineering, 2009

Online Interval Skyline Queries on Time Series.
Proceedings of the 25th International Conference on Data Engineering, 2009

Answering aggregate keyword queries on relational databases using minimal group-bys.
Proceedings of the EDBT 2009, 2009

Continuous privacy preserving publishing of data streams.
Proceedings of the EDBT 2009, 2009

Efficiently indexing shortest paths by exploiting symmetry in graphs.
Proceedings of the EDBT 2009, 2009

Personalizing entity detection and recommendation with a fusion of web log mining techniques.
Proceedings of the EDBT 2009, 2009

MAPO: Mining and Recommending API Usage Patterns.
Proceedings of the ECOOP 2009, 2009

Detecting topic evolution in scientific literature: how can citations help?
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Towards Web Search Engine Scale Data Mining.
Proceedings of the Eighth Australasian Data Mining Conference, AusDM 2009, Melbourne, 2009

2008
Privacy-Preserving Data Stream Classification.
Proceedings of the Privacy-Preserving Data Mining - Models and Algorithms, 2008

A Survey of Utility-based Privacy-Preserving Data Transformation Methods.
Proceedings of the Privacy-Preserving Data Mining - Models and Algorithms, 2008

Anonymization by Local Recoding in Data with Attribute Hierarchical Taxonomies.
IEEE Trans. Knowl. Data Eng., 2008

A brief survey on anonymization techniques for privacy preserving publishing of social network data.
SIGKDD Explor., 2008

Advances in information and knowledge management.
SIGIR Forum, 2008

Efficient skyline querying with variable user preferences on nominal attributes.
Proc. VLDB Endow., 2008

Clustering by Pattern Similarity.
J. Comput. Sci. Technol., 2008

Managing Uncertain Data: Probabilistic Approaches.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

PLEDS: A Personalized Entity Detection System Based on Web Log Mining Techniques.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

Query answering techniques on uncertain and probabilistic data: tutorial summary.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Ranking queries on uncertain data: a probabilistic threshold approach.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

DiMaC: a system for cleaning disguised missing data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

A Spamicity Approach to Web Spam Detection.
Proceedings of the SIAM International Conference on Data Mining, 2008

Mining Sequence Classifiers for Early Prediction.
Proceedings of the SIAM International Conference on Data Mining, 2008

Fast and quality-guaranteed data streaming in resource-constrained sensor networks.
Proceedings of the 9th ACM Interational Symposium on Mobile Ad Hoc Networking and Computing, 2008

Mining preferences from superior and inferior examples.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

DiMaC: a disguised missing data cleaning tool.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Context-aware query suggestion by mining click-through and session data.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Publishing Sensitive Transactions for Itemset Utility.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Preserving Privacy in Social Networks Against Neighborhood Attacks.
Proceedings of the 24th International Conference on Data Engineering, 2008

Efficiently Answering Probabilistic Threshold Top-k Queries on Uncertain Data.
Proceedings of the 24th International Conference on Data Engineering, 2008

OrthoCluster: a new tool for mining synteny blocks and applications in comparative genomics.
Proceedings of the EDBT 2008, 2008

Anonymity for continuous data publishing.
Proceedings of the EDBT 2008, 2008

2007
Multi-Dimensional Analysis of Data Streams Using Stream Cubes.
Proceedings of the Data Streams - Models and Algorithms, 2007

Sequence Data Mining
Advances in Database Systems 33, Kluwer, ISBN: 978-0-387-69936-3, 2007

An Energy-Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation.
IEEE Trans. Parallel Distributed Syst., 2007

Efficient Skyline and Top-k Retrieval in Subspaces.
IEEE Trans. Knowl. Data Eng., 2007

Introduction to the special issue on data mining for health informatics.
SIGKDD Explor., 2007

Mining gene-sample-time microarray data: a coherent gene cluster discovery approach.
Knowl. Inf. Syst., 2007

Answering ad hoc aggregate queries from data streams using prefix aggregate trees.
Knowl. Inf. Syst., 2007

Constraint-based sequential pattern mining: the pattern-growth methods.
J. Intell. Inf. Syst., 2007

Active Rules Termination Analysis Through Conditional Formula Containing Updatable Variable.
Proceedings of the Advances in Data and Web Management, 2007

(<i>alpha</i>, <i>k</i>)-anonymity Based Privacy Preservation by Lossy Join.
Proceedings of the Advances in Data and Web Management, 2007

Minimality Attack in Privacy Preserving Data Publishing.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Probabilistic Skylines on Uncertain Data.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Efficiently Answering Top-k Typicality Queries on Large Databases.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Maintaining K-Anonymity against Incremental Updates.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

Mining API patterns as partial orders from source code: from usage scenarios to specifications.
Proceedings of the 6th joint meeting of the European Software Engineering Conference and the ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2007

Sketching Landscapes of Page Farms.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

WAT: Finding Top-K Discords in Time Series Database.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

Mining Frequent Trajectory Patterns for Activity Monitoring Using Radio Frequency Tag Arrays.
Proceedings of the Fifth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom 2007), 2007

Mining favorable facets.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Cleaning disguised missing data: a heuristic approach.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Mining Software Engineering Data.
Proceedings of the 29th International Conference on Software Engineering (ICSE 2007), 2007

Computing Compressed Multidimensional Skyline Cubes Efficiently.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Ix-cubes: iceberg cubes for data warehousing and olap on xml data.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

TS-Trees: A Non-Alterable Search Tree Index for Trustworthy Databases on Write-Once-Read-Many (WORM) Storage.
Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007

2006
Towards multidimensional subspace skyline analysis.
ACM Trans. Database Syst., 2006

Closed Constrained Gradient Mining in Retail Databases.
IEEE Trans. Knowl. Data Eng., 2006

Discovering Frequent Closed Partial Orders from Strings.
IEEE Trans. Knowl. Data Eng., 2006

Regression Cubes with Lossless Compression and Aggregation.
IEEE Trans. Knowl. Data Eng., 2006

Utility-based anonymization for privacy preservation with less information loss.
SIGKDD Explor., 2006

Mining changing regions from access-constrained snapshots: a cluster-embedded decision tree approach.
J. Intell. Inf. Syst., 2006

Mining Co-Location Patterns with Rare Events from Spatial Data Sets.
GeoInformatica, 2006

An Erratum on "Pushing Convertible Constraints in Frequent Itemset Mining".
Data Min. Knowl. Discov., 2006

Using High Dimensional Indexes to Support Relevance Feedback Based Interactive Images Retrival.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

MAPO: mining API usages from open source repositories.
Proceedings of the 2006 International Workshop on Mining Software Repositories, 2006

Utility-based anonymization using local recoding.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Suppressing model overfitting in mining concept-drifting data streams.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

On privacy preservation against adversarial data mining.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

An effective approach to entity resolution problem using quasi-clique and its application to digital libraries.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2006

Improving Grouped-Entity Resolution Using Quasi-Cliques.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

SUBSKY: Efficient Computation of Skylines in Subspaces.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Granularity Adaptive Density Estimation and on Demand Clustering of Concept-Drifting Data Streams.
Proceedings of the Data Warehousing and Knowledge Discovery, 8th International Conference, 2006

Achieving <i>k</i>-Anonymity by Clustering in Attribute Hierarchical Structures.
Proceedings of the Data Warehousing and Knowledge Discovery, 8th International Conference, 2006

Classification spanning correlated data streams.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Minimum Description Length Principle: Generators Are Preferable to Closed Patterns.
Proceedings of the Proceedings, 2006

2005
An Interactive Approach to Mining Gene Expression Data.
IEEE Trans. Knowl. Data Eng., 2005

Preference-Based Frequent Pattern Mining.
Int. J. Data Warehous. Min., 2005

Book Review on "Out of Their Minds: The Lives and Discoveries of 15 Great Computer Scientists".
J. Comput. Sci. Technol., 2005

Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams.
Distributed Parallel Databases, 2005

A Stratification-Based Approach to Accurate and Fast Image Annotation.
Proceedings of the Advances in Web-Age Information Management, 2005

Catching the Best Views of Skyline: A Semantic Approach Based on Decisive Subspaces.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Mining Most General Multidimensional Summarization of Probably Groups in Data Warehouses.
Proceedings of the 17th International Conference on Scientific and Statistical Database Management, 2005

GraphMiner: a structural pattern-mining system for large disk-based graph databases and its applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

A dynamic clustering and scheduling approach to energy saving in data collection from wireless sensor networks.
Proceedings of the Second Annual IEEE Communications Society Conference on Sensor and Ad Hoc Communications and Networks, 2005

Cross Table Cubing: Mining Iceberg Cubes from Data Warehouses.
Proceedings of the 2005 SIAM International Conference on Data Mining, 2005

A Random Method for Quantifying Changing Distributions in Data Streams.
Proceedings of the Knowledge Discovery in Databases: PKDD 2005, 2005

Pattern-based similarity search for microarray data.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

On mining cross-graph quasi-cliques.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Efficiently Mining Frequent Closed Partial Orders.
Proceedings of the 5th IEEE International Conference on Data Mining (ICDM 2005), 2005

Online Mining of Data Streams: Applications, Techniques and Progress.
Proceedings of the 21st International Conference on Data Engineering, 2005

Mining Cross-Graph Quasi-Cliques in Gene Expression and Protein Interaction Data.
Proceedings of the 21st International Conference on Data Engineering, 2005

A General Approach to Mining Quality Pattern-Based Clusters from Microarray Data.
Proceedings of the Database Systems for Advanced Applications, 2005

Mining Succinct Systems of Minimal Generators of Formal Concepts.
Proceedings of the Database Systems for Advanced Applications, 2005

2004
Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach.
IEEE Trans. Knowl. Data Eng., 2004

Mining Constrained Gradients in Large Databases.
IEEE Trans. Knowl. Data Eng., 2004

Mining Condensed Frequent-Pattern Bases.
Knowl. Inf. Syst., 2004

From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach.
J. Comput. Sci. Technol., 2004

Pushing Convertible Constraints in Frequent Itemset Mining.
Data Min. Knowl. Discov., 2004

Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach.
Data Min. Knowl. Discov., 2004

GPX: Interactive Mining of Gene Expression Data.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

A Fast Algorithm for Subspace Clustering by Pattern Similarity.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

Efficient Pattern-Growth Methods for Frequent Tree Pattern Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2004

Scalable mining of large disk-based graph databases.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Mining coherent gene clusters from gene-sample-time microarray data.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

A rank sum test method for informative gene discovery.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Data Mining for Intrusion Detection: Techniques, Applications and Systems.
Proceedings of the 20th International Conference on Data Engineering, 2004

Preface to CoMWIM 2004.
Proceedings of the Conceptual Modeling for Advanced Application Domains, 2004


2003
Towards interactive exploration of gene expression patterns.
SIGKDD Explor., 2003

Recent Progress on Selected Topics in Database Research - A Report by Nine Young Chinese Researchers Working in the United States.
J. Comput. Sci. Technol., 2003

Efficacious Data Cube Exploration by Semantic Summarization and Compression.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

SOCQET: Semantic OLAP with Compressed Cube and Summarization.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

QC-Trees: An Efficient Summary Structure for Semantic OLAP.
Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, 2003

ApproxMAP: Approximate Mining of Consensus Sequential Patterns.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

Mining Confident Colocation Rules without A Support Threshold.
Proceedings of the 2003 ACM Symposium on Applied Computing (SAC), 2003

CLOSET+: searching for the best strategies for mining frequent closed itemsets.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Mining phenotypes and informative genes from gene expression data.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Interactive exploration of coherent patterns in time-series gene expression data.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

MaPle: A Fast Algorithm for Maximal Pattern-based Clustering.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

A General Model for Online Analytical Processing of Complex Data.
Proceedings of the Conceptual Modeling, 2003

DHC: A Density-Based Hierarchical Clustering Method for Time Series Gene Expression Dat.
Proceedings of the 3rd IEEE International Symposium on BioInformatics and BioEngineering (BIBE 2003), 2003

2002
Constrained frequent pattern mining: a pattern-growth view.
SIGKDD Explor., 2002

Quotient Cube: How to Summarize the Semantics of a Data Cube.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

COMMIX: towards effective web information extraction, integration and query answering.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

CubeExplorer: online exploration of data cubes.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

On Computing Condensed Frequent Pattern Bases.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

Online Analytical Processing Stream Data: Is It Feasible?
Proceedings of the 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2002

Mining sequential patterns with constraints in large databases.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

2001
Mining Multi-Dimensional Constrained Gradients in Data Cubes.
Proceedings of the VLDB 2001, 2001

DNA-Miner: A System Prototype for Mining DNA Sequences.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Efficient Computation of Iceberg Cubes with Complex Measures.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Scalable frequent-pattern mining methods: an overview.
Proceedings of the Tutorial notes of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth.
Proceedings of the 17th International Conference on Data Engineering, 2001

Mining Frequent Item Sets with Convertible Constraints.
Proceedings of the 17th International Conference on Data Engineering, 2001

Fault-Tolerant Frequent Pattern Mining: Problems and Challenges.
Proceedings of the 2001 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2001

Multi-Dimensional Sequential Pattern Mining.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
Mining Frequent Patterns by Pattern-Growth: Methodology and Implications.
SIGKDD Explor., 2000

Towards Data Mining Benchmarking: A Testbed for Performance Study of Frequent Pattern Mining.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

Mining Frequent Patterns without Candidate Generation.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

Mining Access Patterns Efficiently from Web Logs.
Proceedings of the Knowledge Discovery and Data Mining, 2000

Can we push more constraints into frequent pattern mining?
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

FreeSpan: frequent pattern-projected sequential pattern mining.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets.
Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2000


  Loading...