Jiawei Han

Orcid: 0000-0002-3629-2696

Affiliations:
  • University of Illinois at Urbana-Champaign, Department of Computer Science, IL, USA
  • Simon Fraser University, School of Computing Science, Canada (former)


According to our database1, Jiawei Han authored at least 1,148 papers between 1985 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Large Language Models on Graphs: A Comprehensive Survey.
IEEE Trans. Knowl. Data Eng., December, 2024

TeKo: Text-Rich Graph Neural Networks With External Knowledge.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

Schema-Guided Culture-Aware Complex Event Simulation with Multi-Agent Role-Play.
CoRR, 2024

Scaling Diffusion Language Models via Adaptation from Autoregressive Models.
CoRR, 2024

RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph.
CoRR, 2024

InstructG2I: Synthesizing Images from Multimodal Attributed Graphs.
CoRR, 2024

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG.
CoRR, 2024

Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval.
CoRR, 2024

A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts.
CoRR, 2024

Law of the Weakest Link: Cross Capabilities of Large Language Models.
CoRR, 2024

CoDiCast: Conditional Diffusion Model for Weather Prediction with Uncertainty Quantification.
CoRR, 2024

Investigating Instruction Tuning Large Language Models on Graphs.
CoRR, 2024

Unsupervised Episode Detection for Large-Scale News Events.
CoRR, 2024

Establishing Knowledge Preference in Language Models.
CoRR, 2024

F-FOMAML: GNN-Enhanced Meta-Learning for Peak Period Demand Forecasting with Proxy Data.
CoRR, 2024

KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge.
CoRR, 2024

Automated Construction of Theme-specific Knowledge Graphs.
CoRR, 2024

Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs.
CoRR, 2024

TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision.
CoRR, 2024

Multi-LoRA Composition for Image Generation.
CoRR, 2024

A Unified Taxonomy-Guided Instruction Tuning Framework for Entity Set Expansion and Taxonomy Expansion.
CoRR, 2024

Investigating Data Contamination for Pre-training Language Models.
CoRR, 2024

TrustLLM: Trustworthiness in Large Language Models.
CoRR, 2024

Molecule Maker Lab Institute: Accelerating, advancing, and democratizing molecular innovation.
AI Mag., 2024

Improving Retrieval in Theme-specific Applications using a Corpus Topical Taxonomy.
Proceedings of the ACM on Web Conference 2024, 2024

Bridging Text Data and Graph Data: Towards Semantics and Structure-aware Knowledge Discovery.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Geospatial Topological Relation Extraction from Text with Knowledge Augmentation.
Proceedings of the 2024 SIAM International Conference on Data Mining, 2024

TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

MART: Improving LLM Safety with Multi-round Automatic Red-Teaming.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Automated Mining of Structured Knowledge from Text in the Era of Large Language Models.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Ontology Enrichment for Effective Fine-grained Entity Typing.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

OntoType: Ontology-Guided and Pre-Trained Language Model Assisted Fine-Grained Entity Typing.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

SLOG: An Inductive Spectral Graph Neural Network Beyond Polynomial Filter.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

MolCRAFT: Structure-Based Drug Design in Continuous Parameter Space.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Structured Chemistry Reasoning with Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Models as Semantic Indexers.
Proceedings of the Forty-first International Conference on Machine Learning, 2024


Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Representation Deficiency in Masked Language Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Instruct, Not Assist: LLM-based Multi-Turn Planning and Hierarchical Questioning for Socratic Code Debugging.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Taxonomy-guided Semantic Indexing for Academic Paper Search.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Topic-Oriented Open Relation Extraction with A Priori Seed Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

ActionIE: Action Extraction from Scientific Literature with Programming Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Text2DB: Integration-Aware Information Extraction with Large Language Model Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Seed-Guided Fine-Grained Entity Typing in Science and Engineering Domains.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
GCN for HIN via Implicit Utilization of Attention and Meta-Paths.
IEEE Trans. Knowl. Data Eng., April, 2023

Entity Linking Meets Deep Learning: Techniques and Solutions.
IEEE Trans. Knowl. Data Eng., March, 2023

Embedding text-rich graph neural networks with sequence and topical semantic structures.
Knowl. Inf. Syst., February, 2023

The MAPLE Benchmark for Graph Mining.
Dataset, February, 2023

The MAPLE Benchmark for Graph Mining.
Dataset, February, 2023

The MAPLE Benchmark for Scientific Literature Tagging.
Dataset, February, 2023

Structured Chemistry Reasoning with Large Language Models.
CoRR, 2023

"Why Should I Review This Paper?" Unifying Semantic, Topic, and Citation Factors for Paper-Reviewer Matching.
CoRR, 2023

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions.
CoRR, 2023

Learning Multiplex Embeddings on Text-rich Networks with One Text Encoder.
CoRR, 2023

Explaining and Adapting Graph Conditional Shift.
CoRR, 2023

PromptClass: Weakly-Supervised Text Classification with Prompting Enhanced Noise-Robust Self-Training.
CoRR, 2023

OntoType: Ontology-Guided Zero-Shot Fine-Grained Entity Typing with Weak Supervision from Pre-Trained Language Models.
CoRR, 2023

MEGClass: Text Classification with Extremely Weak Supervision via Mutually-Enhancing Text Granularities.
CoRR, 2023

The Effect of Metadata on Scientific Literature Tagging: A Cross-Field Cross-Model Study.
Proceedings of the ACM Web Conference 2023, 2023

SCStory: Self-supervised and Continual Online Story Discovery.
Proceedings of the ACM Web Conference 2023, 2023

PDSum: Prototype-driven Continuous Summarization of Evolving Multi-document Sets Stream.
Proceedings of the ACM Web Conference 2023, 2023

Unsupervised Event Chain Mining from Multiple Documents.
Proceedings of the ACM Web Conference 2023, 2023


Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

FineSum: Target-Oriented, Fine-Grained Opinion Summarization.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Unsupervised Story Discovery from Continuous News Streams via Scalable Thematic Embedding.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Taxonomy-Guided Fine-Grained Entity Set Expansion.
Proceedings of the 2023 SIAM International Conference on Data Mining, 2023

Corpus-Based Relation Extraction by Identifying and Refining Relation Patterns.
Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

Weakly Supervised Multi-Label Classification of Full-Text Scientific Papers.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Pretrained Language Representations for Text Understanding: A Weakly-Supervised Perspective.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Heterformer: Transformer-based Deep Node Representation Learning on Heterogeneous Text-Rich Networks.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations at Twitter.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning.
Proceedings of the International Conference on Machine Learning, 2023

Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Revisiting Citation Prediction with Cluster-Aware Text-Enhanced Heterogeneous Graph Neural Networks.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

Geospatial Knowledge Hypercube.
Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems, 2023

PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

RESIN-EDITOR: A Schema-guided Hierarchical Event Graph Visualizer and Editor.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GLEN: General-Purpose Event Detection for Thousands of Types.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MEGClass: Extremely Weakly Supervised Text Classification via Mutually-Enhancing Text Granularities.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Instruct and Extract: Instruction Tuning for On-Demand Information Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Augmenting Zero-Shot Dense Retrievers with Plug-in Mixture-of-Memories.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Reaction Miner: An Integrated System for Chemical Reaction Extraction from Textual Data.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Large Language Models Can Self-Improve.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Mining Structures from Massive Texts by Exploring the Power of Pre-trained Language Models.
Proceedings of the Proceedings 26th International Conference on Extending Database Technology, 2023

Large Foundational Model is a Blessing to Natural Language Understanding and Data Mining.
Proceedings of the 5th IEEE International Conference on Cognitive Machine Intelligence, 2023

ReactIE: Enhancing Chemical Reaction Extraction with Weak Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Compositional Data Augmentation for Abstractive Conversation Summarization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Patton: Language Model Pretraining on Text-Rich Networks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Text Augmented Open Knowledge Graph Completion via Pre-Trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

DynaMiTE: Discovering Explosive Topic Evolutions with User Guidance.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Automated Taxonomy Discovery and Exploration
Synthesis Lectures on Data Mining and Knowledge Discovery, Springer, ISBN: 978-3-031-11405-2, 2022

Heterogeneous Network Representation Learning: A Unified Framework With Survey and Benchmark.
IEEE Trans. Knowl. Data Eng., 2022

Toward Tweet Entity Linking With Heterogeneous Information Networks.
IEEE Trans. Knowl. Data Eng., 2022

Guest Editorial: Special Issue on Network Structural Modeling and Learning in Big Data.
IEEE Trans. Big Data, 2022

Heterogeneous Information Networks: the Past, the Present, and the Future.
Proc. VLDB Endow., 2022

Towards a Unified Multi-Dimensional Evaluator for Text Generation.
CoRR, 2022

Few-shot Text Classification with Dual Contrastive Consistency.
CoRR, 2022

TwHIN-BERT: A Socially-Enriched Pre-trained Language Model for Multilingual Tweet Representations.
CoRR, 2022

TeKo: Text-Rich Graph Neural Networks with External Knowledge.
CoRR, 2022

All Birds with One Stone: Multi-task Text Classification for Efficient Inference with One Forward Pass.
CoRR, 2022

Heterformer: A Transformer Architecture for Node Representation Learning on Heterogeneous Text-Rich Networks.
CoRR, 2022

CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation.
CoRR, 2022

Shift-Robust Node Classification via Graph Adversarial Clustering.
CoRR, 2022

PILED: An Identify-and-Localize Framework for Few-Shot Event Detection.
CoRR, 2022

Unsupervised Summarization with Customized Granularities.
CoRR, 2022

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

KoMen: Domain Knowledge Guided Interaction Recommendation for Emerging Scenarios.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

TaxoEnrich: Self-Supervised Taxonomy Completion via Structure-Semantic Representations.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022


MotifClass: Weakly Supervised Text Classification with Higher-order Metadata Information.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Generating Training Data with Language Models: Towards Zero-Shot Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Unsupervised Key Event Detection from Massive Text Corpora.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

New Frontiers of Scientific Text Mining: Tasks, Data, and Tools.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Adapting Pretrained Representations for Text Mining.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Chemical-Reaction-Aware Molecule Representation Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators.
Proceedings of the Tenth International Conference on Learning Representations, 2022

THINK: Temporal Hypergraph Hyperbolic Network.
Proceedings of the IEEE International Conference on Data Mining, 2022

Unsupervised Multi-Granularity Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Towards a Unified Multi-Dimensional Evaluator for Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PALT: Parameter-Lite Transfer of Language Models for Knowledge Graph Completion.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CiteSum: Citation Text-guided Scientific Extreme Summarization and Domain Adaptation with Limited Supervision.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Open Relation and Event Type Discovery with Type Abstraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Open-Vocabulary Argument Role Prediction For Event Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

MentorGNN: Deriving Curriculum for Pre-Training GNNs.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Entity Set Co-Expansion in StackOverflow.
Proceedings of the IEEE International Conference on Big Data, 2022

REACTCLASS: Cross-Modal Supervision for Subword-Guided Reactant Entity Classification.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2022

Schema-Guided Event Graph Completion.
Proceedings of the 4th Conference on Automated Knowledge Base Construction, 2022

Eider: Empowering Document-level Relation Extraction with Efficient Evidence Extraction and Inference-stage Fusion.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Phrase-aware Unsupervised Constituency Parsing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Fine-Grained Opinion Summarization with Minimal Supervision.
CoRR, 2021

Multi-head or Single-head? An Empirical Comparison for Transformer Training.
CoRR, 2021

Eider: Evidence-enhanced Document-level Relation Extraction.
CoRR, 2021

Future is not One-dimensional: Graph Modeling based Complex Event Schema Induction for Event Prediction.
CoRR, 2021

Who Should Go First? A Self-Supervised Concept Sorting Model for Improving Taxonomy Expansion.
CoRR, 2021

Minimally-Supervised Structure-Rich Text Categorization via Learning on Text-Rich Networks.
Proceedings of the WWW '21: The Web Conference 2021, 2021

MATCH: Metadata-Aware Text Classification in A Large Hierarchy.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Hierarchical Metadata-Aware Document Categorization under Weak Supervision.
Proceedings of the WSDM '21, 2021

BiTe-GCN: A New GCN Architecture via Bidirectional Convolution of Topology and Features on Text-Rich Networks.
Proceedings of the WSDM '21, 2021


SUMDocS: Surrounding-aware Unsupervised Multi-Document Summarization.
Proceedings of the 2021 SIAM International Conference on Data Mining, 2021

Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Shift-Robust GNNs: Overcoming the Limitations of Localized Graph Training data.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Universal Graph Convolutional Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Open-Domain Question Answering with Pre-Constructed Question Spaces.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021

Event Time Extraction and Propagation via Graph Attention Networks.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Demonstrations, 2021

TaxoClass: Hierarchical Multi-Label Text Classification Using Only Class Names.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Document-Level Event Argument Extraction by Conditional Generation.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

On the Transformer Growth for Progressive BERT Training.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

UCPhrase: Unsupervised Context-aware Quality Phrase Tagging.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

On the Power of Pre-Trained Text Representations: Models and Applications in Text Mining.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

TAXOGAN: Hierarchical Network Representation Learning via Taxonomy Guided Generative Adversarial Networks (Extended Abstract).
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding.
Proceedings of the 9th International Conference on Learning Representations, 2021

AS-GCN: Adaptive Semantic Architecture of Graph Convolutional Networks for Text-Rich Networks.
Proceedings of the IEEE International Conference on Data Mining, 2021

Out-of-Category Document Identification Using Target-Category Names as Weak Supervision.
Proceedings of the IEEE International Conference on Data Mining, 2021

ChemNER: Fine-Grained Chemistry Named Entity Recognition with Ontology-Guided Distant Supervision.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Corpus-based Open-Domain Event Type Induction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

The Future is not One-dimensional: Complex Event Schema Induction by Graph Modeling for Event Prediction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Few-Shot Named Entity Recognition: An Empirical Baseline Study.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Training ELECTRA Augmented with Multi-word Selection.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Reader-Guided Passage Reranking for Open-Domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Generation-Augmented Retrieval for Open-Domain Question Answering.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Vision Paper: Grand Challenges in Resilience: Autonomous System Resilience through Design and Runtime Measures.
IEEE Open J. Comput. Soc., 2020

Deep multiplex graph infomax: Attentive multiplex network embedding using global information.
Knowl. Based Syst., 2020

Unsupervised Word Embedding Learning by Incorporating Local and Global Contexts.
Frontiers Big Data, 2020

Few-Shot Named Entity Recognition: A Comprehensive Study.
CoRR, 2020

Constrained Abstractive Summarization: Preserving Factual Consistency with Constrained Generation.
CoRR, 2020

BiTe-GCN: A New GCN Architecture via BidirectionalConvolution of Topology and Features on Text-Rich Networks.
CoRR, 2020

COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation.
CoRR, 2020

Partially-Typed NER Datasets Integration: Connecting Practice to Theory.
CoRR, 2020

Automatic Textual Evidence Mining in COVID-19 Literature.
CoRR, 2020

Heterogeneous Network Representation Learning: Survey, Benchmark, Evaluation, and Beyond.
CoRR, 2020

Comprehensive Named Entity Recognition on CORD-19 with Distant or Weak Supervision.
CoRR, 2020

cube2net: Efficient Query-Specific Network Construction with Data Cube Organization.
CoRR, 2020

Multiscale online media simulation with SocialCube.
Comput. Math. Organ. Theory, 2020

Collective Multi-type Entity Alignment Between Knowledge Graphs.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

TaxoExpan: Self-supervised Taxonomy Expansion with Position-Enhanced Graph Neural Network.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

NetTaxo: Automated Topic Taxonomy Construction from Text-Rich Network.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Discriminative Topic Mining via Category-Name Guided Text Embedding.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Guiding Corpus-based Set Expansion by Auxiliary Sets Generation and Co-Expansion.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Generating Representative Headlines for News Stories.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Relation Learning on Social Networks with Multi-Modal Graph Edge Variational Autoencoders.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Inf-VAE: A Variational Autoencoder Framework to Integrate Homophily and Influence in Diffusion Prediction.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Joint Aspect-Sentiment Analysis with Minimal User Guidance.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Minimally Supervised Categorization of Text with Metadata.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Neural Concept Map Generation for Effective Document Classification with Interpretable Structured Summarization.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

FUSE: Multi-faceted Set Expansion by Coherent Clustering of Skip-Grams.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2020

A Data-Driven Graph Generative Model for Temporal Interaction Networks.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

MultiSage: Empowering GCN with Contextualized Multi-Embeddings on Web-Scale Multipartite Networks.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Unsupervised Differentiable Multi-aspect Network Embedding.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Embedding-Driven Multi-Dimensional Topic Mining and Text Analysis.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Octet: Online Catalog Taxonomy Enrichment with Self-Supervision.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

When Do GNNs Work: Understanding and Improving Neighborhood Aggregation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

On the Variance of the Adaptive Learning Rate and Beyond.
Proceedings of the 8th International Conference on Learning Representations, 2020

Co-Embedding Network Nodes and Hierarchical Labels with Taxonomy Based Generative Adversarial Networks.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

Mining Text Outliers in Document Directories.
Proceedings of the 20th IEEE International Conference on Data Mining, 2020

SynSetExpan: An Iterative Framework for Joint Entity Set Expansion and Synonym Discovery.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Near-imperceptible Neural Linguistic Steganography via Self-Adjusting Arithmetic Coding.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Text Classification Using Label Names Only: A Language Model Self-Training Approach.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Understanding the Difficulty of Training Transformers.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Weakly-Supervised Aspect-Based Sentiment Analysis via Joint Aspect-Sentiment Topic Embedding.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Graph Clustering with Embedding Propagation.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Textual Evidence Mining via Spherical Heterogeneous Information Network Embedding.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Pattern-enhanced Named Entity Recognition with Distant Supervision.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

Fine-Grained Named Entity Recognition with Distant Supervision in COVID-19 Literature.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

Empower Entity Set Expansion via Language Model Probing.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

EVIDENCEMINER: Textual Evidence Discovery for Life Sciences.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Facet-Aware Evaluation for Extractive Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Unsupervised Attributed Multiplex Network Embedding.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multidimensional Mining of Massive Text Data
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01914-2, 2019

TextCube: Automated Construction and Multidimensional Exploration.
Proc. VLDB Endow., 2019

Grand Challenges of Resilience: Autonomous System Resilience through Design and Runtime Measures.
CoRR, 2019

CubeNet: Multi-Facet Hierarchical Heterogeneous Network Construction, Analysis, and Mining.
CoRR, 2019

Facet-Aware Evaluation for Extractive Text Summarization.
CoRR, 2019

CatE: Category-Name GuidedWord Embedding.
CoRR, 2019

Raw-to-End Name Entity Recognition in Social Media.
CoRR, 2019

Cross-type biomedical named entity recognition with deep multi-task learning.
Bioinform., 2019

STFNets: Learning Sensing Signals from the Time-Frequency Perspective with Short-Time Fourier Neural Networks.
Proceedings of the World Wide Web Conference, 2019

Place Deduplication with Embeddings.
Proceedings of the World Wide Web Conference, 2019

Integrating Local Context and Global Cohesiveness for Open Information Extraction.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

Arabic Named Entity Recognition: What Works and What's Next.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

Constrained Sequence-to-sequence Semitic Root Extraction for Enriching Word Embeddings.
Proceedings of the Fourth Arabic Natural Language Processing Workshop, 2019

GeoAttn: Localization of Social Media Messages via Attentional Memory Network.
Proceedings of the 2019 SIAM International Conference on Data Mining, 2019

User-Guided Clustering in Heterogeneous Information Networks via Motif-Based Comprehensive Transcription.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

Spherical Text Embedding.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Biomedical Event Extraction based on Knowledge-driven Tree-LSTM.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Constructing and Mining Heterogeneous Information Networks from Massive Text.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

Neural Embedding Propagation on Heterogeneous Networks.
Proceedings of the 2019 IEEE International Conference on Data Mining, 2019

cube2net: Efficient Query-Specific Network Construction with Data Cube Organization.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

CrossWeigh: Training Named Entity Tagger from Imperfect Annotations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Hierarchical Text Classification with Reinforced Label Assignment.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Query-Specific Knowledge Summarization with Entity Evolutionary Networks.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Discovering Hypernymy in Text-Rich Heterogeneous Information Network by Exploiting Context Granularity.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Task-Guided Pair Embedding in Heterogeneous Network.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

From Unstructured Text to TextCube: Automated Construction and Multidimensional Exploration.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Mining News Events from Comparable News Corpora: A Multi-Attribute Proximity Network Modeling Approach.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Parsimonious Morpheme Segmentation with an Application to Enriching Word Embeddings.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Distantly Supervised Biomedical Named Entity Recognition with Dictionary Expansion.
Proceedings of the 2019 IEEE International Conference on Bioinformatics and Biomedicine, 2019

Reliability-aware Dynamic Feature Composition for Name Tagging.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Mining Entity Synonyms with Efficient Neural Set Generation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Weakly-Supervised Hierarchical Text Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Mining Structures of Factual Knowledge from Text: An Effort-Light Approach
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01912-8, 2018

Role Discovery.
Proceedings of the Encyclopedia of Social Network Analysis and Mining, 2nd Edition, 2018

Ranking Methods for Networks.
Proceedings of the Encyclopedia of Social Network Analysis and Mining, 2nd Edition, 2018

Pattern-Growth Methods.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Frequent Itemsets and Association Rules.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Stream Mining.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Data Mining.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Towards Quality Aware Information Integration in Distributed Sensing Systems.
IEEE Trans. Parallel Distributed Syst., 2018

SHINE+: A General Framework for Domain-Specific Entity Linking with Heterogeneous Information Networks.
IEEE Trans. Knowl. Data Eng., 2018

Automated Phrase Mining from Massive Text Corpora.
IEEE Trans. Knowl. Data Eng., 2018

DPPred: An Effective Prediction Framework with Concise Discriminative Patterns.
IEEE Trans. Knowl. Data Eng., 2018

GeoBurst+: Effective and Real-Time Local Event Detection in Geo-Tagged Tweet Streams.
ACM Trans. Intell. Syst. Technol., 2018

pg-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data.
IEEE Trans. Big Data, 2018

Unsupervised meta-path selection for text similarity measure based on heterogeneous information networks.
Data Min. Knowl. Discov., 2018

Higher-Order Clustering in Heterogeneous Information Networks.
CoRR, 2018

Integrating Local Context and Global Cohesiveness for Open Information Extraction.
CoRR, 2018

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling.
CoRR, 2018

Expert Finding in Heterogeneous Bibliographic Networks with Locally-trained Embeddings.
CoRR, 2018

Contrast Subgraph Mining from Coherent Cores.
CoRR, 2018

mvn2vec: Preservation and Collaboration in Multi-View Network Embedding.
CoRR, 2018

Open Information Extraction with Global Structure Constraints.
Proceedings of the Companion of the The Web Conference 2018 on The Web Conference 2018, 2018

Weakly-supervised Relation Extraction by Pattern-enhanced Embedding Learning.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Indirect Supervision for Relation Extraction using Question-Answer Pairs.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Curriculum Learning for Heterogeneous Star Network Embedding via Deep Reinforcement Learning.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

On the Power of Massive Text Data.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

AspEm: Embedding Learning by Aspects in Heterogeneous Information Networks.
Proceedings of the 2018 SIAM International Conference on Data Mining, 2018

Annotating gene sets by mining large literature collections with protein networks.
Proceedings of the Biocomputing 2018: Proceedings of the Pacific Symposium, 2018

Similarity Modeling on Heterogeneous Networks via Automatic Path Discovery.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2018

Multi-Domain Integration and Correlation Engine.
Proceedings of the 2018 IEEE Military Communications Conference, 2018

TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

I Know You'll Be Back: Interpretable New User Clustering and Churn Prediction on a Mobile Social Application.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

TruePIE: Discovering Reliable Patterns in Pattern-Based Information Extraction.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Dynamic Truth Discovery on Numerical Data.
Proceedings of the IEEE International Conference on Data Mining, 2018

Meta-Graph Based HIN Spectral Embedding: Methods, Analyses, and Insights.
Proceedings of the IEEE International Conference on Data Mining, 2018

Active Learning on Heterogeneous Information Networks: A Multi-armed Bandit Approach.
Proceedings of the IEEE International Conference on Data Mining, 2018

Doc2Cube: Allocating Documents to Text Cube Without Labeled Data.
Proceedings of the IEEE International Conference on Data Mining, 2018

Did You Enjoy the Ride? Understanding Passenger Experience via Heterogeneous Network Embedding.
Proceedings of the 34th IEEE International Conference on Data Engineering, 2018

Learning Named Entity Tagger using Domain-Specific Dictionary.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Investigating Rumor News Using Agreement-Aware Search.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Weakly-Supervised Neural Text Classification.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Open-Schema Event Profiling for Massive News Corpora.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Pattern Discovery for Wide-Window Open Information Extraction in Biomedical Literature.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

PENNER: Pattern-enhanced Nested Named Entity Recognition in Biomedical Literature.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Open Information Extraction with Meta-pattern Discovery in Biomedical Literature.
Proceedings of the 2018 ACM International Conference on Bioinformatics, 2018

Node, Motif and Subgraph: Leveraging Network Functional Blocks Through Structural Convolution.
Proceedings of the IEEE/ACM 2018 International Conference on Advances in Social Networks Analysis and Mining, 2018

End-to-End Reinforcement Learning for Automatic Taxonomy Induction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

A Spherical Hidden Markov Model for Semantics-Rich Human Mobility Modeling.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Spatiotemporal Activity Modeling Under Data Scarcity: A Graph-Regularized Cross-Modal Embedding Approach.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Empower Sequence Labeling with Task-Aware Neural Language Model.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Phrase Mining from Massive Text and Its Applications
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01910-4, 2017

Quality Threshold Clustering.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Partitional Clustering.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Mean Shift.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Locality Sensitive Hashing Based Clustering.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

<i>K</i>-Way Spectral Clustering.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

<i>K</i>-Medoids Clustering.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

<i>K</i>-Means Clustering.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

Expectation Maximization Clustering.
Proceedings of the Encyclopedia of Machine Learning and Data Mining, 2017

OLAP, Spatial.
Proceedings of the Encyclopedia of GIS., 2017

Embedding Learning with Events in Heterogeneous Information Networks.
IEEE Trans. Knowl. Data Eng., 2017

Detection of Complexes in Biological Networks Through Diversified Dense Subgraph Mining.
J. Comput. Biol., 2017

Graph Clustering with Dynamic Embedding.
CoRR, 2017

Overcoming Limited Supervision in Relation Extraction: A Pattern-enhanced Distributional Representation Approach.
CoRR, 2017

Liberal Entity Extraction: Rapid Construction of Fine-Grained Entity Typing Systems.
Big Data, 2017

Regions, Periods, Activities: Uncovering Urban Dynamics via Cross-Modal Representation Learning.
Proceedings of the 26th International Conference on World Wide Web, 2017

CoType: Joint Extraction of Typed Entities and Relations with Knowledge Bases.
Proceedings of the 26th International Conference on World Wide Web, 2017

Constructing Structured Information Networks from Massive Text Corpora.
Proceedings of the 26th International Conference on World Wide Web Companion, 2017

PRED: Periodic Region Detection for Mobility Modeling of Social Media Users.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Comparative Document Analysis for Large Text Corpora.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Mining Structures from Massive Text Data: A Data-Driven Approach.
Proceedings of the 4th Annual International Symposium on Information Management and Big Data, 2017

Building Structured Databases of Factual Knowledge from Massive Text Corpora.
Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Construction of Structured Heterogeneous Networks from Massive Text Data: Extended Abstract.
Proceedings of the 2nd International Workshop on Network Data Analytics, 2017

ReAct: Online Multimodal Embedding for Recency-Aware Spatiotemporal Activity Modeling.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

Computing Restricted Voronoi Diagram on Graphics Hardware.
Proceedings of the 25th Pacific Conference on Computer Graphics and Applications, 2017

TrioVecEvent: Embedding-Based Online Local Event Detection in Geo-Tagged Tweet Streams.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

PReP: Path-Based Relevance from a Probabilistic Perspective in Heterogeneous Information Networks.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Automatic Synonym Discovery with Knowledge Bases.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

MetaPAD: Meta Pattern Discovery from Massive Text Corpora.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Mining structures from massive text data: will it help software engineering?
Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, 2017

Bringing Semantics to Spatiotemporal Data Mining: Challenges, Methods, and Applications.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Link Prediction across Aligned Networks with Sparse and Low Rank Matrix Estimation.
Proceedings of the 33rd IEEE International Conference on Data Engineering, 2017

Identifying Semantically Deviating Outlier Documents.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Heterogeneous Supervision for Relation Extraction: A Representation Learning Approach.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

ClaimVerif: A Real-time Claim Verification System Using the Web and Fact Databases.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Detecting Multiple Periods and Periodic Patterns in Event Time Sequences.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Distant Meta-Path Similarities for Text-Based Heterogeneous Information Networks.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

An Attention-based Collaboration Framework for Multi-View Network Representation Learning.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Urbanity: A System for Interactive Exploration of Urban Dynamics from Streaming Human Sensing Data.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Unsupervised Concept Categorization and Extraction from Scientific Document Titles.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Life-iNet: A Structured Network-Based Knowledge Exploration and Analytics System for Life Sciences.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Conflicts to Harmony: A Framework for Resolving Conflicts in Heterogeneous Data by Truth Discovery.
IEEE Trans. Knowl. Data Eng., 2016

Recurring and Novel Class Detection Using Class-Based Ensemble for Evolving Data Stream.
IEEE Trans. Knowl. Data Eng., 2016

World Knowledge as Indirect Supervision for Document Clustering.
ACM Trans. Knowl. Discov. Data, 2016

A probabilistic approach to detect mixed periodic patterns from moving object data.
GeoInformatica, 2016

Multi-Dimensional, Phrase-Based Summarization in Text Cubes.
IEEE Data Eng. Bull., 2016

p-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data.
CoRR, 2016

Meta-Path Guided Embedding for Similarity Search in Large-Scale Heterogeneous Information Networks.
CoRR, 2016

Towards Mobile Query Auto-Completion: An Efficient Mobile Application-Aware Approach.
Proceedings of the 25th International Conference on World Wide Web, 2016

Automatic Entity Recognition and Typing in Massive Text Corpora.
Proceedings of the 25th International Conference on World Wide Web, 2016

Representing Documents via Latent Keyphrase Inference.
Proceedings of the 25th International Conference on World Wide Web, 2016

Entity Disambiguation with Linkless Knowledge Bases.
Proceedings of the 25th International Conference on World Wide Web, 2016

Modeling Check-in Preferences with Multidimensional Knowledge: A Minimax Entropy Approach.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Knowledge Fragment Enrichment Using Domain Knowledge Base.
Proceedings of the Social Media Processing - 5th National Conference, 2016

Automatic Entity Recognition and Typing in Massive Text Data.
Proceedings of the 2016 International Conference on Management of Data, 2016

GeoBurst: Real-Time Local Event Detection in Geo-Tagged Tweet Streams.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

RelSim: Relation Similarity Search in Schema-Rich Heterogeneous Information Networks.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

DPClass: An Effective but Concise Discriminative Patterns-Based Classification Framework.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

MACFP: Maximal Approximate Consecutive Frequent Pattern Mining under Edit Distance.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

Complexes Detection in Biological Networks via Diversified Dense Subgraphs Mining.
Proceedings of the Research in Computational Molecular Biology - 20th Annual Conference, 2016


GMove: Group-Level Mobility Modeling Using Geo-Tagged Social Media.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

From Truth Discovery to Trustworthy Opinion Discovery: An Uncertainty-Aware Quantitative Modeling Approach.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

CatchTartan: Representing and Summarizing Dynamic Multicontextual Behaviors.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Mining Reliable Information from Passively and Actively Crowdsourced Data.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Collaborative Multi-Level Embedding Learning from Reviews for Rating Prediction.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Hostname Preference to Enhance Search Relevance.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Community Cores: Removing Size Bias from Community Detection.
Proceedings of the Tenth International Conference on Web and Social Media, 2016

Towards Faster Rates and Oracle Property for Low-Rank Matrix Estimation.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Large-Scale Embedding Learning in Heterogeneous Event Data.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

AFET: Automatic Fine-Grained Entity Typing by Hierarchical Partial-Label Embedding.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

FastHybrid: A Hybrid Model for Efficient Answer Selection.
Proceedings of the COLING 2016, 2016

Data-Driven Contextual Valence Shifter Quantification for Multi-Theme Sentiment Analysis.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

FacetGist: Collective Extraction of Document Facets in Large Technical Corpora.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

On the power of big data: Mining structures from massive, unstructured text data.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Mining Discriminative Patterns to Predict Health Status for Cardiopulmonary Patients.
Proceedings of the 7th ACM International Conference on Bioinformatics, 2016

On predicting social unrest using social media.
Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2016

Downside management in recommender systems.
Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2016

Liberal Event Extraction and Event Schema Induction.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Text Classification with Heterogeneous Information Network Kernels.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

EKNOT: Event Knowledge from News and Opinions in Twitter.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Mining Latent Entity Structures
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01907-4, 2015

Power-Based Diagnosis of Node Silence in Remote High-End Sensing Systems.
ACM Trans. Sens. Networks, 2015

Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions.
IEEE Trans. Knowl. Data Eng., 2015

ePeriodicity: Mining Event Periodicity from Incomplete Observations.
IEEE Trans. Knowl. Data Eng., 2015

A Unifying Framework of Mining Trajectory Patterns of Various Temporal Tightness.
IEEE Trans. Knowl. Data Eng., 2015

A Framework of Mining Trajectories from Untrustworthy Data in Cyber-Physical System.
ACM Trans. Knowl. Discov. Data, 2015

Mining Trajectory Data and Geotagged Data in Social Media for Road Map Inference.
Trans. GIS, 2015

A Survey on Truth Discovery.
SIGKDD Explor., 2015

Truth Discovery and Crowdsourcing Aggregation: A Unified Perspective.
Proc. VLDB Endow., 2015

Constructing topical hierarchies in heterogeneous information networks.
Knowl. Inf. Syst., 2015

KnowEnG: a knowledge engine for genomics.
J. Am. Medical Informatics Assoc., 2015

Mining strong relevance between heterogeneous entities from unstructured biomedical data.
Data Min. Knowl. Discov., 2015

Leveraging Pattern Semantics for Extracting Entities in Enterprises.
Proceedings of the 24th International Conference on World Wide Web, 2015

Network A/B Testing: From Sampling to Estimation.
Proceedings of the 24th International Conference on World Wide Web, 2015

On Integrating Network and Community Discovery.
Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, 2015

Mining Quality Phrases from Massive Text Corpora.
Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, Melbourne, Victoria, Australia, May 31, 2015

adaQAC: Adaptive Query Auto-Completion via Implicit Negative Feedback.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Graph Regularized Meta-path Based Transductive Regression in Heterogeneous Information Network.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

GIN: A Clustering Model for Capturing Dual Heterogeneity in Networked Data.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

Robust Classification of Information Networks by Consistent Graph Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Fast Inbound Top-K Query for Random Walk with Restart.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

Debiasing Crowdsourced Batches.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Modeling Truth Existence in Truth Discovery.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Assembler: Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Incorporating World Knowledge to Document Clustering via Heterogeneous Information Networks.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Towards Interactive Construction of Topical Hierarchy: A Recursive Tensor Decomposition Approach.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Automatic Entity Recognition and Typing from Massive Text Corpora: A Phrase and Network Mining Approach.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

FaitCrowd: Fine Grained Truth Discovery for Crowdsourced Data Aggregation.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

On the Discovery of Evolving Truth.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

TOPTRAC: Topical Trajectory Pattern Mining.
Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

Constrained Information-Theoretic Tripartite Graph Clustering to Identify Semantically Similar Relations.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Mining Multi-aspect Reflection of News Events in Twitter: Discovery, Linking and Presentation.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

KnowSim: A Document Similarity Measure on Structured Heterogeneous Information Networks.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

STREAMCUBE: Hierarchical spatio-temporal hashtag clustering for event exploration over the Twitter stream.
Proceedings of the 31st IEEE International Conference on Data Engineering, 2015

Query-Based Outlier Detection in Heterogeneous Information Networks.
Proceedings of the 18th International Conference on Extending Database Technology, 2015

Classification with Active Learning and Meta-Paths in Heterogeneous Information Networks.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015

Context-aware Entity Morph Decoding.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Successful Data Mining Methods for NLP.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Outlier Detection for Temporal Data
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01905-0, 2014

Role Discovery.
Encyclopedia of Social Network Analysis and Mining, 2014

Ranking Methods for Networks.
Encyclopedia of Social Network Analysis and Mining, 2014

Troubleshooting interactive complexity bugs in wireless sensor networks using data mining techniques.
ACM Trans. Sens. Networks, 2014

Co-Evolution of Multi-Typed Objects in Dynamic Star Networks.
IEEE Trans. Knowl. Data Eng., 2014

Outlier Detection for Temporal Data: A Survey.
IEEE Trans. Knowl. Data Eng., 2014

Uncovering deception in social media.
Soc. Netw. Anal. Min., 2014

Exploring and inferring user-user pseudo-friendship for sentiment analysis with heterogeneous networks.
Stat. Anal. Data Min., 2014

Splitter: Mining Fine-Grained Sequential Patterns in Semantic Trajectories.
Proc. VLDB Endow., 2014

MoveMine 2.0: Mining Object Relationships from Movement Data.
Proc. VLDB Endow., 2014

A Confidence-Aware Approach for Truth Discovery on Long-Tail Data.
Proc. VLDB Endow., 2014

Scalable Topical Phrase Mining from Text Corpora.
Proc. VLDB Endow., 2014

Scalable and Robust Construction of Topical Hierarchies.
CoRR, 2014

A network-assisted co-clustering algorithm to discover cancer subtypes based on gene expression.
BMC Bioinform., 2014

Learning to efficiently rank on big data.
Proceedings of the 23rd International World Wide Web Conference, 2014

A hierarchical Dirichlet model for taxonomy expansion for search engines.
Proceedings of the 23rd International World Wide Web Conference, 2014

Personalized entity recommendation: a heterogeneous information network approach.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

On building entity recommender systems using user click log and freebase knowledge.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Heterogeneous graph-based intent learning with queries, web pages and Wikipedia concepts.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Leveraging periodicity in human mobility for next place prediction.
Proceedings of the IEEE Wireless Communications and Networking Conference, 2014

Batch-Mode Active Learning via Error Bound Minimization.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

NewsNetExplorer: automatic construction and exploration of news information networks.
Proceedings of the International Conference on Management of Data, 2014

A probabilistic model for linking named entities in web text with heterogeneous information networks.
Proceedings of the International Conference on Management of Data, 2014

Resolving conflicts in heterogeneous data by truth discovery and source reliability estimation.
Proceedings of the International Conference on Management of Data, 2014

Mining latent entity structures from massive unstructured and interconnected data.
Proceedings of the International Conference on Management of Data, 2014

Local Learning for Mining Outlier Subgraphs from Network Datasets.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Automatic Construction and Ranking of Topical Keyphrases on Collections of Short Documents.
Proceedings of the 2014 SIAM International Conference on Data Mining, 2014

Generalized Decision Aggregation in Distributed Sensing Systems.
Proceedings of the IEEE 35th IEEE Real-Time Systems Symposium, 2014

Scalable Moment-Based Inference for Latent Dirichlet Allocation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2014

Robust Tensor Decomposition with Gross Corruption.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Towards scalable critical alert mining.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

ClusCite: effective citation recommendation by information network-based clustering.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Bringing structure to text: mining phrases, entities, topics, and hierarchies.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Mining Query-Based Subnetwork Outliers in Heterogeneous Information Networks.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Online Spectral Learning on a Graph with Bandit Feedback.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Top-K interesting subgraph discovery in information networks.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Privacy Risk in Anonymized Heterogeneous Information Networks.
Proceedings of the 17th International Conference on Extending Database Technology, 2014

The Wisdom of Minority: Unsupervised Slot Filling Validation based on Multi-dimensional Truth-Finding.
Proceedings of the COLING 2014, 2014

Modeling Topic Diffusion in Multi-Relational Bibliographic Information Networks.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

Regularized l1-Graph for Data Clustering.
Proceedings of the British Machine Vision Conference, 2014

Be Appropriate and Funny: Automatic Entity Morph Encoding.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Sequential Pattern Mining.
Proceedings of the Frequent Pattern Mining, 2014

Pattern-Growth Methods.
Proceedings of the Frequent Pattern Mining, 2014

Mining Graph Patterns.
Proceedings of the Frequent Pattern Mining, 2014

Probabilistic Models for Classification.
Proceedings of the Data Classification: Algorithms and Applications, 2014

Active Learning: A Survey.
Proceedings of the Data Classification: Algorithms and Applications, 2014

2013
Perspective on Measurement Metrics for Community Detection Algorithms.
Proceedings of the Mining Social Networks and Security Informatics, 2013

The parallel path framework for entity discovery on the web.
ACM Trans. Web, 2013

On Exploiting Transient Social Contact Patterns for Data Forwarding in Delay-Tolerant Networks.
IEEE Trans. Mob. Comput., 2013

Classification and Adaptive Novel Class Detection of Feature-Evolving Data Streams.
IEEE Trans. Knowl. Data Eng., 2013

Reinforced Similarity Integration in Image-Rich Information Networks.
IEEE Trans. Knowl. Data Eng., 2013

Revealing Density-Based Clustering Structure from the Core-Connected Tree of a Network.
IEEE Trans. Knowl. Data Eng., 2013

A Graph-Based Consensus Maximization Approach for Combining Multiple Supervised and Unsupervised Models.
IEEE Trans. Knowl. Data Eng., 2013

PathSelClus: Integrating Meta-Path Selection with User-Guided Object Clustering in Heterogeneous Information Networks.
ACM Trans. Knowl. Discov. Data, 2013

Parallel Field Ranking.
ACM Trans. Knowl. Discov. Data, 2013

A framework of traveling companion discovery on trajectory data streams.
ACM Trans. Intell. Syst. Technol., 2013

MiTexCube: MicroTextCluster Cube for online analysis of text cells and its applications.
Stat. Anal. Data Min., 2013

Trustworthiness analysis of sensor data in cyber-physical systems.
J. Comput. Syst. Sci., 2013

KERT: Automatic Extraction and Ranking of Topical Keyphrases from Content-Representative Document Titles.
CoRR, 2013

Mining collective intelligence in diverse groups.
Proceedings of the 22nd International World Wide Web Conference, 2013

Exploring structure and content on the web: extraction and integration of the semi-structured web.
Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, 2013

Research-insight: providing insight on research by publication network analysis.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Multi-View Clustering via Joint Nonnegative Matrix Factorization.
Proceedings of the 13th SIAM International Conference on Data Mining, 2013

On the Detectability of Node Grouping in Networks.
Proceedings of the 13th SIAM International Conference on Data Mining, 2013

A Text Cube Approach to Human, Social and Cultural Behavior in the Twitter Stream.
Proceedings of the Social Computing, Behavioral-Cultural Modeling and Prediction, 2013

Recommendation in heterogeneous information networks with implicit user feedback.
Proceedings of the Seventh ACM Conference on Recommender Systems, 2013

Community Distribution Outlier Detection in Heterogeneous Information Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2013

Social patterns: Community detection using behavior-generated network datasets.
Proceedings of the 2nd IEEE Network Science Workshop, 2013

Research on the Construction of Dynamic Chinese Natural Sign Language Corpus.
Proceedings of the IEEE 9th International Conference on Mobile Ad-hoc and Sensor Networks, 2013

A phrase mining framework for recursive construction of a topical hierarchy.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

EventCube: multi-dimensional search and mining of structured and text data.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Mining lines in the sand: on trajectory discovery from untrustworthy data in cyber-physical system.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Ranking-based name matching for author disambiguation in bibliographic data.
Proceedings of the 2013 KDD Cup 2013 Workshop, 2013

Mining evidences for named entity disambiguation.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Spatiotemporal periodical pattern mining in traffic data.
Proceedings of the 2nd ACM SIGKDD International Workshop on Urban Computing, 2013

Selective sampling on graphs for classification.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Large-Scale Spectral Clustering on Graphs.
Proceedings of the IJCAI 2013, 2013

Constructing Topical Hierarchies in Heterogeneous Information Networks.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Semantic Frame-Based Document Representation for Comparable Corpora.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

A Feature-Enhanced Ranking-Based Classifier for Multimodal Data and Heterogeneous Information Networks.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

gIceberg: Towards iceberg analysis in large graphs.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

Inferring human mobility patterns from taxicab location traces.
Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2013

Content coverage maximization on word networks for hierarchical topic summarization.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Channeling the deluge: research challenges for big data and information systems.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

An exploration of discussion threads in social news sites: a case study of the Reddit community.
Proceedings of the Advances in Social Networks Analysis and Mining 2013, 2013

On detecting association-based clique outliers in heterogeneous information networks.
Proceedings of the Advances in Social Networks Analysis and Mining 2013, 2013

Clustered Support Vector Machines.
Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 2013

Unsupervised Link Selection in Networks.
Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics, 2013

MedRank: Discovering Influential Medical Treatments from Literature by Information Network Analysis.
Proceedings of the Twenty-Fourth Australasian Database Conference, 2013

Resolving Entity Morphs in Censored Data.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

A Survey of RFID Data Processing.
Proceedings of the Managing and Mining Sensor Data, 2013

A Survey of Datamining Methods for Sensor Network Bug Diagnosis.
Proceedings of the Managing and Mining Sensor Data, 2013

Efficient Direct Mining of Selective Discriminative Patterns for Classification.
Proceedings of the Contrast Data Mining: Concepts, Algorithms, and Applications, 2013

Spectral Clustering.
Proceedings of the Data Clustering: Algorithms and Applications, 2013

Probabilistic Models for Clusterings.
Proceedings of the Data Clustering: Algorithms and Applications, 2013

2012
Mining Heterogeneous Information Networks: Principles and Methodologies
Synthesis Lectures on Data Mining and Knowledge Discovery, Morgan & Claypool Publishers, ISBN: 978-3-031-01902-9, 2012

Locally Discriminative Coclustering.
IEEE Trans. Knowl. Data Eng., 2012

Latent Community Topic Analysis: Integration of Community Discovery with Topic Modeling.
ACM Trans. Intell. Syst. Technol., 2012

Mining heterogeneous information networks: a structural analysis approach.
SIGKDD Explor., 2012

A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration.
Proc. VLDB Endow., 2012

Mining Knowledge from Interconnected Data: A Heterogeneous Information Network Analysis Approach.
Proc. VLDB Endow., 2012

Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes.
Proc. VLDB Endow., 2012

Uncertainty Reduction for Knowledge Discovery and Information Extraction on the World Wide Web.
Proc. IEEE, 2012

Mining diversity on social media networks.
Multim. Tools Appl., 2012

A Variance Minimization Criterion to Active Learning on Graphs.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

Locality Preserving Feature Learning.
Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, 2012

RankCompete: Simultaneous ranking and clustering of information networks.
Neurocomputing, 2012

Multidimensional Sensor Data Analysis in Cyber-Physical System: An Atypical Cube Approach.
Int. J. Distributed Sens. Networks, 2012

Learning influence from heterogeneous social networks.
Data Min. Knowl. Discov., 2012

Mining periodic behaviors of object movements for animal and biological sustainability studies.
Data Min. Knowl. Discov., 2012

BlueFinder: estimate where a beach photo was taken.
Proceedings of the 21st World Wide Web Conference, 2012

When will it happen?: relationship prediction in heterogeneous information networks.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Construction of Web-Based, Service-Oriented Information Networks: A Data Mining Perspective - (Abstract).
Proceedings of the Web-Age Information Management - 13th International Conference, 2012

SocialCube: A Text Cube Framework for Analyzing Social Media Data.
Proceedings of the 2012 International Conference on Social Informatics (SocialInformatics), 2012

Optimizing index for taxonomy keyword search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2012

Citation Prediction in Heterogeneous Bibliographic Networks.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Learning Hierarchical Relationships among Partially Ordered Objects with Heterogeneous Attributes and Links.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

IntruMine: Mining Intruders in Untrustworthy Data of Cyber-physical Systems.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Evaluating Event Credibility on Twitter.
Proceedings of the Twelfth SIAM International Conference on Data Mining, 2012

Quality of Information Based Data Selection and Transmission in Wireless Sensor Networks.
Proceedings of the 33rd IEEE Real-Time Systems Symposium, 2012

Community Trend Outlier Detection Using Soft Temporal Pattern Mining.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2012

Selective Labeling via Error Bound Minimization.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Query-driven discovery of semantically similar substructures in heterogeneous networks.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Integrating meta-path selection with user-guided object clustering in heterogeneous information networks.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Event-based social networks: linking the online and offline social worlds.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Mining event periodicity from incomplete observations.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Mining heterogeneous information networks: the next frontier.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Integrating community matching and outlier detection for mining evolutionary community outliers.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

Modeling and exploiting heterogeneous bibliographic networks for expertise ranking.
Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries, 2012

A Simple Algorithm for Semi-supervised Learning with Improved Generalization Error Bound.
Proceedings of the 29th International Conference on Machine Learning, 2012

Predicting Links in Multi-relational and Heterogeneous Networks.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

ETM: Entity Topic Models for Mining Documents Associated with Entities.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Towards Active Learning on Graphs: An Error Bound Minimization Approach.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Stream Classification with Recurring and Novel Class Detection Using Class-Based Ensemble.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

On Discovery of Traveling Companions from Streaming Trajectories.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Multidimensional Analysis of Atypical Events in Cyber-Physical Data.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Mining Knowledge from Data: An Information Network Analysis Approach.
Proceedings of the IEEE 28th International Conference on Data Engineering (ICDE 2012), 2012

Building enriched web page representations using link paths.
Proceedings of the 23rd ACM Conference on Hypertext and Social Media, 2012

Tweet Ranking Based on Heterogeneous Networks.
Proceedings of the COLING 2012, 2012

Analysis and Refinement of Cross-Lingual Entity Linking.
Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics, 2012

User guided entity similarity search using meta-path selection in heterogeneous information networks.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Document-topic hierarchies from document graphs.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Efficient mining of correlated sequential patterns based on null hypothesis.
Proceedings of the 2012 international workshop on Web-scale knowledge representation, 2012

Predicting future popularity trend of events in microblogging platforms.
Proceedings of the Information, Interaction, Innovation: Celebrating the Past, Constructing the Present and Creating the Future, 2012

Probabilistic Models for Text Mining.
Proceedings of the Mining Text Data, 2012

2011
Speed up kernel discriminant analysis.
VLDB J., 2011

Cloud-based malware detection for evolving data streams.
ACM Trans. Manag. Inf. Syst., 2011

Classification and Novel Class Detection in Concept-Drifting Data Streams under Time Constraints.
IEEE Trans. Knowl. Data Eng., 2011

Mining Discriminative Patterns for Classifying Trajectories on Road Networks.
IEEE Trans. Knowl. Data Eng., 2011

Laplacian Regularized Gaussian Mixture Model for Data Clustering.
IEEE Trans. Knowl. Data Eng., 2011

Efficient Keyword-Based Search for Top-K Cells in Text Cube.
IEEE Trans. Knowl. Data Eng., 2011

Locally Consistent Concept Factorization for Document Clustering.
IEEE Trans. Knowl. Data Eng., 2011

Collection-based sparse label propagation and its application on social group suggestion from photos.
ACM Trans. Intell. Syst. Technol., 2011

MoveMine: Mining moving object data for discovery of animal movement patterns.
ACM Trans. Intell. Syst. Technol., 2011

Survey on web spam detection: principles and algorithms.
SIGKDD Explor., 2011

Heterogeneous network-based trust analysis: a survey.
SIGKDD Explor., 2011

A general framework for efficient clustering of large datasets based on activity detection.
Stat. Anal. Data Min., 2011

Mining Top-K Large Structural Patterns in a Massive Network.
Proc. VLDB Endow., 2011

PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks.
Proc. VLDB Endow., 2011

On Link-based Similarity Join.
Proc. VLDB Endow., 2011

SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks.
Proc. VLDB Endow., 2011

Mining Flipping Correlations from Large Datasets with Taxonomies.
Proc. VLDB Endow., 2011

Graph Regularized Nonnegative Matrix Factorization for Data Representation.
IEEE Trans. Pattern Anal. Mach. Intell., 2011

Facing the reality of data stream classification: coping with scarcity of labeled data.
Knowl. Inf. Syst., 2011

Methods for mining frequent items in data streams: an overview.
Knowl. Inf. Syst., 2011

Geographical topic discovery and comparison.
Proceedings of the 20th International Conference on World Wide Web, 2011

Growing parallel paths for entity-page discovery.
Proceedings of the 20th International Conference on World Wide Web, 2011

Trust analysis with clustering.
Proceedings of the 20th International Conference on World Wide Web, 2011

HyLiEn: a hybrid approach to general list extraction on the web.
Proceedings of the 20th International Conference on World Wide Web, 2011

Generalized Fisher Score for Feature Selection.
Proceedings of the UAI 2011, 2011

CUNY-UIUC-SRI TAC-KBP2011 Entity Linking System Description.
Proceedings of the Fourth Text Analysis Conference, 2011

Retrieving k-Nearest Neighboring Trajectories by a Set of Point Locations.
Proceedings of the Advances in Spatial and Temporal Databases, 2011

Mining Significant Time Intervals for Relationship Detection.
Proceedings of the Advances in Spatial and Temporal Databases, 2011

Finding Top-k Shortest Path Distance Changes in an Evolutionary Network.
Proceedings of the Advances in Spatial and Temporal Databases, 2011

Graph cube: on warehousing and OLAP multidimensional networks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

WINACS: construction and analysis of web-based computer science information networks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Differentially private data cubes: optimizing noise sources and consistency.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

Learning online discussion structures by conditional random fields.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Learning relevance from heterogeneous social network and its application in online targeting.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Authorship classification: a discriminative syntactic tree mining approach.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Learning search tasks in queries and web pages via graph regularization.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Collective topic modeling for heterogeneous networks.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Hierarchical aggregate classification with limited supervision for data reduction in wireless sensor networks.
Proceedings of the 9th International Conference on Embedded Networked Sensor Systems, 2011

Diversified Trajectory Pattern Ranking in Geo-tagged Social Media.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Unsupervised Disaggregation of Low Frequency Power Measurements.
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011

Construction and Analysis of Web-Based Computer Science Information Networks.
Proceedings of the Rough Sets, Fuzzy Sets, Data Mining and Granular Computing, 2011

Efficient Mining of Top Correlated Patterns Based on Null-Invariant Measures.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

Linear Discriminant Dimensionality Reduction.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

LikeMiner: a system for mining the power of 'like' in social media networks.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Ranking-based classification of heterogeneous information networks.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

Probabilistic topic models with biased propagation on heterogeneous information networks.
Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2011

FAST Sequence Mining Based on Sparse Id-Lists.
Proceedings of the Foundations of Intelligent Systems - 19th International Symposium, 2011

Power watermarking: Facilitating power-based diagnosis of node silence in remote high-end sensing systems.
Proceedings of the 10th International Conference on Information Processing in Sensor Networks, 2011

Apollo: Towards factfinding in participatory sensing.
Proceedings of the 10th International Conference on Information Processing in Sensor Networks, 2011

DustDoctor: A self-healing sensor data collection system.
Proceedings of the 10th International Conference on Information Processing in Sensor Networks, 2011

Consensus extraction from heterogeneous detectors to improve performance over network traffic anomaly detection.
Proceedings of the INFOCOM 2011. 30th IEEE International Conference on Computer Communications, 2011

Joint Feature Selection and Subspace Learning.
Proceedings of the IJCAI 2011, 2011

On Trivial Solution and Scale Transfer Problems in Graph Regularized NMF.
Proceedings of the IJCAI 2011, 2011

Extracting General Lists from Web Documents: A Hybrid Approach.
Proceedings of the Modern Approaches in Applied Intelligence, 2011

Exposing Complex Bug-Triggering Conditions in Distributed Systems via Graph Mining.
Proceedings of the International Conference on Parallel Processing, 2011

LPTA: A Probabilistic Model for Latent Periodic Topic Analysis.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Detecting Recurring and Novel Classes in Concept-Drifting Data Streams.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

The Joint Inference of Topic Diffusion and Evolution in Social Communities.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Signature Pattern Covering via Local Greedy Algorithm and Pattern Shrink.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Patent Maintenance Recommendation with Patent Information Network Model.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

A Spectral Framework for Detecting Inconsistency across Multi-source Object Relationships.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Bidirectional mining of non-redundant recurrent rules from a sequence database.
Proceedings of the 27th International Conference on Data Engineering, 2011

The Sparse Regression Cube: A Reliable Modeling Technique for Open Cyber-Physical Systems.
Proceedings of the 2011 IEEE/ACM International Conference on Cyber-Physical Systems, 2011

On Bayesian interpretation of fact-finding in information networks.
Proceedings of the 14th International Conference on Information Fusion, 2011

Efficient Topological OLAP on Information Networks.
Proceedings of the Database Systems for Advanced Applications, 2011

TEXplorer: keyword-based object search and exploration in multidimensional text databases.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Correlated multi-label feature selection.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Towards feature selection in network.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

MiTexCube: MicroTextCluster Cube for Online Analysis of Text Cells.
Proceedings of the 2011 Conference on Intelligent Data Understanding, 2011

Geo-Friends Recommendation in GPS-based Cyber-physical Social Network.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

Is Objective Function the Silver Bullet? A Case Study of Community Detection Algorithms on Social Networks.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

Dynamic Social Influence Analysis through Time-Dependent Factor Graphs.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

Co-author Relationship Prediction in Heterogeneous Bibliographic Networks.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

Evolutionary Clustering and Analysis of Bibliographic Networks.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

SCENE: Structural Conversation Evolution NEtwork.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2011

Learning a Kernel for Multi-Task Clustering.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

An Overview of Social Tagging and Applications.
Proceedings of the Social Network Data Analytics, 2011

Data Mining: Concepts and Techniques, 3rd edition
Morgan Kaufmann, ISBN: 978-0123814791, 2011

2010
Graph Indexing.
Proceedings of the Managing and Mining Graph Data, 2010

Mining Graph Patterns.
Proceedings of the Managing and Mining Graph Data, 2010

Quality Threshold Clustering.
Proceedings of the Encyclopedia of Machine Learning, 2010

Partitional Clustering.
Proceedings of the Encyclopedia of Machine Learning, 2010

Mean Shift.
Proceedings of the Encyclopedia of Machine Learning, 2010

Locality Sensitive Hashing Based Clustering.
Proceedings of the Encyclopedia of Machine Learning, 2010

<i>K</i>-Way Spectral Clustering.
Proceedings of the Encyclopedia of Machine Learning, 2010

<i>K</i>-Medoids Clustering.
Proceedings of the Encyclopedia of Machine Learning, 2010

<i>K</i>-Means Clustering.
Proceedings of the Encyclopedia of Machine Learning, 2010

Expectation Maximization Clustering.
Proceedings of the Encyclopedia of Machine Learning, 2010

Mining Concept-Drifting Data Streams.
Proceedings of the Data Mining and Knowledge Discovery Handbook, 2nd ed., 2010

Modeling Massive RFID Data Sets: A Gateway-Based Movement Graph Approach.
IEEE Trans. Knowl. Data Eng., 2010

Unexpected results in automatic list extraction on the web.
SIGKDD Explor., 2010

Survey on social tagging techniques.
SIGKDD Explor., 2010

Discriminative frequent subgraph mining with optimality guarantees.
Stat. Anal. Data Min., 2010

On Graph Query Optimization in Large Networks.
Proc. VLDB Endow., 2010

Swarm: Mining Relaxed Temporal Moving Object Clusters.
Proc. VLDB Endow., 2010

The Inverse Classification Problem.
J. Comput. Sci. Technol., 2010

Re-examination of interestingness measures in pattern mining: a unified framework.
Data Min. Knowl. Discov., 2010

Social group suggestion from user image collections.
Proceedings of the 19th International Conference on World Wide Web, 2010

LINKREC: a unified framework for link recommendation with user attributes and graph structure.
Proceedings of the 19th International Conference on World Wide Web, 2010

CETR: content extraction via tag ratios.
Proceedings of the 19th International Conference on World Wide Web, 2010

Entity relation discovery from web tables and links.
Proceedings of the 19th International Conference on World Wide Web, 2010

Keyword extraction for social snippets.
Proceedings of the 19th International Conference on World Wide Web, 2010

Topic initiator detection on the world wide web.
Proceedings of the 19th International Conference on World Wide Web, 2010

iRIN: image retrieval in image-rich information networks.
Proceedings of the 19th International Conference on World Wide Web, 2010

RankCompete: simultaneous ranking and clustering of web photos.
Proceedings of the 19th International Conference on World Wide Web, 2010

MoveMine: mining moving object databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Mining knowledge from databases: an information network analysis approach.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2010

Privacy-aware regression modeling of participatory sensing data.
Proceedings of the 8th International Conference on Embedded Networked Sensor Systems, 2010

Classification and Novel Class Detection of Data Streams in a Dynamic Feature Space.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

NDPMine: Efficiently Mining Discriminative Numerical Features for Pattern-Based Classification.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Graph Regularized Transductive Classification on Heterogeneous Information Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Structure Is Informative: On Mining Structured Information Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Classification and Novel Class Detection in Data Streams with Active Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

Hierarchical Web-Page Clustering via In-Page and Cross-Page Link Structures.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

The wisdom of social multimedia: using flickr for prediction and forecast.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Community evolution detection in dynamic heterogeneous information networks.
Proceedings of the Eighth Workshop on Mining and Learning with Graphs, 2010

Mining advisor-advisee relationships from research publication networks.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

PET: a statistical model for popular events tracking in social communities.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

Mining periodic behaviors for moving objects.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

On community outliers and their efficient detection in information networks.
Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2010

DIAMOND: Correlation-Based Anomaly Monitoring Daemon for DIME.
Proceedings of the 12th IEEE International Symposium on Multimedia, 2010

Diagnostic powertracing for sensor node failure analysis.
Proceedings of the 9th International Conference on Information Processing in Sensor Networks, 2010

Tru-Alarm: Trustworthiness Analysis of Sensor Networks in Cyber-Physical Systems.
Proceedings of the ICDM 2010, 2010

gSkeletonClu: Density-Based Network Clustering via Structure-Connected Tree Division or Agglomeration.
Proceedings of the ICDM 2010, 2010

Addressing Concept-Evolution in Concept-Drifting Data Streams.
Proceedings of the ICDM 2010, 2010

Top-K aggregation queries over large networks.
Proceedings of the 26th International Conference on Data Engineering, 2010

TopCells: Keyword-based search of top-k aggregated documents in text cube.
Proceedings of the 26th International Conference on Data Engineering, 2010

Progressive clustering of networks using Structure-Connected Order of Traversal.
Proceedings of the 26th International Conference on Data Engineering, 2010

A worldwide tourism recommendation system based on geotaggedweb photos.
Proceedings of the IEEE International Conference on Acoustics, 2010

Region-based online promotion analysis.
Proceedings of the EDBT 2010, 2010

Fast computation of SimRank for static and dynamic information networks.
Proceedings of the EDBT 2010, 2010

Aggregation of Multiple Judgments for Evaluating Ordered Lists.
Proceedings of the Advances in Information Retrieval, 2010

Mining Diversity on Networks.
Proceedings of the Database Systems for Advanced Applications, 2010

Incremental Clustering for Trajectories.
Proceedings of the Database Systems for Advanced Applications, 2010

Mining Moving Object, Trajectory and Traffic Data.
Proceedings of the Database Systems for Advanced Applications, 2010

Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions.
Proceedings of the COLING 2010, 2010

Mapping web pages to database records via link paths.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Mining topic-level influence in heterogeneous networks.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Visual cube and on-line analytical processing of images.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

SHRINK: a structural clustering algorithm for detecting hierarchical communities in networks.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Keyword Search in Text Cube: Finding Top-k Aggregated Cell Documents.
Proceedings of the 2010 Conference on Intelligent Data Understanding, 2010

A Unified Framework for Link Recommendation Using Random Walks.
Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, 2010

Veracity Analysis and Object Distinction.
Proceedings of the Link Mining: Models, Algorithms, and Applications, 2010

Scalable Link-Based Similarity Computation and Clustering.
Proceedings of the Link Mining: Models, Algorithms, and Applications, 2010

Integrating Clustering with Ranking in Heterogeneous Information Networks Analysis.
Proceedings of the Link Mining: Models, Algorithms, and Applications, 2010

Discriminative Frequent Pattern-Based Graph Classification.
Proceedings of the Link Mining: Models, Algorithms, and Applications, 2010

InfoNetOLAP: OLAP and Mining of Information Networks.
Proceedings of the Link Mining: Models, Algorithms, and Applications, 2010

Mining Large Information Networks by Graph Summarization.
Proceedings of the Link Mining: Models, Algorithms, and Applications, 2010

Ranking Gradients in Multi-Dimensional Spaces.
Proceedings of the Complex Data Warehousing and Knowledge Discovery for Advanced Retrieval Development, 2010

2009
Stream Mining.
Proceedings of the Encyclopedia of Database Systems, 2009

Data Mining.
Proceedings of the Encyclopedia of Database Systems, 2009

Pattern-Growth Methods.
Proceedings of the Encyclopedia of Database Systems, 2009

Frequent Itemsets and Association Rules.
Proceedings of the Encyclopedia of Database Systems, 2009

Topic modeling for OLAP on multidimensional text databases: topic cube and its applications.
Stat. Anal. Data Min., 2009

iNextCube: Information Network-Enhanced Text Cube.
Proc. VLDB Endow., 2009

Promotion Analysis in Multi-Dimensional Space.
Proc. VLDB Endow., 2009

A Particle-and-Density Based Evolutionary Clustering Method for Dynamic Networks.
Proc. VLDB Endow., 2009

Mining Graph Patterns Efficiently via Randomized Summaries.
Proc. VLDB Endow., 2009

Graph OLAP: a multi-dimensional framework for graph data analysis.
Knowl. Inf. Syst., 2009

Top-down mining of frequent closed patterns from very high dimensional data.
Inf. Sci., 2009

Exploring optimization of semantic relationship graph for multi-relational Bayesian classification.
Decis. Support Syst., 2009

MLR-Index: An Index Structure for Fast and Scalable Similarity Search in High Dimensions.
Proceedings of the Scientific and Statistical Database Management, 2009

Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases.
Proceedings of the SIAM International Conference on Data Mining, 2009

Near-optimal Supervised Feature Selection among Frequent Subgraphs.
Proceedings of the SIAM International Conference on Data Mining, 2009

GAD: General Activity Detection for Fast Clustering on Large Data.
Proceedings of the SIAM International Conference on Data Mining, 2009

Integrating Novel Class Detection with Classification for Concept-Drifting Data Streams.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

A Multi-partition Multi-chunk Ensemble Technique to Classify Concept-Drifting Data Streams.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2009

Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

VideoMule: a consensus learning approach to multi-label classification from noisy user-generated videos.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Exploring social tagging graph for web object classification.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Ranking-based clustering of heterogeneous information networks with star network schema.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Classification of software behaviors for failure detection: a discriminative pattern mining approach.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Heterogeneous source consensus learning via decision propagation and negotiation.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009


Locality Preserving Nonnegative Matrix Factorization.
Proceedings of the IJCAI 2009, 2009

Filtering and Refinement: A Two-Stage Approach for Efficient and Effective Anomaly Detection.
Proceedings of the ICDM 2009, 2009

Mining Personal Image Collection for Social Group Suggestion.
Proceedings of the ICDM Workshops 2009, 2009

iTopicModel: Information Network-Integrated Topic Modeling.
Proceedings of the ICDM 2009, 2009

Parallel PathFinder Algorithms for Mining Structures from Graphs.
Proceedings of the ICDM 2009, 2009

Temporal Outlier Detection in Vehicle Traffic Data.
Proceedings of the 25th International Conference on Data Engineering, 2009

Efficient Mining of Closed Repetitive Gapped Subsequences from a Sequence Database.
Proceedings of the 25th International Conference on Data Engineering, 2009

Modeling Probabilistic Measurement Correlations for Problem Determination in Large-Scale Distributed Systems.
Proceedings of the 29th IEEE International Conference on Distributed Computing Systems (ICDCS 2009), 2009

Real-Time Knowledge Discovery and Dissemination for Intelligence Analysis.
Proceedings of the 42st Hawaii International International Conference on Systems Science (HICSS-42 2009), 2009

RankClus: integrating clustering with ranking for heterogeneous information network analysis.
Proceedings of the EDBT 2009, 2009

Scalable OLAP and mining of information networks.
Proceedings of the EDBT 2009, 2009

Subspace Discovery for Promotion: A Cell Clustering Approach.
Proceedings of the Discovery Science, 12th International Conference, 2009

CHRONICLE: A Two-Stage Density-Based Clustering Algorithm for Dynamic Networks.
Proceedings of the Discovery Science, 12th International Conference, 2009

Finding Symbolic Bug Patterns in Sensor Networks.
Proceedings of the Distributed Computing in Sensor Systems, 2009

P-Rank: a comprehensive structural similarity measure over information networks.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Mining Heterogeneous Information Networks by Exploring the Power of Links.
Proceedings of the Algorithmic Learning Theory, 20th International Conference, 2009

2008
Approximate Frequent Itemset Mining In the Presence of Random Noise.
Proceedings of the Soft Computing for Knowledge Discovery and Data Mining, 2008

Anomaly Detection in Moving Object.
Proceedings of the Intelligence and Security Informatics, Techniques and Applications, 2008

OLAP, Spatial.
Proceedings of the Encyclopedia of GIS., 2008

A Systematic Study of Failure Proximity.
IEEE Trans. Software Eng., 2008

Truth Discovery with Multiple Conflicting Information Providers on the Web.
IEEE Trans. Knowl. Data Eng., 2008

Learning a Maximum Margin Subspace for Image Retrieval.
IEEE Trans. Knowl. Data Eng., 2008

SRDA: An Efficient Algorithm for Large-Scale Discriminant Analysis.
IEEE Trans. Knowl. Data Eng., 2008

<i>TraClass</i>: trajectory classification using hierarchical region-based and trajectory-based clustering.
Proc. VLDB Endow., 2008

On Having Search Engines Deliver Hierarchies of Web Pages.
J. Object Technol., 2008

Efficiently Identifying Max-Gap Clusters in Pairwise Genome Comparison.
J. Comput. Biol., 2008

Classifying Data Streams with Skewed Class Distributions and Concept Drifts.
IEEE Internet Comput., 2008

PerRank: Personalized Rank Retrieval with Categorical and Numerical Attributes.
Proceedings of the Ninth International Conference on Web-Age Information Management, 2008

Mining significant graph patterns by leap search.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

ARCube: supporting ranking aggregate queries in partially materialized data cubes.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

BibNetMiner: mining bibliographic information networks.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Sampling cube: a framework for statistical olap over sampling data.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2008

Dustminer: troubleshooting interactive complexity bugs in sensor networks.
Proceedings of the 6th International Conference on Embedded Networked Sensor Systems, 2008

CISpan: Comprehensive Incremental Mining Algorithms of Closed Sequential Patterns for Multi-Versional Software Mining.
Proceedings of the SIAM International Conference on Data Mining, 2008

SpaRClus: Spatial Relationship Pattern-Based Hierarchial Clustering.
Proceedings of the SIAM International Conference on Data Mining, 2008

Mining preferences from superior and inferior examples.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Knowledge transfer via multiple model local structure mapping.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Direct mining of discriminative and essential frequent patterns via model-based search tree.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Data Mining for Diagnostic Debugging in Sensor Networks: Preliminary Evidence and Lessons Learned.
Proceedings of the Knowledge Discovery from Sensor Data, 2008

Exploring the Power of Heuristics and Links in Multi-relational Data Mining.
Proceedings of the Foundations of Intelligent Systems, 17th International Symposium, 2008

Stream Sequential Pattern Mining with Precise Error Bounds.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

A Practical Approach to Classify Evolving Data Streams: Training with Limited Amount of Labeled Data.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Text Cube: Computing IR Measures for Multidimensional Text Database Analysis.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Graph OLAP: Towards Online Analytical Processing on Graphs.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

Non-negative Matrix Factorization on Manifold.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

P-Cube: Answering Preference Queries in Multi-Dimensional Space.
Proceedings of the 24th International Conference on Data Engineering, 2008

Trajectory Outlier Detection: A Partition-and-Detect Framework.
Proceedings of the 24th International Conference on Data Engineering, 2008

Direct Discriminative Pattern Mining for Effective Classification.
Proceedings of the 24th International Conference on Data Engineering, 2008

Training Linear Discriminant Analysis in Linear Time.
Proceedings of the 24th International Conference on Data Engineering, 2008

Peer to peer botnet detection for cyber-security: a data mining approach.
Proceedings of the 4th annual workshop on Cyber security and information intelligence research, 2008

Data mining for image/video processing: a promising research frontier.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

On effective presentation of graph patterns: a structural representative approach.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Modeling hidden topics on document manifold.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Sparse Projections over Graph.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

Mining Frequent Approximate Sequential Patterns.
Proceedings of the Next Generation of Data Mining., 2008

Research Challenges for Data Mining in Science and Engineering.
Proceedings of the Next Generation of Data Mining., 2008


2007
Multi-Dimensional Analysis of Data Streams Using Stream Cubes.
Proceedings of the Data Streams - Models and Algorithms, 2007

On Clustering Massive Data Streams: A Summarization Paradigm.
Proceedings of the Data Streams - Models and Algorithms, 2007

Computing Iceberg Cubes by Top-Down and Bottom-Up Integration: The StarCubing Approach.
IEEE Trans. Knowl. Data Eng., 2007

Frequent Closed Sequence Mining without Candidate Maintenance.
IEEE Trans. Knowl. Data Eng., 2007

Semantic annotation of frequent patterns.
ACM Trans. Knowl. Discov. Data, 2007

Introduction.
ACM Trans. Knowl. Discov. Data, 2007

Constraint-based sequential pattern mining: the pattern-growth methods.
J. Intell. Inf. Syst., 2007

On compressing frequent patterns.
Data Knowl. Eng., 2007

CrossClus: user-guided multi-relational clustering.
Data Min. Knowl. Discov., 2007

Frequent pattern mining: current status and future directions.
Data Min. Knowl. Discov., 2007

TagSmart: analysis and visualization for yeast mutant fitness data measured by tag microarrays.
BMC Bioinform., 2007

Exploring the Power of Links in Data Mining.
Proceedings of the Advances in Data and Web Management, 2007

DataScope: Viewing Database Contents in Google Maps' Way.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Mining Approximate Top-K Subspace Anomalies in Multi-Dimensional Time-Series Data.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Adaptive Fastest Path Computation on a Road Network: A Traffic Mining Approach.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

Towards Graph Containment Search and Indexing.
Proceedings of the 33rd International Conference on Very Large Data Bases, 2007

On Efficient Processing of Subspace Skyline Queries on High Dimensional Data.
Proceedings of the 19th International Conference on Scientific and Statistical Database Management, 2007

Traffic Density-Based Discovery of Hot Routes in Road Networks.
Proceedings of the Advances in Spatial and Temporal Databases, 2007

Finding frequent items in data streams using hierarchical information.
Proceedings of the IEEE International Conference on Systems, 2007

Progressive and selective merge: computing top-k with ad-hoc ranking functions.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

Trajectory clustering: a partition-and-group framework.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2007

ROAM: Rule- and Motif-Based Anomaly Detection in Massive Moving Object Data Sets.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions.
Proceedings of the Seventh SIAM International Conference on Data Mining, 2007

Association Mining in Large Databases: A Re-examination of Its Measures.
Proceedings of the Knowledge Discovery in Databases: PKDD 2007, 2007

gPrune: A Constraint Pushing Framework for Graph Pattern Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Research Frontiers in Advanced Data Mining Technologies and Applications.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Spectral regression: a unified subspace learning framework for content-based image retrieval.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Mining, Indexing, and Searching Graphs in Large Data Sets.
Proceedings of the Mining and Learning with Graphs, 2007

Regularized regression on image manifold for retrieval.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Locality Sensitive Discriminant Analysis.
Proceedings of the IJCAI 2007, 2007

Indexing Noncrashing Failures: A Dynamic Program Slicing-Based Approach.
Proceedings of the 23rd IEEE International Conference on Software Maintenance (ICSM 2007), 2007

Efficient Discovery of Frequent Approximate Sequential Patterns.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

On Appropriate Assumptions to Mine Data Streams: Analysis and Practice.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

gApprox: Mining Frequent Approximate Patterns from a Massive Network.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Efficient Kernel Discriminant Analysis via Spectral Regression.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Spectral Regression: A Unified Approach for Sparse Subspace Learning.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Mining Colossal Frequent Patterns by Core Pattern Fusion.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Object Distinction: Distinguishing Objects with Identical Names.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Integrating OLAP and Ranking: The Ranking-Cube Methodology.
Proceedings of the 23rd International Conference on Data Engineering Workshops, 2007

The Multi-Relational Skyline Operator.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Cost-Conscious Cleaning of Massive RFID Data Sets.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Discriminative Frequent Pattern Analysis for Effective Classification.
Proceedings of the 23rd International Conference on Data Engineering, 2007

Semi-supervised Discriminant Analysis.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Spectral Regression for Efficient Regularized Subspace Learning.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Learning a Spatially Smooth Subspace for Face Recognition.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Regularized locality preserving indexing via spectral regression.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007

Isometric Projection.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Statistical Debugging: A Hypothesis Testing-Based Approach.
IEEE Trans. Software Eng., 2006

Feature-based similarity search in graph structures.
ACM Trans. Database Syst., 2006

Efficient Classification across Multiple Database Relations: A CrossMine Approach.
IEEE Trans. Knowl. Data Eng., 2006

Closed Constrained Gradient Mining in Retail Databases.
IEEE Trans. Knowl. Data Eng., 2006

Regression Cubes with Lossless Compression and Aggregation.
IEEE Trans. Knowl. Data Eng., 2006

A Framework for On-Demand Classification of Evolving Data Streams.
IEEE Trans. Knowl. Data Eng., 2006

Orthogonal Laplacianfaces for Face Recognition.
IEEE Trans. Image Process., 2006

An Erratum on "Pushing Convertible Constraints in Frequent Itemset Mining".
Data Min. Knowl. Discov., 2006

Integrative Array Analyzer: a software package for analysis of cross-platform and cross-species microarray data.
Bioinform., 2006

Error-Adaptive and Time-Aware Maintenance of Frequency Counts over Data Streams.
Proceedings of the Advances in Web-Age Information Management, 2006

LinkClus: Efficient Clustering via Heterogeneous Semantic Links.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Answering Top-k Queries with Multi-Dimensional Selections: The Ranking Cube Approach.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Towards Robust Indexing for Ranked Queries.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

FlowCube: Constructuing RFID FlowCubes for Multi-Dimensional Analysis of Commodity Flows.
Proceedings of the 32nd International Conference on Very Large Data Bases, 2006

Failure proximity: a fault localization-based approach.
Proceedings of the 14th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2006

Ranking objects based on relationships.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2006

Tensor space model for document analysis.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Mining Control Flow Abnormality for Logic Error Isolation.
Proceedings of the Sixth SIAM International Conference on Data Mining, 2006

Mining Interesting Patterns from Very High Dimensional Data: A Top-Down Row Enumeration Approach.
Proceedings of the Sixth SIAM International Conference on Data Mining, 2006

Ranking Outliers Using Symmetric Neighborhood Relationship.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2006

Discovering interesting patterns through user's interactive feedback.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Extracting redundancy-aware top-k patterns.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Generating semantic annotations for frequent patterns with context analysis.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

GPLAG: detection of software plagiarism by program dependence graph analysis.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Motion-Alert: Automatic Anomaly Detection in Massive Moving Objects.
Proceedings of the Intelligence and Security Informatics, 2006

How Bayesians Debug.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery.
Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 2006

Searching Substructures with Superimposed Distance.
Proceedings of the 22nd International Conference on Data Engineering, 2006

C-Cubing: Efficient Computation of Closed Cubes by Aggregation-Based Checking.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Top-Down Mining of Interesting Patterns from Very High Dimensional Data.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Mining, Indexing, and Similarity Search in Graphs and Complex Structures.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Warehousing and Analyzing Massive RFID Data Sets.
Proceedings of the 22nd International Conference on Data Engineering, 2006

On the Inverse Classification Problem and its Applications.
Proceedings of the 22nd International Conference on Data Engineering, 2006

Mining compressed commodity workflows from massive RFID data sets.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

Warehousing and Mining Massive RFID Data Sets.
Proceedings of the Advanced Data Mining and Applications, Second International Conference, 2006

Data Mining: Concepts and Techniques, Second Edition.
The Morgan Kaufmann series in data management systems, Elsevier, ISBN: 978-1-55860-901-3, 2006

2005
Graph indexing based on discriminative frequent structure analysis.
ACM Trans. Database Syst., 2005

Divide-and-Approximate: A Novel Constraint Push Strategy for Iceberg Cube Mining.
IEEE Trans. Knowl. Data Eng., 2005

TFP: An Efficient Algorithm for Mining Top-K Frequent Closed Itemsets.
IEEE Trans. Knowl. Data Eng., 2005

Document Clustering Using Locality Preserving Indexing.
IEEE Trans. Knowl. Data Eng., 2005

A tribute to Professor Hongjun Lu.
SIGMOD Rec., 2005

TSP: Mining top-<i>k</i> closed sequential patterns.
Knowl. Inf. Syst., 2005

Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams.
Distributed Parallel Databases, 2005

Making SVMs Scalable to Large Data Sets using Hierarchical Cluster Indexing.
Data Min. Knowl. Discov., 2005

On High Dimensional Projected Clustering of Data Streams.
Data Min. Knowl. Discov., 2005

Mining Compressed Frequent-Pattern Sets.
Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30, 2005

Searching for Related Objects in Relational Databases.
Proceedings of the 17th International Conference on Scientific and Statistical Database Management, 2005

SOBER: statistical model-based bug localization.
Proceedings of the 10th European Software Engineering Conference held jointly with 13th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2005

Substructure Similarity Search in Graph Databases.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

GraphMiner: a structural pattern-mining system for large disk-based graph databases and its applications.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2005

Mining Behavior Graphs for "Backtrace" of Noncrashing Bugs.
Proceedings of the 2005 SIAM International Conference on Data Mining, 2005

SeqIndex: Indexing Sequences by Sequential Pattern Analysis.
Proceedings of the 2005 SIAM International Conference on Data Mining, 2005

A sampling-based framework for parallel data mining.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2005

Efficient Classification from Multiple Heterogeneous Databases.
Proceedings of the Knowledge Discovery in Databases: PKDD 2005, 2005

Efficient Processing of Ranked Queries with Sweeping Selection.
Proceedings of the Knowledge Discovery in Databases: PKDD 2005, 2005

Community Mining from Multi-relational Networks.
Proceedings of the Knowledge Discovery in Databases: PKDD 2005, 2005

Image clustering with tensor representation.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Cross-relational clustering with user's guidance.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Summarizing itemset patterns: a profile-based approach.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Parallel mining of closed sequential patterns.
Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005

Accelerating DNA sequencing-by-hybridization with noise.
Proceedings of the 5th international workshop on Bioinformatics, 2005

Mining hidden community in heterogeneous social networks.
Proceedings of the 3rd international workshop on Link discovery, 2005

Mining coherent dense subgraphs across massive biological networks for functional discovery.
Proceedings of the Proceedings Thirteenth International Conference on Intelligent Systems for Molecular Biology 2005, 2005

Mining Closed Relational Graphs with Connectivity Constraints.
Proceedings of the 21st International Conference on Data Engineering, 2005

Mining Evolving Customer-Product Relationships in Multi-Dimensional Space.
Proceedings of the 21st International Conference on Data Engineering, 2005

Survey of Biodata Analysis from a Data Mining Perspective.
Proceedings of the Data Mining in Bioinformatics, 2005

Mining Data Streams.
Proceedings of the Data Mining and Knowledge Discovery Handbook., 2005

2004
PEBL: Web Page Classification without Negative Examples.
IEEE Trans. Knowl. Data Eng., 2004

Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach.
IEEE Trans. Knowl. Data Eng., 2004

Mining Constrained Gradients in Large Databases.
IEEE Trans. Knowl. Data Eng., 2004

The third SIGKDD workshop on mining temporal and sequential data (KDD/TDM 2004).
SIGKDD Explor., 2004

Mining Condensed Frequent-Pattern Bases.
Knowl. Inf. Syst., 2004

From Sequential Pattern Mining to Structured Pattern Mining: A Pattern-Growth Approach.
J. Comput. Sci. Technol., 2004

Pushing Convertible Constraints in Frequent Itemset Mining.
Data Min. Knowl. Discov., 2004

Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach.
Data Min. Knowl. Discov., 2004

High-Dimensional OLAP: A Minimal Cubing Approach.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

A Framework for Projected Clustering of High Dimensional Data Streams.
Proceedings of the (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, VLDB 2004, Toronto, Canada, August 31, 2004

MM-Cubing: Computing Iceberg Cubes by Factorizing the Lattice Space.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

Continuous K-Nearest Neighbor Search for Moving Objects.
Proceedings of the 16th International Conference on Scientific and Statistical Database Management (SSDBM 2004), 2004

Graph Indexing: A Frequent Structure-based Approach.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

MAIDS: Mining Alarming Incidents from Data Streams.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2004

Mining Thick Skylines over Large Databases.
Proceedings of the Knowledge Discovery in Databases: PKDD 2004, 2004

CCMine: Efficient Mining of Confidence-Closed Correlated Patterns.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2004

Mining scale-free networks using geodesic clustering.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Clustering moving objects.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Discovering complex matchings across web query interfaces: a correlation mining approach.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

IncSpan: incremental mining of sequential patterns in large database.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

On demand classification of data streams.
Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2004

Scalable Construction of Topic Directory with Nonparametric Closed Termset Mining.
Proceedings of the 4th IEEE International Conference on Data Mining (ICDM 2004), 2004

CrossMine: Efficient Classification Across Multiple Database Relations.
Proceedings of the 20th International Conference on Data Engineering, 2004

BIDE: Efficient Mining of Frequent Closed Sequences.
Proceedings of the 20th International Conference on Data Engineering, 2004

Mining complex matchings across Web query interfaces.
Proceedings of the 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2004

2003
Pushing Support Constraints Into Association Rules Mining.
IEEE Trans. Knowl. Data Eng., 2003

Efficient Mining of Intertransaction Association Rules.
IEEE Trans. Knowl. Data Eng., 2003

Cancer classification using gene expression data.
Inf. Syst., 2003

Profile-Based Object Matching for Information Integration.
IEEE Intell. Syst., 2003

Star-Cubing: Computing Iceberg Cubes by Top-Down and Bottom-Up Integration.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

A Framework for Clustering Evolving Data Streams.
Proceedings of 29th International Conference on Very Large Data Bases, 2003

CPAR: Classification based on Predictive Association Rules.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

CloSpan: Mining Closed Sequential Patterns in Large Datasets.
Proceedings of the Third SIAM International Conference on Data Mining, 2003

Classifying large data sets using SVMs with hierarchical clusters.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

CloseGraph: mining closed frequent graph patterns.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

CLOSET+: searching for the best strategies for mining frequent closed itemsets.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Mining concept-drifting data streams using ensemble classifiers.
Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 24, 2003

Object Matching for Information Integration: A Profiler-Based Approach.
Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), 2003

TSP: Mining Top-K Closed Sequential Patterns.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

CoMine: Efficient Mining of Correlated Patterns.
Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 2003

Pushing Aggregate Constraints by Divide-and-Approximate.
Proceedings of the 19th International Conference on Data Engineering, 2003

Mining Typical Preferences of Collaborative User Groups.
Proceedings of the Conceptual Modeling, 2003

Discovering Compact and Highly Discriminative Features or Feature Combinations of Drug Activities Using Support Vector Machines.
Proceedings of the 2nd IEEE Computer Society Bioinformatics Conference, 2003

Text classification from positive and unlabeled documents.
Proceedings of the 2003 ACM CIKM International Conference on Information and Knowledge Management, 2003

2002
A template model for multidimensional inter-transactional association rules.
VLDB J., 2002

CLARANS: A Method for Clustering Objects for Spatial Data Mining.
IEEE Trans. Knowl. Data Eng., 2002

Database Research at the University of Illinois at Urbana-Champaign.
SIGMOD Rec., 2002

Constrained frequent pattern mining: a pattern-growth view.
SIGKDD Explor., 2002

Data Mining for Web Intelligence.
Computer, 2002

Emerging scientific applications in data mining.
Commun. ACM, 2002

Quotient Cube: How to Summarize the Semantics of a Data Cube.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Multi-Dimensional Regression Analysis of Time-Series Data Streams.
Proceedings of 28th International Conference on Very Large Data Bases, 2002

Mining long sequential patterns in a noisy environment.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

CubeExplorer: online exploration of data cubes.
Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, 2002

Top Down FP-Growth for Association Rule Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2002

PEBL: positive example based learning for Web page classification using SVM.
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002

Mining frequent item sets by opportunistic projection.
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2002

How Can Data Mining Help Bio-Data Analysis?
Proceedings of the 2nd ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD 2002), 2002

Heterogeneous Learner for Web Page Classification.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

gSpan: Graph-Based Substructure Pattern Mining.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

On Computing Condensed Frequent Pattern Bases.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

Mining Top-K Frequent Closed Patterns without Minimum Support.
Proceedings of the 2002 IEEE International Conference on Data Mining (ICDM 2002), 2002

Profit Mining: From Patterns to Actions.
Proceedings of the Advances in Database Technology, 2002

Online Analytical Processing Stream Data: Is It Feasible?
Proceedings of the 2002 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2002

Mining sequential patterns with constraints in large databases.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002

2001
Reminiscences on Influential Papers.
SIGMOD Rec., 2001

Mining Multi-Dimensional Constrained Gradients in Data Cubes.
Proceedings of the VLDB 2001, 2001

DNA-Miner: A System Prototype for Mining DNA Sequences.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Efficient Computation of Iceberg Cubes with Complex Measures.
Proceedings of the 2001 ACM SIGMOD international conference on Management of data, 2001

Mining top-n local outliers in large databases.
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

Scalable frequent-pattern mining methods: an overview.
Proceedings of the Tutorial notes of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining, 2001

Constraint-based clustering in large databases.
Proceedings of the Database Theory, 2001

H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules.
Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November, 2001

Spatial Clustering in the Presence of Obstacles.
Proceedings of the 17th International Conference on Data Engineering, 2001

PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth.
Proceedings of the 17th International Conference on Data Engineering, 2001

Mining Frequent Item Sets with Convertible Constraints.
Proceedings of the 17th International Conference on Data Engineering, 2001

Fault-Tolerant Frequent Pattern Mining: Problems and Challenges.
Proceedings of the 2001 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2001

RecTree: An Efficient Collaborative Filtering Method.
Proceedings of the Data Warehousing and Knowledge Discovery, 2001

Multi-Dimensional Sequential Pattern Mining.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
Beyond intratransaction association analysis: mining multidimensional intertransaction association rules.
ACM Trans. Inf. Syst., 2000

Object-Based Selective Materialization for Efficient Implementation of Spatial Data Cubes.
IEEE Trans. Knowl. Data Eng., 2000

Discovering Geographic Knowledge in Data Rich Environments: A Report on a Specialist Meeting.
SIGKDD Explor., 2000

Mining Frequent Patterns by Pattern-Growth: Methodology and Implications.
SIGKDD Explor., 2000

Efficient Rule-Based Attribute-Oriented Induction for Data Mining.
J. Intell. Inf. Syst., 2000

Mining Frequent Itemsets Using Support Constraints.
Proceedings of the VLDB 2000, 2000

Mining Frequent Patterns without Candidate Generation.
Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000

COE: Clustering with Obstacles Entities. A Preliminary Study.
Proceedings of the Knowledge Discovery and Data Mining, 2000

Mining Access Patterns Efficiently from Web Logs.
Proceedings of the Knowledge Discovery and Data Mining, 2000

Geo-Spatial Clustering with User-Specified Constraints.
Proceedings of the International Workshop on Multimedia Data Mining, 2000

Can we push more constraints into frequent pattern mining?
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

FreeSpan: frequent pattern-projected sequential pattern mining.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

Mining Recurrent Items in Multimedia with Progressive Resolution Refinement.
Proceedings of the 16th International Conference on Data Engineering, San Diego, California, USA, February 28, 2000

Discovering spatial associations in images.
Proceedings of the Data Mining and Knowledge Discovery: Theory, 2000

CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets.
Proceedings of the 2000 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 2000

AIM: Approximate Intelligent Matching for Time Series Data.
Proceedings of the Data Warehousing and Knowledge Discovery, 2000

Data Mining: Concepts and Techniques
Morgan Kaufmann, ISBN: 1-55860-489-8, 2000

1999
Join Index Hierarchy: An Indexing Structure for Efficient Navigation in Object-Oriented Databases.
IEEE Trans. Knowl. Data Eng., 1999

Mining Multiple-Level Association Rules in Large Databases.
IEEE Trans. Knowl. Data Eng., 1999

Review - An Array-Based Algorithm for Simultaneous Multidimensional Aggregates.
ACM SIGMOD Digit. Rev., 1999

Constraint-Based Multidimensional Data Mining.
Computer, 1999

Efficient Polygon Amalgamation Methods for Spatial OLAP and Spatial Data Mining.
Proceedings of the Advances in Spatial Databases, 1999

Exploratory Mining via Constrained Frequent Set Queries.
Proceedings of the SIGMOD 1999, 1999

Optimization of Constrained Frequent Set Queries with 2-variable Constraints.
Proceedings of the SIGMOD 1999, 1999

Breaking the Barrier of Transactions: Mining Inter-Transaction Association Rules.
Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1999

Efficient Mining of Partial Periodic Patterns in Time Series Database.
Proceedings of the 15th International Conference on Data Engineering, 1999

Plan Mining by Divide-and-Conquer.
Proceedings of the 1999 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, 1999

Mining Inter-Transaction Associations with Templates.
Proceedings of the 1999 ACM CIKM International Conference on Information and Knowledge Management, 1999

1998
Towards On-Line Analytical Mining in Large Databases.
SIGMOD Rec., 1998

Knowledge representation and discovery based on linguistic atoms.
Knowl. Based Syst., 1998

Evaluation of Declarative n-Queens Recursion: A Deductive Database Approach.
Inf. Sci., 1998

Generalization-Based Data Mining in Object-Oriented Databases Using an Object Cube Model.
Data Knowl. Eng., 1998

Editorial.
Data Min. Knowl. Discov., 1998

WebML: Querying the World-Wide Web for Resources and Knowledge.
Proceedings of the CIKM'98 First Workshop on Web Information and Data Management (WIDM'98), 1998

MultiMediaMiner: A System Prototype for Multimedia Data Mining.
Proceedings of the SIGMOD 1998, 1998

Exploratory Mining and Pruning Optimizations of Constrained Association Rules.
Proceedings of the SIGMOD 1998, 1998

Selective Materialization: An Efficient Method for Spatial Data Cube Construction.
Proceedings of the Research and Development in Knowledge Discovery and Data Mining, 1998

Mining Segment-Wise Periodic Patterns in Time-Related Databases.
Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98), 1998

Mining multimedia data.
Proceedings of the 1998 conference of the Centre for Advanced Studies on Collaborative Research, November 30, 1998

Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs.
Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries, 1998

1997
GeoMiner: A System Prototype for Spatial Data Mining.
Proceedings of the SIGMOD 1997, 1997

Generalization and Decision Tree Induction: Efficient Classification in Data Mining.
Proceedings of the 7th International Workshop on Research Issues in Data Engineering (RIDE '97) High Performance Database Management for Large-Scale Applications, 1997

Metarule-Guided Mining of Multi-Dimensional Association Rules Using Data Cubes.
Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97), 1997

Data Mining: Where is it Heading? (Panel).
Proceedings of the Thirteenth International Conference on Data Engineering, 1997

OLAP Mining: Integration of OLAP with Data Mining.
Proceedings of the Data Mining and Reverse Engineering: Searching for Semantics, 1997

DBMiner: a system for data mining in relational databases and data warehouses.
Proceedings of the 1997 conference of the Centre for Advanced Studies on Collaborative Research, 1997

1996
Index Nesting - An Efficient Approach to Indexing in Object-Oriented Databases
VLDB J., 1996

Intelligent Query Answering by Knowledge Discovery Techniques.
IEEE Trans. Knowl. Data Eng., 1996

Data Mining: An Overview from a Database Perspective.
IEEE Trans. Knowl. Data Eng., 1996

DBMiner: Interactive Mining of Multiple-Level Knowledge in Relational Databases.
Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, 1996

Data Mining Techniques.
Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, 1996

A Fast Distributed Algorithm for Mining Association Rules.
Proceedings of the Fourth International Conference on Parallel and Distributed Information Systems, 1996

DBMiner: A System for Mining Knowledge in Large Relational Databases.
Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96), 1996

Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique.
Proceedings of the Twelfth International Conference on Data Engineering, February 26, 1996

Spatial Data Mining: Progress and Challenges.
Proceedings of the Workshop on Research Issues on Data Mining and Knowledge Discovery, 1996

Attribute-Oriented Induction in data Mining.
Proceedings of the Advances in Knowledge Discovery and Data Mining., 1996

1995
Chain-Split Evaluation in Deductive Databases.
IEEE Trans. Knowl. Data Eng., 1995

Evaluation of Regular Nonlinear Recursions by Deductive Database Techniques.
Inf. Syst., 1995

Query evaluation and optimization in deductive and object-oriented spatial databases.
Inf. Softw. Technol., 1995

Fact Updates in Logic Databases.
Int. J. Softw. Eng. Knowl. Eng., 1995

Information-Associated Join Indices for Spatial Range Search.
Int. J. Geogr. Inf. Sci., 1995

Discovery of Multiple-Level Association Rules from Large Databases.
Proceedings of the VLDB'95, 1995

Discovery of Spatial Association Rules in Geographic Information Databases.
Proceedings of the Advances in Spatial Databases, 1995

Resource and Knowledge Discovery in Global Information Systems: A Preliminary Design and Experiment.
Proceedings of the First International Conference on Knowledge Discovery and Data Mining (KDD-95), 1995

Advances of the DBLearn System for Knowledge Discovery in Large Databases.
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995

Normalization and Compilation of Deductive and Object-Oriented Databases Programs for Efficient Query Evaluation.
Proceedings of the Deductive and Object-Oriented Databases, 1995

Meta-Rule-Guided Mining of Association Rules in Relational Databases.
Proceedings of the DOOD 1995 Post-Conference Workshops on Integration of Knowledge Discovery in Databases with Deductive and Object-Oriented Databases (KDOOD) and Temporal Reasoning in Deductive and Object-Oriented Databases (TDOOD), 1995

Maintenance of Discovered Knowledge: A Strategy for Updating Association Rules (Abstract).
Proceedings of the DOOD 1995 Post-Conference Workshops on Integration of Knowledge Discovery in Databases with Deductive and Object-Oriented Databases (KDOOD) and Temporal Reasoning in Deductive and Object-Oriented Databases (TDOOD), 1995

Mining Knowledge at Multiple Concept Levels.
Proceedings of the CIKM '95, Proceedings of the 1995 International Conference on Information and Knowledge Management, November 28, 1995

1994
A Study on the Structure of Linear Recursion.
IEEE Trans. Knowl. Data Eng., 1994

Constraint-Based Query Evaluation in Deductive Databases.
IEEE Trans. Knowl. Data Eng., 1994

Towards Efficient Induction Mechanisms in Database Systems.
Theor. Comput. Sci., 1994

Multichannel Delay Sensitive Data Retrieval.
Multim. Syst., 1994

Delay-Sensitive Multimedia on Disks.
IEEE Multim., 1994

Join Index Hierarchies for Supporting Efficient Navigations in Object-Oriented Databases.
Proceedings of the VLDB'94, 1994

Efficient and Effective Clustering Methods for Spatial Data Mining.
Proceedings of the VLDB'94, 1994

DBLearn: A System Prototype for Knowledge Discovery in Relational Databases.
Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, 1994

Dynamic Generation and Refinement of Concept Hierarchies for Knowledge Discovery in Databases.
Proceedings of the Knowledge Discovery in Databases: Papers from the 1994 AAAI Workshop, 1994

Incremental Processing of Logic Database Relations.
Proceedings of the Methodologies for Intelligent Systems, 8th International Symposium, 1994

A Case-Based Reasoning Approach for Associative Query Answering.
Proceedings of the Methodologies for Intelligent Systems, 8th International Symposium, 1994

How Does Knowledge Discovery Cooperate with Active Database Techniques in Controlling Dynamic Environment?
Proceedings of the Database and Expert Systems Applications, 5th International Conference, 1994

Cooperative Query Answering Using Multiple Layered Databases.
Proceedings of the Second International Conference on Cooperative Information Systems, 1994

LogicBase: A Deductive Database System Prototype.
Proceedings of the Third International Conference on Information and Knowledge Management (CIKM'94), Gaithersburg, Maryland, USA, November 29, 1994

1993
Data-Driven Discovery of Quantitative Rules in Relational Databases.
IEEE Trans. Knowl. Data Eng., 1993

Compilation and evaluation of linear mutual recursions.
Inf. Sci., 1993

Evaluation of Regular Nonlinear Recursive Programs.
Proceedings of the Logic Programming, 1993

An Attribute-Oriented Rough Set Approach for Knowledge Discovery in Databases.
Proceedings of the Rough Sets, 1993

Human-Computer Interfaces: DBLEARN and System X.
Proceedings of the Rough Sets, 1993

Homomorphic Tree Embeddings and Their Applications to Recursive Program Optimization
Proceedings of the Eighth Annual Symposium on Logic in Computer Science (LICS '93), 1993

Normalization of Linear Recursions in Deductive Databases.
Proceedings of the Ninth International Conference on Data Engineering, 1993

Outline of the LogicBase Demonstration.
Proceedings of the Workshop on Programming with Logic Databases. In Conjunction with ILPS, 1993

LogicBase: A System Prototype for Deductive Query Evaluation.
Proceedings of the Workshop on Programming with Logic Databases. In Conjunction with ILPS, 1993

1992
Classification and Compilation of Linear Recursive Queries in Deductive Databases.
IEEE Trans. Knowl. Data Eng., 1992

Automatic generation of compiled forms for linear recursions.
Inf. Syst., 1992

Binding Propagation Beyond the Reach of Rule/Goal Graphs.
Inf. Process. Lett., 1992

Deductive-ER: deductive entity-relationship data model and its data language.
Inf. Softw. Technol., 1992

On the Power of Query-Independent Compilation.
Int. J. Softw. Eng. Knowl. Eng., 1992

Knowledge Discovery in Databases: An Attribute-Oriented Approach.
Proceedings of the 18th International Conference on Very Large Data Bases, 1992

Chain-Based Evaluation - A Bridge Linking Recursive and Nonrecursive Query Evaluation.
Proceedings of the RIDE-TQP '92, 1992

Distance-Associated Join Indices for Spatial Range Search.
Proceedings of the Eighth International Conference on Data Engineering, 1992

Compilation-Based List Processing in Deductive Databases.
Proceedings of the Advances in Database Technology, 1992

Compressed Counting Method.
Proceedings of the Workshop on Deductive Databases held in conjunction with the Joint International Conference and Symposium on Logic Programming, 1992

N-Queens Problem Revisited: A Deductive Database Approach.
Proceedings of the Workshop on Deductive Databases held in conjunction with the Joint International Conference and Symposium on Logic Programming, 1992

Efficient Deduction and Induction: Key to the Success of Data-Intensive Knowledge-Base Systems.
Proceedings of the Formal Methods in Databases and Software Engineering, 1992

1991
Efficient Evaluation of Multiple Linear Recursions.
IEEE Trans. Software Eng., 1991

Evaluation of functional linear recursions: a compilation approach.
Inf. Syst., 1991

Learning in relational databases: an attribute-oriented approach.
Comput. Intell., 1991

Efficient Evaluation of Linear Recursions with Function Sysmbols in Deductive Databases.
Proceedings of the Methodologies for Intelligent Systems, 6th International Symposium, 1991

Constraint-Based Reasoning in Deductive Databases.
Proceedings of the Seventh International Conference on Data Engineering, 1991

Efficient Access Methods in Deductive and Object-Oriented Databases.
Proceedings of the Deductive and Object-Oriented Databases, 1991

Attribute-Oriented Induction in Relational Databases.
Proceedings of the Knowledge Discovery in Databases, 1991

1990
From transitive closure recursions to single-chain recursions.
Inf. Syst., 1990

Path: An Approach to Incorporate List Processing in A Relational Database.
Proceedings of the SEKE'90, 1990

An Attribute-Oriented Approach for Learning Classification Rules from Relational Databases.
Proceedings of the Sixth International Conference on Data Engineering, 1990

1989
Asynchronous Chain Recursions.
IEEE Trans. Knowl. Data Eng., 1989

Multi-way counting method.
Inf. Syst., 1989

Compiling general linear recursions by variable connection graph analysis.
Comput. Intell., 1989

Processing Multiple Linear Recursions.
Proceedings of the Logic Programming, 1989

Derivation of Magic Sets by Compilation.
Proceedings of the SEKE'89, 1989

The Level-Cycle Merging Method.
Proceedings of the Deductive and Object-Oriented Databases, 1989

1988
Classification of Recursive Formulas in Deductive Databases.
Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, 1988

One-Directional Recursive Formulas.
Proceedings of the Third International Conference on Data and Knowledge Bases: Improving Usability and Responsiveness, 1988

Selection of Processing Strategies for Different Recursive Queries.
Proceedings of the Third International Conference on Data and Knowledge Bases: Improving Usability and Responsiveness, 1988

What Kinds of Recursion Can Be Processed by Transitive Closure Strategies?
Proceedings of the Methodologies for Intelligent Systems, 1988

Semantic Query Optimization in Recursive Databases.
Proceedings of the Fourth International Conference on Data Engineering, 1988

The Processing and Evaluation of Transitive Closure Queries.
Proceedings of the Advances in Database Technology, 1988

1987
Handling Redundancy in the Processing of Recursive Database Queries.
Proceedings of the Association for Computing Machinery Special Interest Group on Management of Data 1987 Annual Conference, 1987

1986
Some Performance Results on Recursive Query Processing in Relational Database Systems.
Proceedings of the Second International Conference on Data Engineering, 1986

1985
Using Expert Knowledge in Database-Oriented Problem Solving.
Proceedings of the 6th International Conference on Information Systems, 1985

Knowledge-Directed Recursive Rule Compilation in Expert Database Systems.
Proceedings of the Artificial Intelligence Applications, 1985


  Loading...