Tat-Seng Chua

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2024
Enhancing Video-Language Representations With Structural Spatio-Temporal Alignment.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Behavioral Intention Prediction in Driving Scenes: A Survey.
IEEE Trans. Intell. Transp. Syst., August, 2024

MultiCBR: Multi-view Contrastive Learning for Bundle Recommendation.
ACM Trans. Inf. Syst., July, 2024

Filter-based Stance Network for Rumor Verification.
ACM Trans. Inf. Syst., July, 2024

Enhancing Out-of-distribution Generalization on Graphs via Causal Attention Learning.
ACM Trans. Knowl. Discov. Data, June, 2024

Robust Collaborative Filtering to Popularity Distribution Shift.
ACM Trans. Inf. Syst., May, 2024

Rule-Guided Counterfactual Explainable Recommendation.
IEEE Trans. Knowl. Data Eng., May, 2024

Computational Technologies for Fashion Recommendation: A Survey.
ACM Comput. Surv., May, 2024

Learning to Double-Check Model Prediction From a Causal Perspective.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

SLED: Structure Learning based Denoising for Recommendation.
ACM Trans. Inf. Syst., March, 2024

Dynamic Multimodal Fusion via Meta-Learning Towards Micro-Video Recommendation.
ACM Trans. Inf. Syst., March, 2024

Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems.
IEEE Trans. Knowl. Data Eng., February, 2024

Causal Disentangled Recommendation against User Preference Shifts.
ACM Trans. Inf. Syst., January, 2024

Context-Aware Dynamic Word Embeddings for Aspect Term Extraction.
IEEE Trans. Affect. Comput., 2024

Multiple-environment Self-adaptive Network for aerial-view geo-localization.
Pattern Recognit., 2024

Cross-view hypergraph contrastive learning for attribute-aware recommendation.
Inf. Process. Manag., 2024

Efficient Inference for Large Language Model-based Generative Recommendation.
CoRR, 2024

Grammar Induction from Visual, Speech and Text.
CoRR, 2024

AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models.
CoRR, 2024

MASKDROID: Robust Android Malware Detection with Masked Graph Representations.
CoRR, 2024

Scene-Text Grounding for Text-Based Video Question Answering.
CoRR, 2024

ExpLLM: Towards Chain of Thought for Facial Expression Recognition.
CoRR, 2024

VideoQA in the Era of LLMs: An Empirical Study.
CoRR, 2024

Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation.
CoRR, 2024

Harnessing Large Language Models for Multimodal Product Bundling.
CoRR, 2024

A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting.
CoRR, 2024

Disentangling Masked Autoencoders for Unsupervised Domain Generalization.
CoRR, 2024

Language Models Encode Collaborative Signals in Recommendation.
CoRR, 2024

On Softmax Direct Preference Optimization for Recommendation.
CoRR, 2024

Hello Again! LLM-powered Personalized Agent for Long-term Dialogue.
CoRR, 2024

Unified Text-to-Image Generation and Retrieval.
CoRR, 2024

Towards Semantic Equivalence of Tokenization in Multimodal LLM.
CoRR, 2024

Instructing Prompt-to-Prompt Generation for Zero-Shot Learning.
CoRR, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness.
CoRR, 2024

ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation.
CoRR, 2024

Towards Comprehensive and Efficient Post Safety Alignment of Large Language Models via Safety Patching.
CoRR, 2024

Co-Matching: Towards Human-Machine Collaborative Legal Case Matching.
CoRR, 2024

Learnable Tokenizer for LLM-based Generative Recommendation.
CoRR, 2024

A Survey on RAG Meets LLMs: Towards Retrieval-Augmented Large Language Models.
CoRR, 2024

A Survey of Generative Search and Recommendation in the Era of Large Language Models.
CoRR, 2024

Concept - An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors.
CoRR, 2024

A Picture Is Worth a Graph: Blueprint Debate on Graph for Multimodal Reasoning.
CoRR, 2024

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images.
CoRR, 2024

Think Twice Before Assure: Confidence Estimation for Large Language Models through Reflection on Multiple Answers.
CoRR, 2024

Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning.
CoRR, 2024

Contrastive Pre-training for Deep Session Data Understanding.
CoRR, 2024

Learning to Ask Critical Questions for Assisting Product Search.
CoRR, 2024

Abductive Ego-View Accident Video Understanding for Safe Driving Perception.
CoRR, 2024

Prospect Personalized Recommendation on Large Language Model-based Agent Platform.
CoRR, 2024

GraphEdit: Large Language Models for Graph Structure Learning.
CoRR, 2024

Gotcha! Don't trick me with unanswerable questions! Self-aligning Large Language Models for Responding to Unknown Questions.
CoRR, 2024

LLM-based Federated Recommendation.
CoRR, 2024

Smart Fitting Room: A Generative Approach to Matching-aware Virtual Try-On.
CoRR, 2024

TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data.
CoRR, 2024

Instilling Multi-round Thinking to Text-guided Image Generation.
CoRR, 2024

CPT: Colorful Prompt Tuning for pre-trained vision-language models.
AI Open, 2024

Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks.
Proceedings of the ACM on Web Conference 2024, 2024

Uplift Modeling for Target User Attacks on Recommender Systems.
Proceedings of the ACM on Web Conference 2024, 2024

Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models.
Proceedings of the ACM on Web Conference 2024, 2024

FashionReGen: LLM-Empowered Fashion Report Generation.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Large Language Model Powered Agents in the Web.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Leveraging Multimodal Features and Item-level User Feedback for Bundle Construction.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

Denoising Diffusion Recommender Model.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

I3: Intent-Introspective Retrieval Conditioned on Instructions.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Data-efficient Fine-tuning for LLM-based Recommendation.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Towards Human-centered Proactive Conversational Agents.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

A Taxation Perspective for Fair Re-ranking.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

On Generative Agents in Recommendation.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Large Language Model Powered Agents for Information Retrieval.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Information-Controllable Graph Contrastive Learning for Recommendation.
Proceedings of the 18th ACM Conference on Recommender Systems, 2024

A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

CIRP: Cross-Item Relational Pre-training for Multimodal Product Bundling.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Causal-driven Large Language Models with Faithful Reasoning for Knowledge Question Answering.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Smart Fitting Room: A One-stop Framework for Matching-aware Virtual Try-On.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

EE-LCE: An Event Extraction Framework Based on LLM-Generated CoT Explanation.
Proceedings of the Knowledge Science, Engineering and Management, 2024

Adversary and Attention Guided Knowledge Graph Reasoning Based on Reinforcement Learning.
Proceedings of the Knowledge Science, Engineering and Management, 2024

LARP: Language Audio Relational Pre-training for Cold-Start Playlist Continuation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Bridging Items and Language: A Transition Paradigm for Large Language Model-Based Recommendation.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

A Survey on Neural Question Generation: Methods, Applications, and Prospects.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

NExT-Chat: An LMM for Chat, Detection and Segmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

NExT-GPT: Any-to-Any Multimodal LLM.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Auto-Encoding Morph-Tokens for Multimodal LLM.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

InstructVid2Vid: Controllable Video Editing with Natural Language Instructions.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Towards 3D Molecule-Text Interpretation in Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

TAT-LLM: A Specialized Language Model for Discrete Reasoning over Financial Tabular and Textual Data.
Proceedings of the 5th ACM International Conference on AI in Finance, 2024

Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Ask-before-Plan: Proactive Language Agents for Real-World Planning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Beyond Persuasion: Towards Conversational Recommender System with Credible Explanations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Don't Just Say "I don't know"! Self-aligning Large Language Models for Responding to Unknown Questions with Explanations.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A Study of Implicit Ranking Unfairness in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching.
Proceedings of the Computer Vision - ECCV 2024, 2024

Hop-based Heterogeneous Graph Transformer.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Can I Trust Your Answer? Visually Grounded Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Discriminative Probing and Tuning for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Abductive Ego-View Accident Video Understanding for Safe Driving Perception.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

LASO: Language-Guided Affordance Segmentation on 3D Object.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Dysen-VDM: Empowering Dynamics-Aware Text-to-Video Diffusion with LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Learnable Item Tokenization for Generative Recommendation.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Chain-of-Exemplar: Enhancing Distractor Generation for Multimodal Educational Question Generation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ProtT3: Protein-to-Text Generation for Text-based Protein Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text Pretraining.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Distillation Enhanced Generative Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

On the Multi-turn Instruction Following for Conversational Web Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

GOODAT: Towards Test-Time Graph Out-of-Distribution Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Temporally and Distributionally Robust Optimization for Cold-Start Recommendation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Towards Generative Search and Recommendation: A keynote at RecSys 2023.
SIGIR Forum, December, 2023

Causal Inference for Knowledge Graph Based Recommendation.
IEEE Trans. Knowl. Data Eng., November, 2023

Contrastive Video Question Answering via Video Graph Transformer.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Personalized Latent Structure Learning for Recommendation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis.
Int. J. Comput. Vis., August, 2023

Report on the 16th ACM International Conference on Web Search and Data Mining (WSDM 2023).
SIGIR Forum, June, 2023

User Perception of Recommendation Explanation: Are Your Explanations What Users Need?
ACM Trans. Inf. Syst., April, 2023

On the Robustness of Aspect-based Sentiment Analysis: Rethinking Model, Data, and Training.
ACM Trans. Inf. Syst., April, 2023

A Unified Multi-task Learning Framework for Multi-goal Conversational Recommender Systems.
ACM Trans. Inf. Syst., 2023

State Graph Reasoning for Multimodal Conversational Recommendation.
IEEE Trans. Multim., 2023

Self-Supervised Learning for Multimedia Recommendation.
IEEE Trans. Multim., 2023

Cross-GCN: Enhancing Graph Convolutional Network with $k$k-Order Feature Interactions.
IEEE Trans. Knowl. Data Eng., 2023

Learning Relation Prototype From Unlabeled Texts for Long-Tail Relation Extraction.
IEEE Trans. Knowl. Data Eng., 2023

ACM WSDM 2023 Report.
SIGWEB Newsl., 2023

Reinforced Causal Explainer for Graph Neural Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Towards Goal-oriented Intelligent Tutoring Systems in Online Education.
CoRR, 2023

Structured, Complex and Time-complete Temporal Event Forecasting.
CoRR, 2023

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
CoRR, 2023

Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching.
CoRR, 2023

Do LLMs Implicitly Exhibit User Discrimination in Recommendation? An Empirical Study.
CoRR, 2023

NExT-Chat: An LMM for Chat, Detection and Segmentation.
CoRR, 2023

On Generative Agents in Recommendation.
CoRR, 2023

A Multi-facet Paradigm to Bridge Large Language Model and Recommendation.
CoRR, 2023

Progressive Text-to-3D Generation for Automatic 3D Prototyping.
CoRR, 2023

Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models.
CoRR, 2023

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.
CoRR, 2023

SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning.
CoRR, 2023

XNLP: An Interactive Demonstration System for Universal Structured NLP.
CoRR, 2023

Revisiting Conversation Discourse for Dialogue Disentanglement.
CoRR, 2023

LLMDet: A Large Language Models Detection Tool.
CoRR, 2023

Robust Instruction Optimization for Large Language Models with Distribution Shifts.
CoRR, 2023

Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration.
CoRR, 2023

Doc2SoarGraph: Discrete Reasoning over Visually-Rich Table-Text Documents with Semantic-Oriented Hierarchical Graphs.
CoRR, 2023

Transfer Visual Prompt Generator across LLMs.
CoRR, 2023

Search-in-the-Chain: Towards the Accurate, Credible and Traceable Content Generation for Complex Knowledge-intensive Tasks.
CoRR, 2023

Generative Recommendation: Towards Next-generation Recommender Paradigm.
CoRR, 2023

Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting.
CoRR, 2023

SoarGraph: Numerical Reasoning over Financial Table-Text Data via Semantic-Oriented Hierarchical Graphs.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Invariant Collaborative Filtering to Popularity Distribution Shift.
Proceedings of the ACM Web Conference 2023, 2023

A Dual Prompt Learning Framework for Few-Shot Dialogue State Tracking.
Proceedings of the ACM Web Conference 2023, 2023

Cooperative Explanations of Graph Neural Networks.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Strategy-aware Bundle Recommender System.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

LightGT: A Light Graph Transformer for Multimedia Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Diffusion Recommender Model.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Learnable Pillar-based Re-ranking for Image-Text Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Dual Semantic Knowledge Composed Multimodal Dialog Systems.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Rethinking Conversational Agents in the Era of LLMs: Proactivity, Non-collaborativity, and Beyond.
Proceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, 2023

Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

VPGTrans: Transfer Visual Prompt Generator across LLMs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

UAVM '23: 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Deconfounded Multimodal Learning for Spatio-temporal Video Grounding.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LGM3A '23: 1st Workshop on Large Generative Models Meet Multimodal Applications.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Equivariant Learning for Out-of-Distribution Cold-start Recommendation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Deep Multimodal Learning for Information Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Redundancy-aware Transformer for Video Question Answering.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Context-aware Event Forecasting via Graph Disentanglement.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Discovering Dynamic Causal Space for DAG Structure Learning.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

FLOOD: A Flexible Invariant Learning Framework for Out-of-Distribution Generalization on Graphs.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

A Survey on Proactive Dialogue Systems: Problems, Methods, and Prospects.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Boosting Causal Discovery via Adaptive Sample Reweighting.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Discovering Spatio-Temporal Rationales for Video Question Answering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LLMDet: A Third Party Large Language Models Generated Text Detection Tool.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Robust Prompt Optimization for Large Language Models Against Distribution Shifts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent Space.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Popularity-aware Distributionally Robust Optimization for Recommendation System.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Diffusion Variational Autoencoder for Tackling Stochasticity in Multi-Step Regression Stock Price Prediction.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Goal Awareness for Conversational AI: Proactivity, Non-collaborativity, and Beyond.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2023

Generating Visual Spatial Description via Holistic 3D Scene Understanding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Improving Named Entity Recognition via Bridge-based Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Hypothetical Training for Robust Machine Reading Comprehension of Tabular Context.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

DiaASQ: A Benchmark of Conversational Aspect-based Sentiment Quadruple Analysis.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Two Heads Are Better Than One: Improving Fake News Video Detection by Correlating with Neighbors.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Constructing Code-mixed Universal Dependency Forest for Unbiased Cross-lingual Relation Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Reasoning Implicit Sentiment with Chain-of-Thought Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Visually Grounded Commonsense Knowledge Acquisition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Video-Audio Domain Generalization via Confounder Disentanglement.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Introduction to the Special Section on Graph Technologies for User Modeling and Recommendation, Part 2.
ACM Trans. Inf. Syst., 2022

Graph Technologies for User Modeling and Recommendation: Introduction to the Special Issue - Part 1.
ACM Trans. Inf. Syst., 2022

Hierarchical User Intent Graph Network for Multimedia Recommendation.
IEEE Trans. Multim., 2022

Modeling Instant User Intent and Content-Level Transition for Sequential Fashion Recommendation.
IEEE Trans. Multim., 2022

Leveraging Multiple Relations for Fashion Trend Forecasting Based on Social Media.
IEEE Trans. Multim., 2022

Mixed Dish Recognition With Contextual Relation and Domain Alignment.
IEEE Trans. Multim., 2022

A Patience-Aware Recommendation Scheme for Shared Accounts on Mobile Devices.
IEEE Trans. Knowl. Data Eng., 2022

Topic-Guided Conversational Recommender in Multiple Domains.
IEEE Trans. Knowl. Data Eng., 2022

Conditional Hyper-Network for Blind Super-Resolution With Multiple Degradations.
IEEE Trans. Image Process., 2022

Video Moment Retrieval With Cross-Modal Neural Architecture Search.
IEEE Trans. Image Process., 2022

Entity Slot Filling for Visual Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Affective Image Content Analysis: Two Decades Review and New Perspectives.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Meta-Transfer Learning Through Hard Tasks.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding.
CoRR, 2022

Cognitive Accident Prediction in Driving Scenes: A Multimodality Benchmark.
CoRR, 2022

Behavioral Intention Prediction in Driving Scenes: A Survey.
CoRR, 2022

CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation.
CoRR, 2022

Subband-based Generative Adversarial Network for Non-parallel Many-to-many Voice Conversion.
CoRR, 2022

RDU: A Region-based Approach to Form-style Document Understanding.
CoRR, 2022

Differentiable Invariant Causal Discovery.
CoRR, 2022

3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective.
CoRR, 2022

KuaiRec: A Fully-observed Dataset for Recommender Systems.
CoRR, 2022

Deconfounding to Explanation Evaluation in Graph Neural Networks.
CoRR, 2022

Prompt Learning for Few-Shot Dialogue State Tracking.
CoRR, 2022

Training Free Graph Neural Networks for Graph Matching.
CoRR, 2022

Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Causal Representation Learning for Out-of-Distribution Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

SHADEWATCHER: Recommendation-guided Cyber Threat Analysis using System Audit Records.
Proceedings of the 43rd IEEE Symposium on Security and Privacy, 2022

Structured and Natural Responses Co-generation for Conversational Search.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

User-controllable Recommendation Against Filter Bubbles.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue Policy.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Complex Document Understanding By Discrete Reasoning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Reflecting on Experiences for Response Generation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MCFR'22: 1st Workshop on Multimedia Computing towards Fashion Recommendation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Equivariant and Invariant Grounding for Video Question Answering.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Causal Attention for Interpretable and Generalizable Graph Classification.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Intelligent Request Strategy Design in Recommender System.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

CrossCBR: Cross-view Contrastive Learning for Bundle Recommendation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Let Invariant Rationale Discovery Inspire Graph Contrastive Learning.
Proceedings of the International Conference on Machine Learning, 2022

Discovering Invariant Rationales for Graph Neural Networks.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Video Question Answering: Datasets, Algorithms and Challenges.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ConReader: Exploring Implicit Relations in Contracts for Contract Clause Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Semi-supervised New Slot Discovery with Incremental Clustering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Fine-Grained Scene Graph Generation with Data Transfer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Video Graph Transformer for Video Question Answering.
Proceedings of the Computer Vision - ECCV 2022, 2022

Invariant Grounding for Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

KuaiRec: A Fully-observed Dataset and Insights for Evaluating Recommender Systems.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Learning to Imagine: Integrating Counterfactual Thinking in Neural Discrete Reasoning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Video as Conditional Graph Hierarchy for Multi-Granular Question Answering.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Rethinking the Two-Stage Framework for Grounded Situation Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-start Users.
ACM Trans. Inf. Syst., 2021

Generating Face Images With Attributes for Free.
IEEE Trans. Neural Networks Learn. Syst., 2021

A Hybrid Approach for Detecting Prerequisite Relations in Multi-Modal Food Recipes.
IEEE Trans. Multim., 2021

Learning to Recommend With Multiple Cascading Behaviors.
IEEE Trans. Knowl. Data Eng., 2021

Graph Adversarial Training: Dynamically Regularizing Based on Graph Structure.
IEEE Trans. Knowl. Data Eng., 2021

A Study of Multi-Task and Region-Wise Deep Learning for Food Ingredient Recognition.
IEEE Trans. Image Process., 2021

Dialogue State Tracking with Incremental Reasoning.
Trans. Assoc. Comput. Linguistics, 2021

3-D Relation Network for visual relation recognition in videos.
Neurocomputing, 2021

Deconfounded Training for Graph Neural Networks.
CoRR, 2021

Learning Robust Recommender from Noisy Implicit Feedback.
CoRR, 2021

GRCN: Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback.
CoRR, 2021

Hierarchical User Intent Graph Network forMultimedia Recommendation.
CoRR, 2021

How Knowledge Graph and Attention Help? A Quantitative Analysis into Bag-level Relation Extraction.
CoRR, 2021

A-FMI: Learning Attributions from Deep Networks via Feature Map Importance.
CoRR, 2021

Retrieving and Reading: A Comprehensive Survey on Open-domain Question Answering.
CoRR, 2021

Advances and challenges in conversational recommender systems: A survey.
AI Open, 2021

Learning Intents behind Interactions with Knowledge Graph for Recommendation.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Multi-domain Dialogue State Tracking with Recursive Inference.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Denoising Implicit Feedback for Recommendation.
Proceedings of the WSDM '21, 2021

CauseRec: Counterfactual User Sequence Synthesis for Sequential Recommendation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Deconfounded Video Moment Retrieval with Causal Intervention.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Clicks can be Cheating: Counterfactual Recommendation for Mitigating Clickbait Issue.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

MMConv: An Environment for Multimodal Conversational Search across Multiple Domains.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Hybrid Learning to Rank for Financial Event Ranking.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Towards Multi-Grained Explainability for Graph Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Why Do We Click: Visual Impression-aware News Recommendation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Contrastive Learning for Cold-Start Recommendation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Video Visual Relation Detection via Iterative Inference.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

UrbanMM'21: 1st International Workshop on Multimedia Computing for Urban Data.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

The Next Generation Multimodal Conversational Search and Recommendation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Interventional Video Relation Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

VidVRD 2021: The Third Grand Challenge on Video Relation Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Towards Enriching Responses with Crowd-sourced Knowledge for Task-oriented Dialogue.
Proceedings of the MuCAI'21: Proceedings of the 2nd ACM Multimedia Workshop on Multimodal Conversational AI, 2021

Multi-Perspective Video Captioning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Leveraging Two Types of Global Graph for Sequential Fashion Recommendation.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Deconfounded Recommendation for Alleviating Bias Amplification.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Time horizon-aware modeling of financial texts for stock price prediction.
Proceedings of the ICAIF'21: 2nd ACM International Conference on AI in Finance, Virtual Event, November 3, 2021

Pre-training and evaluation of numeracy-oriented language model.
Proceedings of the ICAIF'21: 2nd ACM International Conference on AI in Finance, Virtual Event, November 3, 2021

Few-Shot 3D Point Cloud Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Empowering Language Understanding with Counterfactual Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Conceptualized and Contextualized Gaussian Embedding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Have We Solved The Hard Problem? It's Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Learning Visual Elements of Images for Discovery of Brand Posts.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Fast Matrix Factorization With Nonuniform Weights on Missing Data.
IEEE Trans. Neural Networks Learn. Syst., 2020

Hierarchical Attention Network for Visually-Aware Food Recommendation.
IEEE Trans. Multim., 2020

Adversarial Training Towards Robust Multimedia Recommender System.
IEEE Trans. Knowl. Data Eng., 2020

Unsupervised Video Action Clustering via Motion-Scene Interaction Constraint.
IEEE Trans. Circuits Syst. Video Technol., 2020

MGAT: Multimodal Graph Attention Network for Recommendation.
Inf. Process. Manag., 2020

HoAFM: A High-order Attentive Factorization Machine for CTR Prediction.
Inf. Process. Manag., 2020

Special issue on deep learning in image and video retrieval.
Int. J. Multim. Inf. Retr., 2020

Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method.
CoRR, 2020

Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment.
CoRR, 2020

"Click" Is Not Equal to "Like": Counterfactual Recommendation for Mitigating Clickbait Issue.
CoRR, 2020

Language Guided Networks for Cross-modal Moment Retrieval.
CoRR, 2020

Cross-GCN: Enhancing Graph Convolutional Network with k-Order Feature Interactions.
CoRR, 2020

Reinforced Negative Sampling over Knowledge Graph for Recommendation.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Tree-Augmented Cross-Modal Encoding for Complex-Query Video Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Disentangled Graph Collaborative Filtering.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Hierarchical Fashion Graph Network for Personalized Outfit Recommendation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Conversational Recommendation: Formulation, Methods, and Evaluation.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

FinIR 2020: The First Workshop on Information Retrieval in Finance.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Enhancing Text Classification via Discovering Additional Semantic Clues from Logograms.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Neural Sparse Voxel Fields.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Crowd Knowledge Enhanced Multimodal Conversational Assistant in Travel Domain.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Video Relation Detection via Multiple Hypothesis Association.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Multi-modal Cooking Workflow Construction for Food Recipes.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Heterogeneous Fusion of Semantic and Collaborative Information for Visually-Aware Food Recommendation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Enhancing Anomaly Detection in Surveillance Videos with Transfer Learning from Action Recognition.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

How to Learn Item Representation for Cold-Start Multimedia Recommendation?
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Knowledge Enhanced Neural Fashion Trend Forecasting.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Interactive Path Reasoning on Graph for Conversational Recommendation.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Learning Goal-oriented Dialogue Policy with opposite Agent Awareness.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

PS2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Exploring and Evaluating Attributes, Values, and Structures for Entity Alignment.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Re-examining the Role of Schema Linking in Text-to-SQL.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Visual Relation Grounding in Videos.
Proceedings of the Computer Vision - ECCV 2020, 2020

SESS: Self-Ensembling Semi-Supervised 3D Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Hyperbolic Visual Embedding Learning for Zero-Shot Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semantic Graphs for Generating Deep Questions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Heuristic Black-Box Adversarial Attacks on Video Recognition Models.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Image Enhanced Event Detection in News Articles.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

PEIA: Personality and Emotion Integrated Attentive Model for Music Recommendation on Social Media Platforms.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Source Domain Adaptation for Visual Sentiment Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Zero-Shot Ingredient Recognition by Multi-Relational Graph Convolutional Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Solving Sequential Text Classification as Board-Game Playing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Introduction to the Special Issue on the Cross-Media Analysis for Visual Question Answering.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Attentive Aspect Modeling for Review-Aware Recommendation.
ACM Trans. Inf. Syst., 2019

Temporal Relational Ranking for Stock Prediction.
ACM Trans. Inf. Syst., 2019

Modeling Embedding Dimension Correlations via Convolutional Neural Collaborative Filtering.
ACM Trans. Inf. Syst., 2019

Discovering Latent Discriminative Patterns for Multi-Mode Event Representation.
IEEE Trans. Multim., 2019

More is Better: Precise and Detailed Image Captioning Using Online Positive Recall and Missing Concepts Mining.
IEEE Trans. Image Process., 2019

PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation.
CoRR, 2019

Deep Conversational Recommender in Travel.
CoRR, 2019

Learning to Self-Train for Semi-Supervised Few-Shot Classification.
CoRR, 2019

Recent Advances in Neural Question Generation.
CoRR, 2019

LCC: Learning to Customize and Combine Neural Networks for Few-Shot Learning.
CoRR, 2019

Neural Multimodal Belief Tracker with Adaptive Attention for Dialogue Systems.
Proceedings of the World Wide Web Conference, 2019

Unifying Knowledge Graph Learning and Recommendation: Towards a Better Understanding of User Preferences.
Proceedings of the World Wide Web Conference, 2019

Interpretable Fashion Matching with Rich Attributes.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Neural Graph Collaborative Filtering.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Learning to Self-Train for Semi-Supervised Few-Shot Classification.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Legal and Ethical Challenges in Multimedia Research.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Relation Understanding in Videos: A Grand Challenge Overview.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning Using Privileged Information for Food Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Who, Where, and What to Wear?: Extracting Fashion Knowledge from Social Media.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Automatic Fashion Knowledge Extraction from Social Media.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning Subjective Attributes of Images from Auxiliary Sources.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Mixed-dish Recognition with Contextual Relation Networks.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

DietLens-Eout: Large Scale Restaurant Food Photo Recognition.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Annotating Objects and Relations in User-Generated Videos.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Keynote: Towards Explainability in AI and Multimedia Research.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

KGAT: Knowledge Graph Attention Network for Recommendation.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Enhancing Stock Movement Prediction with Adversarial Training.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Neural Multi-task Recommendation from Multi-behavior Data.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Semi-supervised Entity Alignment via Joint Knowledge Embedding Model and Cross-graph Model.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Revisit Automatic Error Detection for Wrong and Missing Translation - A Supervised Approach.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Low-Resource Name Tagging Learned with Weakly Labeled Data.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Meta-Transfer Learning for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning and Reasoning on Graph for Recommendation.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Mixed Dish Recognition through Multi-Label Learning.
Proceedings of the 11th Workshop on Multimedia for Cooking and Eating Activities, 2019

Generating Expensive Relationship Features from Cheap Objects.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Multiple Hypothesis Video Relation Detection.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Graph Neural Networks with Generated Parameters for Relation Extraction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multi-Channel Graph Neural Network for Entity Alignment.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

TransNFCM: Translation-Based Neural Fashion Compatibility Modeling.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Explainable Reasoning over Knowledge Graphs for Recommendation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

A Whole New Ball Game: Harvesting Game Data for Player Profiling.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Attributed Social Network Embedding.
IEEE Trans. Knowl. Data Eng., 2018

NAIS: Neural Attentive Item Similarity Model for Recommendation.
IEEE Trans. Knowl. Data Eng., 2018

Real-Time Multimedia Social Event Detection in Microblog.
IEEE Trans. Cybern., 2018

Predicting Personalized Image Emotion Perceptions in Social Networks.
IEEE Trans. Affect. Comput., 2018

Learning a Disentangled Embedding for Monocular 3D Shape Retrieval and Pose Estimation.
CoRR, 2018

Fast Matrix Factorization with Non-Uniform Weights on Missing Data.
CoRR, 2018

Improving Stock Movement Prediction with Adversarial Training.
CoRR, 2018

Visually-aware Collaborative Food Recommendation.
CoRR, 2018

Learning Recommender Systems from Multi-Behavior Data.
CoRR, 2018

TEM: Tree-enhanced Embedding Model for Explainable Recommendation.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Learning on Partial-Order Hypergraphs.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

f<sub>BGD</sub>: Learning Embeddings From Positive Unlabeled Data with BGD.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

Attentive Moment Retrieval in Videos.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Adversarial Personalized Ranking for Recommendation.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Rethinking Summarization and Storytelling for Modern Social Multimedia.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Food Photo Recognition for Dietary Tracking: System and Experiment.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Venue Prediction for Social Images by Exploiting Rich Temporal Patterns in LBSNs.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Cross-modal Moment Localization in Videos.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Knowledge-aware Multimodal Fashion Chatbot.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Knowledge-aware Multimodal Dialogue Systems.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Interpretable Multimodal Retrieval for Fashion Products.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Beyond the Product: Discovering Image Posts for Brands in Social Media.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Recommendation Technologies for Multimedia Content.
Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, 2018

Affective Image Content Analysis: A Comprehensive Survey.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Quality Matters: Assessing cQA Pair Quality via Transductive Multi-View Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Cross-Domain Depression Detection via Harvesting Social Media.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Improving Implicit Recommender Systems with View Data.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Outer Product-based Neural Collaborative Filtering.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Temporally Grounding Natural Sentence in Video.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Compact Indexing and Judicious Searching for Billion-Scale Microblog Retrieval.
ACM Trans. Inf. Syst., 2017

Unifying Virtual and Physical Worlds: Learning Toward Local and Global Consistency.
ACM Trans. Inf. Syst., 2017

Tweet Can Be Fit: Integrating Data from Wearable Sensors and Multiple Social Networks for Wellness Profile Learning.
ACM Trans. Inf. Syst., 2017

Cross-Platform App Recommendation by Jointly Modeling Ratings and Texts.
ACM Trans. Inf. Syst., 2017

VideoWhisper: Toward Discriminative Unsupervised Video Feature Learning With Attention-Based Recurrent Neural Networks.
IEEE Trans. Multim., 2017

Matryoshka Peek: Toward Learning Fine-Grained, Robust, Discriminative Features for Product Search.
IEEE Trans. Multim., 2017

I Know What You Want to Express: Sentence Element Inference by Incorporating External Knowledge Base.
IEEE Trans. Knowl. Data Eng., 2017

Detecting Stress Based on Social Interactions in Social Networks.
IEEE Trans. Knowl. Data Eng., 2017

Wellness Representation of Users in Social Media: Towards Joint Modelling of Heterogeneity and Temporality.
IEEE Trans. Knowl. Data Eng., 2017

Learning User Attributes via Mobile Social Multimedia Analytics.
ACM Trans. Intell. Syst. Technol., 2017

Cost-Optimized Microblog Distribution over Geo-Distributed Data Centers: Insights from Cross-Media Analysis.
ACM Trans. Intell. Syst. Technol., 2017

Version-sensitive mobile App recommendation.
Inf. Sci., 2017

Startup Takes On Growing Demand for Visual Search Technology.
IEEE Multim., 2017

User-Generated Content in Social Media (Dagstuhl Seminar 17301).
Dagstuhl Reports, 2017

Document Visualization using Topic Clouds.
CoRR, 2017

Neural Collaborative Filtering.
Proceedings of the 26th International Conference on World Wide Web, 2017

Leveraging Behavioral Factorization and Prior Knowledge for Community Discovery and Profiling.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

Item Silk Road: Recommending Items from Information Domains to Social Users.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Computational Social Indicators: A Case Study of Chinese University Ranking.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Cross-Domain Recommendation via Clustering on Multi-Layer Graphs.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Exploring User-Specific Information in Music Retrieval.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Embedding Factorization Models for Jointly Recommending Items and User Generated Lists.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Neural Factorization Machines for Sparse Predictive Analytics.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Video Visual Relation Detection.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Laplacian-Steered Neural Style Transfer.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

How Personality Affects our Likes: Towards a Better Understanding of Actionable Images.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Cross-modal Recipe Retrieval with Rich Food Attributes.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

PIC2DISH: A Customized Cooking Assistant System.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Representativeness-aware Aspect Analysis for Brand Monitoring in Social Media.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Gender and emotion recognition with implicit user signals.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

VIDEOWHISPER: Towards unsupervised learning of discriminative features of videos with RNN.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Object trajectory proposal.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Visual Translation Embedding Network for Visual Relation Detection.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Discovering gender differences in facial emotion recognition via implicit behavioral cues.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction, 2017

TweetFit: Fusing Multiple Social Media and Sensor Data for Wellness Profile Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Towards User Personality Profiling from Multiple Social Networks.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Learning from Multiple Social Networks
Synthesis Lectures on Information Concepts, Retrieval, and Services, Morgan & Claypool Publishers, ISBN: 978-3-031-02300-2, 2016

Learning from Collective Intelligence: Feature Learning Using Social Images and Tags.
ACM Trans. Multim. Comput. Commun. Appl., 2016

Volunteerism Tendency Prediction via Harvesting Multiple Social Networks.
ACM Trans. Inf. Syst., 2016

Filtering of Brand-Related Microblogs Using Social-Smooth Multiview Embedding.
IEEE Trans. Multim., 2016

Capturing the Semantics of Key Phrases Using Multiple Languages for Question Retrieval.
IEEE Trans. Knowl. Data Eng., 2016

Generating Incremental Length Summary Based on Hierarchical Topic Coverage Maximization.
ACM Trans. Intell. Syst. Technol., 2016

Deep Fusion of Multiple Semantic Cues for Complex Event Recognition.
IEEE Trans. Image Process., 2016

Joint Content Replication and Request Routing for Social Video Distribution Over Cloud CDN: A Community Clustering Method.
IEEE Trans. Circuits Syst. Video Technol., 2016

"360° user profiling: past, future, and applications" by Aleksandr Farseev, Mohammad Akbari, Ivan Samborskii and Tat-Seng Chua with Martin Vesely as coordinator.
SIGWEB Newsl., 2016

Big data meets multimedia analytics.
Signal Process., 2016

Accurate online video tagging via probabilistic hybrid modeling.
Multim. Syst., 2016

Resolving local cuisines for tourists with multi-source social media contents.
Multim. Syst., 2016

Learning content-social influential features for influence analysis.
Int. J. Multim. Inf. Retr., 2016

Social-Sensed Multimedia Computing.
IEEE Multim., 2016

Towards organizing health knowledge on community-based health services.
EURASIP J. Bioinform. Syst. Biol., 2016

Image-embodied Knowledge Representation Learning.
CoRR, 2016

Generative Topic Embedding: a Continuous Representation of Documents (Extended Version with Proofs).
CoRR, 2016

SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning.
CoRR, 2016

Computational Intelligence for Big Social Data Analysis [Guest Editorial].
IEEE Comput. Intell. Mag., 2016

Discrete Collaborative Filtering.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Fast Matrix Factorization for Online Recommendation with Implicit Feedback.
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Smart Ambient Sound Analysis via Structured Statistical Modeling.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Deep Learning Generic Features for Cross-Media Retrieval.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Mental Visual Browsing.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Predicting Personalized Emotion Perceptions of Social Images.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Play and Rewind: Optimizing Binary Representations of Videos by Self-Supervised Temporal Hashing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Shorter-is-Better: Venue Category Estimation from Micro-Video.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Mental Visual Indexing: Towards Fast Video Browsing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

An Intention-Aware Interactive System for Mobile Video Browsing.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

bBridge: A Big Data Platform for Social Multimedia Analytics.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Beyond Language and Vision, Towards Truly Multimedia Integration.
Proceedings of the 2016 ACM workshop on Vision and Language Integration Meets Multimedia Fusion, 2016

From Personal Wellness to Healthcare Support Systems: A Big Data Driven Approach.
Proceedings of the 2016 ACM Workshop on Multimedia for Personal Health and Health Care, 2016

Micro Tells Macro: Predicting the Popularity of Micro-Videos via a Transductive Model.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

What Does Social Media Say about Your Stress?.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

DrMAD: Distilling Reverse-Mode Automatic Differentiation for Optimizing Hyperparameters of Deep Neural Networks.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Saliency meets spatial quantization: A practical framework for large scale product search.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

Online Collaborative Learning for Open-Vocabulary Visual Classifiers.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Scrutinizing Mobile App Recommendation: Identifying Important App-Related Indicators.
Proceedings of the Information Retrieval Technology, 2016

Generative Topic Embedding: a Continuous Representation of Documents.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Discrete Image Hashing Using Large Weakly Annotated Photo Collections.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

From Tweets to Wellness: Wellness Event Detection from Twitter Streams.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Enhancing Video Event Recognition Using Automatically Constructed Semantic-Visual Knowledge Base.
IEEE Trans. Multim., 2015

Corrections to "Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss".
IEEE Trans. Multim., 2015

Semantic-Based Location Recommendation With Multimodal Venue Semantics.
IEEE Trans. Multim., 2015

Multimedia Summarization for Social Events in Microblog Stream.
IEEE Trans. Multim., 2015

Bridging the Vocabulary Gap between Health Seekers and Healthcare Knowledge.
IEEE Trans. Knowl. Data Eng., 2015

Disease Inference from Health-Related Questions via Sparse Deep Learning.
IEEE Trans. Knowl. Data Eng., 2015

Robust Multiview Feature Learning for RGB-D Image Understanding.
ACM Trans. Intell. Syst. Technol., 2015

Toward an SDN-enabled big data platform for social TV analytics.
IEEE Netw., 2015

Resolving polysemy and pseudonymity in entity linking with comprehensive name and context modeling.
Inf. Sci., 2015

aMM: Towards adaptive ranking of multi-modal documents.
Int. J. Multim. Inf. Retr., 2015

Cross-Social Network Collaborative Recommendation.
Proceedings of the ACM Web Science Conference, 2015

Multiple Social Network Learning and Its Application in Volunteerism Tendency Prediction.
Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2015

Multimedia Social Event Detection in Microblog.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Learning Features from Large-Scale, Noisy and Social Image-Tag Collection.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Beyond Doctors: Future Health Prediction from Multimedia and Multimodal Observations.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Online Multimodal Co-indexing and Retrieval of Weakly Labeled Web Image Collections.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Harvesting Multiple Sources for User Profile Learning: a Big Data Study.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Discriminative Feature Selection for Multiple Ocular Diseases Classification by Sparse Induced Graph Regularized Group Lasso.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, 2015

Catch the Black Sheep: Unified Framework for Shilling Attack Detection Based on Fraudulent Action Propagation.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Interest Inference via Structure-Constrained Multi-Source Multi-Task Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Learning Image and User Features for Recommendation in Social Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Task-based recommendation on a web-scale.
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015

Exploring Key Concept Paraphrasing Based on Pivot Language Translation for Question Retrieval.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Topical Word Embeddings.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Attribute-Augmented Semantic Hierarchy: Towards a Unified Framework for Content-Based Image Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Memory recall based video search: Finding videos you have seen before based on your memory.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Learning to Recommend Descriptive Tags for Questions in Social Forums.
ACM Trans. Inf. Syst., 2014

Social-Sensed Image Search.
ACM Trans. Inf. Syst., 2014

Exploiting Web Images for Semantic Video Indexing Via Robust Sample-Specific Loss.
IEEE Trans. Multim., 2014

Product Aspect Ranking and Its Applications.
IEEE Trans. Knowl. Data Eng., 2014

Personalized Recommendations of Locally Interesting Venues to Tourists via Cross-Region Community Matching.
ACM Trans. Intell. Syst. Technol., 2014

Robust (Semi) Nonnegative Graph Embedding.
IEEE Trans. Image Process., 2014

Discovering high quality answers in community question answering archives using a hierarchy of classifiers.
Inf. Sci., 2014

Toward Multiscreen Social TV with Geolocation-Aware Social Sense.
IEEE Multim., 2014

Toward Scalable Systems for Big Data Analytics: A Technology Tutorial.
IEEE Access, 2014

The design of a live social observatory system.
Proceedings of the 23rd International World Wide Web Conference, 2014

WenZher: comprehensive vertical search for healthcare domain.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

A Joint Local-Global Approach for Medical Terminology Assignment.
Proceedings of the Medical Information Retrieval Workshop at SIGIR co-located with the 37th annual international ACM SIGIR conference (ACM SIGIR 2014), 2014

New and improved: modeling versions to improve app recommendation.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Predicting trending messages and diffusion participants in microblogging network.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Toward a biometric-aware cloud service engine for multi-screen video applications.
Proceedings of the ACM SIGCOMM 2014 Conference, 2014

Social TV analytics: a novel paradigm to transform TV watching experience.
Proceedings of the Multimedia Systems Conference 2014, 2014

Searching for Recent Celebrity Images in Microblog Platform.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Exploring Principles-of-Art Features For Image Emotion Recognition.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

One of a Kind: User Profiling by Social Curation.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

The Multimedia Challenges in Social Media Analytics.
Proceedings of the 3rd International Workshop on Socially-Aware Multimedia, 2014

Image Tagging with Social Assistance.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Brand Data Gathering From Live Social Media Streams.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Community based effective social video contents placement in cloud centric CDN network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Customized Organization of Social Media Contents using Focused Topic Hierarchy.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

A Dynamic Reconstruction Approach to Topic Summarization of User-Generated-Content.
Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, 2014

User Intent Identification from Online Discussions Using a Joint Aspect-Action Topic Model.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

Community Understanding in Location-based Social Networks.
Proceedings of the Human-Centered Social Media Analytics, 2014

2013
Hierarchical Organization of Collaboratively Constructed Content.
Proceedings of the People's Web Meets NLP, Collaboratively Constructed Language Resources, 2013

GPSView: A scenic driving route planner.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Detecting profilable and overlapping communities with user-generated multimedia contents in LBSNs.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Towards optimizing human labeling for interactive image tagging.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Robust image annotation via simultaneous feature and sample outlier pursuit.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Large-scale multilabel propagation based on efficient sparse graph construction.
ACM Trans. Multim. Comput. Commun. Appl., 2013

When Amazon Meets Google: Product Visualization by Exploring Multiple Web Sources.
ACM Trans. Internet Techn., 2013

Beyond Text QA: Multimedia Answer Generation by Harvesting Web Information.
IEEE Trans. Multim., 2013

Detecting Group Activities With Multi-Camera Context.
IEEE Trans. Circuits Syst. Video Technol., 2013

Label-specific training set construction from web resource for image annotation.
Signal Process., 2013

Multimedia encyclopedia construction by mining web knowledge.
Signal Process., 2013

Video recommendation over multiple information sources.
Multim. Syst., 2013

Topic hierarchy construction for the organization of multi-source user generated contents.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Addressing cold-start in app recommendation: latent user models constructed from twitter followers.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Emerging topic detection for organizations from microblogs.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Venue Semantics: Multimedia Topic Modeling of Social Media Contents.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Robust Semantic Video Indexing by Harvesting Web Images.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

<i>NExT-Live</i>: A Live Observatory on Social Media.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Hyperspectral Image Classification by Using Pixel Spatial Correlation.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Attribute-augmented semantic hierarchy: towards bridging semantic gap and intention gap in image retrieval.
Proceedings of the ACM Multimedia Conference, 2013

Summary abstract for the 1st ACM international workshop on personal data meets distributed multimedia.
Proceedings of the ACM Multimedia Conference, 2013

Multimedia summarization for trending topics in microblogs.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

From Interest to Function: Location Estimation in Social Media.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

A Pattern Matching Based Model for Implicit Opinion Question Identification.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Label-to-region with continuity-biased bi-layer sparsity priors.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Image label completion by pursuing contextual decomposability.
ACM Trans. Multim. Comput. Commun. Appl., 2012

In-video product annotation with web information mining.
ACM Trans. Multim. Comput. Commun. Appl., 2012

Oracle in Image Search: A Content-Based Approach to Performance Prediction.
ACM Trans. Inf. Syst., 2012

Interactive Video Indexing With Statistical Active Learning.
IEEE Trans. Multim., 2012

Movie2Comics: Towards a Lively Video Content Presentation.
IEEE Trans. Multim., 2012

Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification.
IEEE Trans. Multim., 2012

Sparse Ensemble Learning for Concept Detection.
IEEE Trans. Multim., 2012

Mining Travel Patterns from Geotagged Photos.
ACM Trans. Intell. Syst. Technol., 2012

Semantic-Gap-Oriented Active Learning for Multilabel Image Annotation.
IEEE Trans. Image Process., 2012

Camera Constraint-Free View-Based 3-D Object Retrieval.
IEEE Trans. Image Process., 2012

Social media mining and search.
Multim. Tools Appl., 2012

Exploring probabilistic localized video representation for human action recognition.
Multim. Tools Appl., 2012

Semantic multimedia.
Multim. Tools Appl., 2012

Guest editorial: Special issue on information retrieval for social media.
Inf. Retr., 2012

Multimedia semantics-aware query-adaptive hashing with bits reconfigurability.
Int. J. Multim. Inf. Retr., 2012

Large-Scale Multimedia Data Collections.
IEEE Multim., 2012

Multimedia Question Answering.
IEEE Multim., 2012

NExT: NUS-Tsinghua Center for Extreme Search of User-Generated Content.
IEEE Multim., 2012

Assistive tagging: A survey of multimedia tagging with human-computer joint exploration.
ACM Comput. Surv., 2012

Mining slang and urban opinion words and phrases from cQA services: an optimization approach.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

Transforming mobile personal life log into autobiographical multimedia eChronicles.
Proceedings of the 10th International Conference on Advances in Mobile Computing & Multimedia, 2012

On Video Recommendation over Social Network.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Video Browser Showdown by NUS.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012

Attribute feedback.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Attribute feedback.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Harvesting visual concepts for image search with complex queries.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Visual query attributes suggestion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Answering Opinion Questions on Products by Exploiting Hierarchical Organization of Consumer Reviews.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

SSHLDA: A Semi-Supervised Hierarchical Topic Model.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Robust Non-negative Graph Embedding: Towards noisy data, unreliable graphs, and noisy labels.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

The Use of Dependency Relation Graph to Enhance the Term Weighting in Question Retrieval.
Proceedings of the COLING 2012, 2012

A Semi-Supervised Bayesian Network Model for Microblog Topic Classification.
Proceedings of the COLING 2012, 2012

Automatic labeling hierarchical topics.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Mining sentiment terminology through time.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Community Answer Summarization for Multi-Sentence Question with Group L1 Regularization.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Text Mining in Multimedia.
Proceedings of the Mining Text Data, 2012

2011
Video accessibility enhancement for hearing-impaired users.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Beyond search: Event-driven summarization for web videos.
ACM Trans. Multim. Comput. Commun. Appl., 2011

Utilizing Related Samples to Enhance Interactive Concept-Based Video Search.
IEEE Trans. Multim., 2011

Image annotation by <i>k</i>NN-sparse graph-based label propagation over noisily tagged web images.
ACM Trans. Intell. Syst. Technol., 2011

Localized Multiple Kernel Learning for Realistic Human Action Recognition in Videos.
IEEE Trans. Circuits Syst. Video Technol., 2011

Research and applications on georeferenced multimedia: a survey.
Multim. Tools Appl., 2011

Hot research topics - guest editorial.
Multim. Tools Appl., 2011

Survey papers in multimedia - guest editorial.
Multim. Tools Appl., 2011

Interactive multimedia computing.
Multim. Syst., 2011

VisionGo: Towards video retrieval with joint exploration of human and computer.
Inf. Sci., 2011

Label-Specific Training Set Construction from Web Resource for Image Annotation
CoRR, 2011

Hierarchical organization of unstructured consumer reviews.
Proceedings of the 20th International Conference on World Wide Web, 2011

Multimedia answering: enriching text QA with media information.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Product comparison using comparative relations.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Mining Travel Patterns from GPS-Tagged Photos.
Proceedings of the Advances in Multimedia Modeling, 2011

Generating Representative Views of Landmarks via Scenic Theme Detection.
Proceedings of the Advances in Multimedia Modeling, 2011

Integrating rich information for video recommendation with multi-task rank aggregation.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Learning concept bundles for video search with complex queries.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Towards multi-semantic image annotation with graph regularized exclusive group lasso.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Locally regressive G-optimal design for image retrieval.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Learning reconfigurable hashing for diverse semantics.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

ShotTagger: tag location for internet videos.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Affective Video Summarization and Story Board Generation Using Pupillary Dilation and Eye Gaze.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

Multi-label visual classification with label exclusive context.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Domain-Assisted Product Aspect Hierarchy Generation: Towards Hierarchical Organization of Unstructured Consumer Reviews.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Visual query suggestion: Towards capturing user intent in internet image search.
ACM Trans. Multim. Comput. Commun. Appl., 2010

Combining relations for information extraction from free text.
ACM Trans. Inf. Syst., 2010

Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations.
IEEE Trans. Multim., 2010

Efficient Mining of Multiple Partial Near-Duplicate Alignments by Temporal Network.
IEEE Trans. Circuits Syst. Video Technol., 2010

Multimedia Question Answering.
Scholarpedia, 2010

Question Answering over Community-Contributed Web Videos.
IEEE Multim., 2010

TRECVID 2010 Known-item Search by NUS.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Segmentation of multi-sentence questions: towards effective question retrieval in cQA services.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Prototype hierarchy based clustering for the categorization and navigation of web collections.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Vocabulary Filtering for Term Weighting in Archived Question Search.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

VDictionary: Automatically Generate Visual Dictionary via Wikimedias.
Proceedings of the Advances in Multimedia Modeling, 2010

Estimating Poses of World's Photos with Geographic Metadata.
Proceedings of the Advances in Multimedia Modeling, 2010

Learning Cooking Techniques from YouTube.
Proceedings of the Advances in Multimedia Modeling, 2010

Mediapedia: Mining Web Knowledge to Construct Multimedia Encyclopedia.
Proceedings of the Advances in Multimedia Modeling, 2010

Video Reference: A Video Question Answering Engine.
Proceedings of the Advances in Multimedia Modeling, 2010

One person labels one million images.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Making computers look the way we look: exploiting visual attention for image understanding.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

ACM international workshop on very-large-scale multimedia corpus, mining and retrieval (VLS-MCMR'10).
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Movie2Comics: a feast of multimedia artwork.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Dynamic captioning: video accessibility enhancement for hearing impairment.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

iComics: automatic conversion of movie into comics.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

W2Go: a travel guidance system by automatic landmark ranking.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Efficient large-scale image annotation by probabilistic collaborative multi-label propagation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A distribution based video representation for human action recognition.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

An Eye Fixation Database for Saliency Detection in Images.
Proceedings of the Computer Vision, 2010

Semantic context modeling with maximal margin Conditional Random Fields for automatic image annotation.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Automatic Generation of Semantic Fields for Annotating Web Images.
Proceedings of the COLING 2010, 2010

Exploiting Salient Patterns for Question Detection and Question Retrieval in Community-based Question Answering.
Proceedings of the COLING 2010, 2010

Utilizing related samples to learn complex queries in interactive concept-based video search.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Exploring large scale data for multimedia QA: an initial study.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Exploring domain-specific term weight in archived question search.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

2009
Toward a higher-level visual representation for object-based image retrieval.
Vis. Comput., 2009

A syntactic tree matching approach to finding similar questions in community-based qa services.
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2009

Adaptive Model for Integrating Different Types of Associated Texts for Automated Annotation of Web Images.
Proceedings of the Advances in Multimedia Modeling, 2009

Multimedia Evidence Fusion for Video Concept Detection via OWA Operator.
Proceedings of the Advances in Multimedia Modeling, 2009

Tour the world: a technical demonstration of a web-scale landmark recognition engine.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Inferring semantic concepts from community-contributed images and noisy tags.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Pornprobe: an LDA-SVM based pornography detection system.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Scalable detection of partial near-duplicate videos by visual-temporal consistency.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Automated localization of affective objects and actions in images via caption text-cum-eye gaze analysis.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

ViewFocus: explore places of interests on Google maps using photos with view direction filtering.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Label to region by bi-layer sparsity priors.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Video reference: question answering on YouTube.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Event driven summarization for web videos.
Proceedings of the first SIGMM workshop on Social media, 2009

From text question-answering to multimedia QA on web-scale media resources.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

An efficient sparse metric learning in high-dimensional space via <i>l</i><sub>1</sub>-penalized log-determinant regularization.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Domain adaptation from multiple sources via auxiliary classifiers.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Tour the world: Building a web-scale landmark recognition engine.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

A revisit of Generative Model for Automatic Image Annotation using Markov Random Fields.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Hierarchical spatio-temporal context modeling for action recognition.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

VisionGo: towards true interactivity.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

NUS-WIDE: a real-world web image database from National University of Singapore.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

Exploiting internal and external semantics for the clustering of short texts using world knowledge.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

Query Segmentation Based on Eigenspace Similarity.
Proceedings of the ACL 2009, 2009

Summarizing Definition from Wikipedia.
Proceedings of the ACL 2009, 2009

2008
Editorial.
Vis. Comput., 2008

Content-based video retrieval: Three example systems from TRECVid.
Int. J. Imaging Syst. Technol., 2008

Object-Based Image Retrieval Beyond Visual Appearances.
Proceedings of the Advances in Multimedia Modeling, 2008

Exploring knowledge of sub-domain in a multi-resolution bootstrapping framework for concept detection in news video.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Integrated graph-based semi-supervised multiple/single instance learning framework for image annotation.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Word2Image: towards visual interpreting of words.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Modeling Context in Scenario Template Creation.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Pre-attentive discrimination of interestingness in images.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Markovian mixture face recognition with discriminative face alignment.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Visual Synset: Towards a higher-level visual representation.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Probabilistic optimized ranking for multimedia semantic concept detection via RVM.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Automatic image annotation via local multi-label classification.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

VisionGo: bridging users and multimedia video retrieval.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Adaptive multiple feedback strategies for interactive video search.
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

2007
Soft pattern matching models for definitional question answering.
ACM Trans. Inf. Syst., 2007

Document concept lattice for text understanding and summarization.
Inf. Process. Manag., 2007

Automatically Integrating Heterogeneous Ontologies from Structured Web Pages.
Int. J. Semantic Web Inf. Syst., 2007

TRECVID 2007 Search Tasks by NUS-ICT.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Interesting nuggets and their impact on definitional question answering.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Auto-Annotation of Paintings Using Social Annotations, Domain Ontology and Transductive Inference.
Proceedings of the Advances in Multimedia Information Processing, 2007

Ontology-Based Annotation of Paintings Using Transductive Inference Framework.
Proceedings of the Advances in Multimedia Modeling, 2007

Fusion of Region and Image-Based Techniques for Automatic Image Annotation.
Proceedings of the Advances in Multimedia Modeling, 2007

Enhancing image annotation by integrating concept ontology and text-based bayesian learning model.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

The use of topic evolution to help users browse and find answers in news video corpus.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Segregated feedback with performance-based adaptive sampling for interactive news video retrieval.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Annotation of paintings with high-level semantic concepts using transductive inference and ontology-based concept disambiguation.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Towards the next plateau: innovative multimedia research beyond trecvid.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Use of Generalized Pattern Model for Video Annotation.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

News Video Retrieval using Implicit Event Semantics.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Interactive Spatio-Temporal Visual Map Model for Web Video Retrieval.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

The use of temporal, semantic and visual partitioning model for efficient near-duplicate keyframe detection in large scale news corpus.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

Active learning approach to interactive spatio-temporal news video retrieval.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

A Multi-resolution Framework for Information Extraction from Free Text.
Proceedings of the ACL 2007, 2007

2006
Guest editors' introduction: multimedia modelling.
Vis. Comput., 2006

Fusion of AV features and external information sources for event detection in team sports video.
ACM Trans. Multim. Comput. Commun. Appl., 2006

A maximal figure-of-merit (MFoM)-learning approach to robust classifier design for text categorization.
ACM Trans. Inf. Syst., 2006

Learning Object Models from Semistructured Web Documents.
IEEE Trans. Knowl. Data Eng., 2006

TRECVID 2006 by NUS-I2R.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Mining dependency relations for query expansion in passage retrieval.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Multi-faceted contextual model for person identification in news video.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Semi-supervised annotation of brushwork in paintings domain using serial combinations of multiple experts.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Transductive inference using multiple experts for brushwork annotation in paintings domain.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

News video search with fuzzy event clustering using high-level features.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Automatic Indexing and Retrieval of Large Broadcast News Video Collections - The TRECVID Experience.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Story Boundary Detection in News Video using Global Rule Induction Technique.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Morphable Face Reconstruction with Multiple Images.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Face Alignment with Unified Subspace Optimization of Active Statistical Models.
Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

Paraphrase Recognition via Dissimilarity Significance Classification.
Proceedings of the EMNLP 2006, 2006

Automatic Person Annotation of Family Photo Album.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

Bayesian Learning of Hierarchical Multinomial Mixture Models of Concepts for Automatic Image Annotation.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

Video Retrieval Using High Level Features: Exploiting Query Matching and Confidence-Based Weighting.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

ARE: Instance Splitting Strategies for Dependency Relation-Based Information Extraction.
Proceedings of the ACL 2006, 2006

2005
Clustering web pages about persons and organizations.
Web Intell. Agent Syst., 2005

TRECVID 2005 by NUS PRIS.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Using Syntactic and Semantic Relation Analysis in Question Answering.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Dependency relation matching for answer selection.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Question answering passage retrieval using dependency relations.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Generic soft pattern models for definitional question answering.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

A Novel Approach to Auto Image Annotation Based on Pairwise Constrained Clustering and Semi-Naïve Bayesian Model.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Retrieval of News Video Using Video Sequence Matching.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Fusion of Multiple Asynchronous Information Sources for Event Detection in Soccer Video.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Analysis and Retrieval of Paintings Using Artistic Color Concepts.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Combining text and audio-visual features in video indexing.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Evaluating Keyword Selection Methods for WEBSOM Text Archives.
IEEE Trans. Knowl. Data Eng., 2004

Global Rule Induction for Information Extraction.
Int. J. Artif. Intell. Tools, 2004

FADA: find all distinct answers.
Proceedings of the 13th international conference on World Wide Web, 2004

Unsupervised learning of soft patterns for generating definitions from online news.
Proceedings of the 13th international conference on World Wide Web, 2004

Detecting and Partitioning Data Objects in Complex Web Pages.
Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2004), 2004

TRECVID 2004 Search and Feature Extraction Task by NUS PRIS.
Proceedings of the 2004 TREC Video Retrieval Evaluation, 2004

National University of Singapore at the TREC 13 Question Answering Main Task.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

Effectiveness of web page classification on finding list answers.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

A Learning-based Approach for Annotating Large On-Line Image Collection.
Proceedings of the 10th International Multimedia Modeling Conference (MMM 2004), 2004

A semi-naïve Bayesian method incorporating clustering with pair-wise constraints for auto image annotation.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

A bootstrapping framework for annotating and retrieving WWW images.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Story boundary detection in large broadcast news video archives: techniques, experience and trends.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

Between context-aware media capture and multimedia content analysis: where do we find the promised land?
Proceedings of the 12th ACM International Conference on Multimedia, 2004

The fusion of audio-visual features and external knowledge for event detection in team sports video.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

A Public Reference Implementation of the RAP Anaphora Resolution Algorithm.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

A MFoM learning approach to robust multiclass multi-label text categorization.
Proceedings of the Machine Learning, 2004

Representation and retrieval of paintings based on art history concepts.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

A hierarchical approach to story segmentation of large broadcast news video corpus.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Web-based List Question Answering.
Proceedings of the COLING 2004, 2004

Cascading Use of Soft and Hard Matching Pattern Rules for Weakly Supervised Information Extraction.
Proceedings of the COLING 2004, 2004

An Adaptive Image Content Representation and Segmentation Approach to Automatic Image Annotation.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

2003
A Multi-Modal Approach to Story Segmentation for News Video.
World Wide Web, 2003

A cinematic-based framework for scene boundary detection in video.
Vis. Comput., 2003

Modeling Web Knowledge for Answering Event-based Questions.
Proceedings of the Twelfth International World Wide Web Conference - Posters, 2003

Querying and Clustering Web Pages about Persons and Organizations.
Proceedings of the 2003 IEEE / WIC International Conference on Web Intelligence, 2003

A Two-Level Multi-Modal Approach for Story Segmentation of Large News Video Corpus.
Proceedings of the 2003 TREC Video Retrieval Evaluation, 2003

QUALIFIER In TREC-12 QA Main Task.
Proceedings of The Twelfth Text REtrieval Conference, 2003

Structured use of external knowledge for event-based open domain question answering.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

A maximal figure-of-merit learning approach to text categorization.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

ATMRA: An Automatic Temporal Multi-resolution Analysis Framework for Shot Boundary Detection.
Proceedings of the 9th International Conference on Multi-Media Modeling, 2003

VideoQA: question answering on news video.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

A mid-level representation framework for semantic sports video analysis.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

A bootstrapping approach to annotating large image collection.
Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003

A Global Rule Induction Approach to Information Extraction.
Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2003), 2003

Face tracking in video with hybrid of Lucas-Kanade and condensation algorithm.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Fractional scaling of image and video in DCT domain.
Proceedings of the 2003 International Conference on Image Processing, 2003

An unified framework for shot boundary detection via active learning.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Hierarchical Indexing and Flexible Element Retrieval for Structured Document.
Proceedings of the Advances in Information Retrieval, 2003

QUALIFIER: Question Answering by Lexical Fabric and External Resources.
Proceedings of the EACL 2003, 2003

Automatic Tracking of Face Sequences in MPEG Video.
Proceedings of the 2003 Computer Graphics International (CGI 2003), 2003

A Framework to Customize a Face Model for Reusing Animation.
Proceedings of the 2003 Computer Graphics International (CGI 2003), 2003

Extracting Key Semantic Terms from Chinese Speech Query for Web Searches.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

2002
Detection of human faces in a compressed domain for video stratification.
Vis. Comput., 2002

Guest Editorial: Modeling Multimedia Information and Systems.
Multim. Tools Appl., 2002

Stratification Approach to Modeling Video.
Multim. Tools Appl., 2002

Comparing Keyword Extraction Techniques for WEBSOM Text Archives.
Int. J. Artif. Intell. Tools, 2002

Complementary Content.
IEEE Multim., 2002

The Segmentation and Classification of Story Boundaries in News Video.
Proceedings of the Visual and Multimedia Information Management, 2002

The Integration of Lexical Knowledge and External Resources for Question Answering.
Proceedings of The Eleventh Text REtrieval Conference, 2002

Temporal Multi-Resolution Framework for Shot Boundary Detection and Keyframe Extraction.
Proceedings of The Eleventh Text REtrieval Conference, 2002

A framework for video scene boundary detection.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Weighted graph based decision tree optimization for high accuracy acoustic modeling.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

The segmentation of news video into story units.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Grouping Web Pages about Persons and Organizations for Information Extraction.
Proceedings of the Digital Libraries: People, 2002

Retrieving News Stories from a News Integration Archive.
Proceedings of the Digital Libraries: People, 2002

An Agent-based Approach to Chinese Named Entity Recognition.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

Extracting Pronunciation-translated Names from Chinese Texts using Bootstrapping Approach.
Proceedings of the First Workshop on Chinese Language Processing, 2002

Learning Pattern Rules for Chinese Named Entity Extraction.
Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28, 2002

2001
A Replication Strategy for Reducing Wait Time in Video-On-Demand Systems.
Multim. Tools Appl., 2001

Automatic proxy-based watermarking for WWW.
Comput. Commun., 2001

A Match and Tiling Approach to Content-based Video Retrieval.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Building Semantic Perceptron Net for Topic Spotting.
Proceedings of the Association for Computational Linguistic, 2001

2000
Approximating Content-Based Object-Level Image Retrieval.
Multim. Tools Appl., 2000

Video Modeling Using Strata-Based Annotation.
IEEE Multim., 2000

Temporal multiresolution analysis for video segmentation.
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000

Detection of text captions in compressed domain video.
Proceedings of the ACM Multimedia 2000 Workshops, Los Angeles, CA, USA, October 30, 2000

1999
Relevance Feedback Techniques for Image Retrieval Using Multiple Attributes.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

1998
Fast Image Retrieval Using Color-Spatial Information.
VLDB J., 1998

Relevance Feedback Techniques for Color-based Image Retrieval.
Proceedings of the 1998 MultiMedia Modeling (MMM '98), 1998

Color-Based Relevance Feedback for Image Retrieval.
Proceedings of the International Workshop on Multi-Media Database Management Systems, 1998

An Empirical Study of Color-Spatial Retrieval Techniques for Large Image Databases.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1998

Color-Based Pseudo Object Model for Image Retrieval with Relevance Feedback.
Proceedings of the Advanced Multimedia Content Processing, First International Conference, 1998

1997
Fast Signature-Based Color-Spatial Image Retrieval.
Proceedings of the International Conference on Multimedia Computing and Systems, 1997

On Video-on-Demand sSrvers with Hierarchical Storage.
Proceedings of the Database Systems for Advanced Applications '97, 1997

1996
Using Domain Knowledge in Querying Image Databases.
Proceedings of the 1996 MultiMedia Modeling: Towards The Information Society Superhighway, 1996

Disk Striping Strategies for Large Video-on-Demand Servers.
Proceedings of the Forth ACM International Conference on Multimedia '96, 1996

From Text Description to Animation Sequences.
Proceedings of the Computer Animation 1996, 1996

1995
A Video Retrieval and Sequencing System.
ACM Trans. Inf. Syst., 1995

Automatic generation and refinement of hypertext links.
New Rev. Hypermedia Multim., 1995

An Integrated Color-Spatial Approach to Content-Based Image Retrieval.
Proceedings of the Third ACM International Conference on Multimedia '95, 1995

Automatic Task Generation and View Control for 3D Graphical Manual.
Proceedings of the Computer Graphics: Developments in Virtual Environments, 1995

1994
Content-Based Retrieval of Segmented Images.
Proceedings of the Second ACM International Conference on Multimedia '94, 1994

A Concept-Based Image Retrieval System.
Proceedings of the 27th Annual Hawaii International Conference on System Sciences (HICSS-27), 1994

1992
Applying relevance feedback to a photo archival system.
J. Inf. Sci., 1992

1991
A model for integrating multimedia information around 3D graphics hierarchies.
Vis. Comput., 1991

Supporting Composition in a Hypermedia Environment.
Hypermedia, 1991

1989
On the Design of a Frame-Based Hypermedia System.
Proceedings of the Hypertext: State of the Art. Papers presented at the Hypertext 2 conference, 1989

1987
A Synthetic Instruction Mix for Evaluating Microprocessor Performance.
IEEE Micro, 1987

1984
Using microcomputers in computer education.
ACM SIGCSE Bull., 1984

1982
Mathematical software for gas transmission networks.
PhD thesis, 1982


  Loading...