Yueting Zhuang

Orcid: 0000-0001-9017-2508

According to our database1, Yueting Zhuang authored at least 481 papers between 1996 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Ask Questions With Double Hints: Visual Question Generation With Answer-Awareness and Region-Reference.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Unified fair federated learning for digital healthcare.
Patterns, January, 2024

Better Together: Data-Free Multi-Student Coevolved Distillation.
Knowl. Based Syst., January, 2024

Position-aware compositional embeddings for compressed recommendation systems.
Neurocomputing, 2024

Contrastive Hawkes graph neural networks with dynamic sampling for event prediction.
Neurocomputing, 2024

GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation.
CoRR, 2024

RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection.
CoRR, 2024

Align<sup>2</sup>LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation.
CoRR, 2024

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition.
CoRR, 2024

Logic Distillation: Learning from Code Function by Function for Planning and Decision-making.
CoRR, 2024

IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization.
CoRR, 2024

From Easy to Hard: Learning Curricular Shape-aware Features for Robust Panoptic Scene Graph Generation.
CoRR, 2024

Improving Large Models with Small models: Lower Costs and Better Performance.
CoRR, 2024

Bridging Local Details and Global Context in Text-Attributed Graphs.
CoRR, 2024

Stock Movement Prediction with Multimodal Stable Fusion via Gated Cross-Attention Mechanism.
CoRR, 2024

DuetRAG: Collaborative Retrieval-Augmented Generation.
CoRR, 2024

WorldGPT: Empowering LLM as Multimodal World Model.
CoRR, 2024

LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation.
CoRR, 2024

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models.
CoRR, 2024

ProSwitch: Knowledge-Guided Language Model Fine-Tuning to Generate Professional and Non-Professional Styled Text.
CoRR, 2024

Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering.
CoRR, 2024

Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation.
CoRR, 2024

The 2nd International Workshop on Deep Multi-modal Generation and Retrieval.
Proceedings of the 2nd International Workshop on Deep Multimodal Generation and Retrieval, 2024

WorldGPT: Empowering LLM as Multimodal World Model.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

DEMON24: ACM MM24 Demonstrative Instruction Following Challenge.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Auto-Encoding Morph-Tokens for Multimodal LLM.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

InstructVid2Vid: Controllable Video Editing with Natural Language Instructions.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Bridging Local Details and Global Context in Text-Attributed Graphs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Learning Global Controller in Latent Space for Parameter-Efficient Fine-Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Data Shunt: Collaboration of Small and Large Models for Lower Costs and Better Performance.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

A knowledge-guided and traditional Chinese medicine informed approach for herb recommendation.
Frontiers Inf. Technol. Electron. Eng., October, 2023

Attribute-driven streaming edge partitioning with reconciliations for distributed graph neural network training.
Neural Networks, August, 2023

Federated unsupervised representation learning.
Frontiers Inf. Technol. Electron. Eng., August, 2023

Stable Prediction With Leveraging Seed Variable.
IEEE Trans. Knowl. Data Eng., June, 2023

Elastic Knowledge Distillation by Learning From Recollection.
IEEE Trans. Neural Networks Learn. Syst., May, 2023

Learning Decomposed Representations for Treatment Effect Estimation.
IEEE Trans. Knowl. Data Eng., May, 2023

VL-NMS: Breaking Proposal Bottlenecks in Two-stage Visual-language Matching.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Deep Residual Weight-Sharing Attention Network With Low-Rank Attention for Visual Question Answering.
IEEE Trans. Multim., 2023

Cross-Modal Data Augmentation for Tasks of Different Modalities.
IEEE Trans. Multim., 2023

Single image super-resolution based on progressive fusion of orientation-aware features.
Pattern Recognit., 2023

Graph neural networks meet with distributed graph partitioners and reconciliations.
Neurocomputing, 2023

TaskBench: Benchmarking Large Language Models for Task Automation.
CoRR, 2023

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer.
CoRR, 2023

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
CoRR, 2023

Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models.
CoRR, 2023

Improving Vision Anomaly Detection with the Guidance of Language Modality.
CoRR, 2023

Dancing Avatar: Pose and Text-Guided Human Motion Videos Synthesis with Image Diffusion Model.
CoRR, 2023

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.
CoRR, 2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking.
CoRR, 2023

ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation.
CoRR, 2023

Improving Reference-based Distinctive Image Captioning with Contrastive Rewards.
CoRR, 2023

Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow.
CoRR, 2023

Interactive Data Synthesis for Systematic Vision Adaptation via LLMs-AIGCs Collaboration.
CoRR, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
CoRR, 2023

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace.
CoRR, 2023

Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding.
CoRR, 2023

1st Place Solution for ECCV 2022 OOD-CV Challenge Object Detection Track.
CoRR, 2023

1st Place Solution for ECCV 2022 OOD-CV Challenge Image Classification Track.
CoRR, 2023

DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System for Multilingual Named Entity Recognition.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

FedAA: Using Non-sensitive Modalities to Improve Federated Learning while Preserving Image Privacy.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Continual Vision-Language Representation Learning with Off-Diagonal Information.
Proceedings of the International Conference on Machine Learning, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Unsupervised Prompt Tuning for Text-Driven Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PromptNER: Prompt Locating and Typing for Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

DiffusionNER: Boundary Diffusion for Named Entity Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Balance-Subsampled Stable Prediction Across Unknown Test Data.
ACM Trans. Knowl. Discov. Data, 2022

Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.
IEEE Trans. Image Process., 2022

Deep Learning for Weakly-Supervised Object Detection and Localization: A Survey.
Neurocomputing, 2022

NAP: Neural architecture search with pruning.
Neurocomputing, 2022

DBA: Efficient Transformer with Dynamic Bilinear Low-Rank Attention.
CoRR, 2022

Citation Trajectory Prediction via Publication Influence Representation Using Temporal Knowledge Graph.
CoRR, 2022

BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval.
CoRR, 2022

ReLER@ZJU-Alibaba Submission to the Ego4D Natural Language Queries Challenge 2022.
CoRR, 2022

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Fine-Grained Semantically Aligned Vision-Language Pre-Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Hybrid Behavior Patterns for Multimedia Recommendation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Self-Supervised Noisy Label Learning for Source-Free Unsupervised Domain Adaptation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Robust Meta-learning with Sampling Noise and Label Noise via Eigen-Reptile.
Proceedings of the International Conference on Machine Learning, 2022

Learning Domain Adaptive Object Detection with Probabilistic Teacher.
Proceedings of the International Conference on Machine Learning, 2022

Simulation-and-Mining: Towards Accurate Source-Free Unsupervised Domain Adaptive Object Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Transductive Clip with Class-Conditional Contrastive Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Query-based Instance Discrimination Network for Relational Triple Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

The Tenth Visual Object Tracking VOT2022 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Slimmable Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning to Learn by Jointly Optimizing Neural Architecture and Weights.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Label Matching Semi-Supervised Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Parallel Instance Query Network for Named Entity Recognition.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
End-to-End Video Saliency Detection via a Deep Contextual Spatiotemporal Network.
IEEE Trans. Neural Networks Learn. Syst., 2021

Explore Video Clip Order With Self-Supervised and Curriculum Learning for Video Applications.
IEEE Trans. Multim., 2021

Mining Fraudsters and Fraudulent Strategies in Large-Scale Mobile Social Networks.
IEEE Trans. Knowl. Data Eng., 2021

Adaptive Spatio-Temporal Graph Enhanced Vision-Language Representation for Video QA.
IEEE Trans. Image Process., 2021

Tell and guess: cooperative learning for natural image caption generation with hierarchical refined attention.
Multim. Tools Appl., 2021

Visual knowledge: an attempt to explore machine creativity.
Frontiers Inf. Technol. Electron. Eng., 2021

Multiple knowledge representation for big data artificial intelligence: framework, applications, and case studies.
Frontiers Inf. Technol. Electron. Eng., 2021

Self-Supervised Class Incremental Learning.
CoRR, 2021

Federated Self-Supervised Contrastive Learning via Ensemble Similarity Distillation.
CoRR, 2021

Alleviate Representation Overlapping in Class Incremental Learning by Contrastive Class Concentration.
CoRR, 2021

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey.
CoRR, 2021

VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching.
CoRR, 2021

Self-Supervised Noisy Label Learning for Source-Free Unsupervised Domain Adaptation.
CoRR, 2021

Box Re-Ranking: Unsupervised False Positive Suppression for Domain Adaptive Pedestrian Detection.
CoRR, 2021

Hierarchical Cross-Modal Graph Consistency Learning for Video-Text Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Learning to Generate Visual Questions with Noisy Supervision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

WAB'21: 1st Workshop on Multimodal Product Identification in Livestreaming and WAB Challenge.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Sequence-to-Set Network for Nested Named Entity Recognition.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Natural Language Video Localization with Learnable Moment Proposals.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Consensus Graph Representation Learning for Better Grounded Image Captioning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Disentangled Motif-aware Graph Learning for Phrase Grounding.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Multichannel Attention Refinement for Video Question Answering.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Frame Augmented Alternating Attention Network for Video Question Answering.
IEEE Trans. Multim., 2020

MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution.
IEEE Trans. Multim., 2020

Open-Ended Video Question Answering via Multi-Modal Conditional Adversarial Networks.
IEEE Trans. Image Process., 2020

Context-Aware Graph Label Propagation Network for Saliency Detection.
IEEE Trans. Image Process., 2020

Human-Centric Clothing Segmentation via Deformable Semantic Locality-Preserving Network.
IEEE Trans. Circuits Syst. Video Technol., 2020

Towards a new generation of artificial intelligence in China.
Nat. Mach. Intell., 2020

Learning embeddings of a heterogeneous behavior network for potential behavior prediction.
Frontiers Inf. Technol. Electron. Eng., 2020

Bi-Decoder Augmented Network for Neural Machine Translation.
Neurocomputing, 2020

Run Away From your Teacher: Understanding BYOL by a Novel Self-Supervised Approach.
CoRR, 2020

Federated Unsupervised Representation Learning.
CoRR, 2020

Learning Decomposed Representation for Counterfactual Inference.
CoRR, 2020

Stable Prediction via Leveraging Seed Variable.
CoRR, 2020

Balance-Subsampled Stable Prediction.
CoRR, 2020

Real-Time Driving Scene Semantic Segmentation.
IEEE Access, 2020

Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Relational Graph Learning for Grounded Video Description Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Photo Stream Question Answer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

De-Biased Court's View Generation with Causality.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Counterfactual Samples Synthesizing for Robust Visual Question Answering.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural-DINF: A Neural Network based Framework for Measuring Document Influence.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Time2Graph: Revisiting Time Series Modeling with Dynamic Shapelets.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
VPModel: High-Fidelity Product Simulation in a Virtual-Physical Environment.
IEEE Trans. Vis. Comput. Graph., 2019

Video Question Answering via Knowledge-based Progressive Spatial-Temporal Attention Network.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Deep Group-Wise Fully Convolutional Network for Co-Saliency Detection With Graph Propagation.
IEEE Trans. Image Process., 2019

A Bilinear Ranking SVM for Knowledge Based Relation Prediction and Classification.
IEEE Trans. Big Data, 2019

Deep Neural Network for Fast and Accurate Single Image Super-Resolution via Channel-Attention-based Fusion of Orientation-aware Features.
CoRR, 2019

What Makes a Good Team? A Large-scale Study on the Effect of Team Composition in Honor of Kings.
Proceedings of the World Wide Web Conference, 2019

The ZJU-EDL System for Entity Discovery and Linking at TAC KBP 2019.
Proceedings of the 2019 Text Analysis Conference, 2019

Video Dialog via Multi-Grained Convolutional Self-Attention Context Networks.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Posterior-regularized REINFORCE for Instance Selection in Distant Supervision.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Video Relation Detection with Spatio-Temporal Graph.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Walking with MIND: Mental Imagery eNhanceD Embodied QA.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Informative Visual Storytelling with Cross-modal Rules.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multi-interaction Network with Object Relation for Video Question Answering.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Weak Supervision Enhanced Generative Network for Question Generation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Learning Dynamic Context Augmentation for Global Entity Linking.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Video Dialog via Progressive Inference and Cross-Transformer.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Understanding Default Behavior in Online Lending.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

KCAT: A Knowledge-Constraint Typing Annotation Tool.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Cross-Relation Cross-Bag Attention for Distantly-Supervised Relation Extraction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Heterogeneous Attributed Network Embedding with Graph Convolutional Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Identifying Objective and Subjective Words via Topic Modeling.
IEEE Trans. Neural Networks Learn. Syst., 2018

Social-Aware Movie Recommendation via Multimodal Network Learning.
IEEE Trans. Multim., 2018

Fusing Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks.
IEEE Trans. Multim., 2018

Multimodal Deep Embedding via Hierarchical Grounded Compositional Semantics.
IEEE Trans. Circuits Syst. Video Technol., 2018

Active instance matching with pairwise constraints and its application to Chinese knowledge base construction.
Knowl. Inf. Syst., 2018

Temporality-enhanced knowledgememory network for factoid question answering.
Frontiers Inf. Technol. Electron. Eng., 2018

Entity mention aware document representation.
Inf. Sci., 2018

To Stay or to Leave: Churn Prediction for Urban Migrants in the Initial Period.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Two Step Joint Model for Drug Drug Interaction Extraction.
Proceedings of the 2018 Text Analysis Conference, 2018

Intra-view and Inter-view Attention for Multi-view Network Embedding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

MacNet: Transferring Knowledge from Machine Comprehension to Sequence-to-Sequence Models.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Open-Ended Long-form Video Question Answering via Adaptive Hierarchical Reinforced Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Attentional Image Retweet Modeling via Multi-Faceted Ranking Network Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Deep Convolutional Neural Networks with Merge-and-Run Mappings.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Feature Enhancement in Attention for Visual Question Answering.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Semantic Locality-Aware Deformable Network for Clothing Segmentation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Video question answering via multi-granularity temporal attention network learning.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Discourse Marker Augmented Network with Reinforcement Learning for Natural Language Inference.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Dynamic Network Embedding by Modeling Triadic Closure Process.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Urban Dreams of Migrants: A Case Study of Migrant Integration in Shanghai.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Multi-Label Community-Based Question Classification via Personalized Sequence Memory Network Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Bag-of-Discriminative-Words (BoDW) Representation via Topic Modeling.
IEEE Trans. Knowl. Data Eng., 2017

Temporal Interaction and Causal Influence in Community-Based Question Answering.
IEEE Trans. Knowl. Data Eng., 2017

Data-Dependent Label Distribution Learning for Age Estimation.
IEEE Trans. Image Process., 2017

Regularized Deep Belief Network for Image Attribute Detection.
IEEE Trans. Circuits Syst. Video Technol., 2017

Flickr group recommendation with auxiliary information in heterogeneous information networks.
Multim. Syst., 2017

A human motion feature based on semi-supervised learning of GMM.
Multim. Syst., 2017

Challenges and opportunities: from big data to knowledge in AI 2.0.
Frontiers Inf. Technol. Electron. Eng., 2017

Disambiguating named entities with deep supervised learning via crowd labels.
Frontiers Inf. Technol. Electron. Eng., 2017

Representation Learning for Scale-free Networks.
CoRR, 2017

Task-driven Visual Saliency and Attention-based Visual Question Answering.
CoRR, 2017

The Y_dcd_zju Slot Filling System for TAC KBP 2017.
Proceedings of the 2017 Text Analysis Conference, 2017

The ZHI-EDL System for Entity Discovery and Linking at TAC KBP 2017.
Proceedings of the 2017 Text Analysis Conference, 2017

Learning Max-Margin GeoSocial Multimedia Network Representations for Point-of-Interest Suggestion.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Video Question Answering via Attribute-Augmented Attention Network Learning.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

ENCORE: External Neural Constraints Regularized Distant Supervision for Relation Extraction.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Panel: Cross-media Intelligence.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Video Question Answering via Hierarchical Dual-Level Attention Network Learning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Video Question Answering via Gradually Refined Attention over Appearance and Motion.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning Deep Contextual Attention Network for Narrative Photo Stream Captioning.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Detecting Temporal Proposal for Action Localization with Tree-structured Search Policy.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Video Question Answering via Hierarchical Spatio-Temporal Attention Networks.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Microblog Sentiment Classification via Recurrent Random Walk Network Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Link Prediction via Ranking Metric Dual-Level Attention Network Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Deeply-Learned Part-Aligned Representations for Person Re-identification.
Proceedings of the IEEE International Conference on Computer Vision, 2017

NITE: A Neural Inductive Teaching Framework for Domain Specific NER.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Integrating Side Information for Boosting Machine Comprehension.
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017

Community-Based Question Answering via Asymmetric Multi-Faceted Ranking Network Learning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Effective deep learning-based multi-modal retrieval.
VLDB J., 2016

Scalable Linear Visual Feature Learning via Online Parallel Nonnegative Matrix Factorization.
IEEE Trans. Neural Networks Learn. Syst., 2016

User Preference Learning for Online Social Recommendation.
IEEE Trans. Knowl. Data Eng., 2016

Graph Regularized Feature Selection with Data Reconstruction.
IEEE Trans. Knowl. Data Eng., 2016

Learning of Multimodal Representations With Random Walks on the Click Graph.
IEEE Trans. Image Process., 2016

Joint Multilabel Classification With Community-Aware Label Graph Learning.
IEEE Trans. Image Process., 2016

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection.
IEEE Trans. Image Process., 2016

Deep Learning Driven Visual Path Prediction From a Single Image.
IEEE Trans. Image Process., 2016

Aspect Learning for Multimedia Summarization via Nonparametric Bayesian.
IEEE Trans. Circuits Syst. Video Technol., 2016

Fast view-based 3D model retrieval via unsupervised multiple feature fusion and online projection learning.
Signal Process., 2016

Online Metric-Weighted Linear Representations for Robust Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Recognizing an Action Using Its Name: A Knowledge-Based Approach.
Int. J. Comput. Vis., 2016

D-Ocean: an unstructured data management system for data ocean environment.
Frontiers Comput. Sci., 2016

ZJU Participation in TAC 2016 EDL task.
Proceedings of the 2016 Text Analysis Conference, 2016

Partial Multi-Modal Sparse Coding via Adaptive Similarity Structure Regularization.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Ad Recommendation for Sponsored Search Engine via Composite Long-Short Term Memory.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Expert Finding for Community-Based Question Answering via Ranking Metric Network Learning.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Diverse Image Captioning via GroupTalk.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Self-Paced Boost Learning for Classification.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Relational Knowledge Transfer for Zero-Shot Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Community-Based Question Answering via Heterogeneous Social Network Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Structured Visual Feature Learning for Classification via Supervised Probabilistic Tensor Factorization.
IEEE Trans. Multim., 2015

Probabilistic Word Selection via Topic Modeling.
IEEE Trans. Knowl. Data Eng., 2015

Cross-Modal Learning to Rank via Latent Joint Representation.
IEEE Trans. Image Process., 2015

Mining Spatial-Temporal Patterns and Structural Sparsity for Human Motion Data Denoising.
IEEE Trans. Cybern., 2015

Weakly Semi-Supervised Deep Learning for Multi-Label Image Annotation.
IEEE Trans. Big Data, 2015

Sparse motion bases selection for human motion denoising.
Signal Process., 2015

The classification of multi-modal data with hidden conditional random field.
Pattern Recognit. Lett., 2015

Efficient semi-supervised multiple feature fusion with out-of-sample extension for 3D model retrieval.
Neurocomputing, 2015

A locally weighted sparse graph regularized Non-Negative Matrix Factorization method.
Neurocomputing, 2015

Topic aspect-oriented summarization via group selection.
Neurocomputing, 2015

The ZJU-EDL System for Entity Discovery and Linking at TAC KBP 2015.
Proceedings of the 2015 Text Analysis Conference, 2015

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Multi-modal Retrieval via Deep Textual-Visual Correlation Learning.
Proceedings of the Intelligence Science and Big Data Engineering. Image and Video Data Engineering, 2015

Mobile Query Recommendation via Tensor Function Learning.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Sketch the Storyline with CHARCOAL: A Non-Parametric Approach.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

HTMVS: Visualizing hierarchical topics and their evolution.
Proceedings of the 10th IEEE Conference on Visual Analytics Science and Technology, 2015

Flickr group recommendation via heterogeneous information networks.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

RAISE: A Whole Process Modeling Method for Unstructured Data Management.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Metric Learning Driven Multi-Task Structured Output Optimization for Robust Keypoint Tracking.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Structured Embedding via Pairwise Relations and Long-Range Interactions in Knowledge Base.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Exploring Semantic Inter-Class Relationships (SIR) for Zero-Shot Action Recognition.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Sparse Multi-Modal Hashing.
IEEE Trans. Multim., 2014

Effective Multi-Modal Retrieval based on Stacked Auto-Encoders.
Proc. VLDB Endow., 2014

Multiple kernel learning with NOn-conVex group spArsity.
J. Vis. Commun. Image Represent., 2014

Real-time motion data annotation via action string.
Comput. Animat. Virtual Worlds, 2014

A GPU-accelerated non-negative sparse latent semantic analysis algorithm for social tagging data.
Inf. Sci., 2014

Exploiting temporal stability and low-rank structure for motion capture data refinement.
Inf. Sci., 2014

Editorial of the special issue on cross-media analysis.
Int. J. Multim. Inf. Retr., 2014

Special section on learning from multiple evidences for large scale multimedia analysis.
Comput. Vis. Image Underst., 2014

Hashing with List-Wise learning to rank.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Discriminative coupled dictionary hashing for fast cross-media retrieval.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Cross-Media Hashing with Neural Networks.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Learning Multimodal Neural Network with Ranking Examples.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Geo-informative discriminative image representation by semi-supervised hierarchical topic modeling.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Attribute prediction with long-range interactions via path coding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Retrieval-based cartoon gesture recognition and applications via semi-supervised heterogeneous classifiers learning.
Pattern Recognit., 2013

Image annotation by semi-supervised cross-domain learning with group sparsity.
J. Vis. Commun. Image Represent., 2013

A semantic feature for human motion retrieval.
Comput. Animat. Virtual Worlds, 2013

Learning for scalable multimedia representation.
Neurocomputing, 2013

Hypergraph Spectral Hashing for image retrieval with heterogeneous social contexts.
Neurocomputing, 2013

A low rank structural large margin method for cross-modal ranking.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Cross-media semantic representation via bi-directional learning to rank.
Proceedings of the ACM Multimedia Conference, 2013

πLDA: document clustering with selective structural constraints.
Proceedings of the ACM Multimedia Conference, 2013

Supervised Coupled Dictionary Learning with Group Structures for Multi-modal Retrieval.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Supervised Nonnegative Tensor Factorization with Maximum-Margin Constraint.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Digital Library Engine: Adapting Digital Library for Cloud Computing.
Proceedings of the 2013 IEEE Sixth International Conference on Cloud Computing, Santa Clara, CA, USA, June 28, 2013

2012
Web and Personal Image Annotation by Mining Label Correlation With Relaxed Visual Graph Embedding.
IEEE Trans. Image Process., 2012

Spline Regression Hashing for Fast Image Search.
IEEE Trans. Image Process., 2012

Image Annotation by Input-Output Structural Grouping Sparsity.
IEEE Trans. Image Process., 2012

Sparse Unsupervised Dimensionality Reduction for Multiple View Data.
IEEE Trans. Circuits Syst. Video Technol., 2012

Dynamic Time Warping for Chinese calligraphic character matching and recognizing.
Pattern Recognit. Lett., 2012

A unified framework for web video topic discovery and visualization.
Pattern Recognit. Lett., 2012

A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Societally connected multimedia across cultures.
J. Zhejiang Univ. Sci. C, 2012

Synthesizing style-preserving cartoons via non-negative style factorization.
J. Zhejiang Univ. Sci. C, 2012

Guest Editors' Introduction: Special Section on Connected Multimedia.
J. Multim., 2012

The heterogeneous feature selection with structural sparsity for multimedia annotation and hashing: a survey.
Int. J. Multim. Inf. Retr., 2012

Annotating web images using NOVA: NOn-conVex group spArsity.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Correlated attribute transfer with multi-task graph-guided fusion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Supervised cross-collection topic modeling.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Logistic Tensor Regression for Classification.
Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012

Nonnegative Matrix Factorization for Multimodality Data from Multi-source Domain.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Graph-guided sparse reconstruction for region tagging.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Adaptive Unsupervised Multi-view Feature Selection for Visual Concept Recognition.
Proceedings of the Computer Vision - ACCV 2012, 2012

2011
Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor.
IEEE Trans. Vis. Comput. Graph., 2011

Cartoon synthesis using constrained spreading activation network.
Multim. Tools Appl., 2011

A hybrid brain-computer interface control strategy in a virtual environment.
J. Zhejiang Univ. Sci. C, 2011

Efficient shape matching for Chinese calligraphic character retrieval.
J. Zhejiang Univ. Sci. C, 2011

Stable multi-label boosting for image annotation with structural feature selection.
Sci. China Inf. Sci., 2011

Group sparse representation for image categorization and semantic video retrieval.
Sci. China Inf. Sci., 2011

Hypergraph spectral hashing for similarity search of social image.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Image annotation by composite kernel learning with group structure.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Inverse-degree Sampling for Spectral Clustering.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Tag Clustering and Refinement on Semantic Unity Graph.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Multi-label Image Annotation by Structural Grouping Sparsity.
Proceedings of the Social Media Modeling and Computing., 2011

2010
Image Clustering Using Local Discriminant Models and Global Integration.
IEEE Trans. Image Process., 2010

Recognizing Cartoon Image Gestures for Retrieval and Interactive Cartoon Clip Synthesis.
IEEE Trans. Circuits Syst. Video Technol., 2010

Multi-Label Transfer Learning With Sparse Representation.
IEEE Trans. Circuits Syst. Video Technol., 2010

Cross-media retrieval using query dependent search methods.
Pattern Recognit., 2010

A group of novel approaches and a toolkit for motion capture data reusing.
Multim. Tools Appl., 2010

Javelin: an access and manipulation interface for large displays.
J. Zhejiang Univ. Sci. C, 2010

CMSOF: a structured data organization framework for scanned Chinese medicine books in digital libraries.
J. Zhejiang Univ. Sci. C, 2010

Multiple Hypergraph Clustering of Web Images by MiningWord2Image Correlations.
J. Comput. Sci. Technol., 2010

Silhouette representation and matching for 3D pose discrimination - A comparative study.
Image Vis. Comput., 2010

Classification by semi-supervised discriminative regularization.
Neurocomputing, 2010

Overview of ACM international workshop on connected multimedia.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Heterogeneous feature selection by group lasso with logistic regression.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multi-label boosting for image annotation by structural grouping sparsity.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Topic discovery of web video using star-structured K-partite graph.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Automatic annotation of geo-information in panoramic street view by image retrieval.
Proceedings of the International Conference on Image Processing, 2010

Sparse representation using nonnegative curds and whey.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Local and Global Regressive Mapping for Manifold Learning with Out-of-Sample Extrapolation.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

Multi-Task Sparse Discriminant Analysis (MtSDA) with Overlapping Categories.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Tensor-Based Transductive Learning for Multimodality Video Semantic Concept Detection.
IEEE Trans. Multim., 2009

Discovering calligraphy style relationships by Supervised Learning Weighted Random Walk Model.
Multim. Syst., 2009

Latent Style Model: Discovering writing styles for calligraphy works.
J. Vis. Commun. Image Represent., 2009

Competitive motion synthesis based on hybrid control.
Comput. Animat. Virtual Worlds, 2009

Perceptual 3D pose distance estimation by boosting relational geometric features.
Comput. Animat. Virtual Worlds, 2009

Local and global approaches of affinity propagation clustering for large scale data
CoRR, 2009

Applying probabilistic latent semantic analysis to multi-criteria recommender system.
AI Commun., 2009

Retrieval based interactive cartoon synthesis via unsupervised bi-distance metric learning.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Ranking with local regression and global alignment for cross media retrieval.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Review-oriented metadata enrichment: a case study.
Proceedings of the 2009 Joint International Conference on Digital Libraries, 2009

Style-consistency calligraphy synthesis system in digital library.
Proceedings of the 2009 Joint International Conference on Digital Libraries, 2009

Face Inpainting by Feature Guidance.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Web image interpretation: semi-supervised mining annotated words.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Chinese Brush Calligraphy Character Retrieval and Learning.
Proceedings of the Methods and Applications for Advancing Distance Education Technologies, 2009

2008
Mining Semantic Correlation of Heterogeneous Multimedia Data for Cross-Media Retrieval.
IEEE Trans. Multim., 2008

Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-Media Retrieval.
IEEE Trans. Multim., 2008

Perspective-aware cartoon clips synthesis.
Comput. Animat. Virtual Worlds, 2008

Quasi-Facial Communication for Online Learning Using 3D Modeling Techniques.
Int. J. Distance Educ. Technol., 2008

An encoding-based dual distance tree high-dimensional index.
Sci. China Ser. F Inf. Sci., 2008

Personalized Multimedia Retrieval in CADAL Digital Library.
Proceedings of the Advances in Multimedia Information Processing, 2008

Skeleton-Based Recognition of Chinese Calligraphic Character Image.
Proceedings of the Advances in Multimedia Information Processing, 2008

Search-Based Automatic Web Image Annotation Using Latent Visual and Semantic Analysis.
Proceedings of the Advances in Multimedia Information Processing, 2008

Heterogeneous multimedia data semantics mining using content and location context.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Active post-refined multimodality video semantic concept detection with tensor representation.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Clustering by evidence accumulation on affinity propagation.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Indexing high-dimensional data in dual distance spaces: a symmetrical encoding approach.
Proceedings of the EDBT 2008, 2008

Adaptive and compact shape descriptor by progressive feature combination and selection with boosting.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Manifold Learning Based Cross-media Retrieval: A Solution to Media Object Complementary Nature.
J. VLSI Signal Process., 2007

Interactive high-dimensional index for large Chinese calligraphic character databases.
ACM Trans. Asian Lang. Inf. Process., 2007

Hallucinating faces: LPH super-resolution and neighbor reconstruction for residue compensation.
Pattern Recognit., 2007

Content-based retrieval of Flash<sup>TM</sup> movies: research issues, generic framework, and future directions.
Multim. Tools Appl., 2007

Adaptive control in cartoon data reusing.
Comput. Animat. Virtual Worlds, 2007

Composite Distance Transformation for Indexing and <i>k</i> -Nearest-Neighbor Searching in High-Dimensional Spaces.
J. Comput. Sci. Technol., 2007

Hierarchical Approximate Matching for Retrieval of Chinese Historical Calligraphy Character.
J. Comput. Sci. Technol., 2007

Chinese Brush Calligraphy Character Retrieval and Learning.
Int. J. Distance Educ. Technol., 2007

Using CONDENSATION Tracking to Recover Stroke Order of Chinese Calligraphic Handwritings with CCM.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

View-Independent Human Action Recognition by Action Hypersphere in Nonlinear Subspace.
Proceedings of the Advances in Multimedia Information Processing, 2007

Boosting Cross-Media Retrieval by Learning with Positive and Negative Examples.
Proceedings of the Advances in Multimedia Modeling, 2007

Visual Verification of Historical Chinese Calligraphy Works.
Proceedings of the Advances in Multimedia Modeling, 2007

3D Facial Modeling for Animation: A Nonlinear Approach.
Proceedings of the Advances in Multimedia Modeling, 2007

Cross-modal correlation learning for clustering on image-audio dataset.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Research of 3D Chinese Calligraphic Handwriting Recur System and Its Key Algorithm.
Proceedings of the Computer Vision/Computer Graphics Collaboration Techniques, 2007

A Prediction Error Compression Method with Tensor-PCA in Video Coding.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

A Program Plagiarism Detection Model Based on Information Distance and Clustering.
Proceedings of the 2007 International Conference on Intelligent Pervasive Computing, 2007

A Piece-Wise Learning Approach to 3D Facial Animation.
Proceedings of the Advances in Web Based Learning, 2007

Adaptive Weight Selection for Incremental Eigen-Background Modeling.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Video Motion Capture by Silhouette Analysis and Pose Optimization.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Efficient Silhouette Extraction with Dynamic Viewpoint.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

A Novel Scalable Texture Video Coding Scheme with GPCA.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speeding Up Similarity Queries over Large Chinese Calligraphic Character Databases Using Data Grid.
Proceedings of the Grid and Cooperative Computing, 2007

2006
k Nearest Neighbor Queries Based on Data Grid.
J. Comput. Res. Dev., 2006

A hierarchical clustering algorithm based on fuzzy graph connectedness.
Fuzzy Sets Syst., 2006

Zhejiang University at TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Data-driven Generation of Decision Tree based on Ensemble Multiple-instance Learning for Motion Retrieval.
Proceedings of the IEEE International Conference on Systems, 2006

Filling Holes in Meshes and Recovering Sharp Edges.
Proceedings of the IEEE International Conference on Systems, 2006

An approach for cross-media retrieval with cross-reference graph and PageRank.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Secure Byzantine Fault Tolerant LDAP System.
Proceedings of the Interdisciplinary and Multidisciplinary Research in Computer Science, 2006

Facial Expression Hallucination Through Eigen-Associative Learning.
Proceedings of the Advances in Web Based Learning, 2006

Web based Chinese Calligraphy Learning with 3-D Visualization Method.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Learning Semantic Correlations for Cross-Media Retrieval.
Proceedings of the International Conference on Image Processing, 2006

A New Reconstruction Algorithm in Spline Signal Spaces.
Proceedings of the Computational Science, 2006

Towards Robust 3D Reconstruction of Human Motion from Monocular Video.
Proceedings of the Advances in Artificial Reality and Tele-Existence, 2006

A Web-Based Examination and Evaluation System for Computer Education.
Proceedings of the 6th IEEE International Conference on Advanced Learning Technologies, 2006

A Scalable Byzantine Fault Tolerant Service in Grid System.
Proceedings of the 2006 International Conference on Grid Computing & Applications, 2006

Towards interactive indexing for large Chinese calligraphic character databases.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

An Efficient Keyframe Extraction from Motion Capture Data.
Proceedings of the Advances in Computer Graphics, 2006

Video-Based Facial Expression Hallucination: A Two- Level Hierarchical Fusion Approach.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2006

2005
Searching for Flash Movies on the Web: A Content and Context Based Framework.
World Wide Web, 2005

Steerable pyramid-based face hallucination.
Pattern Recognit., 2005

Automatic generation of human animation based on motion programming.
Comput. Animat. Virtual Worlds, 2005

Understanding Multimedia Document Semantics for Cross-Media Retrieval.
Proceedings of the Advances in Multimedia Information Processing, 2005

Error-Bounded Solid Voxelization for Polygonal Model Based on Heuristic Seed Filling.
Proceedings of the Advances in Visual Computing, First International Symposium, 2005

Sketch-based retrieval on Flash movies via primary scene.
Proceedings of the Seventh IEEE International Symposium on Multimedia (ISM 2005), 2005

Web-Based Chinese Calligraphy Retrieval and Learning System.
Proceedings of the Advances in Web-Based Learning - ICWL 2005, 4th International Conference, Hong Kong, China, July 31, 2005

Segmenting Layers in Automated Visual Surveillance.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Research on Grid-Aware Mechanisms and Issues for CADAL Project.
Proceedings of the Digital Libraries: Implementing Strategies and Sharing Experiences, 2005

Automatic Video Knowledge Mining for Summary Generation Based on Un-supervised Statistical Learning.
Proceedings of the Fuzzy Systems and Knowledge Discovery, Second International Conference, 2005

On Identity-Discrepancy-Contrary Connection Degree in SPA and Its Applications.
Proceedings of the Fuzzy Systems and Knowledge Discovery, Second International Conference, 2005

Media-Based Presentation with Personalization in a Web-Based eLearning System.
Proceedings of the Advances in Computer Science, 2005

Multi-Modal Information Retrieval with a Semantic View Mechanism.
Proceedings of the 19th International Conference on Advanced Information Networking and Applications (AINA 2005), 2005

2004
Towards Data-Adaptive and User-Adaptive Image Retrieval by Peer Indexing.
Int. J. Comput. Vis., 2004

Towards Comprehensive 3D Enabled Web-Based Learning.
Int. J. Comput. Process. Orient. Lang., 2004

Retrieval of Chinese Calligraphic Character Image.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

ImprovingWeb-Based Learning: Automatic Annotation of Multimedia Semantics and Cross-Media Indexing.
Proceedings of the Advances in Web-Based Learning, 2004

A New Iris Recognition Approach for Embedded System.
Proceedings of the Embedded Software and Systems, First International Conference, 2004

A Novel Watermarking Scheme Based on Video Content.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004

Multi-document Summarization Based on Link Analysis and Text Classification.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004

A Two-Step Approach to Multiple Facial Feature Tracking: Temporal Particle Filter and Spatial Belief Propagation.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

2003
Translating EXPRESS language model into C language model.
ACM SIGPLAN Notices, 2003

3D motion retrieval with motion index tree.
Comput. Vis. Image Underst., 2003

Music Information Retrieval by Detecting Mood via Computational Media Aesthetics.
Proceedings of the 2003 IEEE / WIC International Conference on Web Intelligence, 2003

Popular music retrieval by detecting mood.
Proceedings of the SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28, 2003

Modeling Data and User Characteristics by Peer Indexing in Content-based Image Retrieval.
Proceedings of the 9th International Conference on Multi-Media Modeling, 2003

Subdivision Feedback Based 3D Facial Modeling for E-learning.
Proceedings of the Advances in Web-Based Learning, 2003

3D Model and Motion Retrieval: The Extended Dimensions for Web-Based Learning.
Proceedings of the Advances in Web-Based Learning, 2003

2002
Accommodating hybrid retrieval in a comprehensive video database management system.
IEEE Trans. Multim., 2002

Multiple animated characters motion fusion.
Comput. Animat. Virtual Worlds, 2002

OCTOPUS: aggressive search of multi-modality data using multifaceted knowledge base.
Proceedings of the Eleventh International World Wide Web Conference, 2002

Search for Flash<sup>TM</sup> Movies on the Web.
Proceedings of the 3rd International Conference on Web Information Systems Engineering Workshops, 2002

A hierarchical approach: query large music database by acoustic input.
Proceedings of the SIGIR 2002: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2002

Audio Retrieval with Fast Relevance Feedback Based on Constrained Fuzzy Clustering and Stored Index Table.
Proceedings of the Advances in Multimedia Information Processing, 2002

A Robust Algorithm for Video Based Human Motion Tracking.
Proceedings of the Advances in Multimedia Information Processing, 2002

A Hybrid Motion Data Manipulation: Wavelet Based Motion Processing and Spacetime Rectification.
Proceedings of the Advances in Multimedia Information Processing, 2002

MediaView: A Semantic View Mechanism for Multimedia Modeling.
Proceedings of the Advances in Multimedia Information Processing, 2002

Popular Song Retrieval Based on Singing Matching.
Proceedings of the Advances in Multimedia Information Processing, 2002

Popular Music Retrieval by Independent Component Analysis.
Proceedings of the ISMIR 2002, 2002

Multimedia Knowledge Exploitation for E-Learning: Some Enabling Techniques.
Proceedings of the Advances in Web-Based Learning, First International Conference, 2002

Image retrieval and relevance feedback using peer indexing.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

A graphic-theoretic model for incremental relevance feedback in image retrieval.
Proceedings of the 2002 International Conference on Image Processing, 2002

Incomplete motion feature tracking algorithm in video sequences.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
Web-Based Multimedia Retrieval: Balancing Out between Common Knowledge and Personalized Views.
Proceedings of the 2nd International Conference on Web Information Systems Engineering, 2001

Content-based video retrieval integrating human perception.
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001

Search for Multi-modality Data in Digital Libraries.
Proceedings of the Advances in Multimedia Information Processing, 2001

Query Similar Music by Correlation Degree.
Proceedings of the Advances in Multimedia Information Processing, 2001

Multi-Modal Retrieval for Multimedia Digital Libraries: Issues, Architecture, and Mechanisms.
Proceedings of the MIS '01, 2001

Thesaurus-Aided Approach For Image Browsing And Retrieval.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

A Hybrid Approach to Video Retrieval in a Generic Video Management and Application Processing Framework.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Web-Based Image Retrieval: A Hybrid Approach.
Proceedings of the Computer Graphics International 2001 (CGI'01), 2001

2000
Apply semantic template to support content-based image retrieval.
Proceedings of the Storage and Retrieval for Media Databases 2000, 2000

Content-based video similarity model.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Hierarchical Model Based Human Motion Tracking.
Proceedings of the 2000 International Conference on Image Processing, 2000

Image Retrieval System for Web: Webscope-CBIR.
Proceedings of the 11th International Workshop on Database and Expert Systems Applications (DEXA'00), 2000

A Framework for Garment Shopping over the Internet.
Proceedings of the Handbook on Electronic Commerce, 2000

1999
Video key frame extraction by unsupervised clustering and feedback adjustment.
J. Comput. Sci. Technol., 1999

Video based human motion capture.
Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999

A new approach to retrieve video by example video clip.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Video based human animation technique.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Video Motion Capture Using Feature Tracking and Skeleton Reconstruction.
Proceedings of the 1999 International Conference on Image Processing, 1999

Content-based Video Retrieval by Example Clip on WWW.
Proceedings of the Eurographics Multimedia Workshop 1999, 1999

1998
Adaptive Key Frame Extraction using Unsupervised Clustering.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

1996
OOADS: An object-oriented design model for advertising CAD system.
J. Comput. Sci. Technol., 1996


  Loading...