Fei Wu

Orcid: 0000-0003-2139-8807

Affiliations:
  • Zhejiang University, College of Computer Science and Technology, Hangzhou, China


According to our database1, Fei Wu authored at least 527 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unleash the Power of Inconsistency-Based Semi-Supervised Active Learning by Dynamic Programming of Curriculum Learning.
IEEE Trans. Knowl. Data Eng., November, 2024

Learning Individual Treatment Effects under Heterogeneous Interference in Networks.
ACM Trans. Knowl. Discov. Data, September, 2024

Transferring Causal Mechanism over Meta-representations for Target-Unknown Cross-domain Recommendation.
ACM Trans. Inf. Syst., July, 2024

Video Moment Retrieval With Noisy Labels.
IEEE Trans. Neural Networks Learn. Syst., May, 2024

IEEE Transactions on Neural Networks and Learning Systems Special Issue on Causal Discovery and Causality-Inspired Machine Learning.
IEEE Trans. Neural Networks Learn. Syst., April, 2024

Out-of-Distribution Generalization With Causal Feature Separation.
IEEE Trans. Knowl. Data Eng., April, 2024

SLED: Structure Learning based Denoising for Recommendation.
ACM Trans. Inf. Syst., March, 2024

MgSvF: Multi-Grained Slow versus Fast Framework for Few-Shot Class-Incremental Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Dual Attention Graph Convolutional Network for Relation Extraction.
IEEE Trans. Knowl. Data Eng., February, 2024

Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems.
IEEE Trans. Knowl. Data Eng., February, 2024

Unified fair federated learning for digital healthcare.
Patterns, January, 2024

Simulating doctors' thinking logic for chest X-ray report generation via Transformer-based Semantic Query learning.
Medical Image Anal., January, 2024

Deconfounded hierarchical multi-granularity classification.
Comput. Vis. Image Underst., 2024

Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering.
CoRR, 2024

AutoGeo: Automating Geometric Image Dataset Creation for Enhanced Geometry Understanding.
CoRR, 2024

RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU.
CoRR, 2024

LuWu: An End-to-End In-Network Out-of-Core Optimizer for 100B-Scale Model-in-Network Data-Parallel Training on Distributed GPUs.
CoRR, 2024

Causal Agent based on Large Language Model.
CoRR, 2024

Generalized Encouragement-Based Instrumental Variables for Counterfactual Regression.
CoRR, 2024

Causal Inference with Complex Treatments: A Survey.
CoRR, 2024

Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning.
CoRR, 2024

DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models.
CoRR, 2024

More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs.
CoRR, 2024

Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration.
CoRR, 2024

NieR: Normal-Based Lighting Scene Rendering.
CoRR, 2024

MOSS: Motion-based 3D Clothed Human Synthesis from Monocular Video.
CoRR, 2024

RemoCap: Disentangled Representation Learning for Motion Capture.
CoRR, 2024

NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction.
CoRR, 2024

Gaussian Control with Hierarchical Semantic Graphs in 3D Human Recovery.
CoRR, 2024

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback.
CoRR, 2024

MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities.
CoRR, 2024

Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU.
CoRR, 2024

Pareto-Optimal Estimation and Policy Learning on Short-term and Long-term Treatment Effects.
CoRR, 2024

ModelGPT: Unleashing LLM's Capabilities for Tailored Model Generation.
CoRR, 2024

Enabling Collaborative Clinical Diagnosis of Infectious Keratitis by Integrating Expert Knowledge and Interpretable Data-driven Intelligence.
CoRR, 2024

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks.
CoRR, 2024

Leveraging Print Debugging to Improve Code Generation in Large Language Models.
CoRR, 2024

Unleashing the Power of LLMs in Court View Generation by Stimulating Internal Knowledge and Incorporating External Knowledge.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Semantic Alignment for Multimodal Large Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Semantic Codebook Learning for Dynamic Recommendation Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Adapting Pre-trained Generative Model to Medical Image for Data Augmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Physical-Priors-Guided Aortic Dissection Detection Using Non-Contrast-Enhanced CT Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Learning Causal Relations from Subsampled Time Series with Two Time-Slices.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Non-confusing Generation of Customized Concepts in Diffusion Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Learning Shadow Variable Representation for Treatment Effect Estimation under Collider Bias.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Active Retrosynthetic Planning Aware of Route Quality.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Domaindiff: Boost out-of-Distribution Generalization with Synthetic Data.
Proceedings of the IEEE International Conference on Acoustics, 2024

CCL-BTree: A Crash-Consistent Locality-Aware B+-Tree for Reducing XPBuffer-Induced Write Amplification in Persistent Memory.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LLMCO4MR: LLMs-Aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang.
Proceedings of the Computer Vision - ECCV 2024, 2024

MPOD123: One Image to 3D Content Generation Using Mask-Enhanced Progressive Outline-to-Detail Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Evolving Knowledge Distillation with Large Language Models and Active Learning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Enhancing Court View Generation with Knowledge Injection and Guidance.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

From Graph to Word Bag: Introducing Domain Knowledge to Confusing Charge Prediction.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

De-biased Attention Supervision for Text Classification with Causality.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Learning to Reweight for Generalizable Graph Neural Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Collaborative Semantic Aggregation and Calibration for Federated Domain Generalization.
IEEE Trans. Knowl. Data Eng., December, 2023

Source-free and black-box domain adaptation via distributionally adversarial training.
Pattern Recognit., November, 2023

Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Federated mutual learning: a collaborative machine learning method for heterogeneous data, models, and objectives.
Frontiers Inf. Technol. Electron. Eng., October, 2023

Personalized Latent Structure Learning for Recommendation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Federated unsupervised representation learning.
Frontiers Inf. Technol. Electron. Eng., August, 2023

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI.
IEEE Trans. Knowl. Data Eng., July, 2023

DeepAlgPro: an interpretable deep neural network model for predicting allergenic proteins.
Briefings Bioinform., July, 2023

Stable Prediction With Leveraging Seed Variable.
IEEE Trans. Knowl. Data Eng., June, 2023

Physics-Informed Time-Aware Neural Networks for Industrial Nonintrusive Load Monitoring.
IEEE Trans. Ind. Informatics, June, 2023

Learning Decomposed Representations for Treatment Effect Estimation.
IEEE Trans. Knowl. Data Eng., May, 2023

U-Turn: Crafting Adversarial Queries with Opposite-Direction Features.
Int. J. Comput. Vis., April, 2023

A Differentiable Parallel Sampler for Efficient Video Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Instrumental Variable-Driven Domain Generalization with Unobserved Confounders.
ACM Trans. Knowl. Discov. Data, 2023

Domain-Specific Bias Filtering for Single Labeled Domain Generalization.
Int. J. Comput. Vis., 2023

Differentiated matching for individual and average treatment effect estimation.
Data Min. Knowl. Discov., 2023

Learning to Reweight for Graph Neural Network.
CoRR, 2023

Sim-GPT: Text Similarity via GPT Annotated Data.
CoRR, 2023

Sentiment Analysis through LLM Negotiations.
CoRR, 2023

Helios: An Efficient Out-of-core GNN Training System on Terabyte-scale Graphs with In-memory Performance.
CoRR, 2023

A Chinese Prompt Attack Dataset for LLMs with Evil Content.
CoRR, 2023

Instruction Tuning for Large Language Models: A Survey.
CoRR, 2023

MEDOE: A Multi-Expert Decoder and Output Ensemble Framework for Long-tailed Semantic Segmentation.
CoRR, 2023

Hierarchical Topological Ordering with Conditional Independence Test for Limited Time Series.
CoRR, 2023

Pushing the Limits of ChatGPT on NLP Tasks.
CoRR, 2023

Denoising Multi-modal Sequential Recommenders with Contrastive Learning.
CoRR, 2023

GPT-NER: Named Entity Recognition via Large Language Models.
CoRR, 2023

IDEAL: Toward High-efficiency Device-Cloud Collaborative and Dynamic Recommendation System.
CoRR, 2023

DGBCT: A Scalable Distributed Gradient Boosting Causal Tree at Alipay.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

A Practical Rule Learning Framework for Risk Management.
Proceedings of the Companion Proceedings of the ACM Web Conference 2023, 2023

Enhancing Hierarchy-Aware Graph Networks with Deep Dual Clustering for Session-based Recommendation.
Proceedings of the ACM Web Conference 2023, 2023

DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device Model Generalization.
Proceedings of the ACM Web Conference 2023, 2023

Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

DisCover: Disentangled Music Representation Learning for Cover Song Identification.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

ML-LJP: Multi-Law Aware Legal Judgment Prediction.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Alleviating Matching Bias in Marketing Recommendations.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PTADisc: A Cross-Course Dataset Supporting Personalized Learning in Cold-Start Scenarios.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Two Heads are Better Than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Generalized Universal Domain Adaptation with Generative Flow Networks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Reconnecting the Broken Civilization: Patchwork Integration of Fragments from Ancient Manuscripts.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unsupervised Domain Adaptation for Referring Semantic Segmentation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CLAP: Contrastive Language-Audio Pre-training Model for Multi-modal Sentiment Analysis.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Stable Prediction on Graphs with Agnostic Distribution Shifts.
Proceedings of the KDD'23 Workshop on Causal Discovery, 2023

Quantitatively Measuring and Contrastively Exploring Heterogeneity for Domain Generalization.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

GAL-VNE: Solving the VNE Problem with Global Reinforcement Learning and Local One-Shot Neural Prediction.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Stable Estimation of Heterogeneous Treatment Effects.
Proceedings of the International Conference on Machine Learning, 2023

DCMT: A Direct Entire-Space Causal Multi-Task Framework for Post-Click Conversion Estimation.
Proceedings of the 39th IEEE International Conference on Data Engineering, 2023

VarietySound: Timbre-Controllable Video to Sound Generation Via Unsupervised Information Disentanglement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Precedent-Enhanced Legal Judgment Prediction with LLM and Domain-Model Collaboration.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Text Classification via Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

RexUIE: A Recursive Method with Explicit Schema Instructor for Universal Information Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

ART: rule bAsed futuRe-inference deducTion.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

WINNER: Weakly-supervised hIerarchical decompositioN and aligNment for spatio-tEmporal video gRounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Sequential Style Consistency Learning for Domain-Generalizable Text Recognition.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

Parameters Efficient Fine-Tuning for Long-Tailed Sequential Recommendation.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

Multi-trends Enhanced Dynamic Micro-video Recommendation.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

FairDR: Ensuring Fairness in Mixed Data of Fairly and Unfairly Treated Instances.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

Attention-Based RNA Secondary Structure Prediction.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

End-to-End Optimization of Quantization-Based Structure Learning and Interventional Next-Item Recommendation.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

Focus-aware Response Generation in Inquiry Conversation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multi-modal Action Chain Abductive Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Learning Instrumental Variable from Data Fusion for Treatment Effect Estimation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Learning Chemical Rules of Retrosynthesis with Pre-training.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Video-Audio Domain Generalization via Confounder Disentanglement.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Memory-Efficient Class-Incremental Learning for Image Classification.
IEEE Trans. Neural Networks Learn. Syst., 2022

Data-Driven Variable Decomposition for Treatment Effect Estimation.
IEEE Trans. Knowl. Data Eng., 2022

Auto IV: Counterfactual Prediction via Automatic Instrumental Variable Decomposition.
ACM Trans. Knowl. Discov. Data, 2022

Balance-Subsampled Stable Prediction Across Unknown Test Data.
ACM Trans. Knowl. Discov. Data, 2022

Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.
IEEE Trans. Image Process., 2022

Progressive Multistage Learning for Discriminative Tracking.
IEEE Trans. Cybern., 2022

Local-Global Graph Pooling via Mutual Information Maximization for Video-Paragraph Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2022

SemGloVe: Semantic Co-Occurrences for GloVe From BERT.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Sentence Similarity Based on Contexts.
Trans. Assoc. Comput. Linguistics, 2022

Estimation of Winter Wheat Tiller Number Based on Optimization of Gradient Vegetation Characteristics.
Remote. Sens., 2022

TapLab: A Fast Framework for Semantic Video Segmentation Tapping Into Compressed-Domain Knowledge.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

A full-process intelligent trial system for smart court.
Frontiers Inf. Technol. Electron. Eng., 2022

Smart grid dispatch powered by deep learning: a survey.
Frontiers Inf. Technol. Electron. Eng., 2022

Structure-conditioned adversarial learning for unsupervised domain adaptation.
Neurocomputing, 2022

Interaction augmented transformer with decoupled decoding for video captioning.
Neurocomputing, 2022

NAP: Neural architecture search with pruning.
Neurocomputing, 2022

Instrumental Variables in Causal Inference and Machine Learning: A Survey.
CoRR, 2022

Confounder Balancing for Instrumental Variable Regression with Latent Variable.
CoRR, 2022

Exploiting Contrastive Learning and Numerical Evidence for Improving Confusing Legal Judgment Prediction.
CoRR, 2022

Learning Individual Treatment Effects under Heterogeneous Interference in Networks.
CoRR, 2022

MetaNetwork: A Task-agnostic Network Parameters Generation Framework for Improving Device Model Generalization.
CoRR, 2022

Treatment Effect Estimation with Unmeasured Confounders in Data Fusion.
CoRR, 2022

Personalizing Intervened Network for Long-tailed Sequential User Behavior Modeling.
CoRR, 2022

CCL4Rec: Contrast over Contrastive Learning for Micro-video Recommendation.
CoRR, 2022

E2-AEN: End-to-End Incremental Learning with Adaptively Expandable Network.
CoRR, 2022

TRIE++: Towards End-to-End Information Extraction from Visually Rich Documents.
CoRR, 2022

Knowledge Distillation of Transformer-based Language Models Revisited.
CoRR, 2022

A Novel Architecture Slimming Method for Network Pruning and Knowledge Distillation.
CoRR, 2022

Attribute-aware interpretation learning for thyroid ultrasound diagnosis.
Artif. Intell. Medicine, 2022

WISE: Wavelet based Interpretable Stock Embedding for Risk-Averse Portfolio Management.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Re4: Learning to Re-contrast, Re-attend, Re-construct for Multi-interest Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Uncovering Causal Effects of Online Short Videos on Consumer Behaviors.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

FpgaNIC: An FPGA-based Versatile 100Gb SmartNIC for GPUs.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

GRASP: Navigating Retrosynthetic Planning with Goal-driven Policy.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ConfounderGAN: Protecting Image Data Privacy with Causal Confounder.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Triggerless Backdoor Attack for NLP Tasks with Clean Labels.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Label-Efficient Domain Generalization via Collaborative Exploration and Generalization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Set-Based Face Recognition Beyond Disentanglement: Burstiness Suppression With Variance Vocabulary.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly-supervised Disentanglement Network for Video Fingerspelling Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

TranSQ: Transformer-Based Semantic Query for Medical Report Generation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Intelligent Request Strategy Design in Recommender System.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Estimating Individualized Causal Effect with Confounded Instruments.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Separate-to-Recognize: Joint Multi-target Speech Separation and Speech Recognition for Speaker-attributed ASR.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

RoSA: A Robust Self-Aligned Framework for Node-Node Graph Contrastive Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Instrumental Variable Regression with Confounder Balancing.
Proceedings of the International Conference on Machine Learning, 2022

Deconfounded Value Decomposition for Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

The Role of Deconfounding in Meta-learning.
Proceedings of the International Conference on Machine Learning, 2022

GNN-LM: Language Modeling based on Global Contexts via GNN.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Disentangled Sequential Autoencoder with Local Consistency for Infectious Keratitis Diagnosis.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Towards Interactivity and Interpretability: A Rationale-based Legal Judgment Prediction Framework.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Investigating the Robustness of Natural Language Generation from Logical Forms via Counterfactual Samples.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross- Modal Denoising Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Paraphrase Generation as Unsupervised Machine Translation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Similar Case Based Prison Term Prediction.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Connecting Patients with Pre-diagnosis: A Multiple Graph Regularized Method for Mental Disorder Diagnosis.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Multi-objective Meta-return Reinforcement Learning for Sequential Recommendation.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Fast Nearest Neighbor Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Dependency Parsing as MRC-based Span-Span Prediction.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
End-to-End Video Saliency Detection via a Deep Contextual Spatiotemporal Network.
IEEE Trans. Neural Networks Learn. Syst., 2021

Local-Global Memory Neural Network for Medication Prediction.
IEEE Trans. Neural Networks Learn. Syst., 2021

Mining Fraudsters and Fraudulent Strategies in Large-Scale Mobile Social Networks.
IEEE Trans. Knowl. Data Eng., 2021

Training Robust Object Detectors From Noisy Category Labels and Imprecise Bounding Boxes.
IEEE Trans. Image Process., 2021

Learning to Anticipate Egocentric Actions by Imagination.
IEEE Trans. Image Process., 2021

Interaction-Integrated Network for Natural Language Moment Localization.
IEEE Trans. Image Process., 2021

ResKD: Residual-Guided Knowledge Distillation.
IEEE Trans. Image Process., 2021

Graph-Based Multi-Interaction Network for Video Question Answering.
IEEE Trans. Image Process., 2021

FREE: A Fast and Robust End-to-End Video Text Spotter.
IEEE Trans. Image Process., 2021

Continuous treatment effect estimation via generative adversarial de-confounding.
Data Min. Knowl. Discov., 2021

Multi-agent Communication with Graph Information Bottleneck under Limited Bandwidth.
CoRR, 2021

A General Framework for Defending Against Backdoor Attacks via Influence Graph.
CoRR, 2021

Triggerless Backdoor Attack for NLP Tasks with Clean Labels.
CoRR, 2021

Edge-Cloud Polarization and Collaboration: A Comprehensive Survey.
CoRR, 2021

Unified Group Fairness on Federated Learning.
CoRR, 2021

Dialogue Inspectional Summarization with Factual Inconsistency Awareness.
CoRR, 2021

Do We Need to Directly Access the Source Datasets for Domain Generalization?
CoRR, 2021

Multi-trends Enhanced Dynamic Micro-video Recommendation.
CoRR, 2021

Stable Prediction on Graphs with Agnostic Distribution Shift.
CoRR, 2021

Layer-wise Model Pruning based on Mutual Information.
CoRR, 2021

Defending against Backdoor Attacks in Natural Language Generation.
CoRR, 2021

Parameter Estimation for the SEIR Model Using Recurrent Nets.
CoRR, 2021

Modeling Text-visual Mutual Dependency for Multi-modal Dialog Generation.
CoRR, 2021

Federated Graph Learning - A Position Paper.
CoRR, 2021

BertGCN: Transductive Text Classification by Combining GCN and BERT.
CoRR, 2021

Modeling High-order Interactions across Multi-interests for Micro-video Reommendation.
CoRR, 2021

Unsupervised Domain Adaptation for Image Classification via Structure-Conditioned Adversarial Learning.
CoRR, 2021

VersatileGait: A Large-Scale Synthetic Gait Dataset with Fine-GrainedAttributes and Complicated Scenarios.
CoRR, 2021

AI+X micro-program fosters interdisciplinary skills in China.
Commun. ACM, 2021

Future-Aware Diverse Trends Framework for Recommendation.
Proceedings of the WWW '21: The Web Conference 2021, 2021

CauseRec: Counterfactual User Sequence Synthesis for Sequential Recommendation.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

MGD-GAN: Text-to-Pedestrian Generation Through Multi-grained Discrimination.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Why Do We Click: Visual Impression-aware News Recommendation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Analysis and Applications of Class-wise Robustness in Adversarial Training.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

KD3A: Unsupervised Multi-Source Decentralized Domain Adaptation via Knowledge Distillation.
Proceedings of the 38th International Conference on Machine Learning, 2021

VSR: A Unified Framework for Document Layout Analysis Combining Vision, Semantics and Relations.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text Recognition.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

ICDAR 2021 Competition on Scene Video Text Spotting.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

FcaNet: Frequency Channel Attention Networks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semi-supervised Active Learning for Semi-supervised Models: Exploit Adversarial Examples with Graph-based Virtual Labels.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

ConRPG: Paraphrase Generation using Contexts as Regularizer.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

kFolden: k-Fold Ensemble for Out-Of-Distribution Detection.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Layer-wise Model Pruning based on Mutual Information.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

DeVLBert: Out-of-Distribution Visio-Linguistic Pretraining With Causality.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Grounded, Controllable and Debiased Image Completion With Lexical Semantics.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Estimating Treatment Effect via Differentiated Confounder Matching.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

BertGCN: Transductive Text Classification by Combining GNN and BERT.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Modeling High-order Interactions across Multi-interests for Micro-video Reommendation (Student Abstract).
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Web-based Platform for K-12 AI Education in China.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

MANGO: A Mask Attention Guided One-Stage Scene Text Spotter.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Judgment Prediction via Injecting Legal Knowledge into Neural Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Recurrent Attention Network with Reinforced Generator for Visual Dialog.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Frame Augmented Alternating Attention Network for Video Question Answering.
IEEE Trans. Multim., 2020

An Attentive Sequence to Sequence Translator for Localizing Video Clips by Natural Language.
IEEE Trans. Multim., 2020

Treatment Effect Estimation via Differentiated Confounder Balancing and Regression.
ACM Trans. Knowl. Discov. Data, 2020

Adaptive Graph Representation Learning for Video Person Re-Identification.
IEEE Trans. Image Process., 2020

Context-Aware Graph Label Propagation Network for Saliency Detection.
IEEE Trans. Image Process., 2020

Context-Aware Deep Spatiotemporal Network for Hand Pose Estimation From Depth Images.
IEEE Trans. Cybern., 2020

Convolutional Reconstruction-to-Sequence for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Human-Centric Clothing Segmentation via Deformable Semantic Locality-Preserving Network.
IEEE Trans. Circuits Syst. Video Technol., 2020

Movie Question Answering via Textual Memory and Plot Graph.
IEEE Trans. Circuits Syst. Video Technol., 2020

Video Dialog via Multi-Grained Convolutional Self-Attention Context Multi-Modal Networks.
IEEE Trans. Circuits Syst. Video Technol., 2020

Towards a new generation of artificial intelligence in China.
Nat. Mach. Intell., 2020

OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual Contexts.
CoRR, 2020

Self-Explaining Structures Improve NLP Models.
CoRR, 2020

Neural Semi-supervised Learning for Text Classification Under Large-Scale Pretraining.
CoRR, 2020

Summarize, Outline, and Elaborate: Long-Text Generation via Hierarchical Supervision from Extractive Summaries.
CoRR, 2020

Pair the Dots: Jointly Examining Training History and Test Stimuli for Model Interpretability.
CoRR, 2020

MGD-GAN: Text-to-Pedestrian generation through Multi-Grained Discrimination.
CoRR, 2020

Improving Robustness and Generality of NLP Models Using Disentangled Representations.
CoRR, 2020

Learning a Domain Classifier Bank for Unsupervised Adaptive Object Detection.
CoRR, 2020

Federated Mutual Learning.
CoRR, 2020

Learning Decomposed Representation for Counterfactual Inference.
CoRR, 2020

Rethinking Localization Map: Towards Accurate Object Perception with Self-Enhancement Maps.
CoRR, 2020

Stable Prediction via Leveraging Seed Variable.
CoRR, 2020

Balance-Subsampled Stable Prediction.
CoRR, 2020

Deep Sequential Feature Learning in Clinical Image Classification of Infectious Keratitis.
CoRR, 2020

Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions.
CoRR, 2020

Object-QA: Towards High Reliable Object Quality Assessment.
CoRR, 2020

Progressive Multi-Stage Learning for Discriminative Tracking.
CoRR, 2020

SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection.
CoRR, 2020

Grounded and Controllable Image Completion by Incorporating Lexical Semantics.
CoRR, 2020

Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units.
CoRR, 2020

Non-Autoregressive Neural Dialogue Generation.
CoRR, 2020

LAVA NAT: A Non-Autoregressive Translation Model with Look-Around Decoding and Vocabulary Attention.
CoRR, 2020

Legal Intelligence: Algorithmic, Data, and Social Challenges.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

TRIE: End-to-End Text Reading and Information Extraction for Document Understanding.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Poet: Product-oriented Video Captioner for E-commerce.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Photo Stream Question Answer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ADHD Intelligent Auxiliary Diagnosis System Based on Multimodal Information Fusion.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

DeVLBert: Learning Deconfounded Visio-Linguistic Representations.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Concept-based Explanation for Fine-grained Images and Its Application in Infectious Keratitis Classification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Comprehensive Information Integration Modeling Framework for Video Titling.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Dress like an Internet Celebrity: Fashion Retrieval in Videos.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Polar Relative Positional Encoding for Video-Language Segmentation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Description Based Text Classification with Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

De-Biased Court's View Generation with Causality.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

IB-M: A Flexible Framework to Align an Interpretable Model and a Black-box Model.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

CorefQA: Coreference Resolution as Query-based Span Prediction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Dice Loss for Data-imbalanced NLP Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Unified MRC Framework for Named Entity Recognition.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Text Perceptron: Towards End-to-End Arbitrary-Shaped Text Spotting.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Deep Q Learning Driven CT Pancreas Segmentation With Geometry-Aware U-Net.
IEEE Trans. Medical Imaging, 2019

Long-Form Video Question Answering via Dynamic Hierarchical Reinforced Networks.
IEEE Trans. Image Process., 2019

Deep Group-Wise Fully Convolutional Network for Co-Saliency Detection With Graph Propagation.
IEEE Trans. Image Process., 2019

User-Ranking Video Summarization With Multi-Stage Spatio-Temporal Representation.
IEEE Trans. Image Process., 2019

A Bilinear Ranking SVM for Knowledge Based Relation Prediction and Classification.
IEEE Trans. Big Data, 2019

Multi-Task Structure-Aware Context Modeling for Robust Keypoint-Based Object Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

State Distribution-Aware Sampling for Deep Q-Learning.
Neural Process. Lett., 2019

Coreference Resolution as Query-based Span Prediction.
CoRR, 2019

Large-scale Pretraining for Neural Machine Translation with Tens of Billions of Sentence Pairs.
CoRR, 2019

Adaptive Graph Representation Learning for Video Person Re-identification.
CoRR, 2019

Galaxy Learning - A Position Paper.
CoRR, 2019

Efficient Video Scene Text Spotting: Unifying Detection, Tracking, and Recognition.
CoRR, 2019

How Do Your Neighbors Disclose Your Information: Social-Aware Time Series Imputation.
Proceedings of the World Wide Web Conference, 2019

The ZJU-EDL System for Entity Discovery and Linking at TAC KBP 2019.
Proceedings of the 2019 Text Analysis Conference, 2019

Posterior-regularized REINFORCE for Instance Selection in Distant Supervision.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Walking with MIND: Mental Imagery eNhanceD Embodied QA.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Informative Visual Storytelling with Cross-modal Rules.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

You Only Recognize Once: Towards Fast Video Text Spotting.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning Dynamic Context Augmentation for Global Entity Linking.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Legal Summarization for Multi-role Debate Dialogue via Controversy Focus Mining and Multi-task Learning.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Understanding Default Behavior in Online Lending.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Distributed Modelling Approaches for Data Privacy Preserving.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

KCAT: A Knowledge-Constraint Typing Annotation Tool.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Cross-Relation Cross-Bag Attention for Distantly-Supervised Relation Extraction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Heterogeneous Attributed Network Embedding with Graph Convolutional Networks.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Identifying Objective and Subjective Words via Topic Modeling.
IEEE Trans. Neural Networks Learn. Syst., 2018

Deep Context-Sensitive Facial Landmark Detection With Tree-Structured Modeling.
IEEE Trans. Image Process., 2018

Body Structure Aware Deep Crowd Counting.
IEEE Trans. Image Process., 2018

Transductive Zero-Shot Learning With a Self-Training Dictionary Approach.
IEEE Trans. Cybern., 2018

Multimodal Deep Embedding via Hierarchical Grounded Compositional Semantics.
IEEE Trans. Circuits Syst. Video Technol., 2018

Guest Editorial: Spatio-temporal Feature Learning for Unconstrained Video Analysis.
Multim. Tools Appl., 2018

Temporality-enhanced knowledgememory network for factoid question answering.
Frontiers Inf. Technol. Electron. Eng., 2018

Entity mention aware document representation.
Inf. Sci., 2018

Bi-Adversarial Auto-Encoder for Zero-Shot Learning.
CoRR, 2018

Textually Guided Ranking Network for Attentional Image Retweet Modeling.
CoRR, 2018

Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images.
CoRR, 2018

Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions.
CoRR, 2018

Find the Conversation Killers: A Predictive Study of Thread-ending Posts.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

To Stay or to Leave: Churn Prediction for Urban Migrants in the Initial Period.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

Two Step Joint Model for Drug Drug Interaction Extraction.
Proceedings of the 2018 Text Analysis Conference, 2018

Identify Shifts of Word Semantics through Bayesian Surprise.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Text-to-Image Synthesis via Visual-Memory Creative Adversarial Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Intra-view and Inter-view Attention for Multi-view Network Embedding.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Multi-modal Sequence to Sequence Learning with Content Attention for Hotspot Traffic Speed Prediction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Visual Dialog with Multi-turn Attentional Memory Network.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Reading Document and Answering Question via Global Attentional Inference.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Sequence Learning with Auxiliary Information for Traffic Prediction.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

Open-Ended Long-form Video Question Answering via Adaptive Hierarchical Reinforced Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Attentional Image Retweet Modeling via Multi-Faceted Ranking Network Learning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

HST-LSTM: A Hierarchical Spatial-Temporal Long-Short Term Memory Network for Location Prediction.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Video question answering via multi-granularity temporal attention network learning.
Proceedings of the 10th International Conference on Internet Multimedia Computing and Service, 2018

Dynamic Network Embedding by Modeling Triadic Closure Process.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Urban Dreams of Migrants: A Case Study of Migrant Integration in Shanghai.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Representation Learning for Scale-Free Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Multi-Label Community-Based Question Classification via Personalized Sequence Memory Network Learning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Bag-of-Discriminative-Words (BoDW) Representation via Topic Modeling.
IEEE Trans. Knowl. Data Eng., 2017

Temporal Interaction and Causal Influence in Community-Based Question Answering.
IEEE Trans. Knowl. Data Eng., 2017

Hierarchical Contextual Attention Recurrent Neural Network for Map Query Suggestion.
IEEE Trans. Knowl. Data Eng., 2017

Data-Dependent Label Distribution Learning for Age Estimation.
IEEE Trans. Image Process., 2017

Regularized Deep Belief Network for Image Attribute Detection.
IEEE Trans. Circuits Syst. Video Technol., 2017

Flickr group recommendation with auxiliary information in heterogeneous information networks.
Multim. Syst., 2017

Challenges and opportunities: from big data to knowledge in AI 2.0.
Frontiers Inf. Technol. Electron. Eng., 2017

Disambiguating named entities with deep supervised learning via crowd labels.
Frontiers Inf. Technol. Electron. Eng., 2017

Representation Learning for Scale-free Networks.
CoRR, 2017

The Y_dcd_zju Slot Filling System for TAC KBP 2017.
Proceedings of the 2017 Text Analysis Conference, 2017

The ZHI-EDL System for Entity Discovery and Linking at TAC KBP 2017.
Proceedings of the 2017 Text Analysis Conference, 2017

Learning Max-Margin GeoSocial Multimedia Network Representations for Point-of-Interest Suggestion.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

ENCORE: External Neural Constraints Regularized Distant Supervision for Relation Extraction.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Video Question Answering via Gradually Refined Attention over Appearance and Motion.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Learning Deep Contextual Attention Network for Narrative Photo Stream Captioning.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Detecting Temporal Proposal for Action Localization with Tree-structured Search Policy.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Group-wise Deep Co-saliency Detection.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Top attention in line with time: A light-weight strategy.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Graph-theoretic spatiotemporal context modeling for video saliency detection.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

NITE: A Neural Inductive Teaching Framework for Domain Specific NER.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

2016
Learning of Multimodal Representations With Random Walks on the Click Graph.
IEEE Trans. Image Process., 2016

Joint Multilabel Classification With Community-Aware Label Graph Learning.
IEEE Trans. Image Process., 2016

DeepSaliency: Multi-Task Deep Neural Network Model for Salient Object Detection.
IEEE Trans. Image Process., 2016

Deep Learning Driven Visual Path Prediction From a Single Image.
IEEE Trans. Image Process., 2016

Aspect Learning for Multimedia Summarization via Nonparametric Bayesian.
IEEE Trans. Circuits Syst. Video Technol., 2016

Special issue on distributed computing and artificial intelligence.
Frontiers Inf. Technol. Electron. Eng., 2016

Kernelized sparse hashing for scalable image retrieval.
Neurocomputing, 2016

Concept over time: the combination of probabilistic topic model with wikipedia knowledge.
Expert Syst. Appl., 2016

LSTM-in-LSTM for generating long descriptions of images.
Comput. Vis. Media, 2016

Sentences Embedding for Slot Filling via Convolutional Neural Networks.
Proceedings of the 2016 Text Analysis Conference, 2016

ZJU Participation in TAC 2016 EDL task.
Proceedings of the 2016 Text Analysis Conference, 2016

The ijk System for EAL at TAC KBP 2016 Event Track.
Proceedings of the 2016 Text Analysis Conference, 2016

Ad Recommendation for Sponsored Search Engine via Composite Long-Short Term Memory.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Diverse Image Captioning via GroupTalk.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Self-Paced Boost Learning for Classification.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Community-Based Question Answering via Heterogeneous Social Network Learning.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Structured Visual Feature Learning for Classification via Supervised Probabilistic Tensor Factorization.
IEEE Trans. Multim., 2015

Probabilistic Word Selection via Topic Modeling.
IEEE Trans. Knowl. Data Eng., 2015

Cross-Modal Learning to Rank via Latent Joint Representation.
IEEE Trans. Image Process., 2015

Compact and Discriminative Descriptor Inference Using Multi-Cues.
IEEE Trans. Image Process., 2015

Weakly Semi-Supervised Deep Learning for Multi-Label Image Annotation.
IEEE Trans. Big Data, 2015

The classification of multi-modal data with hidden conditional random field.
Pattern Recognit. Lett., 2015

Deep learning driven blockwise moving object detection with binary scene modeling.
Neurocomputing, 2015

Tracking news article evolution by dense subgraph learning.
Neurocomputing, 2015

Topic aspect-oriented summarization via group selection.
Neurocomputing, 2015

Combining MIML and Distant Supervision for KBP Slot Filling.
Proceedings of the 2015 Text Analysis Conference, 2015

The ZJU-EDL System for Entity Discovery and Linking at TAC KBP 2015.
Proceedings of the 2015 Text Analysis Conference, 2015

Deep Compositional Cross-modal Learning to Rank via Local-Global Alignment.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Multi-modal Retrieval via Deep Textual-Visual Correlation Learning.
Proceedings of the Intelligence Science and Big Data Engineering. Image and Video Data Engineering, 2015

Sketch the Storyline with CHARCOAL: A Non-Parametric Approach.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

HTMVS: Visualizing hierarchical topics and their evolution.
Proceedings of the 10th IEEE Conference on Visual Analytics Science and Technology, 2015

Flickr group recommendation via heterogeneous information networks.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

RAISE: A Whole Process Modeling Method for Unstructured Data Management.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Metric Learning Driven Multi-Task Structured Output Optimization for Robust Keypoint Tracking.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Structured Embedding via Pairwise Relations and Long-Range Interactions in Knowledge Base.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Sparse Multi-Modal Hashing.
IEEE Trans. Multim., 2014

Multiple kernel learning with NOn-conVex group spArsity.
J. Vis. Commun. Image Represent., 2014

Hashing with List-Wise learning to rank.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Discriminative coupled dictionary hashing for fast cross-media retrieval.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Cross-Media Hashing with Neural Networks.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Learning Multimodal Neural Network with Ranking Examples.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Attribute prediction with long-range interactions via path coding.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Image annotation by semi-supervised cross-domain learning with group sparsity.
J. Vis. Commun. Image Represent., 2013

Hypergraph Spectral Hashing for image retrieval with heterogeneous social contexts.
Neurocomputing, 2013

A low rank structural large margin method for cross-modal ranking.
Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval, 2013

Cross-media semantic representation via bi-directional learning to rank.
Proceedings of the ACM Multimedia Conference, 2013

Cross-media topic mining on wikipedia.
Proceedings of the ACM Multimedia Conference, 2013

πLDA: document clustering with selective structural constraints.
Proceedings of the ACM Multimedia Conference, 2013

Multiple feature fusion for face recognition.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Supervised Coupled Dictionary Learning with Group Structures for Multi-modal Retrieval.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

Supervised Nonnegative Tensor Factorization with Maximum-Margin Constraint.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Web and Personal Image Annotation by Mining Label Correlation With Relaxed Visual Graph Embedding.
IEEE Trans. Image Process., 2012

Spline Regression Hashing for Fast Image Search.
IEEE Trans. Image Process., 2012

Image Annotation by Input-Output Structural Grouping Sparsity.
IEEE Trans. Image Process., 2012

Sparse Unsupervised Dimensionality Reduction for Multiple View Data.
IEEE Trans. Circuits Syst. Video Technol., 2012

Sparse spectral hashing.
Pattern Recognit. Lett., 2012

The heterogeneous feature selection with structural sparsity for multimedia annotation and hashing: a survey.
Int. J. Multim. Inf. Retr., 2012

LuSH: A Generic High-Dimensional Index Framework.
Proceedings of the Web-Age Information Management, 2012

Annotating web images using NOVA: NOn-conVex group spArsity.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Correlated attribute transfer with multi-task graph-guided fusion.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Supervised cross-collection topic modeling.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Logistic Tensor Regression for Classification.
Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012

Nonnegative Matrix Factorization for Multimodality Data from Multi-source Domain.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Graph-guided sparse reconstruction for region tagging.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor.
IEEE Trans. Vis. Comput. Graph., 2011

Stable multi-label boosting for image annotation with structural feature selection.
Sci. China Inf. Sci., 2011

Group sparse representation for image categorization and semantic video retrieval.
Sci. China Inf. Sci., 2011

Hypergraph spectral hashing for similarity search of social image.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Image annotation by composite kernel learning with group structure.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Inverse-degree Sampling for Spectral Clustering.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Tag Clustering and Refinement on Semantic Unity Graph.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Multi-label Image Annotation by Structural Grouping Sparsity.
Proceedings of the Social Media Modeling and Computing., 2011

2010
Multi-Label Transfer Learning With Sparse Representation.
IEEE Trans. Circuits Syst. Video Technol., 2010

Cross-media retrieval using query dependent search methods.
Pattern Recognit., 2010

A group of novel approaches and a toolkit for motion capture data reusing.
Multim. Tools Appl., 2010

Multiple hypergraph ranking for video concept detection.
J. Zhejiang Univ. Sci. C, 2010

Multiple Hypergraph Clustering of Web Images by MiningWord2Image Correlations.
J. Comput. Sci. Technol., 2010

Classification by semi-supervised discriminative regularization.
Neurocomputing, 2010

Heterogeneous feature selection by group lasso with logistic regression.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multi-label boosting for image annotation by structural grouping sparsity.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Automatic annotation of geo-information in panoramic street view by image retrieval.
Proceedings of the International Conference on Image Processing, 2010

Sparse representation using nonnegative curds and whey.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Multi-Task Sparse Discriminant Analysis (MtSDA) with Overlapping Categories.
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, 2010

2009
Tensor-Based Transductive Learning for Multimodality Video Semantic Concept Detection.
IEEE Trans. Multim., 2009

Multi-modality video shot clustering with tensor representation.
Multim. Tools Appl., 2009

Local and global approaches of affinity propagation clustering for large scale data
CoRR, 2009

Web image interpretation: semi-supervised mining annotated words.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Chinese Brush Calligraphy Character Retrieval and Learning.
Proceedings of the Methods and Applications for Advancing Distance Education Technologies, 2009

2008
Mining Semantic Correlation of Heterogeneous Multimedia Data for Cross-Media Retrieval.
IEEE Trans. Multim., 2008

Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-Media Retrieval.
IEEE Trans. Multim., 2008

An encoding-based dual distance tree high-dimensional index.
Sci. China Ser. F Inf. Sci., 2008

Search-Based Automatic Web Image Annotation Using Latent Visual and Semantic Analysis.
Proceedings of the Advances in Multimedia Information Processing, 2008

Active post-refined multimodality video semantic concept detection with tensor representation.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Automatic Video Summarization by Affinity Propagation Clustering and Semantic Content Mining.
Proceedings of The International Symposium on Electronic Commerce and Security, 2008

Clustering by evidence accumulation on affinity propagation.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Adaptive and compact shape descriptor by progressive feature combination and selection with boosting.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Manifold Learning Based Cross-media Retrieval: A Solution to Media Object Complementary Nature.
J. VLSI Signal Process., 2007

Hallucinating faces: LPH super-resolution and neighbor reconstruction for residue compensation.
Pattern Recognit., 2007

Composite Distance Transformation for Indexing and <i>k</i> -Nearest-Neighbor Searching in High-Dimensional Spaces.
J. Comput. Sci. Technol., 2007

Hierarchical Approximate Matching for Retrieval of Chinese Historical Calligraphy Character.
J. Comput. Sci. Technol., 2007

Chinese Brush Calligraphy Character Retrieval and Learning.
Int. J. Distance Educ. Technol., 2007

Using CONDENSATION Tracking to Recover Stroke Order of Chinese Calligraphic Handwritings with CCM.
Proceedings of the Eighth International Workshop on Image Analysis for Multimedia Interactive Services, 2007

Fast Answering <i>k-</i> Nearest-Neighbor Queries over Large Image Databases Using Dual Distance Transformation.
Proceedings of the Advances in Multimedia Modeling, 2007

Bridging the Gap Between Visual and Auditory Feature Spaces for Cross-Media Retrieval.
Proceedings of the Advances in Multimedia Modeling, 2007

Video Semantic Concept Detection Using Multi-modality Subspace Correlation Propagation.
Proceedings of the Advances in Multimedia Modeling, 2007

Cross-modal correlation learning for clustering on image-audio dataset.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Research of 3D Chinese Calligraphic Handwriting Recur System and Its Key Algorithm.
Proceedings of the Computer Vision/Computer Graphics Collaboration Techniques, 2007

A Prediction Error Compression Method with Tensor-PCA in Video Coding.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

A Piece-Wise Learning Approach to 3D Facial Animation.
Proceedings of the Advances in Web Based Learning, 2007

A Novel Scalable Texture Video Coding Scheme with GPCA.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speeding Up Similarity Queries over Large Chinese Calligraphic Character Databases Using Data Grid.
Proceedings of the Grid and Cooperative Computing, 2007

2006
k Nearest Neighbor Queries Based on Data Grid.
J. Comput. Res. Dev., 2006

Zhejiang University at TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Data-driven Generation of Decision Tree based on Ensemble Multiple-instance Learning for Motion Retrieval.
Proceedings of the IEEE International Conference on Systems, 2006

An approach for cross-media retrieval with cross-reference graph and PageRank.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Learning Semantic Correlations for Cross-Media Retrieval.
Proceedings of the International Conference on Image Processing, 2006

An Efficient Keyframe Extraction from Motion Capture Data.
Proceedings of the Advances in Computer Graphics, 2006

Video-Based Facial Expression Hallucination: A Two- Level Hierarchical Fusion Approach.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2006

2005
Steerable pyramid-based face hallucination.
Pattern Recognit., 2005

Automatic generation of human animation based on motion programming.
Comput. Animat. Virtual Worlds, 2005

Understanding Multimedia Document Semantics for Cross-Media Retrieval.
Proceedings of the Advances in Multimedia Information Processing, 2005

Selection of Optimal Features for Iris Recognition.
Proceedings of the Advances in Neural Networks - ISNN 2005, Second International Symposium on Neural Networks, Chongqing, China, May 30, 2005

Web-Based Chinese Calligraphy Retrieval and Learning System.
Proceedings of the Advances in Web-Based Learning - ICWL 2005, 4th International Conference, Hong Kong, China, July 31, 2005

Segmenting Layers in Automated Visual Surveillance.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Research on Grid-Aware Mechanisms and Issues for CADAL Project.
Proceedings of the Digital Libraries: Implementing Strategies and Sharing Experiences, 2005

2004
Towards Comprehensive 3D Enabled Web-Based Learning.
Int. J. Comput. Process. Orient. Lang., 2004

ImprovingWeb-Based Learning: Automatic Annotation of Multimedia Semantics and Cross-Media Indexing.
Proceedings of the Advances in Web-Based Learning, 2004

A Novel Watermarking Scheme Based on Video Content.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004

A Two-Step Approach to Multiple Facial Feature Tracking: Temporal Particle Filter and Spatial Belief Propagation.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

2003
3D motion retrieval with motion index tree.
Comput. Vis. Image Underst., 2003

Subdivision Feedback Based 3D Facial Modeling for E-learning.
Proceedings of the Advances in Web-Based Learning, 2003

3D Model and Motion Retrieval: The Extended Dimensions for Web-Based Learning.
Proceedings of the Advances in Web-Based Learning, 2003

2002
Audio Retrieval with Fast Relevance Feedback Based on Constrained Fuzzy Clustering and Stored Index Table.
Proceedings of the Advances in Multimedia Information Processing, 2002


  Loading...