2025
Bidirectional Mask Selection for Zero-Shot Referring Image Segmentation.
IEEE Trans. Circuits Syst. Video Technol., January, 2025
T2TD: Text-3D Generation Model Based on Prior Knowledge Guidance.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2025
Multi-level semantics probability embedding for image-text matching.
Inf. Process. Manag., 2025
2024
Privacy-preserving Multi-source Cross-domain Recommendation Based on Knowledge Graph.
ACM Trans. Multim. Comput. Commun. Appl., May, 2024
Cross-domain correlation representation for new fault categories discovery in rolling bearings.
Inf. Process. Manag., March, 2024
Strong robust copy-move forgery detection network based on layer-by-layer decoupling refinement.
Inf. Process. Manag., March, 2024
Temporal-Spatial Correlation Attention Network for Clinical Data Analysis in Intensive Care Unit.
IEEE Trans. Biomed. Eng., February, 2024
Adaptive semantic transfer network for unsupervised 2D image-based 3D model retrieval.
Comput. Vis. Image Underst., January, 2024
CDCM: ChatGPT-Aided Diversity-Aware Causal Model for Interactive Recommendation.
IEEE Trans. Multim., 2024
Knowledge-Enhanced Causal Reinforcement Learning Model for Interactive Recommendation.
IEEE Trans. Multim., 2024
Long Dialogue Emotion Detection Based on Commonsense Knowledge Graph Guidance.
IEEE Trans. Multim., 2024
Event-Aware Retrospective Learning for Knowledge-Based Image Captioning.
IEEE Trans. Multim., 2024
Inter- and Intra-Domain Potential User Preferences for Cross-Domain Recommendation.
IEEE Trans. Multim., 2024
Dual-Stage Uncertainty Modeling for Unsupervised Cross-Domain 3D Model Retrieval.
IEEE Trans. Multim., 2024
Multi-modal fusion network guided by prior knowledge for 3D CAD model recognition.
Neurocomputing, 2024
3D shape knowledge graph for cross-domain 3D shape retrieval.
CAAI Trans. Intell. Technol., 2024
Image-Centered Pseudo Label Generation for Weakly Supervised Text-Based Person Re-Identification.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024
Causal Intervention for Brain Tumor Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024
Beyond Users: Denoising Behavior-based Contrastive Learning for Disentangled Cross-Domain Recommendation.
Proceedings of the Database Systems for Advanced Applications, 2024
CAT-DM: Controllable Accelerated Virtual Try-On with Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
AnyScene: Customized Image Synthesis with Composited Foreground.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Graph Disentangled Contrastive Learning with Personalized Transfer for Cross-Domain Recommendation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
A comprehensive survey on deep-learning-based visual captioning.
Multim. Syst., December, 2023
Brain tumor image segmentation based on prior knowledge via transformer.
Int. J. Imaging Syst. Technol., November, 2023
MRFT: Multiscale Recurrent Fusion Transformer Based Prior Knowledge for Bit-Depth Enhancement.
IEEE Trans. Circuits Syst. Video Technol., October, 2023
Deep reinforcement learning framework for thoracic diseases classification via prior knowledge guidance.
Comput. Medical Imaging Graph., September, 2023
Principal views selection based on growing graph convolution network for multi-view 3D model recognition.
Appl. Intell., March, 2023
Self-supervised Image-based 3D Model Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2023
Cross-Domain Recommendation Via User-Clustering and Multidimensional Information Fusion.
IEEE Trans. Multim., 2023
CPG3D: Cross-Modal Priors Guided 3D Object Reconstruction.
IEEE Trans. Multim., 2023
Unsupervised Cross-Media Graph Convolutional Network for 2D Image-Based 3D Model Retrieval.
IEEE Trans. Multim., 2023
A Multiscale Graph Convolutional Neural Network Framework for Fault Diagnosis of Rolling Bearing.
IEEE Trans. Instrum. Meas., 2023
MV-CLIP: Multi-View CLIP for Zero-shot 3D Shape Recognition.
CoRR, 2023
Image-Based Virtual Try-On: A Survey.
CoRR, 2023
Dynamic Causal Disentanglement Model for Dialogue Emotion Detection.
CoRR, 2023
Reinforcement Learning Based Multi-modal Feature Fusion Network for Novel Class Discovery.
CoRR, 2023
Causal Disentanglement Hidden Markov Model for Fault Diagnosis.
CoRR, 2023
Point-PC: Point Cloud Completion Guided by Prior Knowledge via Causal Inference.
CoRR, 2023
Chest X-ray Image Classification: A Causal Perspective.
CoRR, 2023
Multi-Level Residual Feature Fusion Network for Thoracic Disease Classification in Chest X-Ray Images.
IEEE Access, 2023
Point Cloud Adversarial Perturbation Generation for Adversarial Attacks.
IEEE Access, 2023
Instrumental Variable Learning for Chest X-ray Classification.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023
My Brother Helps Me: Node Injection Based Adversarial Attack on Social Bot Detection.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Efficient Spatio-Temporal Video Grounding with Semantic-Guided Feature Decomposition.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Progressive Positive Association Framework for Image and Text Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Chest X-ray Image Classification: A Causal Perspective.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023
Unknown Fault Detection of Rolling Bearing Based on Similarity Mining of Stationary and Non-stationary Features.
Proceedings of the 4th International Workshop on Human-centric Multimedia Analysis, 2023
2022
Toward Region-Aware Attention Learning for Scene Graph Generation.
IEEE Trans. Neural Networks Learn. Syst., 2022
LR-GCN: Latent Relation-Aware Graph Convolutional Network for Conversational Emotion Recognition.
IEEE Trans. Multim., 2022
I-GCN: Incremental Graph Convolution Network for Conversation Emotion Detection.
IEEE Trans. Multim., 2022
Monocular Image-Based 3-D Model Retrieval: A Benchmark.
IEEE Trans. Cybern., 2022
Deep Correlated Joint Network for 2-D Image-Based 3-D Model Retrieval.
IEEE Trans. Cybern., 2022
CLN: Cross-Domain Learning Network for 2D Image-Based 3D Shape Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2022
Region-Aware Image Captioning via Interaction Learning.
IEEE Trans. Circuits Syst. Video Technol., 2022
Residual-Guided Multiscale Fusion Network for Bit-Depth Enhancement.
IEEE Trans. Circuits Syst. Video Technol., 2022
Joint Local Correlation and Global Contextual Information for Unsupervised 3D Model Retrieval and Classification.
IEEE Trans. Circuits Syst. Video Technol., 2022
Iterative Residual Feature Refinement Network for Bit-Depth Enhancement.
IEEE Signal Process. Lett., 2022
LD-GAN: Learning perturbations for adversarial defense based on GAN structure.
Signal Process. Image Commun., 2022
PAGN: perturbation adaption generation network for point cloud adversarial defense.
Multim. Syst., 2022
JFLN: Joint Feature Learning Network for 2D sketch based 3D shape retrieval.
J. Vis. Commun. Image Represent., 2022
LIMAN: Local Information-Based Multiattention Network for 3D Shape Recognition.
IEEE Multim., 2022
3D Shape Knowledge Graph for Cross-domain and Cross-modal 3D Shape Retrieval.
CoRR, 2022
SHREC'22 track: Open-Set 3D Object Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Comput. Graph., 2022
HMTN: Hierarchical Multi-scale Transformer Network for 3D Shape Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Multigranular Visual-Semantic Embedding for Cloth-Changing Person Re-identification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Cross-Domain 3D Model Retrieval Based On Contrastive Learning And Label Propagation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
2021
MV-LFN: Multi-view based local information fusion network for 3D shape recognition.
Vis. Informatics, 2021
Image Captioning with multi-level similarity-guided semantic matching.
Vis. Informatics, 2021
PGNet: Progressive Feature Guide Learning Network for Three-dimensional Shape Recognition.
ACM Trans. Multim. Comput. Commun. Appl., 2021
MMFN: Multimodal Information Fusion Networks for 3D Model Classification and Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2021
Universal Cross-Domain 3D Model Retrieval.
IEEE Trans. Multim., 2021
C-GCN: Correlation Based Graph Convolutional Network for Audio-Video Emotion Recognition.
IEEE Trans. Multim., 2021
M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval.
IEEE Trans. Multim., 2021
3D Pose Estimation Based on Reinforce Learning for 2D Image-Based 3D Model Retrieval.
IEEE Trans. Multim., 2021
Adaptively Clustering-Driven Learning for Visual Relationship Detection.
IEEE Trans. Multim., 2021
DAN: Deep-Attention Network for 3D Shape Recognition.
IEEE Trans. Image Process., 2021
Scene Graph Inference via Multi-Scale Context Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2021
Interactive Multimodal Attention Network for Emotion Recognition in Conversation.
IEEE Signal Process. Lett., 2021
MHFP: Multi-view based hierarchical fusion pooling method for 3D shape recognition.
Pattern Recognit. Lett., 2021
Coupled-dynamic learning for vision and language: Exploring Interaction between different tasks.
Pattern Recognit., 2021
Exposing DeepFake Videos Using Attention Based Convolutional LSTM Network.
Neural Process. Lett., 2021
Multi-modal feature fusion based on multi-layers LSTM for video emotion recognition.
Multim. Tools Appl., 2021
Multi-type decision fusion network for visual Q&A.
Image Vis. Comput., 2021
Hierarchical multi-view context modelling for 3D object classification and retrieval.
Inf. Sci., 2021
Emotion Detection for Conversations Based on Reinforcement Learning Framework.
IEEE Multim., 2021
SVHAN: Sequential View Based Hierarchical Attention Network for 3D Shape Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Triangle-Reward Reinforcement Learning: A Visual-Linguistic Semantic Alignment for Image Captioning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
2020
Multi-View Graph Matching for 3D Model Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2020
HGAN: Holistic Generative Adversarial Networks for Two-dimensional Image-based Three-dimensional Object Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2020
Multi-View Saliency Guided Deep Neural Network for 3-D Object Retrieval and Classification.
IEEE Trans. Multim., 2020
Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning.
IEEE Trans. Multim., 2020
Sequential Saliency Guided Deep Neural Network for Joint Mitosis Identification and Localization in Time-Lapse Phase Contrast Microscopy Images.
IEEE J. Biomed. Health Informatics, 2020
Joint Heterogeneous Feature Learning and Distribution Alignment for 2D Image-Based 3D Object Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2020
Subgraph learning for graph matching.
Pattern Recognit. Lett., 2020
An End-to-End Perceptual Quality Assessment Method via Score Distribution Prediction.
Neural Process. Lett., 2020
3D model retrieval based on multi-view attentional convolutional neural network.
Multim. Tools Appl., 2020
Joint deep feature learning and unsupervised visual domain adaptation for cross-domain 3D object retrieval.
Inf. Process. Manag., 2020
3D Model Retrieval Based on a 3D Shape Knowledge Graph.
IEEE Access, 2020
Two-Stream Network Based on Visual Saliency Sharing for 3D Model Recognition.
IEEE Access, 2020
MVCLN: Multi-View Convolutional LSTM Network for Cross-Media 3D Shape Recognition.
IEEE Access, 2020
MPAN: Multi-Part Attention Network for Point Cloud Based 3D Shape Retrieval.
IEEE Access, 2020
View-Based 3D Model Retrieval by Joint Subgraph Learning and Matching.
IEEE Access, 2020
Pairwise View Weighted Graph Network for View-based 3D Model Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020
Semantic Consistency Guided Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Multi-graph Convolutional Network for Unsupervised 3D Shape Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Hierarchical Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Consistent Domain Structure Learning and Domain Alignment for 2D Image-Based 3D Objects Retrieval.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
SHREC 2020 Track: Extended Monocular Image Based 3D Model Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 13th Eurographics Workshop on 3D Object Retrieval, 2020
2019
Multi-modal Correlated Network for emotion recognition in speech.
Vis. Informatics, 2019
Multi-Domain and Multi-Task Learning for Human Action Recognition.
IEEE Trans. Image Process., 2019
Dual-Stream Recurrent Neural Network for Video Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2019
Hyper-Clique Graph Matching and Applications.
IEEE Trans. Circuits Syst. Video Technol., 2019
3D Object Retrieval Based on Multi-View Latent Variable Model.
IEEE Trans. Circuits Syst. Video Technol., 2019
Location emotion recognition for travel recommendation based on social network.
Signal Image Video Process., 2019
Visual attribute detction for pedestrian detection.
Multim. Tools Appl., 2019
The assessment of 3D model representation for retrieval with CNN-RNN networks.
Multim. Tools Appl., 2019
Multi-guiding long short-term memory for video captioning.
Multim. Syst., 2019
Pooled time series representation for mitosis event recognition.
Multim. Syst., 2019
Panorama based on multi-channel-attention CNN for 3D model recognition.
Multim. Syst., 2019
Scene graph captioner: Image captioning based on structural visual representation.
J. Vis. Commun. Image Represent., 2019
Unsupervised Feature Learning With Graph Embedding for View-Based 3D Model Retrieval.
IEEE Access, 2019
SRNet: Structured Relevance Feature Learning Network From Skeleton Data for Human Action Recognition.
IEEE Access, 2019
End-to-End Visual Domain Adaptation Network for Cross-Domain 3D CPS Data Retrieval.
IEEE Access, 2019
Frequency Estimator Based on Spectrum Correction and Remainder Sifting for Undersampled Real-Valued Waveforms.
IEEE Access, 2019
Dual-level Embedding Alignment Network for 2D Image-Based 3D Object Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
MMJN: Multi-Modal Joint Networks for 3D Shape Recognition.
Proceedings of the 27th ACM International Conference on Multimedia, 2019
Characteristic Views Extraction Modal Based-on Deep Reinforcement Learning for 3D Model Retrieval.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Monocular Image Based 3D Model Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 12th Eurographics Workshop on 3D Object Retrieval, 2019
2018
View-Based 3-D Model Retrieval: A Benchmark.
IEEE Trans. Cybern., 2018
View-Based 3D Model Retrieval via Multi-graph Matching.
Neural Process. Lett., 2018
Mitosis event recognition and detection based on evolution of feature in time domain.
Mach. Vis. Appl., 2018
Multi-scale CNNs for 3D model retrieval.
Multim. Tools Appl., 2018
View-based 3D model retrieval via supervised multi-view feature learning.
Multim. Tools Appl., 2018
3D model retrieval via single image based on feature mapping.
Multim. Tools Appl., 2018
Attention-in-Attention Networks for Surveillance Video Understanding in Internet of Things.
IEEE Internet Things J., 2018
Human Action Recognition Based on Selected Spatio-Temporal Features via Bidirectional LSTM.
IEEE Access, 2018
PANORAMA-Based Multi-Scale and Multi-Channel CNN for 3D Model Retrieval.
Proceedings of the IEEE Visual Communications and Image Processing, 2018
Hierarchical Graph Structure Learning for Multi-View 3D Model Retrieval.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Multi-Level Policy and Reward Reinforcement Learning for Image Captioning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Cross-Domain 3D Model Retrieval via Visual Domain Adaption.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
2D Scene Sketch-Based 3D Scene Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 11th Eurographics Workshop on 3D Object Retrieval, 2018
RGB-D Object-to-CAD Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 11th Eurographics Workshop on 3D Object Retrieval, 2018
2D Image-Based 3D Scene Retrieval.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 11th Eurographics Workshop on 3D Object Retrieval, 2018
2017
Multi-Grained Random Fields for Mitosis Identification in Time-Lapse Phase Contrast Microscopy Image Sequences.
IEEE Trans. Medical Imaging, 2017
Benchmarking a Multimodal and Multiview and Interactive Dataset for Human Action Recognition.
IEEE Trans. Cybern., 2017
Modeling Temporal Information of Mitotic for Mitotic Event Detection.
IEEE Trans. Big Data, 2017
Hierarchical Clustering Multi-Task Learning for Joint Human Action Grouping and Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2017
Automatic report generation based on multi-modal information.
Multim. Tools Appl., 2017
3D object retrieval based on Spatial+LDA model.
Multim. Tools Appl., 2017
Convolutional deep learning for 3D object retrieval.
Multim. Syst., 2017
Multimedia venue semantic modeling based on multimodal data.
J. Vis. Commun. Image Represent., 2017
Multi-view feature extraction based on slow feature analysis.
Neurocomputing, 2017
3D models retrieval algorithm based on multimodal data.
Neurocomputing, 2017
Multi-layers CNNs for 3D Model Retrieval.
Proceedings of the Internet Multimedia Computing and Service, 2017
2016
Multi-Modal Clique-Graph Matching for View-Based 3D Model Retrieval.
IEEE Trans. Image Process., 2016
Quality models for venue recommendation in location-based social network.
Multim. Tools Appl., 2016
Geo-location driven image tagging via cross-domain learning.
Multim. Syst., 2016
Cross-domain semantic transfer from large-scale social media.
Multim. Syst., 2016
3D object retrieval based on sparse coding in weak supervision.
J. Vis. Commun. Image Represent., 2016
Cross-view action recognition by cross-domain learning.
Image Vis. Comput., 2016
Effective 3D object detection based on detector and tracker.
Neurocomputing, 2016
HEp-2 cells Classification via clustered multi-task learning.
Neurocomputing, 2016
Evaluation of local spatial-temporal features for cross-view action recognition.
Neurocomputing, 2016
3D Convolutional Networks-Based Mitotic Event Detection in Time-Lapse Phase Contrast Microscopy Image Sequences of Stem Cell Populations.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016
3D Object Retrieval with Multimodal Views.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 9th Eurographics Workshop on 3D Object Retrieval, 2016
2015
Semantic-Based Location Recommendation With Multimodal Venue Semantics.
IEEE Trans. Multim., 2015
Coupled hidden conditional random fields for RGB-D human action recognition.
Signal Process., 2015
Multiple Person Tracking based on Spatial-temporal Information by Global Graph Clustering.
KSII Trans. Internet Inf. Syst., 2015
Graph-based characteristic view set extraction and matching for 3D model retrieval.
Inf. Sci., 2015
3D Model Retrieval with Weighted Locality-constrained Group Sparse Coding.
Neurocomputing, 2015
TJU-TJUT@TRECVID 2015: Surveillance Event Detection.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Multi-modal & Multi-view & Interactive Benchmark Dataset for Human Action Recognition.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Clique-graph matching by preserving global & local structure.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
3D Object Retrieval with Multimodal Views.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 8th Eurographics Workshop on 3D Object Retrieval, 2015
2014
Single/cross-camera multiple-person tracking by graph matching.
Neurocomputing, 2014
Multi-view action recognition by cross-domain learning.
Proceedings of the IEEE 16th International Workshop on Multimedia Signal Processing, 2014
2013
Venue Semantics: Multimedia Topic Modeling of Social Media Contents.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013
An Effective Tracking System for Multiple Object Tracking in Occlusion Scenes.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013
2012
Multiple Person Tracking by Spatiotemporal Tracklet Association.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012