Zan Gao

Orcid: 0000-0001-6970-491X

According to our database1, Zan Gao authored at least 122 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Talking Face Generation With Audio-Deduced Emotional Landmarks.
IEEE Trans. Neural Networks Learn. Syst., October, 2024

A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Universal Relocalizer for Weakly Supervised Referring Expression Grounding.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Semantic-Aware Contrastive Learning With Proposal Suppression for Video Semantic Role Grounding.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

A Semantic Perception and CNN-Transformer Hybrid Network for Occluded Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

SCREAM: Knowledge sharing and compact representation for class incremental learning.
Inf. Process. Manag., March, 2024

GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning.
IEEE Trans. Inf. Forensics Secur., 2024

Fine-Grained Temporal-Enhanced Transformer for Dynamic Facial Expression Recognition.
IEEE Signal Process. Lett., 2024

Adaptive knowledge transfer for class incremental learning.
Pattern Recognit. Lett., 2024

Illumination-aware divide-and-conquer network for improperly-exposed image enhancement.
Neural Networks, 2024

Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification.
CoRR, 2024

MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection.
CoRR, 2024

A Coarse to Fine Detection Method for Prohibited Object in X-ray Images Based on Progressive Transformer Decoder.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MS-YOLOv5s: An Improved YOLOv5s for the Detection of Imperceptible Defects on Steel Surfaces.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

2023
Review Polarity-Wise Recommender.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Attribute-Guided Collaborative Learning for Partial Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Disentangled Graph Neural Networks for Session-Based Recommendation.
IEEE Trans. Knowl. Data Eng., August, 2023

TBNet: A Two-Stream Boundary-Aware Network for Generic Image Manipulation Localization.
IEEE Trans. Knowl. Data Eng., July, 2023

A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Multim., 2023

Toward Fine-Grained Talking Face Generation.
IEEE Trans. Image Process., 2023

A Multitemporal Scale and Spatial-Temporal Transformer Network for Temporal Action Localization.
IEEE Trans. Hum. Mach. Syst., 2023

Continual Learning with Strong Experience Replay.
CoRR, 2023

Identity-Guided Collaborative Learning for Cloth-Changing Person Reidentification.
CoRR, 2023

Multi-Behavior Recommendation with Cascading Graph Convolution Networks.
Proceedings of the ACM Web Conference 2023, 2023

A Novel Temporal Channel Enhancement and Contextual Excavation Network for Temporal Action Localization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

2022
Pairwise Two-Stream ConvNets for Cross-Domain Action Recognition With Small Data.
IEEE Trans. Neural Networks Learn. Syst., 2022

Frame-Wise Cross-Modal Matching for Video Moment Retrieval.
IEEE Trans. Multim., 2022

A Temporal-Aware Relation and Attention Network for Temporal Action Localization.
IEEE Trans. Image Process., 2022

Domain-Adversarial-Guided Siamese Network for Unsupervised Cross-Domain 3-D Object Retrieval.
IEEE Trans. Cybern., 2022

A Novel Multiple-View Adversarial Learning Network for Unsupervised Domain Adaptation Action Recognition.
IEEE Trans. Cybern., 2022

Joint Local Correlation and Global Contextual Information for Unsupervised 3D Model Retrieval and Classification.
IEEE Trans. Circuits Syst. Video Technol., 2022

Multi-Level View Associative Convolution Network for View-Based 3D Model Retrieval.
IEEE Trans. Circuits Syst. Video Technol., 2022

Editorial paper for Pattern Recognition Letters VSI on cross model understanding for visual question answering.
Pattern Recognit. Lett., 2022

Multiview clustering via consistent and specific nonnegative matrix factorization with graph regularization.
Multim. Syst., 2022

Semantically guided projection for zero-shot 3D model classification and retrieval.
Multim. Syst., 2022

Temporal Action Localization with Multi-temporal Scales.
CoRR, 2022

A Semantic-aware Attention and Visual Shielding Network for Cloth-changing Person Re-identification.
CoRR, 2022

HMTN: Hierarchical Multi-scale Transformer Network for 3D Shape Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Multigranular Visual-Semantic Embedding for Cloth-Changing Person Re-identification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Generic Image Manipulation Localization through the Lens of Multi-scale Spatial Inconsistence.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Introduction to the Special Issue on Fine-grained Visual Computing.
ACM Trans. Multim. Comput. Commun. Appl., 2021

DCR: A Unified Framework for Holistic/Partial Person ReID.
IEEE Trans. Multim., 2021

Video Moment Localization via Deep Cross-Modal Hashing.
IEEE Trans. Image Process., 2021

A Pairwise Attentive Adversarial Spatiotemporal Network for Cross-Domain Few-Shot Action Recognition-R2.
IEEE Trans. Image Process., 2021

A bus passenger re-identification dataset and a deep learning baseline using triplet embedding.
Multim. Tools Appl., 2021

Pairwise attention network for cross-domain image recognition.
Neurocomputing, 2021

Class consistent and joint group sparse representation model for image classification in Internet of Medical Things.
Comput. Commun., 2021

Interest-aware Message-Passing GCN for Recommendation.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Dynamic Modality Interaction Modeling for Image-Text Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

SVHAN: Sequential View Based Hierarchical Attention Network for 3D Shape Recognition.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

A Novel Patch Convolutional Neural Network for View-based 3D Model Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Exploring Deep Learning for View-Based 3D Model Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Pairwise Generalization Network for Cross-Domain Image Recognition.
Neural Process. Lett., 2020

Multiple Discrimination and Pairwise CNN for view-based 3D object retrieval.
Neural Networks, 2020

3D Object retrieval based on non-local graph neural networks.
Multim. Tools Appl., 2020

Parsing human image by fusing semantic and spatial features: A deep learning approach.
Inf. Process. Manag., 2020

Unsupervised Deep Cross-modal Hashing with Virtual Label Regression.
Neurocomputing, 2020

Frame-wise Cross-modal Match for Video Moment Retrieval.
CoRR, 2020

An Incremental Learning Based Edge Caching System: From Modeling to Evaluation.
IEEE Access, 2020

A Multimodal Pairwise Discrimination Network for Cross-Domain Action Recognition.
IEEE Access, 2020

Pairwise View Weighted Graph Network for View-based 3D Model Retrieval.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Integrating aspect-aware interactive attention and emotional position-aware for multi-aspect sentiment analysis.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Attention feature matching for weakly-supervised video relocalization.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Multi-graph Convolutional Network for Unsupervised 3D Shape Retrieval.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

What Aspect Do You Like: Multi-scale Time-aware User Interest Modeling for Micro-video Recommendation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Texture Semantically Aligned with Visibility-aware for Partial Person Re-identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Attention Stereo Matching Network.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

2019
Multi-view and multivariate gaussian descriptor for 3D object retrieval.
Multim. Tools Appl., 2019

基于残差网络的三维模型检索算法 (3-D Model Retrieval Algorithm Based on Residual Network).
计算机科学, 2019

Adaptive Fusion and Category-Level Dictionary Learning Model for Multiview Human Action Recognition.
IEEE Internet Things J., 2019

Cognitive-inspired class-statistic matching with triple-constrain for camera free 3D object retrieval.
Future Gener. Comput. Syst., 2019

Deep Spatial Pyramid Features Collaborative Reconstruction for Partial Person ReID.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Multi-classfication Sentiment Analysis Based on the Fused Model.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

2018
MMA: a multi-view and multi-modality benchmark dataset for human action recognition.
Multim. Tools Appl., 2018

3D object recognition based on pairwise Multi-view Convolutional Neural Networks.
J. Vis. Commun. Image Represent., 2018

Exploring the Cross-Domain Action Recognition Problem by Deep Feature Learning and Cross-Domain Learning.
IEEE Access, 2018

Stereo Matching Based on Density Segmentation and Non-Local Cost Aggregation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Group-Pair Convolutional Neural Networks for Multi-View Based 3D Object Retrieval.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Evaluation of regularized multi-task leaning algorithms for single/multi-view human action recognition.
Multim. Tools Appl., 2017

Collaborative sparse representation leaning model for RGBD action recognition.
J. Vis. Commun. Image Represent., 2017

3D human action recognition model based on image set and regularized multi-task leaning.
Neurocomputing, 2017

Local Shrunk Discriminant Analysis (LSDA).
CoRR, 2017

Acute effect of active video games on older children's mood change.
Comput. Hum. Behav., 2017

Acute Effect of Virtual Reality Exercise Bike Games on College Students' Physiological and Psychological Outcomes.
Cyberpsychology Behav. Soc. Netw., 2017

Segment-tree based cost aggregation for stereo matching with enhanced segmentation advantage.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Human action recognition on depth dataset.
Neural Comput. Appl., 2016

Multi-dimensional human action recognition model based on image set and group sparisty.
Neurocomputing, 2016

Evaluation of local spatial-temporal features for cross-view action recognition.
Neurocomputing, 2016

Object segmentation of indoor scenes using perceptual organization on RGB-D images.
Proceedings of the 8th International Conference on Wireless Communications & Signal Processing, 2016

Reverse Testing Image Set Model Based Multi-view Human Action Recognition.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

A Fast 3D Retrieval Algorithm via Class-Statistic and Pair-Constraint Model.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Iterative color-depth MST cost aggregation for stereo matching.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Registration of depth maps based on IRLS-ICP-TPS.
Proceedings of the 9th International Congress on Image and Signal Processing, 2016

3D Object Retrieval with Multimodal Views.
Proceedings of the 9th Eurographics Workshop on 3D Object Retrieval, 2016

2015
Multipe/Single-View Human Action Recognition via Part-Induced Multitask Structural Learning.
IEEE Trans. Cybern., 2015

Multi-view discriminative and structured dictionary learning with group sparsity for human action recognition.
Signal Process., 2015

Multi-perspective and multi-modality joint representation and recognition model for 3D action recognition.
Neurocomputing, 2015

User cooperation in OFDM-based cognitive radio networks with simultaneous wireless information and power transfer.
Proceedings of the International Conference on Wireless Communications & Signal Processing, 2015

TJU-TJUT@TRECVID 2015: Surveillance Event Detection.
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Single Face Image Super-Resolution via Multi-dictionary Bayesian Non-parametric Learning.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Clique-graph matching by preserving global & local structure.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015


2014
Enhanced and hierarchical structure algorithm for data imbalance problem in semantic extraction under massive video dataset.
Multim. Tools Appl., 2014

Human Action Recognition Using Pyramid Histograms of Oriented Gradients and Collaborative Multi-task Learning.
KSII Trans. Internet Inf. Syst., 2014

Cell type-independent mitosis event detection via hidden-state conditional neural fields.
Proceedings of the IEEE 11th International Symposium on Biomedical Imaging, 2014

2013
Nonnegative Mixed-Norm Convex Optimization for Mitotic Cell Detection in Phase Contrast Microscopy.
Comput. Math. Methods Medicine, 2013

A Kinect-based 3D sensing and human action recognition solution for urban search and rescue environments.
Proceedings of the IEEE International Symposium on Robot and Human Interactive Communication, 2013

Online Boosting Tracking with Fragmented Model.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

An Effective Tracking System for Multiple Object Tracking in Occlusion Scenes.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

2012
Human action recognition based on sparse representation induced by L1/L2 regulations.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

2011
TJUT-TJU@TRECVID 2011: Surveillance Event Detection.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

2010
The Application of Spatio-temporal Feature and Multi-Sensor in Home Medical Devices.
J. Digit. Content Technol. its Appl., 2010

MMM-TJU at TRECVID 2010.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Informedia @ TRECVID2010.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Multi-camera Monitoring of Infusion Pump Use.
Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010), 2010

Comparing Evaluation Protocols on the KTH Dataset.
Proceedings of the Human Behavior Understanding, First International Workshop, 2010

2009
BUPT-MCPRL at TRECVID 2009.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

2008
BUPT at TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

2006
Palmprint Recognition Based on Improved 2DPCA.
Proceedings of the Agent Computing and Multi-Agent Systems, 2006

Palmprint Recognition Based on 2-Dimension PCA.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006


  Loading...