Hehe Fan
Orcid: 0000-0001-9572-2345
According to our database1,
Hehe Fan
authored at least 60 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024
DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition.
IEEE Trans. Multim., 2024
IEEE Trans. Multim., 2024
Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models.
CoRR, 2024
TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment.
CoRR, 2024
Progressive Point Cloud Denoising with Cross-Stage Cross-Coder Adaptive Edge Graph Convolution Network.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024
Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Uncovering what, why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
DocMSU: A Comprehensive Benchmark for Document-Level Multimodal Sarcasm Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Pattern Anal. Mach. Intell., 2023
A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization.
CoRR, 2023
CoRR, 2023
CoRR, 2023
A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023.
CoRR, 2023
STPrivacy: Spatio-Temporal Tubelet Sparsification and Anonymization for Privacy-preserving Action Recognition.
CoRR, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE Trans. Multim., 2022
Unsupervised Visual Representation Learning via Dual-Level Progressive Similar Instance Selection.
IEEE Trans. Cybern., 2022
IEEE Trans. Circuits Syst. Video Technol., 2022
Pattern Recognit., 2022
Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
IEEE Trans. Image Process., 2021
Motion = Video - Content: Towards Unsupervised Learning of Motion Representation from Videos.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
From Video Classification to Video Prediction: Deep Learning Approaches to Video Modelling
PhD thesis, 2020
ACM Trans. Multim. Comput. Commun. Appl., 2020
ACM Trans. Multim. Comput. Commun. Appl., 2020
IEEE Trans. Circuits Syst. Video Technol., 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
CoRR, 2019
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
ACM Trans. Multim. Comput. Commun. Appl., 2018
Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
2016
Multim. Tools Appl., 2016
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016