Hehe Fan

Orcid: 0000-0001-9572-2345

According to our database1, Hehe Fan authored at least 60 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation.
ACM Trans. Multim. Comput. Commun. Appl., July, 2024

DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition.
IEEE Trans. Multim., 2024

Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering.
IEEE Trans. Multim., 2024

CktGen: Specification-Conditioned Analog Circuit Generation.
CoRR, 2024

ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning.
CoRR, 2024

Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models.
CoRR, 2024

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment.
CoRR, 2024

EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing.
CoRR, 2024

ProtChatGPT: Towards Understanding Proteins with Large Language Models.
CoRR, 2024

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.
CoRR, 2024

Progressive Point Cloud Denoising with Cross-Stage Cross-Coder Adaptive Edge Graph Convolution Network.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Prototype Learning for Micro-gesture Classification.
Proceedings of IJCAI 2024 Workshop&Challenge on Micro-gesture Analysis for Hidden Emotion Understanding (MiGA 2024) co-located with 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024

Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.
Proceedings of the Computer Vision - ECCV 2024, 2024

Clustering for Protein Representation Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Uncovering what, why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DocMSU: A Comprehensive Benchmark for Document-Level Multimodal Sarcasm Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization.
CoRR, 2023

FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax.
CoRR, 2023

Prior-Free Continual Learning with Unlabeled Data in the Wild.
CoRR, 2023

DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation.
CoRR, 2023

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023.
CoRR, 2023

STPrivacy: Spatio-Temporal Tubelet Sparsification and Anonymization for Privacy-preserving Action Recognition.
CoRR, 2023

Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PointListNet: Deep Learning on 3D Point Lists.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Text to Point Cloud Localization with Relation-Enhanced Transformer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

SEFormer: Structure Embedding Transformer for 3D Object Detection.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Temporal Cross-Layer Correlation Mining for Action Recognition.
IEEE Trans. Multim., 2022

Unsupervised Visual Representation Learning via Dual-Level Progressive Similar Instance Selection.
IEEE Trans. Cybern., 2022

Understanding Atomic Hand-Object Interaction With Human Intention.
IEEE Trans. Circuits Syst. Video Technol., 2022

Entropy guided attention network for weakly-supervised action localization.
Pattern Recognit., 2022

Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?
CoRR, 2022

Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction.
Proceedings of the Computer Vision - ECCV 2022, 2022

Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Few-Shot Common-Object Reasoning Using Common-Centric Localization Network.
IEEE Trans. Image Process., 2021

Motion = Video - Content: Towards Unsupervised Learning of Motion Representation from Videos.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences.
Proceedings of the 9th International Conference on Learning Representations, 2021

Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
From Video Classification to Video Prediction: Deep Learning Approaches to Video Modelling
PhD thesis, 2020

Recurrent Attention Network with Reinforced Generator for Visual Dialog.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Adaptive Exploration for Unsupervised Person Re-identification.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Cascaded Revision Network for Novel Object Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2020

Person Tube Retrieval via Language Description.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
PointRNN: Point Recurrent Neural Network for Moving Point Cloud Processing.
CoRR, 2019

Cascaded Revision Network for Novel Object Captioning.
CoRR, 2019

Attract or Distract: Exploit the Margin of Open Set.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cubic LSTMs for Video Prediction.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Unsupervised Person Re-identification: Clustering and Fine-tuning.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017
Unsupervised Person Re-identification: Clustering and Fine-tuning.
CoRR, 2017

Complex Event Detection by Identifying Reliable Shots from Untrimmed Videos.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Multiple kernel visual-auditory representation learning for retrieval.
Multim. Tools Appl., 2016

Informedia @ TRECVID 2016.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016


  Loading...