Hehe Fan

IEEE Trans. Pattern Anal. Mach. Intell., 2023

A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization.

[BibT_eX]

[DOI]

CoRR, 2023

FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax.

[BibT_eX]

[DOI]

CoRR, 2023

Prior-Free Continual Learning with Unlabeled Data in the Wild.

[BibT_eX]

[DOI]

CoRR, 2023

DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023.

[BibT_eX]

[DOI]

CoRR, 2023

STPrivacy: Spatio-Temporal Tubelet Sparsification and Anonymization for Privacy-preserving Action Recognition.

[BibT_eX]

[DOI]

CoRR, 2023

Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PointListNet: Deep Learning on 3D Point Lists.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Text to Point Cloud Localization with Relation-Enhanced Transformer.

[BibT_eX]

[DOI]

Guangzhi Wang

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

SEFormer: Structure Embedding Transformer for 3D Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Temporal Cross-Layer Correlation Mining for Action Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2022

Unsupervised Visual Representation Learning via Dual-Level Progressive Similar Instance Selection.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2022

Understanding Atomic Hand-Object Interaction With Human Intention.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2022

Entropy guided attention network for weakly-supervised action localization.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?

[BibT_eX]

[DOI]

CoRR, 2022

Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Few-Shot Common-Object Reasoning Using Common-Centric Localization Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Motion = Video - Content: Towards Unsupervised Learning of Motion Representation from Videos.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

From Video Classification to Video Prediction: Deep Learning Approaches to Video Modelling

[BibT_eX]

[DOI]

PhD thesis, 2020

Recurrent Attention Network with Reinforced Generator for Visual Dialog.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2020

Adaptive Exploration for Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2020

Cascaded Revision Network for Novel Object Captioning.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

Person Tube Retrieval via Language Description.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

PointRNN: Point Recurrent Neural Network for Moving Point Cloud Processing.

[BibT_eX]

[DOI]

CoRR, 2019

Cascaded Revision Network for Novel Object Captioning.

[BibT_eX]

[DOI]

CoRR, 2019

Attract or Distract: Exploit the Margin of Open Set.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cubic LSTMs for Video Prediction.

[BibT_eX]

[DOI]

Linchao Zhu

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Unsupervised Person Re-identification: Clustering and Fine-tuning.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2018

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

2017

Unsupervised Person Re-identification: Clustering and Fine-tuning.

[BibT_eX]

[DOI]

Liang Zheng