Linchao Zhu

Yadong Mu

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Few-Shot Common-Object Reasoning Using Common-Centric Localization Network.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Training Robust Object Detectors From Noisy Category Labels and Imprecise Bounding Boxes.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Learning to Anticipate Egocentric Actions by Imagination.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Holistic LSTM for Pedestrian Trajectory Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Visual commonsense reasoning with directional visual connections.

[BibT_eX]

[DOI]

Frontiers Inf. Technol. Electron. Eng., 2021

Less is More: Sparse Sampling for Dense Reaction Predictions.

[BibT_eX]

[DOI]

CoRR, 2021

OR-Net: Pointwise Relational Inference for Data Completion under Partial Observation.

[BibT_eX]

[DOI]

CoRR, 2021

Universal-Prototype Augmentation for Few-Shot Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

PoseGate-Former: Transformer Encoder with Trainable Gate for 3D Human Pose Estimation Using Weakly Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 28th International Conference, 2021

Vector-Decomposed Disentanglement for Domain-Invariant Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Universal-Prototype Enhancing for Few-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Interactive Prototype Learning for Egocentric Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Multi-Mode Modulator for Multi-Domain Few-Shot Classification.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Adaptive Hierarchical Graph Reasoning with Semantic Coherence for Video-and-Language Inference.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

OpenMix: Reviving Known Knowledge for Discovering Novel Visual Categories in an Open World.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Faster Meta Update Strategy for Noise-Robust Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval.

[BibT_eX]

[DOI]

Xiaohan Wang

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Recurrent Attention Network with Reinforced Generator for Visual Dialog.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2020

Feature Robust Optimal Transport for High-dimensional Data.

[BibT_eX]

[DOI]

CoRR, 2020

UTS Submission at the TRECVID 2020 Disaster Scene Description and Indexing Task.

[BibT_eX]

[DOI]

Qi Rao

Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

SF-Net: Single-Frame Supervision for Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

ActBERT: Learning Global-Local Video-Text Representations.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Gated Channel Transformation for Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semantic Correspondence as an Optimal Transport Problem.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FASTER Recurrent Networks for Efficient Video Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Symbiotic Attention with Privileged Information for Egocentric Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Video representation learning with deep neural networks

[BibT_eX]

[DOI]

PhD thesis, 2019

Learning to Transfer Learn.

[BibT_eX]

[DOI]

CoRR, 2019

Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019.

[BibT_eX]

[DOI]

CoRR, 2019

FASTER Recurrent Networks for Video Classification.

[BibT_eX]

[DOI]

CoRR, 2019

Meta Filter Pruning to Accelerate Deep Convolutional Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Connective Cognition Network for Directional Visual Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Dual Attention Matching for Audio-Visual Event Localization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Auto-ReID: Searching for a Part-Aware ConvNet for Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Entangled Transformer for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation.

[BibT_eX]

[DOI]

Fengda Zhu

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Cubic LSTMs for Video Prediction.

[BibT_eX]

[DOI]

Hehe Fan

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions.

[BibT_eX]

[DOI]

CoRR, 2018

UTS_CAI submission at TRECVID 2018 Ad-hoc Video Search Task.

[BibT_eX]

[DOI]

Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Activities in Extended Video.

[BibT_eX]

[DOI]

Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Decoupled Novel Object Captioner.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Compound Memory Networks for Few-Shot Video Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

Uncovering the Temporal Context for Video Question Answering.

[BibT_eX]

[DOI]

Alexander G. Hauptmann

Int. J. Comput. Vis., 2017

UTS submission to Google YouTube-8M Challenge 2017.

[BibT_eX]

[DOI]

Yanbin Liu

CoRR, 2017

Bidirectional Multirate Reconstruction for Temporal Modeling in Videos.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Few-Shot Object Recognition from Machine-Labeled Web Images.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Recognizing an Action Using Its Name: A Knowledge-Based Approach.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

UTS-CMU-D2DCRC Submission at TRECVID 2016 Video Localization.

[BibT_eX]

[DOI]

Xuanyi Dong

Alexander G. Hauptmann

Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

2015

Uncovering Temporal Context for Video Question and Answering.

[BibT_eX]

[DOI]