Du Tran

Orcid: 0000-0001-9673-7194

According to our database1, Du Tran authored at least 44 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
FLAVR: flow-free architecture for fast video frame interpolation.
Mach. Vis. Appl., September, 2023

Learning Space-Time Semantic Correspondences.
CoRR, 2023

MINOTAUR: Multi-task Video Grounding From Multimodal Queries.
CoRR, 2023

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Relational Space-Time Query in Long-Form Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Long-Short Temporal Contrastive Learning of Video Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Self-Supervised Learning by Cross-Modal Audio-Video Clustering.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Video Modeling With Correlation Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

What Makes Training Multi-Modal Classification Networks Hard?
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FASTER Recurrent Networks for Efficient Video Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Self-Supervised Learning by Cross-Modal Audio-Video Clustering.
CoRR, 2019

FASTER Recurrent Networks for Video Classification.
CoRR, 2019

UniDual: A Unified Model for Image and Video Understanding.
CoRR, 2019

What Makes Training Multi-Modal Networks Hard?
CoRR, 2019

Large-scale weakly-supervised pre-training for video action recognition.
CoRR, 2019

Learning Temporal Pose Estimation from Sparsely-Labeled Videos.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Video Classification With Channel-Separated Convolutional Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SCSampler: Sampling Salient Clips From Video for Efficient Action Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DistInit: Learning Video Representations Without a Single Labeled Video.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Leveraging the Present to Anticipate the Future in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Learning Discriminative Motion Features Through Detection.
CoRR, 2018

Co-Training of Audio and Video Representations from Self-Supervised Temporal Synchronization.
CoRR, 2018

Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scenes-Objects-Actions: A Multi-task, Multi-label Video Dataset.
Proceedings of the Computer Vision - ECCV 2018, 2018

A Closer Look at Spatiotemporal Convolutions for Action Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Detect-and-Track: Efficient Pose Estimation in Videos.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
ConvNet Architecture Search for Spatiotemporal Feature Learning.
CoRR, 2017

Transformation-Based Models of Video Sequences.
CoRR, 2017

Deciphering Severely Degraded License Plates.
Proceedings of the Media Watermarking, Security, and Forensics 2017, Burlingame, CA, USA, 29 January 2017, 2017

2016
Representations and Models for Large-Scale Video Understanding.
PhD thesis, 2016

EXMOVES: Mid-level Features for Efficient Action Recognition and Video Analysis.
Int. J. Comput. Vis., 2016

ViCom: Benchmark and Methods for Video Comprehension.
CoRR, 2016

Deep End2End Voxel2Voxel Prediction.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015
Learning Spatiotemporal Features with 3D Convolutional Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

EXMOVES: Classifier-based Features for Scalable Action Recognition.
Proceedings of the 2nd International Conference on Learning Representations, 2014

C3D: Generic Features for Video Analysis.
CoRR, 2014

2012
Max-Margin Structured Output Regression for Spatio-Temporal Action Localization.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011
Optimal spatio-temporal path discovery for video event detection.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2008
Human Activity Recognition with Metric Learning.
Proceedings of the Computer Vision, 2008


  Loading...