We stand with Ukraine

We stand with Ukraine

Du Tran

Orcid: 0000-0001-9673-7194

According to our database¹, Du Tran authored at least 45 papers between 2008 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2008

2010

2012

2014

2016

2018

2020

2022

2024

0

5

10

1

3

5

2

2

2

2

1

2

2

1

4

6

4

1

1

1

1

1

1

1

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

SEAL: Semantic Attention Learning for Long Video Representation.

[BibT_eX]

[DOI]

,

,

,

Vishnu Naresh Boddeti

,

CoRR, 2024

Open-world Instance Segmentation: Top-down Learning with Bottom-up Supervision.

[BibT_eX]

[DOI]

,

,

,

Manmohan Chandraker

,

Lorenzo Torresani

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

FLAVR: flow-free architecture for fast video frame interpolation.

[BibT_eX]

[DOI]

,

,

Manmohan Chandraker

,

Mach. Vis. Appl., September, 2023

Learning Space-Time Semantic Correspondences.

[BibT_eX]

[DOI]

,

CoRR, 2023

MINOTAUR: Multi-task Video Grounding From Multimodal Queries.

[BibT_eX]

[DOI]

,

Effrosyni Mavroudi

,

,

Sainbayar Sukhbaatar

,

,

,

Lorenzo Torresani

,

CoRR, 2023

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation.

[BibT_eX]

[DOI]

,

,

Manmohan Chandraker

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Relational Space-Time Query in Long-Form Videos.

[BibT_eX]

[DOI]

,

,

,

,

Lorenzo Torresani

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Long-Short Temporal Contrastive Learning of Video Transformers.

[BibT_eX]

[DOI]

,

Gedas Bertasius

,

,

Lorenzo Torresani

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Self-Supervised Learning by Cross-Modal Audio-Video Clustering.

[BibT_eX]

[DOI]

,

,

,

Lorenzo Torresani

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Video Modeling With Correlation Networks.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

What Makes Training Multi-Modal Classification Networks Hard?

[BibT_eX]

[DOI]

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

FASTER Recurrent Networks for Efficient Video Classification.

[BibT_eX]

[DOI]

,

,

Laura Sevilla-Lara

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Self-Supervised Learning by Cross-Modal Audio-Video Clustering.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

,

,

CoRR, 2019

FASTER Recurrent Networks for Video Classification.

[BibT_eX]

[DOI]

,

Laura Sevilla-Lara

,

,

,

,

CoRR, 2019

UniDual: A Unified Model for Image and Video Understanding.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

CoRR, 2019

What Makes Training Multi-Modal Networks Hard?

[BibT_eX]

[DOI]

,

,

CoRR, 2019

Large-scale weakly-supervised pre-training for video action recognition.

[BibT_eX]

[DOI]

Deepti Ghadiyaram

,

,

,

,

,

CoRR, 2019

Learning Temporal Pose Estimation from Sparsely-Labeled Videos.

[BibT_eX]

[DOI]

Gedas Bertasius

,

Christoph Feichtenhofer

,

,

,

Lorenzo Torresani

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Video Classification With Channel-Separated Convolutional Networks.

[BibT_eX]

[DOI]

,

,

,

Lorenzo Torresani

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

SCSampler: Sampling Salient Clips From Video for Efficient Action Recognition.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DistInit: Learning Video Representations Without a Single Labeled Video.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

,

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Leveraging the Present to Anticipate the Future in Videos.

[BibT_eX]

[DOI]

,

,

,

,

Lorenzo Torresani

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition.

[BibT_eX]

[DOI]

Deepti Ghadiyaram

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Learning Discriminative Motion Features Through Detection.

[BibT_eX]

[DOI]

Gedas Bertasius

,

Christoph Feichtenhofer

,

,

,

Lorenzo Torresani

CoRR, 2018

Co-Training of Audio and Video Representations from Self-Supervised Temporal Synchronization.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

CoRR, 2018

Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Scenes-Objects-Actions: A Multi-task, Multi-label Video Dataset.

[BibT_eX]

[DOI]

,

,

,

,

,

Lorenzo Torresani

,

Proceedings of the Computer Vision - ECCV 2018, 2018

A Closer Look at Spatiotemporal Convolutions for Action Recognition.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Detect-and-Track: Efficient Pose Estimation in Videos.

[BibT_eX]

[DOI]

,

Georgia Gkioxari

,

Lorenzo Torresani

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

ConvNet Architecture Search for Spatiotemporal Feature Learning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2017

Transformation-Based Models of Video Sequences.

[BibT_eX]

[DOI]

Joost R. van Amersfoort

,

,

Marc'Aurelio Ranzato

,

,

,

Soumith Chintala

CoRR, 2017

Deciphering Severely Degraded License Plates.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

,

Proceedings of the Media Watermarking, Security, and Forensics 2017, Burlingame, CA, USA, 29 January 2017, 2017

2016

Representations and Models for Large-Scale Video Understanding.

[BibT_eX]

[DOI]

PhD thesis, 2016

EXMOVES: Mid-level Features for Efficient Action Recognition and Video Analysis.

[BibT_eX]

[DOI]

,

Lorenzo Torresani

Int. J. Comput. Vis., 2016

ViCom: Benchmark and Methods for Video Comprehension.

[BibT_eX]

[DOI]

,

,

Lorenzo Torresani

CoRR, 2016

Deep End2End Voxel2Voxel Prediction.

[BibT_eX]

[DOI]

,

Lubomir D. Bourdev

,

,

Lorenzo Torresani

,

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

2015

Learning Spatiotemporal Features with 3D Convolutional Networks.

[BibT_eX]

[DOI]

,

Lubomir D. Bourdev

,

,

Lorenzo Torresani

,

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014

Video Event Detection: From Subvolume Localization to Spatiotemporal Path Search.

[BibT_eX]

[DOI]

,

,

David A. Forsyth

IEEE Trans. Pattern Anal. Mach. Intell., 2014

EXMOVES: Classifier-based Features for Scalable Action Recognition.

[BibT_eX]

[DOI]

,

Lorenzo Torresani

Proceedings of the 2nd International Conference on Learning Representations, 2014

C3D: Generic Features for Video Analysis.

[BibT_eX]

[DOI]

,

Lubomir D. Bourdev

,

,

Lorenzo Torresani

,

CoRR, 2014

2012

Max-Margin Structured Output Regression for Spatio-Temporal Action Localization.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011

Optimal spatio-temporal path discovery for video event detection.

[BibT_eX]

[DOI]

,

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2008

Human Activity Recognition with Metric Learning.

[BibT_eX]

[DOI]

,

Alexander Sorokin

Proceedings of the Computer Vision, 2008

Loading...