Liangzhe Yuan

Orcid: 0000-0001-9206-1908

According to our database1, Liangzhe Yuan authored at least 23 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PolyMaX: General Dense Prediction with Mask Transformer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

VideoPrism: A Foundational Visual Encoder for Video Understanding.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Distilling Vision-Language Models on Millions of Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
VideoGLUE: Video General Understanding Evaluation of Foundation Models.
CoRR, 2023

Spatiotemporally Discriminative Video-Language Pre-Training with Text Grounding.
CoRR, 2023

Unified Visual Relationship Detection with Vision and Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose.
Int. J. Comput. Vis., 2022

Surrogate Gap Minimization Improves Sharpness-Aware Training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

On Temporal Granularity in Self-Supervised Video Representation Learning.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Exploring Temporal Granularity in Self-Supervised Video Representation Learning.
CoRR, 2021

DeepLab2: A TensorFlow Library for Deep Labeling.
CoRR, 2021

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MoViNets: Mobile Video Networks for Efficient Video Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2019
Human Gaze-Driven Spatial Tasking of an Autonomous MAV.
IEEE Robotics Autom. Lett., 2019

Live Demonstration: Unsupervised Event-Based Learning of Optical Flow, Depth and Egomotion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Unsupervised Event-Based Learning of Optical Flow, Depth, and Egomotion.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Zoom-In-To-Check: Boosting Video Interpolation via Instance-Level Discrimination.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
EV-FlowNet: Self-Supervised Optical Flow Estimation for Event-based Cameras.
Proceedings of the Robotics: Science and Systems XIV, 2018

Unsupervised Event-Based Optical Flow Using Motion Compensation.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018


  Loading...