Zhaoyang Liu

Affiliations:

SenseTime Research, Shanghai, China
Hong Kong University of Science and Technology, Hong Kong, SAR, China

According to our database¹, Zhaoyang Liu authored at least 20 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

VLG: General Video Recognition with Web Textual Knowledge.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., October, 2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions.

[BibT_eX]

[DOI]

CoRR, 2024

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks.

[BibT_eX]

[DOI]

CoRR, 2024

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling.

[BibT_eX]

[DOI]

CoRR, 2024

LLMs Meet Multimodal Generation and Editing: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

ControlLLM: Augment Language Models with Tools by Searching on Graphs.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

ControlLLM: Augment Language Models with Tools by Searching on Graphs.

[BibT_eX]

[DOI]

CoRR, 2023

InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language.

[BibT_eX]

[DOI]

CoRR, 2023

Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

MotionBERT: A Unified Perspective on Learning Human Motion Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

MotionBERT: Unified Pretraining for Human Motion Analysis.

[BibT_eX]

[DOI]

CoRR, 2022

Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach.

[BibT_eX]

[DOI]

CoRR, 2022

Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results.

[BibT_eX]

[DOI]

CoRR, 2021

TAM: Temporal Adaptive Module for Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Dynamic Sampling Networks for Efficient Action Recognition in Videos.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

TEINet: Towards an Efficient Architecture for Video Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2018

Context and Temporal Aware Attention Model for Flood Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Context-Aware Attention LSTM Network for Flood Prediction.

[BibT_eX]

[DOI]

Shivakumara Palaiahnakote

Tong Lu

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Zhaoyang Liu

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...