Zhaoyang Liu

Affiliations:
  • SenseTime Research, Shanghai, China
  • Hong Kong University of Science and Technology, Hong Kong, SAR, China


According to our database1, Zhaoyang Liu authored at least 20 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
VLG: General Video Recognition with Web Textual Knowledge.
Int. J. Comput. Vis., October, 2024

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions.
CoRR, 2024

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks.
CoRR, 2024

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling.
CoRR, 2024

LLMs Meet Multimodal Generation and Editing: A Survey.
CoRR, 2024

ControlLLM: Augment Language Models with Tools by Searching on Graphs.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
ControlLLM: Augment Language Models with Tools by Searching on Graphs.
CoRR, 2023

InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language.
CoRR, 2023

Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

MotionBERT: A Unified Perspective on Learning Human Motion Representations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
MotionBERT: Unified Pretraining for Human Motion Analysis.
CoRR, 2022

Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach.
CoRR, 2022

Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing.
Proceedings of the Computer Vision - ECCV 2022, 2022

Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
DeeperForensics Challenge 2020 on Real-World Face Forgery Detection: Methods and Results.
CoRR, 2021

TAM: Temporal Adaptive Module for Video Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Dynamic Sampling Networks for Efficient Action Recognition in Videos.
IEEE Trans. Image Process., 2020

TEINet: Towards an Efficient Architecture for Video Recognition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2018
Context and Temporal Aware Attention Model for Flood Prediction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Context-Aware Attention LSTM Network for Flood Prediction.
Proceedings of the 24th International Conference on Pattern Recognition, 2018


  Loading...