Yaosi Hu

Orcid: 0000-0003-2784-6738

According to our database1, Yaosi Hu authored at least 21 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Benchmark for Controllable Text -Image-to-Video Generation.
IEEE Trans. Multim., 2024

Memory-guided representation matching for unsupervised video anomaly detection.
J. Vis. Commun. Image Represent., 2024

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control.
CoRR, 2024

2023
Multiple visual relationship forecasting and arrangement in videos.
Neurocomputing, July, 2023

LaMD: Latent Motion Diffusion for Video Generation.
CoRR, 2023

A Lightweight No-reference Video Quality Assessment Method.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2023

2022
Subjective Evaluation of Visual Quality and Simulator Sickness of Short 360$^\circ$ Videos: ITU-T Rec. P.919.
IEEE Trans. Multim., 2022

Predicate Correlation Learning for Scene Graph Generation.
IEEE Trans. Image Process., 2022

Decomposing style, content, and motion for videos.
J. Vis. Commun. Image Represent., 2022

Learning Human Cognitive Appraisal Through Reinforcement Memory Unit.
CoRR, 2022

Video Quality Assessment based on Quality Aggregation Networks.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Make It Move: Controllable Image-to-Video Generation with Text Descriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Subjective Quality Assessment of One-to-One Video-Telephony Services.
Proceedings of the IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, 2022

2021
MAPS: Joint Multimodal Attention and POS Sequence Generation for Video Captioning.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

Learn to Look Around: Deep Reinforcement Learning Agent for Video Saliency Prediction.
Proceedings of the International Conference on Visual Communications and Image Processing, 2021

2020
Exploiting the local temporal information for video captioning.
J. Vis. Commun. Image Represent., 2020

A Multimodal Variational Encoder-Decoder Framework for Micro-video Popularity Prediction.
Proceedings of the WWW '20: The Web Conference 2020, Taipei, Taiwan, April 20-24, 2020, 2020

Subjective Study of Perceptual Quality for Micro-Video Applications.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

2019
Hierarchical Global-Local Temporal Modeling for Video Captioning.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Two-Stream Refinement Network for RGB-D Saliency Detection.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

2018
RGB-D Semantic Segmentation: A Review.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018


  Loading...