Fengyun Rao

Orcid: 0000-0002-2868-2088

According to our database1, Fengyun Rao authored at least 20 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling.
CoRR, 2024

Advancing Video Quality Assessment for AIGC.
CoRR, 2024

Revisiting Video Quality Assessment from the Perspective of Generalization.
CoRR, 2024

EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model.
CoRR, 2024

Visual Perception by Large Language Model's Weights.
CoRR, 2024

Multi-Modal Generative Embedding Model.
CoRR, 2024

ReGenNet: Towards Human Action-Reaction Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inter-X: Towards Versatile Human-Human Interaction Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Task Navigator: Decomposing Complex Tasks for Multimodal Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Spatial-Semantic Collaborative Cropping for User Generated Content.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Image Captioning with Multi-Context Synthetic Data.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Text-Only Image Captioning with Multi-Context Data Generation.
CoRR, 2023

A Similarity Alignment Model for Video Copy Segment Matching.
CoRR, 2023

A Dual-level Detection Method for Video Copy Detection.
CoRR, 2023

2022
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
CaSP: Class-agnostic Semi-Supervised Pretraining for Detection and Segmentation.
CoRR, 2021

CLIP4Caption ++: Multi-CLIP for Video Caption.
CoRR, 2021

CLIP4Caption: CLIP for Video Caption.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2019
Multi-Task Multi-Head Attention Memory Network for Fine-Grained Sentiment Analysis.
Proceedings of the Natural Language Processing and Chinese Computing, 2019


  Loading...