Linli Yao

Orcid: 0000-0002-9809-8864

According to our database1, Linli Yao authored at least 12 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

2019
2020
2021
2022
2023
2024
0
1
2
3
4
5
6
7
2
1
1
4
2
1
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Temporal Reasoning Transfer from Text to Video.
CoRR, 2024

DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models.
CoRR, 2024

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Edit As You Wish: Video Caption Editing with Multi-grained User Control.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Edit As You Wish: Video Description Editing with Multi-grained Commands.
CoRR, 2023

CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge.
Proceedings of the ACM Web Conference 2023, 2023

Rethinking Benchmarks for Cross-modal Image-text Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

2022
Image Difference Captioning with Pre-training and Contrastive Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2020
YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos.
CoRR, 2020

2019
RUC at MediaEval 2019: Video Memorability Prediction Based on Visual Textual and Concept Related Features.
Proceedings of the Working Notes Proceedings of the MediaEval 2019 Workshop, 2019


  Loading...