Yuan Yao

Orcid: 0000-0002-8276-3620

Affiliations:
  • Tsinghua University, Institute for Artificial Intelligence, Department of Computer Science and Technology, Beijing, China (former)
  • National University of Singapore, Queenstown, Singapore


According to our database1, Yuan Yao authored at least 59 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
CoRR, 2024

GUICourse: From General Vision Language Models to Versatile GUI Agents.
CoRR, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness.
CoRR, 2024

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
CoRR, 2024

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images.
CoRR, 2024

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data.
CoRR, 2024

CPT: Colorful Prompt Tuning for pre-trained vision-language models.
AI Open, 2024

Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature.
AI Open, 2024

Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

NExT-Chat: An LMM for Chat, Detection and Segmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

LLaVA-UHD: An LMM Perceiving Any Aspect Ratio and High-Resolution Images.
Proceedings of the Computer Vision - ECCV 2024, 2024

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
UPRec: User-aware Pre-training for sequential Recommendation.
AI Open, January, 2023

DreaMoving: A Human Video Generation Framework based on Diffusion Models.
CoRR, 2023

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
CoRR, 2023

Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.
CoRR, 2023

Visually Grounded Commonsense Knowledge Acquisition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
DCT-net: domain-calibrated translation for portrait stylization.
ACM Trans. Graph., 2022

PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Fine-Grained Scene Graph Generation with Data Transfer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Structure-Aware Flow Generation for Human Body Reshaping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Unpaired Cartoon Image Synthesis via Gated Cycle Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Prompt Tuning for Discriminative Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Knowledge Transfer via Pre-training for Recommendation: A Review and Prospect.
Frontiers Big Data, 2021

CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.
CoRR, 2021

UPRec: User-Aware Pre-training for Recommender Systems.
CoRR, 2021

CPM-2: Large-scale cost-effective pre-trained language models.
AI Open, 2021

Pre-trained models: Past, present and future.
AI Open, 2021

Open Hierarchical Relation Extraction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

ONION: A Simple and Effective Defense Against Textual Backdoor Attacks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Adversarial Language Games for Advanced Natural Language Intelligence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Integrating Image-Based and Knowledge-Based Representation Learning.
IEEE Trans. Cogn. Dev. Syst., 2020

Denoising Relation Extraction from Document-level Distant Supervision.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Boosting Semantic Human Matting With Coarse Annotations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Meta-Information Guided Meta-Learning for Few-Shot Relation Classification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Adversarial Language Games for Advanced Natural Language Intelligence.
CoRR, 2019

An Acceleration Framework for High Resolution Image Synthesis.
CoRR, 2019

Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Attention-Aware Multi-Stroke Style Transfer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
CAMF: Context Aware Matrix Factorization for Social Recommendation.
Web Intell., 2018

FewRel: A Large-Scale Supervised Few-shot Relation Classification Dataset with State-of-the-Art Evaluation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Long short-term memory based recurrent neural networks for collaborative filtering.
Proceedings of the 2017 IEEE SmartWorld, 2017

HDNN-CF: A hybrid deep neural networks collaborative filtering architecture for event recommendation.
Proceedings of the 2017 IEEE SmartWorld, 2017

2016
Context Aware Matrix Factorization for Event Recommendation in Event-Based Social Networks.
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016

Towards Accurate Relation Extraction from Wikipedia.
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016

Grouped Text Clustering Using Non-Parametric Gaussian Mixture Experts.
Proceedings of the PRICAI 2016: Trends in Artificial Intelligence, 2016

We Know Where You Are: Home Location Identification in Location-Based Social Networks.
Proceedings of the 25th International Conference on Computer Communication and Networks, 2016

We Know What You Are Doing or Going to Do: Towards Accurate Human Activities Sensing.
Proceedings of the 25th International Conference on Computer Communication and Networks, 2016

2015
A Mixture Distribution Based System in BitTorrent-Like P2P Networks.
Proceedings of the 21st IEEE International Conference on Parallel and Distributed Systems, 2015

2014
Fast Routing in Location-Based Social Networks Leveraging Check-in Data.
Proceedings of the 2014 IEEE International Conference on Internet of Things, 2014


  Loading...