2025
RLPR: Extrapolating RLVR to General Domains without Verifiers.
,
,
,
,
,
,
,
,
,
,
,
CoRR, June, 2025
Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language.
ACM Trans. Inf. Syst., March, 2025
Process Reinforcement through Implicit Rewards.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, February, 2025
Dynamic Guided and Domain Applicable Safeguards for Enhanced Security in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
MiniCPM-V: A GPT-4V Level MLLM on Your Phone.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
GUICourse: From General Vision Language Models to Versatile GUI Agents.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images.
CoRR, 2024
CPT: Colorful Prompt Tuning for pre-trained vision-language models.
AI Open, 2024
Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature.
AI Open, 2024
ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
NExT-Chat: An LMM for Chat, Detection and Segmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Twelfth International Conference on Learning Representations, 2024
MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024
LLaVA-UHD: An LMM Perceiving Any Aspect Ratio and High-Resolution Images.
Proceedings of the Computer Vision - ECCV 2024, 2024
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-Grained Correctional Human Feedback.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
UPRec: User-aware Pre-training for sequential Recommendation.
AI Open, January, 2023
DreaMoving: A Human Video Generation Framework based on Diffusion Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Visually Grounded Commonsense Knowledge Acquisition.
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
DCT-net: domain-calibrated translation for portrait stylization.
ACM Trans. Graph., 2022
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Fine-Grained Scene Graph Generation with Data Transfer.
Proceedings of the Computer Vision - ECCV 2022, 2022
Structure-Aware Flow Generation for Human Body Reshaping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Unpaired Cartoon Image Synthesis via Gated Cycle Mapping.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Prompt Tuning for Discriminative Pre-trained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
Knowledge Transfer via Pre-training for Recommendation: A Review and Prospect.
Frontiers Big Data, 2021
CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2021
UPRec: User-Aware Pre-training for Recommender Systems.
CoRR, 2021
CPM-2: Large-scale cost-effective pre-trained language models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
AI Open, 2021
Pre-trained models: Past, present and future.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
AI Open, 2021
Open Hierarchical Relation Extraction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
CodRED: A Cross-Document Relation Extraction Dataset for Acquiring Knowledge in the Wild.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
ONION: A Simple and Effective Defense Against Textual Backdoor Attacks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Adversarial Language Games for Advanced Natural Language Intelligence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Integrating Image-Based and Knowledge-Based Representation Learning.
IEEE Trans. Cogn. Dev. Syst., 2020
Denoising Relation Extraction from Document-level Distant Supervision.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Boosting Semantic Human Matting With Coarse Annotations.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Meta-Information Guided Meta-Learning for Few-Shot Relation Classification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
2019
Adversarial Language Games for Advanced Natural Language Intelligence.
CoRR, 2019
An Acceleration Framework for High Resolution Image Synthesis.
CoRR, 2019
Open Relation Extraction: Relational Knowledge Transfer from Supervised Data to Unsupervised Data.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
OpenNRE: An Open and Extensible Toolkit for Neural Relation Extraction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Attention-Aware Multi-Stroke Style Transfer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
DocRED: A Large-Scale Document-Level Relation Extraction Dataset.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
CAMF: Context Aware Matrix Factorization for Social Recommendation.
Web Intell., 2018
FewRel: A Large-Scale Supervised Few-shot Relation Classification Dataset with State-of-the-Art Evaluation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
2017
Long short-term memory based recurrent neural networks for collaborative filtering.
Proceedings of the 2017 IEEE SmartWorld, 2017
HDNN-CF: A hybrid deep neural networks collaborative filtering architecture for event recommendation.
Proceedings of the 2017 IEEE SmartWorld, 2017
2016
Context Aware Matrix Factorization for Event Recommendation in Event-Based Social Networks.
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016
Towards Accurate Relation Extraction from Wikipedia.
Proceedings of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, 2016
Grouped Text Clustering Using Non-Parametric Gaussian Mixture Experts.
Proceedings of the PRICAI 2016: Trends in Artificial Intelligence, 2016
We Know Where You Are: Home Location Identification in Location-Based Social Networks.
Proceedings of the 25th International Conference on Computer Communication and Networks, 2016
We Know What You Are Doing or Going to Do: Towards Accurate Human Activities Sensing.
Proceedings of the 25th International Conference on Computer Communication and Networks, 2016
2015
A Mixture Distribution Based System in BitTorrent-Like P2P Networks.
Proceedings of the 21st IEEE International Conference on Parallel and Distributed Systems, 2015
2014
Fast Routing in Location-Based Social Networks Leveraging Check-in Data.
Proceedings of the 2014 IEEE International Conference on Internet of Things, 2014