2025
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective.
CoRR, February, 2025

Position: We Need An Adaptive Interpretation of Helpful, Honest, and Harmless Principles.
CoRR, February, 2025

Breaking Focus: Contextual Distraction Curse in Large Language Models.
CoRR, February, 2025

DataGen: Unified Synthetic Dataset Generation via Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
CpG Island Definition and Methylation Mapping of the T2T-YAO Genome.
Genom. Proteom. Bioinform., 2024

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models.
CoRR, 2024

GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents.
CoRR, 2024

The Best of Both Worlds: Toward an Honest and Helpful Large Language Model.
CoRR, 2024

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models.
CoRR, 2024

LLM-as-a-Coauthor: The Challenges of Detecting LLM-Human Mixcase.
CoRR, 2024

TrustLLM: Trustworthiness in Large Language Models.
CoRR, 2024

HonestLLM: Toward an Honest and Helpful Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Position: TrustLLM: Trustworthiness in Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024