2025

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning.

[DOI]

Xiangru Tang

Tianyu Hu

CoRR, January, 2025

2024

DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models.

[DOI]

CoRR, 2024

Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models.

[DOI]

CoRR, 2024

COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement.

[DOI]

CoRR, 2024

Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models.

[DOI]

CoRR, 2024

Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement.

[DOI]

CoRR, 2024

Themis: Towards Flexible and Interpretable NLG Evaluation.

[DOI]

CoRR, 2024

MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency.

[DOI]

CoRR, 2024

ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions.

[DOI]

Xu Zhang

Xunjian Yin

Xiaojun Wan

CoRR, 2024

Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation.

[DOI]

CoRR, 2024

Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability.

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Error-Robust Retrieval for Chinese Spelling Check.

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Contextual Modeling for Document-level ASR Error Correction.

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

History Matters: Temporal Knowledge Editing in Large Language Model.

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check.

[DOI]

Xunjian Yin

Xiaojun Wan

CoRR, 2023

Human-like Summarization Evaluation with ChatGPT.

[DOI]

CoRR, 2023

Overview of the NLPCC 2023 Shared Task: Chinese Spelling Check.

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2023

ALCUNA: Large Language Models Meet New Knowledge.

[DOI]

Xunjian Yin

Baizhou Huang

Xiaojun Wan

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Context-Aware Evaluation Metrics for Machine Translation.

[DOI]

Xinyu Hu

Xunjian Yin

Xiaojun Wan

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022

Chinese Spelling Check with Nearest Neighbors.

[DOI]

Xunjian Yin

Xinyu Hu

Xiaojun Wan

CoRR, 2022

How Do Seq2Seq Models Perform on End-to-End Data-to-Text Generation?

[DOI]

Xunjian Yin

Xiaojun Wan

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022