ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning.
,
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
DSGram: Dynamic Weighting Sub-Metrics for Grammatical Error Correction in the Era of Large Language Models.
CoRR, 2024
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models.
CoRR, 2024
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement.
CoRR, 2024
Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models.
CoRR, 2024
Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement.
CoRR, 2024
Themis: Towards Flexible and Interpretable NLG Evaluation.
CoRR, 2024
MC-MKE: A Fine-Grained Multimodal Knowledge Editing Benchmark Emphasizing Modality Consistency.
CoRR, 2024
ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions.
CoRR, 2024
Benchmarking Knowledge Boundary for Large Language Model: A Different Perspective on Model Evaluation.
CoRR, 2024
Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Error-Robust Retrieval for Chinese Spelling Check.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Contextual Modeling for Document-level ASR Error Correction.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
History Matters: Temporal Knowledge Editing in Large Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
A Comprehensive Evaluation and Analysis Study for Chinese Spelling Check.
CoRR, 2023
Human-like Summarization Evaluation with ChatGPT.
CoRR, 2023
Overview of the NLPCC 2023 Shared Task: Chinese Spelling Check.
Proceedings of the Natural Language Processing and Chinese Computing, 2023
ALCUNA: Large Language Models Meet New Knowledge.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Exploring Context-Aware Evaluation Metrics for Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Chinese Spelling Check with Nearest Neighbors.
CoRR, 2022
How Do Seq2Seq Models Perform on End-to-End Data-to-Text Generation?
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022