Self-Training Elicits Concise Reasoning in Large Language Models.
CoRR, February, 2025
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Block Transformer: Global-to-Local Language Modeling for Fast Inference.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Carpe diem: On the Evaluation of World Knowledge in Lifelong Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Cross-Modal Retrieval Meets Inference: Improving Zero-Shot Classification with Cross-Modal Retrieval.
CoRR, 2023
HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Large Language Models Are Reasoning Teachers.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Benchmark Dataset for Precipitation Forecasting by Post-Processing the Numerical Weather Prediction.
CoRR, 2022
Understanding Cross-Domain Few-Shot Learning: An Experimental Study.
CoRR, 2022
Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
ReFine: Re-randomization before Fine-tuning for Cross-domain Few-shot Learning.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022