2025

Self-Training Elicits Concise Reasoning in Large Language Models.

[DOI]

Tergel Munkhbat

,

,

,

,

,

CoRR, February, 2025

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models.

[DOI]

,

,

,

,

,

,

,

,

Sheikh Shafayat

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Bill Yuchen Lin

,

,

,

,

,

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

2024

Block Transformer: Global-to-Local Language Modeling for Fast Inference.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Carpe diem: On the Evaluation of World Knowledge in Lifelong Language Models.

[DOI]

,

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023

Cross-Modal Retrieval Meets Inference: Improving Zero-Shot Classification with Cross-Modal Retrieval.

[DOI]

,

,

,

CoRR, 2023

HARE: Explainable Hate Speech Detection with Step-by-Step Reasoning.

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Large Language Models Are Reasoning Teachers.

[DOI]

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Benchmark Dataset for Precipitation Forecasting by Post-Processing the Numerical Weather Prediction.

[DOI]

,

,

,

CoRR, 2022

Understanding Cross-Domain Few-Shot Learning: An Experimental Study.

[DOI]

,

,

,

,

,

CoRR, 2022

Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty.

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ReFine: Re-randomization before Fine-tuning for Cross-domain Few-shot Learning.

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022