2025
In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents.
CoRR, March, 2025

Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation.
CoRR, March, 2025

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving.
CoRR, February, 2025

Reverse Thinking Makes LLMs Stronger Reasoners.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling.
CoRR, 2024

CodecLM: Aligning Language Models with Tailored Synthetic Data.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2021
A prospective evaluation of AI-augmented epidemiology to forecast COVID-19 in the USA and Japan.
npj Digit. Medicine, 2021

2020
Interpretable Sequence Learning for COVID-19 Forecasting.
CoRR, 2020

Interpretable Sequence Learning for Covid-19 Forecasting.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
Assessing the quality of answers autonomously in community question-answering.
Int. J. Digit. Libr., 2019

2018
Retrieving people: Identifying potential answerers in Community Question-Answering.
J. Assoc. Inf. Sci. Technol., 2018

2017
Discerning the Quality of Questions in Educational Q&Ausing Textual Features.
Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, 2017

Bad Users or Bad Content?: Breaking the Vicious Cycle by Finding Struggling Students in Community Question-Answering.
Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, 2017

2016
Evaluating the Quality of Educational Answers in Community Question-Answering.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

Retrieving Rising Stars in Focused Community Question-Answering.
Proceedings of the Intelligent Information and Database Systems - 8th Asian Conference, 2016

2015
MET: A Fast Algorithm for Minimizing Propagation in Large Graphs with Small Eigen-Gaps.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

2013
Hyperlocal: inferring location of IP addresses in real-time bid requests for mobile ads.
Proceedings of the 6th ACM SIGSPATIAL International Workshop on Location-Based Social Networks, 2013