GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge.
CoRR, January, 2025
You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling.
CoRR, 2024
ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems.
CoRR, 2024
FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models.
CoRR, 2024
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications.
CoRR, 2023
CALYPSO: LLMs as Dungeon Masters' Assistants.
CoRR, 2023
CALYPSO: LLMs as Dungeon Master's Assistants.
Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, 2023
FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
An AI Dungeon Master's Guide: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons.
CoRR, 2022
DuelGAN: A Duel Between Two Discriminators Stabilizes the GAN Training.
Proceedings of the Computer Vision - ECCV 2022, 2022