2025

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

POTEC: Off-Policy Contextual Bandits for Large Action Spaces via Policy Decomposition.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

The Art of Refusal: A Survey of Abstention in Large Language Models.

[DOI]

CoRR, 2024

Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop.

[DOI]

CoRR, 2024

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition.

[DOI]

CoRR, 2024