Do Multilingual LLMs Think In English?
CoRR, February, 2025
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs.
CoRR, 2024
Explaining Explainability: Understanding Concept Activation Vectors.
CoRR, 2024
Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero.
CoRR, 2023
Diversifying AI: Towards Creative Chess with AlphaZero.
CoRR, 2023
DeDUCE: Generating Counterfactual Explanations Efficiently.
CoRR, 2021
Speedy Performance Estimation for Neural Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Generating Interpretable Counterfactual Explanations By Implicit Minimisation of Epistemic and Aleatoric Uncertainties.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021
A Bayesian Perspective on Training Speed and Model Selection.
CoRR, 2020
Revisiting the Train Loss: an Efficient Performance Estimator for Neural Architecture Search.
CoRR, 2020
Capsule Networks - A Probabilistic Perspective.
CoRR, 2020
A Bayesian Perspective on Training Speed and Model Selection.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020