Nandi Schoots

According to our database1, Nandi Schoots authored at least 13 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Hidden in Plain Text: Emergence & Mitigation of Steganographic Collusion in LLMs.
CoRR, 2024

Training Neural Networks for Modularity aids Interpretability.
CoRR, 2024

Extending Activation Steering to Broad Skills and Multiple Behaviours.
CoRR, 2024

Dissecting Language Models: Machine Unlearning via Selective Pruning.
CoRR, 2024

The Propensity for Density in Feed-Forward Models.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Channel Randomisation Methods for Zero-Shot Communication.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

2023
Improving Activation Steering in Language Models with Mean-Centring.
CoRR, 2023

Comparing Optimization Targets for Contrast-Consistent Search.
CoRR, 2023

Any Deep ReLU Network is Shallow.
CoRR, 2023

Low-Entropy Latent Variables Hurt Out-of-Distribution Performance.
CoRR, 2023

A theory of representation learning gives a deep generalisation of kernel methods.
Proceedings of the International Conference on Machine Learning, 2023

2021
Learning to Communicate with Strangers via Channel Randomisation Methods.
CoRR, 2021

Safety Properties of Inductive Logic Programming.
Proceedings of the Workshop on Artificial Intelligence Safety 2021 (SafeAI 2021) co-located with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021


  Loading...