Tackling Polysemanticity with Neuron Embeddings.

[DOI]

Alex Foote

CoRR, 2024

Neuron to Graph: Interpreting Language Model Neurons at Scale.

[DOI]

,

,

,

,

,

CoRR, 2023

N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models.

[DOI]

,

,

,

,

CoRR, 2023

REET: robustness evaluation and enhancement toolbox for computational pathology.

[DOI]

,

,

,

Bioinform., 2022

Now You See It, Now You Dont: Adversarial Vulnerabilities in Computational Pathology.

[DOI]

,

,

,

,

,

CoRR, 2021