Tackling Polysemanticity with Neuron Embeddings.
CoRR, 2024
Neuron to Graph: Interpreting Language Model Neurons at Scale.
CoRR, 2023
N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models.
CoRR, 2023
REET: robustness evaluation and enhancement toolbox for computational pathology.
Bioinform., 2022
Now You See It, Now You Dont: Adversarial Vulnerabilities in Computational Pathology.
CoRR, 2021