Shauli Ravfogel

Orcid: 0000-0001-8442-9311

According to our database1, Shauli Ravfogel authored at least 40 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GRADE: Quantifying Sample Diversity in Text-to-Image Models.
CoRR, 2024

Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces.
CoRR, 2024

On Affine Homotopy between Language Encoders.
CoRR, 2024

Language Imbalance Can Boost Cross-lingual Generalisation.
CoRR, 2024

What Changed? Converting Representational Interventions to Natural Language.
CoRR, 2024

MiMiC: Minimally Modified Counterfactuals in the Representation Space.
CoRR, 2024

Representation Surgery: Theory and Practice of Affine Steering.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Language Concept Erasure for Language-invariant Dense Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Visual Comparison of Language Model Adaptation.
IEEE Trans. Vis. Comput. Graph., 2023

The Curious Case of Hallucinatory Unanswerablity: Finding Truths in the Hidden States of Over-Confident Large Language Models.
CoRR, 2023

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations.
CoRR, 2023

Retrieving Texts based on Abstract Descriptions.
CoRR, 2023

Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LEACE: Perfect linear concept erasure in closed form.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

The Curious Case of Hallucinatory (Un)answerability: Finding Truths in the Hidden States of Over-Confident Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Guiding LLM to Fool Itself: Automatically Manipulating Machine Reading Comprehension Shortcut Triggers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Conformal Nucleus Sampling.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Linear Guardedness and its Implications.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Few-shot Fine-tuning vs. In-context Learning: A Fair Comparison and Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions.
CoRR, 2022

Analyzing Gender Representation in Multilingual Models.
Proceedings of the 7th Workshop on Representation Learning for NLP, 2022

Linear Adversarial Concept Erasure.
Proceedings of the International Conference on Machine Learning, 2022

Adversarial Concept Erasure in Kernel Space.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

2021
Amnesic Probing: Behavioral Explanation With Amnesic Counterfactuals.
Trans. Assoc. Comput. Linguistics, 2021

Erratum: Measuring and Improving Consistency in Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2021

Measuring and Improving Consistency in Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2021

Ab Antiquo: Neural Proto-language Reconstruction.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Contrastive Explanations for Model Interpretability.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Counterfactual Interventions Reveal the Causal Effect of Relative Clause Representations on Agreement Prediction.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Neural Extractive Search.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
When Bert Forgets How To POS: Amnesic Probing of Linguistic Properties and MLM Predictions.
CoRR, 2020

Unsupervised Distillation of Syntactic Information from Contextualized Word Representations.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

The Extraordinary Failure of Complement Coercion Crowdsourcing.
Proceedings of the First Workshop on Insights from Negative Results in NLP, 2020

2019
Ab Antiquo: Proto-language Reconstruction with RNNs.
CoRR, 2019

Studying the Inductive Biases of RNNs with Synthetic Variations of Natural Languages.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Can LSTM Learn to Capture Agreement? The Case of Basque.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018


  Loading...