Yonatan Belinkov

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder.

[BibT_eX]

[DOI]

Dana Arad

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Linearity of Relation Decoding in Transformer Language Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Fast Forwarding Low-Rank Training.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Backward Lens: Projecting Language Model Gradients into the Vocabulary Space.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry.

[BibT_eX]

[DOI]

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Generating Benchmarks for Factuality Evaluation of Language Models.

[BibT_eX]

[DOI]

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Concept-Best-Matching: Evaluating Compositionality In Emergent Communication.

[BibT_eX]

[DOI]

Boaz Carmeli

Ron Meir

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Accelerating the Global Aggregation of Local Explanations.

[BibT_eX]

[DOI]

Alon Mor

Giambattista Parascandolo

Benny Kimelfeld

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.

[BibT_eX]

[DOI]

Bartlomiej Bojanowski

Christopher D. Manning

Daniel Moseguí González

Eunice Engefu Manyasi

Evgenii Zheltonozhskii

Fanyue Xia

Fatemeh Siar

Fernando Martínez-Plumed

Giorgio Mariani

Gloria Wang

Gonzalo Jaimovitch-López

Jaime Fernández Fisac

Jascha Sohl-Dickstein

José Hernández-Orallo

Karthik Gopalakrishnan

Lidia Contreras Ochando

Louis-Philippe Morency

María José Ramírez-Quintana

Michael I. Ivanitskiy

Neta Gur-Ari Krakover

Nitish Shirish Keskar

Pablo Antonio Moreno Casares

Pegah Alipoormolabashi

Shyamolima (Shammie) Debnath

Sneha Priscilla Makini

Yadollah Yaghoobzadeh

Trans. Mach. Learn. Res., 2023

Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias.

[BibT_eX]

[DOI]

CoRR, 2023

Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis.

[BibT_eX]

[DOI]

Alessandro Stolfo

Mrinmaya Sachan

CoRR, 2023

Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT.

[BibT_eX]

[DOI]

Shahar Katz

CoRR, 2023

ContraSim - A Similarity Measure Based on Contrastive Learning.

[BibT_eX]

[DOI]

Adir Rahamim

CoRR, 2023

Mass-Editing Memory in a Transformer.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Multiple sequence alignment as a sequence-to-sequence learning problem.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Editing Implicit Assumptions in Text-to-Image Diffusion Models.

[BibT_eX]

[DOI]

Bahjat Kawar

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis.

[BibT_eX]

[DOI]

Alessandro Stolfo

Mrinmaya Sachan

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers.

[BibT_eX]

[DOI]

Shahar Katz

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

When Language Models Fall in Love: Animacy Processing in Transformer Language Models.

[BibT_eX]

[DOI]

Michael Hanna

Sandro Pezzelle

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

FigureOut - Automatic Detection of Metaphors in Hebrew Across the Eras.

[BibT_eX]

[DOI]

Proceedings of the Annual International Conference of the Alliance of Digital Humanities Organizations, 2023

Parallel Context Windows for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

BLIND: Bias Removal With No Demographics.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection.

[BibT_eX]

[DOI]

Shadi Iskander

Kira Radinsky

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Emergent Quantized Communication.

[BibT_eX]

[DOI]

Boaz Carmeli

Ron Meir

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Parallel Context Windows Improve In-Context Learning of Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

Debiasing NLP Models Without Demographic Information.

[BibT_eX]

[DOI]

CoRR, 2022

Choose Your Lenses: Flaws in Gender Bias Evaluation.

[BibT_eX]

[DOI]

CoRR, 2022

Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions.

[BibT_eX]

[DOI]

Abhilasha Ravichander

CoRR, 2022

IDANI: Inference-time Domain Adaptation via Neuron-level Interventions.

[BibT_eX]

[DOI]

Omer Antverg

Eyal Ben-David

CoRR, 2022

MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning.

[BibT_eX]

[DOI]

CoRR, 2022

Locating and Editing Factual Knowledge in GPT.

[BibT_eX]

[DOI]

CoRR, 2022

Probing Classifiers: Promises, Shortcomings, and Advances.

[BibT_eX]

[DOI]

Comput. Linguistics, 2022

A Generative Approach for Mitigating Structural Biases in Natural Language Inference.

[BibT_eX]

[DOI]

Dimion Asael

Zachary M. Ziegler

Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, 2022

Locating and Editing Factual Associations in GPT.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Measures of Information Reflect Memorization Patterns.

[BibT_eX]

[DOI]

Rachit Bansal

Danish Pruthi

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

How Gender Debiasing Affects Internal Model Representations, and Why It Matters.

[BibT_eX]

[DOI]

Seraphina Goldfarb-Tarrant

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

On the Pitfalls of Analyzing Individual Neurons in Language Models.

[BibT_eX]

[DOI]

Omer Antverg

Proceedings of the Tenth International Conference on Learning Representations, 2022

A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference.

[BibT_eX]

[DOI]

Kerem Zaman

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Supervising Model Attention with Human Explanations for Robust Natural Language Inference.

[BibT_eX]

[DOI]

Joe Stacey

Marek Rei

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Natural Language Inference with a Human Touch: Using Human Explanations to Guide Model Attention.

[BibT_eX]

[DOI]

Joe Stacey

Marek Rei

CoRR, 2021

Probing Classifiers: Promises, Shortcomings, and Alternatives.

[BibT_eX]

[DOI]

CoRR, 2021

IRM - when it works and when it doesn't: A test case of natural language inference.

[BibT_eX]

[DOI]

Yana Dranker

He He

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning from others' mistakes: Avoiding dataset biases without modeling them.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning.

[BibT_eX]

[DOI]

Rabeeh Karimi Mahabadi

James Henderson

Proceedings of the 9th International Conference on Learning Representations, 2021

Similarity Analysis of Self-Supervised Speech Representations.

[BibT_eX]

[DOI]

Yu-An Chung

Proceedings of the IEEE International Conference on Acoustics, 2021

Debiasing Methods in Natural Language Understanding Make Bias More Accessible.

[BibT_eX]

[DOI]

Michael Mendelson

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?

[BibT_eX]

[DOI]

Abhilasha Ravichander

Eduard H. Hovy

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Probing Neural Dialog Models for Conversational Understanding.

[BibT_eX]

[DOI]

CoRR, 2020

Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias.

[BibT_eX]

[DOI]

CoRR, 2020

Exploiting Redundancy in Pre-trained Language Models for Efficient Transfer Learning.

[BibT_eX]

[DOI]

CoRR, 2020

On the Linguistic Representational Power of Neural Machine Translation Models.

[BibT_eX]

[DOI]

Comput. Linguistics, 2020

Findings of the WMT 2020 Shared Task on Machine Translation Robustness.

[BibT_eX]

[DOI]

Proceedings of the Fifth Conference on Machine Translation, 2020

Investigating Gender Bias in Language Models Using Causal Mediation Analysis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Constructive Prediction of the Generalization Error Across Scales.

[BibT_eX]

[DOI]

Jonathan S. Rosenfeld

Amir Rosenfeld

Nir Shavit

Proceedings of the 8th International Conference on Learning Representations, 2020

Analyzing Individual Neurons in Pre-trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Analyzing Redundancy in Pretrained Transformer Models.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Similarity Analysis of Contextual Word Representation Models.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

End-to-End Bias Mitigation by Modelling Biases in Corpora.

[BibT_eX]

[DOI]

Rabeeh Karimi Mahabadi

James Henderson

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Interpretability and Analysis in Neural NLP.

[BibT_eX]

[DOI]

Sebastian Gehrmann

Ellie Pavlick

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Analysis Methods in Neural Language Processing: A Survey.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2019

Studying the history of the Arabic language: language technology and a large-scale historical corpus.

[BibT_eX]

[DOI]

Alexander Magidow

Alberto Barrón-Cedeño

Avi Shmidman

Maxim Romanov

Lang. Resour. Evaluation, 2019

Language processing and learning models for community question answering in Arabic.

[BibT_eX]

[DOI]

Salvatore Romeo

Giovanni Da San Martino

Alberto Barrón-Cedeño

Inf. Process. Manag., 2019

Memory-Augmented Recurrent Neural Networks Can Learn Generalized Dyck Languages.

[BibT_eX]

[DOI]

CoRR, 2019

Adversarial Regularization for Visual Question Answering: Strengths, Shortcomings, and Side Effects.

[BibT_eX]

[DOI]

Gabriel Grand

CoRR, 2019

LSTM Networks Can Perform Dynamic Counting.

[BibT_eX]

[DOI]

CoRR, 2019

Character-based Surprisal as a Model of Human Reading in the Presence of Errors.

[BibT_eX]

[DOI]

CoRR, 2019

Findings of the First Shared Task on Machine Translation Robustness.

[BibT_eX]

[DOI]

Xian Li

Paul Michel

Antonios Anastasopoulos

Proceedings of the Fourth Conference on Machine Translation, 2019

On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

Linguistic Knowledge and Transferability of Contextual Representations.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

One Size Does Not Fit All: Comparing NMT Representations of Different Granularities.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition.

[BibT_eX]

[DOI]

Ahmed Ali

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Identifying and Controlling Important Neurons in Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Learning Representations, 2019

Character-based Surprisal as a Model of Reading Difficulty in the Presence of Errors.

[BibT_eX]

[DOI]

Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

Analyzing the Structure of Attention in a Transformer Language Model.

[BibT_eX]

[DOI]

Jesse Vig

Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019

Improving Neural Language Models by Segmenting, Attending, and Predicting the Future.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Don't Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

On internal language representations in deep learning: an analysis of machine translation and speech recognition.

[BibT_eX]

[DOI]

PhD thesis, 2018

On Evaluating the Generalization of LSTM Models in Formal Languages.

[BibT_eX]

[DOI]

Mirac Suzgun

Stuart M. Shieber

CoRR, 2018

On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Synthetic and Natural Noise Both Break Neural Machine Translation.

[BibT_eX]

[DOI]

Yonatan Bisk

Proceedings of the 6th International Conference on Learning Representations, 2018

2017

Analysis of sentence embedding models using prediction tasks in natural language processing.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2017

Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Neural Machine Translation Training in a Multi-Domain Scenario.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Spoken Language Translation, 2017

QMDIS: QCRI-MIT Advanced Dialect Identification System.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Understanding and Improving Morphological Learning in the Neural Machine Translation Decoder.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Learning Representations, 2017

Challenging Language-Dependent Segmentation for Arabic: An Application to Machine Translation and Part-of-Speech Tagging.

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

What do Neural Machine Translation Models Learn about Morphology?

[BibT_eX]

[DOI]

Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016

Large-Scale Machine Translation between Arabic and Hebrew: Available Corpora and Initial Results.

[BibT_eX]

[DOI]

CoRR, 2016

A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects.

[BibT_eX]

[DOI]

Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

Improving Sequence to Sequence Learning for Morphological Inflection Generation: The BIU-MIT Systems for the SIGMORPHON 2016 Shared Task for Morphological Reinflection.

[BibT_eX]

[DOI]

Roee Aharoni

Yoav Goldberg

Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, 2016

SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering.

[BibT_eX]

[DOI]

Mitra Mohtarami