Yonatan Belinkov
According to our database1,
Yonatan Belinkov
authored at least 127 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations.
CoRR, 2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability.
CoRR, 2024
Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions.
CoRR, 2024
REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space.
CoRR, 2024
CoRR, 2024
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language Models.
CoRR, 2024
Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms.
CoRR, 2024
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
Trans. Mach. Learn. Res., 2023
Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias.
CoRR, 2023
Understanding Arithmetic Reasoning in Language Models using Causal Mediation Analysis.
CoRR, 2023
Interpreting Transformer's Attention Dynamic Memory and Visualizing the Semantic Information Flow of GPT.
CoRR, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the Eleventh International Conference on Learning Representations, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
When Language Models Fall in Love: Animacy Processing in Transformer Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Annual International Conference of the Alliance of Digital Humanities Organizations, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Shielded Representations: Protecting Sensitive Attributes Through Iterative Gradient-Based Projection.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
CoRR, 2022
Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions.
CoRR, 2022
MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning.
CoRR, 2022
A Generative Approach for Mitigating Structural Biases in Natural Language Inference.
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Supervising Model Attention with Human Explanations for Robust Natural Language Inference.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Natural Language Inference with a Human Touch: Using Human Explanations to Guide Model Attention.
CoRR, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
CoRR, 2020
Exploiting Redundancy in Pre-trained Language Models for Efficient Transfer Learning.
CoRR, 2020
Comput. Linguistics, 2020
Proceedings of the Fifth Conference on Machine Translation, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Trans. Assoc. Comput. Linguistics, 2019
Studying the history of the Arabic language: language technology and a large-scale historical corpus.
Lang. Resour. Evaluation, 2019
Inf. Process. Manag., 2019
CoRR, 2019
Adversarial Regularization for Visual Question Answering: Strengths, Shortcomings, and Side Effects.
CoRR, 2019
CoRR, 2019
Proceedings of the Fourth Conference on Machine Translation, 2019
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 7th International Conference on Learning Representations, 2019
Character-based Surprisal as a Model of Reading Difficulty in the Presence of Errors.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Don't Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
On internal language representations in deep learning: an analysis of machine translation and speech recognition.
PhD thesis, 2018
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language Inference.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Proceedings of the 6th International Conference on Learning Representations, 2018
2017
Analysis of sentence embedding models using prediction tasks in natural language processing.
IBM J. Res. Dev., 2017
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017
Proceedings of the 14th International Conference on Spoken Language Translation, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Understanding and Improving Morphological Learning in the Neural Machine Translation Decoder.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Evaluating Layers of Representation in Neural Machine Translation on Part-of-Speech and Semantic Tagging Tasks.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017
Proceedings of the 5th International Conference on Learning Representations, 2017
Challenging Language-Dependent Segmentation for Arabic: An Application to Machine Translation and Part-of-Speech Tagging.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Large-Scale Machine Translation between Arabic and Hebrew: Available Corpora and Initial Results.
CoRR, 2016
A Character-level Convolutional Neural Network for Distinguishing Similar Languages and Dialects.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016
Improving Sequence to Sequence Learning for Morphological Inflection Generation: The BIU-MIT Systems for the SIGMORPHON 2016 Shared Task for Morphological Reinflection.
Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, 2016
SLS at SemEval-2016 Task 3: Neural-based Approaches for Ranking in Community Question Answering.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016
Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities, 2016
Proceedings of the COLING 2016, 2016
2015
Erratum: "Exploring Compositional Architectures and Word Vector Representations for Prepositional Phrase Attachment".
Trans. Assoc. Comput. Linguistics, 2015
Proceedings of the Second Workshop on Arabic Natural Language Processing, 2015
VectorSLU: A Continuous Word Vector Approach to Answer Selection in Community Question Answering Systems.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
2014
Exploring Compositional Architectures and Word Vector Representations for Prepositional Phrase Attachment.
Trans. Assoc. Comput. Linguistics, 2014
J. King Saud Univ. Comput. Inf. Sci., 2014
2013
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013