Marco Baroni

Orcid: 0000-0001-5066-3580

Affiliations:
  • University of Trento


According to our database1, Marco Baroni authored at least 137 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Emergence of a High-Dimensional Abstraction Phase in Language Transformers.
CoRR, 2024

Linearly Controlled Language Generation with Performative Guarantees.
CoRR, 2024

MemoryPrompt: A Light Wrapper to Improve Context Tracking in Pre-trained Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Human-like systematic generalization through a meta-learning neural network.
Nat., 2023

Can discrete information extraction prompts generalize across language models?
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Unnatural language processing: How do language models handle machine-generated prompts?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Bridging Information-Theoretic and Geometric Compression in Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Cross-Domain Image Captioning with Discriminative Finetuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Referential Communication in Heterogeneous Communities of Pre-trained Visual Deep Networks.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Communication breakdown: On the low mutual intelligibility between human and neural captioning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
How BPE Affects Memorization in Transformers.
CoRR, 2021

On the proper role of linguistically-oriented deep net analysis in linguistic theorizing.
CoRR, 2021

Interpretable agent communication from scratch (with a generic visual processor emerging on the side).
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Controlled tasks for model analysis: Retrieving discrete information from sequences.
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

2020
Exploring Processing of Nested Dependencies in Neural-Network Language Models and Humans.
CoRR, 2020

Emergent Multi-Agent Communication in the Deep Learning Era.
CoRR, 2020

Syntactic Structure from Deep Learning.
CoRR, 2020

Rat big, cat eaten! Ideas for a useful deep-agent protolanguage.
CoRR, 2020

A Benchmark for Systematic Generalization in Grounded Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Entropy Minimization In Emergent Languages.
Proceedings of the 37th International Conference on Machine Learning, 2020

Permutation Equivariant Models for Compositional Generalization in Language.
Proceedings of the 8th International Conference on Learning Representations, 2020

Emergent Language Generalization and Acquisition Speed are not tied to Compositionality.
Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2020

Compositionality and Generalization In Emergent Languages.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Tabula nearly rasa: Probing the linguistic knowledge of character-level neural language models trained on unsegmented text.
Trans. Assoc. Comput. Linguistics, 2019

Focus on What's Informative and Ignore What's not: Communication Strategies in a Referential Game.
CoRR, 2019

Information Minimization In Emergent Languages.
CoRR, 2019

Linguistic generalization and compositionality in modern artificial neural networks.
CoRR, 2019

Anti-efficient encoding in emergent communication.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

The emergence of number and syntax units in LSTM language models.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

EGG: a toolkit for research on Emergence of lanGuage in Games.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Human few-shot learning of compositional instructions.
Proceedings of the 41th Annual Meeting of the Cognitive Science Society, 2019

CNNs found to jump around more skillfully than RNNs: Compositional Generalization in Seq2seq Convolutional Networks.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Word-order Biases in Deep-agent Emergent Communication.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Miss Tools and Mr Fruit: Emergent Communication in Agents Learning about Object Affordances.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

On the Distribution of Deep Clausal Embeddings: A Large Cross-linguistic Study.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Memorize or generalize? Searching for a compositional RNN in a haystack.
CoRR, 2018

Colorless Green Recurrent Networks Dream Hierarchically.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Generalization without Systematicity: On the Compositional Skills of Sequence-to-Sequence Recurrent Networks.
Proceedings of the 35th International Conference on Machine Learning, 2018

Causal Discovery Using Proxy Variables.
Proceedings of the 6th International Conference on Learning Representations, 2018

Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

How agents see things: On visual representations in an emergent language game.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Jump to better conclusions: SCAN both left and right.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Solar Decathlon ME18 competition as a "learning by doing" experience for students: The case of the team HAAB.
Proceedings of the 2018 IEEE Global Engineering Education Conference, 2018

What you can cram into a single \$&!#* vector: Probing sentence embeddings for linguistic properties.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Still not systematic after all these years: On the compositional skills of sequence-to-sequence recurrent networks.
CoRR, 2017

Living a discrete life in a continuous world: Reference with distributed representations.
CoRR, 2017

Spicy Adjectives and Nominal Donkeys: Capturing Semantic Deviance Using Compositionality in Distributional Spaces.
Cogn. Sci., 2017

A New AI Evaluation Cosmos: Ready to Play the Game?
AI Mag., 2017

Living a discrete life in a continuous world: Reference in cross-modal entity tracking.
Proceedings of the IWCS 2017 - 12th International Conference on Computational Semantics - Short papers, Montpellier, France, September 19, 2017

Multi-Agent Cooperation and the Emergence of (Natural) Language.
Proceedings of the 5th International Conference on Learning Representations, 2017

CommAI: Evaluating the first steps towards a useful general AI.
Proceedings of the 5th International Conference on Learning Representations, 2017

High-risk learning: acquiring new word vectors from tiny data.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

"Show Me the Cup": Reference with Continuous Representations.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017

2016
Neural sensitivity to syllable frequency and mutual information in speech perception and production.
NeuroImage, 2016

SICK through the SemEval glasses. Lesson learned from the evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment.
Lang. Resour. Evaluation, 2016

Grounding Distributional Semantics in the Visual World.
Lang. Linguistics Compass, 2016

Towards Multi-Agent Communication-Based Language Learning.
CoRR, 2016

The red one!: On learning to refer to things based on their discriminative properties.
CoRR, 2016

When the Whole Is Less Than the Sum of Its Parts: How Composition Affects PMI Values in Distributional Semantic Vectors.
Comput. Linguistics, 2016

There Is No Logical Negation Here, But There Are Alternatives: Modeling Conversational Negation with Distributional Semantics.
Comput. Linguistics, 2016

Multimodal Semantic Learning from Child-Directed Input.
Proceedings of the NAACL HLT 2016, 2016

A Roadmap Towards Machine Intelligence.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2016

The LAMBADA dataset: Word prediction requiring a broad discourse context.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

The red one!: On learning to refer to things based on discriminative properties.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
From Visual Attributes to Adjectives through Decompositional Distributional Semantics.
Trans. Assoc. Comput. Linguistics, 2015

Deriving Boolean Structures from Distributional Vectors.
Trans. Assoc. Comput. Linguistics, 2015

Reading visually embodied meaning from the brain: Visually grounded computational models decode visual-object mental imagery induced by written text.
NeuroImage, 2015

Unveiling the Dreams of Word Embeddings: Towards Language-Driven Image Generation.
CoRR, 2015

Improving zero-shot learning by mitigating the hubness problem.
Proceedings of the 3rd International Conference on Learning Representations, 2015

When the Whole Is Not Greater Than the Combination of Its Parts: A "Decompositional" Look at Compositional Distributional Semantics.
Comput. Linguistics, 2015

Leveraging Preposition Ambiguity to Assess Compositional Distributional Models of Semantics.
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, 2015

Combining Language and Vision with a Multimodal Skip-gram Model.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

So similar and yet incompatible: Toward the automated identification of semantically compatible words.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Distributional vectors encode referential attributes.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

A Multitask Objective to Inject Lexical Contrast into Distributional Semantics.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Jointly optimizing word representations for lexical and sentential tasks with the C-PHRASE model.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Hubness and Pollution: Delving into Cross-Space Mapping for Zero-Shot Learning.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis.
Proceedings of the Fourth Workshop on Vision and Language, 2015

2014
Predication Drives Verb Cortical Signatures.
J. Cogn. Neurosci., 2014

Multimodal Distributional Semantics.
J. Artif. Intell. Res., 2014

Dead parrots make bad pets: Exploring modifier effects in noun phrases.
Proceedings of the Third Joint Conference on Lexical and Computational Semantics, 2014

SemEval-2014 Task 1: Evaluation of Compositional Distributional Semantic Models on Full Sentences through Semantic Relatedness and Textual Entailment.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

A SICK cure for the evaluation of compositional distributional semantic models.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Improving the Lexical Function Composition Model with Pathwise Optimized Elastic-Net Regression.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

A practical and linguistically-motivated approach to compositional distributional semantics.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Is this a wampimuk? Cross-modal mapping between distributional semantics and the visual world.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

How to make words with vectors: Phrase generation in distributional semantics.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Processing of speech and non-speech sounds in the supratemporal plane: Auditory input preference does not predict sensitivity to statistical structure.
NeuroImage, 2013

Composition in Distributional Semantics.
Lang. Linguistics Compass, 2013

Recent advancements in human language technology in Italy.
Intelligenza Artificiale, 2013

SHALOM - Space-borne hyperspectral applicative land and ocean mission.
Proceedings of the 5th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing, 2013

Multi-Step Regression Learning for Compositional Distributional Semantics.
Proceedings of the 10th International Conference on Computational Semantics, 2013

Intensionality was only alleged: On adjective-noun composition in distributional semantics.
Proceedings of the 10th International Conference on Computational Semantics, 2013

Studying the Recursive Behaviour of Adjectival Modification with Compositional Distributional Semantics.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Fish Transporters and Miracle Homes: How Compositional Distributional Semantics can Help NP Parsing.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Of Words, Eyes and Brains: Correlating Image-Based Distributional Semantic Models with Neural Representations of Concepts.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Compositional-ly Derived Representations of Morphologically Complex Words in Distributional Semantics.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

General estimation and evaluation of compositional distributional semantic models.
Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, 2013

DISSECT - DIStributional SEmantics Composition Toolkit.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Visual Features for Linguists: Basic image analysis techniques for multimodally-curious NLPers.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

A relatedness benchmark to test the role of determiners in compositional distributional semantics.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Bootstrapping a Game with a Purpose for Commonsense Collection.
ACM Trans. Intell. Syst. Technol., 2012

Distributional semantics with eyes: using image analysis to improve computational representations of word meaning.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Compositionality in (high-dimensional) space.
Proceedings of the 11th Conference on Natural Language Processing, 2012

Entailment above the word level in distributional semantics.
Proceedings of the EACL 2012, 2012

Distributional Semantics in Technicolor.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Stereotypical gender actions can be extracted from web text.
J. Assoc. Inf. Sci. Technol., 2011

A distributional similarity approach to the detection of semantic change in the Google Books Ngram corpus.
Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics, 2011

Distributional semantics from text and images.
Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics, 2011

How we BLESSed distributional semantic evaluation.
Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics, 2011

2010
Distributional Memory: A General Framework for Corpus-Based Semantics.
Comput. Linguistics, 2010

Strudel: A Corpus-Based Semantic Model Based on Properties and Types.
Cogn. Sci., 2010

BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Nouns are Vectors, Adjectives are Matrices: Representing Adjective-Noun Constructions in Semantic Space.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Predicting Cognitively Salient Modifiers of the Constitutive Parts of Concepts.
Proceedings of the 2010 Workshop on Cognitive Modeling and Computational Linguistics, 2010

The Concept Game: Better Commonsense Knowledge Extraction by Combining Text Mining and a Game with a Purpose.
Proceedings of the Commonsense Knowledge, 2010

2009
The WaCky wide web: a collection of very large linguistically processed web-crawled corpora.
Lang. Resour. Evaluation, 2009

BagPack: A general framework to represent semantic relations
CoRR, 2009

Measuring semantic relatedness with vector space models and random walks.
Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing, 2009

EEG responds to conceptual stimuli and corpus semantics.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Analyzing Interactive QA Dialogues Using Logistic Regression Models.
Proceedings of the AI*IA 2009: Emergent Perspectives in Artificial Intelligence, 2009

2008
Cleaneval: a Competition for Cleaning Web Pages.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007
Words and Echoes: Assessing and Mitigating the Non-Randomness Problem in Word Frequency Distribution Modeling.
Proceedings of the ACL 2007, 2007

zipfR: Word Frequency Modeling in R.
Proceedings of the ACL 2007, 2007

2006
A New Approach to the Study of Translationese: Machine-learning the Difference between Original and Translated Text.
Lit. Linguistic Comput., 2006

WebBootCaT. Instant Domain-Specific Corpora to Support Human Translators.
Proceedings of the 11th Annual conference of the European Association for Machine Translation, 2006

A Figure of Merit for the Evaluation of Web-Corpus Randomness.
Proceedings of the EACL 2006, 2006

Large Linguistically-Processed Web Corpora for Multiple Languages.
Proceedings of the EACL 2006, 2006

2005
The Language Component of the Fasty Text Prediction System.
Appl. Artif. Intell., 2005

2004
Introducing the La Repubblica Corpus: A Large, Annotated, TEI(XML)-compliant Corpus of Newspaper Italian.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

BootCaT: Bootstrapping Corpora and Terms from the Web.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Using Cooccurrence Statistics and the Web to Discover Synonyms in a Technical Language.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2002
Unsupervised discovery of morphologically related words based on orthographic and semantic similarity.
Proceedings of the ACL-02 Workshop on Morphological and Phonological Learning, 2002

FASTY - A Multi-lingual Approach to Text Prediction.
Proceedings of the Computers Helping People with Special Needs, 2002

Predicting the Components of German Nominal Compounds.
Proceedings of the 15th European Conference on Artificial Intelligence, 2002

Wordform- and Class-based Prediction of the Components of German Nominal Compounds in an AAC System.
Proceedings of the 19th International Conference on Computational Linguistics, 2002


  Loading...