Dirk Hovy

Orcid: 0000-0002-4618-3127

Affiliations:
  • Bocconi University, Milan, Italy


According to our database1, Dirk Hovy authored at least 128 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
What Can Natural Language Processing Do for Peer Review?
CoRR, 2024

The Call for Socially Aware Language Technologies.
CoRR, 2024

Conversations as a Source for Teaching Scientific Concepts at Different Education Levels.
CoRR, 2024

SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety.
CoRR, 2024

Multilingual Speech Models for Automatic Speech Recognition Exhibit Gender Performance Gaps.
CoRR, 2024

Comparing Human-Centered Language Modeling: Is it Better to Model Groups, Individual Traits, or Both?
CoRR, 2024

Comparing Pre-trained Human Language Models: Is it Better with Human Context as Groups, Individual Traits, or Both?
Proceedings of the 14th Workshop on Computational Approaches to Subjectivity, 2024

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Impoverished Language Technology: The Lack of (Social) Class in NLP.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts.
Proceedings of the 19th Workshop on Innovative Use of NLP for Building Educational Applications, 2024

Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Compromesso! Italian Many-Shot Jailbreaks undermine the safety of Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Classist Tools: Social Class Correlates with Performance in NLP.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Viewpoint: Artificial Intelligence Accidents Waiting to Happen?
J. Artif. Intell. Res., 2023

Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?
CoRR, 2023

How to Use Large Language Models for Text Coding: The Case of Fatherhood Roles in Public Policy Documents.
CoRR, 2023

Leveraging Label Variation in Large Language Models for Zero-Shot Text Classification.
CoRR, 2023

The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond Sociodemographics.
CoRR, 2023

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP.
CoRR, 2023

Leveraging Social Interactions to Detect Misinformation on Social Media.
CoRR, 2023

Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement.
CoRR, 2023

Beyond Digital "Echo Chambers": The Role of Viewpoint Diversity in Political Discussion.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

MilaNLP at SemEval-2023 Task 10: Ensembling Domain-Adapted and Regularized Pretrained Language Models for Robust Sexism Detection.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Top-Down Influence? Predicting CEO Personality and Risk Impact from Speech Transcripts.
Proceedings of the Seventeenth International AAAI Conference on Web and Social Media, 2023

Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

The Ecological Fallacy in Annotation: Modeling Human Label Variation goes beyond Sociodemographics.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

The State of Profanity Obfuscation in Natural Language Processing Scientific Publications.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

What about "em"? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Gender and Age Bias in Commercial Machine Translation.
Proceedings of the Towards Responsible Machine Translation, 2023

2022
ProSiT! Latent Variable Discovery with PROgressive SImilarity Thresholds.
CoRR, 2022

The State of Profanity Obfuscation in Natural Language Processing.
CoRR, 2022

Is It Worth the (Environmental) Cost? Limited Evidence for the Benefits of Diachronic Continuous Training.
CoRR, 2022

On the Limitations of Sociodemographic Adaptation with Transformers.
CoRR, 2022

XLM-EMO: Multilingual Emotion Prediction in Social Media Text.
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

Guiding the Release of Safer E2E Conversational AI through Value Sensitive Design.
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals.
Proceedings of the Second Workshop on Language Technology for Equality, 2022

Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SocioProbe: What, When, and Where Language Models Learn about Sociodemographics.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Bridging Fairness and Environmental Sustainability in Natural Language Processing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

"It's Not Just Hate": A Multi-Dimensional Perspective on Detecting Harmful Speech Online.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

SafetyKit: First Aid for Measuring Safety in Open-domain Conversational Systems.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Five sources of bias in natural language processing.
Lang. Linguistics Compass, 2021

Learning from Disagreement: A Survey.
J. Artif. Intell. Res., 2021

Language Invariant Properties in Natural Language Processing.
CoRR, 2021

Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling.
CoRR, 2021

Universal Joy A Data Set and Results for Classifying Emotions Across Languages.
Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

MilaNLP @ WASSA: Does BERT Feel Sad When You Cry?
Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

FEEL-IT: Emotion and Sentiment Classification for the Italian Language.
Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

HONEST: Measuring Hurtful Sentence Completion in Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

The Importance of Modeling Social Factors of Language: Theory and Practice.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

BERTective: Language Models and Contextual Information for Deception Detection.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Cross-lingual Contextualized Topic Models with Zero-shot Learning.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

"We will Reduce Taxes" - Identifying Election Pledges with Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

On the Gap between Adoption and Understanding in NLP.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
What the [MASK]? Making Sense of Language-Specific BERT Models.
CoRR, 2020

A Report on the VarDial Evaluation Campaign 2020.
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 2020

A Case for Soft Loss Functions.
Proceedings of the Eighth AAAI Conference on Human Computation and Crowdsourcing, 2020

Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on Success.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

"You Sound Just Like Your Father" Commercial Machine Translation Systems Include Stylistic Biases.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Integrating Ethics into the NLP Curriculum.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

2019
Recognizing and Reducing Bias in NLP Applications.
Proceedings of the Sixth Italian Conference on Computational Linguistics, 2019

Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Identifying Linguistic Areas for Geolocation.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Dense Node Representation for Geolocation.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Geolocation with Attention-Based Multitask Learning Models.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

2018
Comparing Bayesian Models of Annotation.
Trans. Assoc. Comput. Linguistics, 2018

Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Improving Author Attribute Prediction by Retrofitting Linguistic Representations with Homophily.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

The Social and the Neural Network: How to Make Natural Language Processing about People again.
Proceedings of the Second Workshop on Computational Modeling of People's Opinions, 2018

2017
Multi-Task Learning for Mental Health using Social Media Text.
CoRR, 2017

End-to-End Information Extraction without Token-Level Supervision.
Proceedings of the Workshop on Speech-Centric Natural Language Processing, 2017

Multitask Learning for Mental Health Conditions with Limited Social Media Data.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Huntsville, hospitals, and hockey teams: Names can reveal your location.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2016
SemEval-2016 Task 10: Detecting Minimal Semantic Units and their Meanings (DiMSUM).
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter.
Proceedings of the Student Research Workshop, 2016

Learning a POS tagger for AAVE-like language.
Proceedings of the NAACL HLT 2016, 2016

Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The Social Impact of Natural Language Processing.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

The Enemy in Your Own Camp: How Well Can We Detect Statistically-Generated Fake Reviews - An Adversarial Study.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations.
Proceedings of the ACL 2016 Student Research Workshop, Berlin, Germany, August 7-12, 2016, 2016

2015
User Review Sites as a Resource for Large-Scale Sociolinguistic Studies.
Proceedings of the 24th International Conference on World Wide Web, 2015

Personality Traits on Twitter - or - How to Get 1, 500 Personality Tests in a Week.
Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, 2015

Mining for unambiguous instances to adapt part-of-speech taggers to new domains.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

The Rating Game: Sentiment Rating Reproducibility from Text.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Cross-lingual syntactic variation over age and gender.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Challenges of studying and processing dialects in social media.
Proceedings of the Workshop on Noisy User-generated Text, 2015

Tagging Performance Correlates with Author Age.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Demographic Factors Improve Classification Performance.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

If all you have is a bit of the Bible: Learning POS taggers for truly low-resource languages.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Robust Cross-Domain Sentiment Analysis for Low-Resource Languages.
Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, 2014

More or less supervised supersense tagging of Twitter.
Proceedings of the Third Joint Conference on Lexical and Computational Semantics, 2014

Copenhagen-Malmö: Tree Approximations of Semantic Parsing Problems.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Augmenting English Adjective Senses with Supersenses.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

When POS data sets don't add up: Combatting sample bias.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Crowdsourcing and annotating NER for Twitter #drift.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Learning part-of-speech taggers with inter-annotator agreement loss.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

What's in a p-value in NLP?
Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014

Selection Bias, Label Bias, and Bias in Ground Truth.
Proceedings of the COLING 2014, 2014

Adapting taggers to Twitter with not-so-distant supervision.
Proceedings of the COLING 2014, 2014

Linguistically debatable or just plain wrong?
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Experiments with crowdsourced re-annotation of a POS tagging data set.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

How Well can We Learn Interpretable Entity Types from Text?
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Solving electrical networks to incorporate supervision in random walks.
Proceedings of the 22nd International World Wide Web Conference, 2013

Learning Whom to Trust with MACE.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Analysis and modeling of "focus" in context.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A Walk-Based Semantically Enriched Tree Kernel Over Distributed Word Representations.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
When Did that Happen? - Linking Events and Relations to Timestamps.
Proceedings of the EACL 2012, 2012

2011
Unsupervised Discovery of Domain-Specific Knowledge from Text.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Models and Training for Unsupervised Preposition Sense Disambiguation.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
What's in a Preposition? Dimensions of Sense Disambiguation for an Interesting Word Class.
Proceedings of the COLING 2010, 2010

2009
Disambiguation of Preposition Sense Using Linguistically Motivated Features.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009


  Loading...