Anders Søgaard

Orcid: 0000-0001-5250-4276

According to our database1, Anders Søgaard authored at least 266 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Is Unsupervised Clustering Somehow Truer?
Minds Mach., December, 2024

Do Vision and Language Models Share Concepts? A Vector Space Alignment Study.
Trans. Assoc. Comput. Linguistics, 2024

CreoleVal: Multilingual Multitask Benchmarks for Creoles.
Trans. Assoc. Comput. Linguistics, 2024

From Words to Worlds: Compositionality for Cognitive Architectures.
CoRR, 2024

Vision-Language Models under Cultural and Inclusive Considerations.
CoRR, 2024

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture.
CoRR, 2024

Comprehensive Reassessment of Large-Scale Evaluation Outcomes in LLMs: A Multifaceted Statistical Approach.
CoRR, 2024

Word Order and World Knowledge.
CoRR, 2024

Group Fairness in Multilingual Speech Recognition Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

The Impact of Differential Privacy on Group Disparity Mitigation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

MuLan: A Study of Fact Mutability in Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

On Mitigating Performance Disparities in Multilingual Speech Recognition.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Defining Knowledge: Bridging Epistemology and Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Concept Space Alignment in Multilingual LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Evaluating Webcam-based Gaze Data as an Alternative for Human Rationale Annotations.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Rawlsian AI fairness loopholes.
AI Ethics, November, 2023

Grounding the Vector Space of an Octopus: Word Meaning from Raw Text.
Minds Mach., March, 2023

CreoleVal: Multilingual Multitask Benchmarks for Creoles.
CoRR, 2023

Large language models converge toward human-like concept organization.
CoRR, 2023

Large Language Models Converge on Brain-Like Word Representations.
CoRR, 2023

WebQAmGaze: A Multilingual Webcam Eye-Tracking-While-Reading Dataset.
CoRR, 2023

Implications of the Convergence of Language and Vision Model Geometries.
CoRR, 2023

Private Meeting Summarization Without Performance Loss.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Re-Framing Case Law Citation Prediction from a Paragraph Perspective.
Proceedings of the Legal Knowledge and Information Systems, 2023

Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language Models.
Proceedings of the International Conference on Machine Learning, 2023

On the Independence of Association Bias and Empirical Fairness in Language Models.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

Copyright Violations and Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Two-Sided Discussion of Preregistration of NLP Research.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Grammatical Error Correction through Round-Trip Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Mapping Brains with Language Models: A Survey.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Being Right for Whose Right Reasons?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Multi hash embeddings in spaCy.
CoRR, 2022

Exploring the Unfairness of DP-SGD Across Settings.
CoRR, 2022

QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

What a Creole Wants, What a Creole Needs.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Evaluating Deep Taylor Decomposition for Reliability Assessment in the Wild.
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, 2022

Date Recognition in Historical Parish Records.
Proceedings of the Frontiers in Handwriting Recognition - 18th International Conference, 2022

Should We Ban English NLP for a Year?
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Are Pretrained Multilingual Models Equally Fair across Languages?
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Shortcomings of Interpretability Taxonomies for Deep Neural Networks.
Proceedings of the CIKM 2022 Workshops co-located with 31st ACM International Conference on Information and Knowledge Management (CIKM 2022), 2022

Are Multilingual Sentiment Models Equally Right for the Right Reasons?
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Challenges and Strategies in Cross-Cultural NLP.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Factual Consistency of Multilingual Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Improved Multi-label Classification under Temporal Concept Drift: Rethinking Group-Robust Algorithms in a Label-Wise Setting.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text Processing.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Word Order Does Matter and Shuffled Language Models Know It.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Ancestor-to-Creole Transfer is Not a Walk in the Park.
Proceedings of the Third Workshop on Insights from Negative Results in NLP, 2022

2021
Explainable Natural Language Processing
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02180-0, 2021

A Global-Local Attentive Relation Detection Model for Knowledge-Based Question Answering.
IEEE Trans. Artif. Intell., 2021

Do We Still Need Automatic Speech Recognition for Spoken Language Understanding?
CoRR, 2021

Revisiting Methods for Finding Influential Examples.
CoRR, 2021

Evaluation of Summarization Systems across Gender, Age, and Race.
CoRR, 2021

John praised Mary because he? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs.
CoRR, 2021

Does injecting linguistic structure into language models lead to better alignment with brain recordings?
CoRR, 2021

Spurious Correlations in Cross-Topic Argument Mining.
Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics, 2021

Locke's Holiday: Belief Bias in Machine Reading.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

The Impact of Positional Encodings on Multilingual Compression.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Dynamic Forecasting of Conversation Derailment.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

The Effect of Round-Trip Translation on Fairness in Sentiment Analysis.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Sociolectal Analysis of Pretrained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

We Need To Talk About Random Splits.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Attention Can Reflect Syntactic Structure (If You Let It).
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Error Analysis and the Role of Morphology.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Ellipsis Resolution as Question Answering: An Evaluation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

On Language Models for Creoles.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

A Multilingual Benchmark for Probing Negation-Awareness with Minimal Pairs.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Can Language Models Encode Perceptual Structure Without Grounding? A Case Study in Color.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Do Language Models Know the Way to Rome?
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

Itihasa: A large-scale corpus for Sanskrit to English translation.
Proceedings of the 8th Workshop on Asian Translation, 2021

How far can we get with one GPU in 100 hours? CoAStaL at MultiIndicMT Shared Task.
Proceedings of the 8th Workshop on Asian Translation, 2021

Common Sense Bias in Semantic Role Labeling.
Proceedings of the Seventh Workshop on Noisy User-generated Text, 2021

Minimax and Neyman-Pearson Meta-Learning for Outlier Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

John praised Mary because _he_? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Is the Lottery Fair? Evaluating Winning Tickets Across Demographics.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

On the Interaction of Belief Bias and Explanations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Replicating and Extending "Because Their Treebanks Leak": Graph Isomorphism, Covariants, and Parser Performance.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Multilingual Negation Scope Resolution for Clinical Text.
Proceedings of the 12th International Workshop on Health Text Mining and Information Analysis, 2021

Analogy Training Multilingual Encoders.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Joint Semantic Analysis with Document-Level Cross-Task Coherence Rewards.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Worst-Case-Aware Curriculum Learning for Zero and Few Shot Transfer.
CoRR, 2020

Weakly Supervised POS Taggers Perform Poorly on Truly Low-Resource Languages.
CoRR, 2020

WikiBank: Using Wikidata to Improve Multilingual Frame-Semantic Parsing.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

DaNE: A Named Entity Resource for Danish.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Model-based Annotation of Coreference.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Do End-to-End Speech Recognition Models Care About Context?
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Are All Good Word Vector Spaces Isomorphic?
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Some Languages Seem Easier to Parse Because Their Treebanks Leak.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Neural Speed Reading Audited.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Type B Reflexivization as an Unambiguous Testbed for Multilingual Multi-Task Gender Bias.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Grammatical Error Correction in Low Error Density Domains: A New Benchmark and Analyses.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Parsing as Pretraining.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Weakly Supervised POS Taggers Perform Poorly on <i>Truly</i> Low-Resource Languages.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

What Do You Mean 'Why?': Resolving Sluices in Conversations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Cross-Lingual Word Embeddings
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02171-8, 2019

A Survey of Cross-lingual Word Embedding Models.
J. Artif. Intell. Res., 2019

Retrieval-based Goal-Oriented Dialogue Generation.
CoRR, 2019

Domain Transfer in Dialogue Systems without Turn-Level Supervision.
CoRR, 2019

Ellipsis and Coreference Resolution as Question Answering.
CoRR, 2019

CoAStaL at SemEval-2019 Task 3: Affect Classification in Dialogue using Attentive BiLSTMs.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Naive Regularizers for Low-Resource Neural Machine Translation.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Comparing Unsupervised Word Translation Methods Step by Step.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Better, Faster, Stronger Sequence Tagging Constituent Parsers.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Issue Framing in Online Discussion Fora.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A Simple and Robust Approach to Detecting Subject-Verb Agreement Errors.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Lost in Evaluation: Misleading Benchmarks for Bilingual Dictionary Induction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Adversarial Removal of Demographic Attributes Revisited.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Rewarding Coreference Resolvers for Being Consistent with World Knowledge.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Higher-order Comparisons of Sentence Encoder Representations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Noisy Channel for Low Resource Grammatical Error Correction.
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019

Unsupervised Cross-Lingual Representation Learning.
Proceedings of the 57th Conference of the Association for Computational Linguistics: Tutorial Abstracts, 2019

Multi-Task Semantic Dependency Parsing with Policy Gradient for Learning Easy-First Strategies.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Historical Text Normalization with Delayed Rewards.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Few-Shot and Zero-Shot Learning for Historical Text Normalization.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

X-WikiRE: A Large, Multilingual Resource for Relation Extraction as Machine Comprehension.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

Latent Multi-Task Architecture Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Jointly Learning to Label Sentences and Tokens.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Predicting Concrete and Abstract Entities in Modern Poetry.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Sentiment analysis under temporal shift.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

What I think when I think about treebanks.
Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, 2018

Learning Language-Independent Representations of Verbs and Adjectives from Multimodal Retrieval.
Proceedings of the 14th International Conference on Signal-Image Technology & Internet-Based Systems, 2018

Limitations of Cross-Lingual Learning from Image Search.
Proceedings of The Third Workshop on Representation Learning for NLP, 2018

Sluice Resolution without Hand-Crafted Features over Brittle Syntax Trees.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Zero-Shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Unsupervised Induction of Linguistic Categories with Records of Reading, Speaking, and Writing.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Multi-Task Learning of Pairwise Sequence Classification Tasks over Disparate Label Spaces.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Nightmare at test time: How punctuation prevents parsers from generalizing.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

A Discriminative Latent-Variable Model for Bilingual Lexicon Induction.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Linguistic representations in multi-task neural networks for ellipsis resolution.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Parameter sharing between dependency parsers for related languages.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

When does deep multi-task learning work for loosely related document classification tasks?
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Why is unsupervised alignment of English embeddings from different algorithms so hard?
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

A strong baseline for question relevancy ranking.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Sequence Classification with Human Attention.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Lexi: A tool for adaptive, personalized text simplification.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

On the Limitations of Unsupervised Bilingual Dictionary Induction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Character-level Supervision for Low-resource POS Tagging.
Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP, 2018

Multi-task learning for historical text normalization: Size matters.
Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP, 2018

Learning to Predict Readability Using Eye-Movement Data From Natives and Learners.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Sluice networks: Learning what to share between loosely related tasks.
CoRR, 2017

Is writing style predictive of scientific fraud?
CoRR, 2017

Using hyperlinks to improve multilingual partial parsers.
Proceedings of the 15th International Conference on Parsing Technologies, 2017

Spikes as regularizers.
Proceedings of the 25th European Symposium on Artificial Neural Networks, 2017

Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Does syntax help discourse segmentation? Not so much.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Parsing Universal Dependencies without training.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Cross-lingual tagger evaluation without test data.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

A Strong Baseline for Learning Cross-Lingual Word Embeddings from Sentence Alignments.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Cross-lingual RST Discourse Parsing.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Identifying beneficial task relations for multi-task learning in deep neural networks.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Using Gaze to Predict Text Readability.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Evaluating hypotheses in geolocation on a very large sample of Twitter.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

Huntsville, hospitals, and hockey teams: Names can reveal your location.
Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

Cross-lingual and cross-domain discourse segmentation of entire documents.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Learning attention for historical text normalization by learning to pronounce.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Multi-Task Learning of Keyphrase Boundary Classification.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Multilingual Projection for Parsing Truly Low-Resource Languages.
Trans. Assoc. Comput. Linguistics, 2016

Empirical Gaussian priors for cross-lingual transfer learning.
CoRR, 2016

Reconsidering Cross-lingual Word Embeddings.
CoRR, 2016

A Test Suite for Evaluating POS Taggers across Varieties of English.
Proceedings of the 25th International Conference on World Wide Web, 2016

Evaluating word embeddings with fMRI and eye-tracking.
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016

Improving sentence compression by learning to predict gaze.
Proceedings of the NAACL HLT 2016, 2016

Learning a POS tagger for AAVE-like language.
Proceedings of the NAACL HLT 2016, 2016

The SemDaX Corpus ― Sense Annotations with Scalable Sense Inventories.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Multi-view and multi-task training of RST discourse parsers.
Proceedings of the COLING 2016, 2016

Improving historical spelling normalization with bi-directional LSTMs and multi-task learning.
Proceedings of the COLING 2016, 2016

Cross-lingual Transfer of Correlations between Parts of Speech and Gaze Features.
Proceedings of the COLING 2016, 2016

Deep multi-task learning with low level tasks supervised at lower layers.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Joint part-of-speech and dependency projection from multiple sources.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Text Simplification as Tree Labeling.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Extracting token-level signals of syntactic processing from fMRI - with an application to PoS induction.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Weakly Supervised Part-of-speech Tagging Using Eye-tracking Data.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
User Review Sites as a Resource for Large-Scale Sociolinguistic Studies.
Proceedings of the 24th International Conference on World Wide Web, 2015

Looking hard: Eye tracking for detecting grammaticality of automatically compressed sentences.
Proceedings of the 20th Nordic Conference of Computational Linguistics, 2015

Active learning for sense annotation.
Proceedings of the 20th Nordic Conference of Computational Linguistics, 2015

Supersense tagging for Danish.
Proceedings of the 20th Nordic Conference of Computational Linguistics, 2015

Mining for unambiguous instances to adapt part-of-speech taggers to new domains.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Simple task-specific bilingual word embeddings.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Learning to parse with IAA-weighted loss.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Any-language frame-semantic parsing.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Do dependency parsing metrics correlate with human judgments?
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Cross-lingual syntactic variation over age and gender.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Reading behavior predicts syntactic categories.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Learning finite state word representations for unsupervised Twitter adaptation of POS taggers.
Proceedings of the Workshop on Noisy User-generated Text, 2015

Challenges of studying and processing dialects in social media.
Proceedings of the Workshop on Noisy User-generated Text, 2015

Non-canonical language is not harder to annotate than canonical language.
Proceedings of The 9th Linguistic Annotation Workshop, 2015

Inverted indexing for cross-lingual NLP.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Unsupervised extractive summarization via coverage maximization with syntactic and semantic concepts.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Tagging Performance Correlates with Author Age.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

If all you have is a bit of the Bible: Learning POS taggers for truly low-resource languages.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Using Frame Semantics for Knowledge Extraction from Twitter.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Modeling Eye Movements when Reading Microblogs.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
NER in Tweets Using Bagging and a Small Crowdsourced Dataset.
Proceedings of the Advances in Natural Language Processing, 2014

More or less supervised supersense tagging of Twitter.
Proceedings of the Third Joint Conference on Lexical and Computational Semantics, 2014

Copenhagen-Malmö: Tree Approximations of Semantic Parsing Problems.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

When POS data sets don't add up: Combatting sample bias.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Crowdsourcing and annotating NER for Twitter #drift.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Importance weighting and unsupervised domain adaptation of POS taggers: a negative result.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Learning part-of-speech taggers with inter-annotator agreement loss.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

What's in a p-value in NLP?
Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014

Selection Bias, Label Bias, and Bias in Ground Truth.
Proceedings of the COLING 2014, 2014

Adapting taggers to Twitter with not-so-distant supervision.
Proceedings of the COLING 2014, 2014

Linguistically debatable or just plain wrong?
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Experiments with crowdsourced re-annotation of a POS tagging data set.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Semi-Supervised Learning and Domain Adaptation in Natural Language Processing
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02149-7, 2013

6, 909 Reasons to Mess Up Your Data.
Proceedings of the 19th Nordic Conference of Computational Linguistics, 2013

Zipfian corruptions for robust POS tagging.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Estimating effect size across datasets.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Down-stream effects of tree-to-dependency conversions.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Disambiguating Explicit Discourse Connectives without Oracles.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Cross-Domain Answer Ranking using Importance Sampling.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Using Crowdsourcing to get Representations based on Regular Expressions.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

With Blinkers on: Robust Prediction of Eye Movements across Readers.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

An Empirical Study of Differences between Conversion Schemes and Annotation Guidelines.
Proceedings of the Second International Conference on Dependency Linguistics, 2013

Part-of-speech tagging with antagonistic adversaries.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Simple, readable sub-sentences.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Unsupervised dependency parsing without training.
Nat. Lang. Eng., 2012

EMNLP@CPH: Is frequency all there is to simplicity?
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

DSim, a Danish Parallel Corpus for Text Simplification.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

An Empirical Etudy of Non-Lexical Extensions to Delexicalized Transfer.
Proceedings of the COLING 2012, 2012

Robust Learning in Random Subspaces: Equipping NLP for OOV Effects.
Proceedings of the COLING 2012, 2012

Mining wisdom.
Proceedings of the Workshop on Computational Linguistics for Literature, 2012

2011
Keith Stenning and Michiel van Lambalgen, Human reasoning and cognitive science.
Stud Logica, 2011

A <i>O(|G|n<sup>6</sup>)</i> time extension of inversion transduction grammars.
Mach. Transl., 2011

Factored Translation with Unsupervised Word Clusters.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

From ranked words to dependency trees: two-stage unsupervised non-projective dependency parsing.
Proceedings of the 2011 Workshop on Graph-based Methods for Natural Language Processing, 2011

Using graphical models for PP attachment.
Proceedings of the 18th Nordic Conference of Computational Linguistics, 2011

Sentence-Level Instance-Weighting for Graph-Based and Transition-Based Dependency Parsing.
Proceedings of the 12th International Conference on Parsing Technologies, 2011

Experiments in Newswire-to-Law Adaptation of Graph-Based Dependency Parsers.
Proceedings of the Evaluation of Natural Language and Speech Tools for Italian, 2011

Data point selection for cross-language adaptation of dependency parsers.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Semi-supervised condensed nearest neighbor for part-of-speech tagging.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
The Effect of Semi-supervised Learning on Parsing Long Distance Dependencies in German and Swedish.
Proceedings of the Advances in Natural Language Processing, 2010

Robust Semi-supervised and Ensemble-Based Methods in Word Sense Disambiguation.
Proceedings of the Advances in Natural Language Processing, 2010

Can inversion transduction grammars generate hand alignments.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

Semi-supervised dependency parsing using generalized tri-training.
Proceedings of the COLING 2010, 2010

Simple Semi-Supervised Training of Part-Of-Speech Taggers.
Proceedings of the ACL 2010, 2010

2009
Polyadic Dynamic Logics for HPSG Parsing.
J. Log. Lang. Inf., 2009

Empirical Lower Bounds on Aligment Error Rates in Syntax-Based Machine Translation.
Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation, 2009

On the Complexity of Alignment Problems in Two Synchronous Grammar Formalisms.
Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation, 2009

Verifying context-sensitive treebanks and heuristic parses in polynomial time.
Proceedings of the 17th Nordic Conference of Computational Linguistics, 2009

A linear time extension of deterministic pushdown automata.
Proceedings of the 17th Nordic Conference of Computational Linguistics, 2009

Empirical lower bounds on translation unit error rate for the full class of inversion transduction grammars.
Proceedings of the 11th International Workshop on Parsing Technologies (IWPT-2009), 2009

Using a maximum entropy-based tagger to improve a very fast vine parser.
Proceedings of the 11th International Workshop on Parsing Technologies (IWPT-2009), 2009

2008
Learning context-sensitive synchronous rules.
Proceedings of the 12th Annual conference of the European Association for Machine Translation, 2008

Range Concatenation Grammars for Translation.
Proceedings of the COLING 2008, 2008

On the Weak Generative Capacity of Weighted Context-free Grammars.
Proceedings of the COLING 2008, 2008

2007
Dov M. Gabbay, Sergei S. Goncharov and Michael Zakharyaschev (eds.), Mathematical Problems from Applied Logic I.
Stud Logica, 2007

Patrick Blackburnand Johan Bos, Representation and Inference for Natural Language.
Stud Logica, 2007

Polynomial Charts For Totally Unordered Languages.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007

2006
Logical investigations on the adequacy of certain feature-based theories of natural language.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

2005
Functionality in grammar design.
Proceedings of the 15th Nordic Conference of Computational Linguistics, 2005

Model Generation in a Dynamic Environment.
Proceedings of the New Frontiers in Artificial Intelligence, 2005


  Loading...