Magnus Sahlgren

Orcid: 0000-0001-5100-0535

According to our database1, Magnus Sahlgren authored at least 88 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SWEb: A Large Web Dataset for the Scandinavian Languages.
CoRR, 2024

Branch-GAN: Improving Text Generation with (not so) Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ELOQUENT CLEF Shared Tasks for Evaluation of Generative Language Model Quality.
Proceedings of the Advances in Information Retrieval, 2024

GPT-SW3: An Autoregressive Language Model for the Scandinavian Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

ELOQUENT 2024 - Robustness Task.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

Overview of ELOQUENT 2024 - Shared Tasks for Evaluating Generative Language Model Quality.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2024

2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages.
CoRR, 2023

The Nordic Pile: A 1.2TB Nordic Dataset for Language Modeling.
CoRR, 2023

Superlim: A Swedish Language Understanding Evaluation Benchmark.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Effect of self-managed lifestyle treatment on glycemic control in patients with type 2 diabetes.
npj Digit. Medicine, 2022

A comparative evaluation and analysis of three generations of Distributional Semantic Models.
Lang. Resour. Evaluation, 2022

Cross-lingual Transfer of Monolingual Models.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Lessons Learned from GPT-SW3: Building the First Large-Scale Generative Language Model for Swedish.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Cross-lingual and Multilingual CLIP.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Fine-Grained Controllable Text Generation Using Non-Residual Prompting.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
The Singleton Fallacy: Why Current Critiques of Language Models Miss the Point.
Frontiers Artif. Intell., 2021

We Need to Talk About Data: The Importance of Data Readiness in Natural Language Processing.
CoRR, 2021

A comprehensive comparative evaluation and analysis of Distributional Semantic Models.
CoRR, 2021

Federated Word2Vec: Leveraging Federated Learning to Encourage Collaborative Representation Learning.
CoRR, 2021

It's Basically the Same Language Anyway: the Case for a Nordic Language Model.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead?
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Decentralized Word2Vec Using Gossip Learning.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Semantic Re-tuning with Contrastive Tension.
Proceedings of the 9th International Conference on Learning Representations, 2021

Predicting Treatment Outcome from Patient Texts: The Case of Internet-Based Cognitive Behavioural Therapy.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
GDTM: Graph-based Dynamic Topic Models.
Prog. Artif. Intell., 2020

Why Not Simply Translate? A First Swedish Evaluation Benchmark for Semantic Similarity.
CoRR, 2020

Data Readiness for Natural Language Processing.
CoRR, 2020

SenseCluster at SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Rethinking Topic Modelling: From Document-Space to Term-Space.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

2019
Learning machines in Internet-delivered psychological treatment.
Prog. Artif. Intell., 2019

Gender Bias in Pretrained Swedish Embeddings.
Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa 2019, Turku, Finland, September 30, 2019

Enriching Word Embeddings with a Regressor Instead of Labeled Corpora.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
R-grams: Unsupervised Learning of Semantic Units in Natural Language.
CoRR, 2018

Monitoring Targeted Hate in Online Environments.
CoRR, 2018

Measuring Issue Ownership using Word Embeddings.
Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, 2018

Distributional Term Set Expansion.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Analysis of Open Answers to Survey Questions through Interactive Clustering and Theme Extraction.
Proceedings of the 2018 Conference on Human Information Interaction and Retrieval, 2018

Learning Representations for Detecting Abusive Language.
Proceedings of the 2nd Workshop on Abusive Language Online, 2018

The Smart Data Layer.
Proceedings of the 2018 AAAI Spring Symposia, 2018

Design Methods to Investigate User Experiences of Artificial Intelligence.
Proceedings of the 2018 AAAI Spring Symposia, 2018

Learning Machines.
Proceedings of the 2018 AAAI Spring Symposia, 2018

2017
Active Learning and Visual Analytics for Stance Classification with ALVA.
ACM Trans. Interact. Intell. Syst., 2017

Random indexing of multidimensional data.
Knowl. Inf. Syst., 2017

Visual Analysis of Relationships between Heterogeneous Networks and Texts: An Application on the IEEE VIS Publication Dataset.
Informatics, 2017

2016
Visual analysis of online social media to open up the investigation of stance phenomena.
Inf. Vis., 2016

Visual Analysis of Text Annotations for Stance Classification with ALVA.
Proceedings of the 18th Eurographics Conference on Visualization, 2016

Parameterized context windows in Random Indexing.
Proceedings of the 1st Workshop on Representation Learning for NLP, 2016

The Gavagai Living Lexicon.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The Effects of Data Size and Frequency Range on Distributional Semantic Models.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Active learning for detection of stance components.
Proceedings of the Workshop on Computational Modeling of People's Opinions, 2016

Unshared task: (Dis)agreement in online debates.
Proceedings of the Third Workshop on Argument Mining, 2016

2015
Encoding Sequential Information in Semantic Space Models: Comparing Holographic Reduced Representation and Random Permutation.
Comput. Intell. Neurosci., 2015

Detecting speculations, contrasts and conditionals in consumer reviews.
Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, 2015

Factorization of Latent Variables in Distributional Semantic Models.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Navigating the Semantic Horizon using Relative Neighborhood Graphs.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Evaluating Learning Language Representations.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015

2014
Visual analysis of stance markers in online social media.
Proceedings of the 9th IEEE Conference on Visual Analytics Science and Technology, 2014

STAVICTA Group Report for RepLab 2014 Reputation Dimension Task.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

2012
Usefulness of Sentiment Analysis.
Proceedings of the Advances in Information Retrieval, 2012

Profiling Reputation of Corporate Entities in Semantic Space.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

2011
Incremental dimension reduction of tensors with random index
CoRR, 2011

2010
Between Bags and Trees - Constructional Patterns in Text Used for Attitude Identification.
Proceedings of the Advances in Information Retrieval, 2010

2009
Terminology mining in social media.
Proceedings of the 18th ACM Conference on Information and Knowledge Management, 2009

2008
Buzz Monitoring in Word Space.
Proceedings of the Intelligence and Security Informatics, 2008

Filaments of Meaning in Word Space.
Proceedings of the Advances in Information Retrieval , 2008

2007
SICS: Valence annotation based on seeds in word space.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

2006
The Word-Space Model: using distributional analysis to represent syntagmatic and paradigmatic relations between words in high-dimensional vector spaces.
PhD thesis, 2006

Towards pertinent evaluation methodologies for word-space models.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Consensus and Opinions; Quality and Churn.
Proceedings of the Computational Approaches to Analyzing Weblogs, 2006

2005
Automatic bilingual lexicon acquisition using random indexing of parallel corpora.
Nat. Lang. Eng., 2005

Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity.
Proceedings of the String Processing and Information Retrieval, 2005

Creating bilingual lexica using reference wordlists for alignment of monolingual semantic vector spaces.
Proceedings of the 15th Nordic Conference of Computational Linguistics, 2005

Principled Query Processing.
Proceedings of the Working Notes for CLEF 2005 Workshop co-located with the 9th European Conference on Digital Libraries (ECDL 2005), 2005

Weighting Query Terms Based on Distributional Statistics.
Proceedings of the Accessing Multilingual Information Repositories, 2005

Dictionary-Based Amharic-French Information Retrieval.
Proceedings of the Accessing Multilingual Information Repositories, 2005

Unsupervised Evaluation of Parser Robustness.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2005

2004
Automatic Bilingual Lexicon Acquisition Using Random Indexing of Aligned Bilingual Data.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Using Bag-of-Concepts to Improve the Performance of Support Vector Machines in Text Categorization.
Proceedings of the COLING 2004, 2004

Dynamic Lexica for Query Translation.
Proceedings of the Working Notes for CLEF 2004 Workshop co-located with the 8th European Conference on Digital Libraries (ECDL 2004), 2004

Bookmarking, Thesaurus, and Cooperation in Bilingual Question Answering.
Proceedings of the Working Notes for CLEF 2004 Workshop co-located with the 8th European Conference on Digital Libraries (ECDL 2004), 2004

Cooperation, Bookmarking, and Thesaurus in Interactive Bilingual Question Answering.
Proceedings of the Multilingual Information Access for Text, 2004

2003
Selective Compound Splitting of Swedish Queries for Boolean Combinations of Truncated Terms.
Proceedings of the Comparative Evaluation of Multilingual Information Access Systems, 2003

2002
English-Japanese Cross-lingual Query Expansion Using Random Indexing of Aligned Bilingual Text Data.
Proceedings of the Third NTCIR Workshop on Research in Information Retrieval, 2002

SICS at CLEF 2002: Automatic Query Expansion using Random Indexing.
Proceedings of the Working Notes for CLEF 2002 Workshop co-located with the 6th European Conference on Digital Libraries (ECDL 2002), 2002

Automatic Query Expansion Using Random Indexing.
Proceedings of the Advances in Cross-Language Information Retrieval, 2002

2001
Using Linguistic Information to Improve the Performance of Vector-Based Semantic Analysis.
Proceedings of the 13th Nordic Conference of Computational Linguistics, 2001

Vector-Based Semantic Analysis Using Random Indexing for Cross-Lingual Query Expansion.
Proceedings of the Evaluation of Cross-Language Information Retrieval Systems, 2001

Vector-based Semantic Analysis using Random Indexing and Morphological Analysis for Cross-Lingual Information Retrieval.
Proceedings of the Working Notes for CLEF 2001 Workshop co-located with the 5th European Conference on Digital Libraries (ECDL 2001), 2001


  Loading...