Stefan Larson

According to our database1, Stefan Larson authored at least 17 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Document Type Classification using File Names.
CoRR, 2024

De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Generating Hard-Negative Out-of-Scope Data with ChatGPT for Intent Classification.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
On Evaluation of Document Classification using RVL-CDIP.
CoRR, 2023

ShabbyPages: A Reproducible Document Denoising and Binarization Dataset.
CoRR, 2023

Augraphy: A Data Augmentation Library for Document Images.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

On Evaluation of Document Classifiers using RVL-CDIP.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022
A Survey of Intent Classification and Slot-Filling Datasets for Task-Oriented Dialog.
CoRR, 2022

Redwood: Using Collision Detection to Grow a Large-Scale Intent Classification Dataset.
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

Evaluating Out-of-Distribution Performance on Document Image Classifiers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP.
Proceedings of the Document Analysis and Recognition, 2021

LSOIE: A Large-Scale Dataset for Supervised Open Information Extraction.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

2020
Data Query Language and Corpus Tools for Slot-Filling and Intent Classification Data.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Iterative Feature Mining for Constraint-Based Data Collection to Increase Data Diversity and Model Robustness.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Inconsistencies in Crowdsourced Slot-Filling Annotations: A Typology and Identification Methods.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Outlier Detection for Improved Data Quality and Diversity in Dialog Systems.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019


  Loading...