Benjamin Minixhofer

According to our database1, Benjamin Minixhofer authored at least 8 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation.
CoRR, 2024

Zero-Shot Tokenizer Transfer.
CoRR, 2024

2023
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response.
CoRR, 2022

WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crises Response.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021


  Loading...