Benjamin Minixhofer
According to our database1,
Benjamin Minixhofer
authored at least 8 papers
between 2021 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation.
CoRR, 2024
2023
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response.
CoRR, 2022
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crises Response.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
2021
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021