Malte Ostendorff

According to our database1, Malte Ostendorff authored at least 31 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs.
CoRR, 2024

Investigating Gender Bias in Turkish Language Models.
CoRR, 2024

Occiglot at WMT24: European Open-source Large Language Models Evaluated on Translation.
Proceedings of the Ninth Conference on Machine Translation, 2024


A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Symmetric Dot-Product Attention for Efficient Training of BERT Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Aspect-based Document Similarity for Literature Recommender Systems.
PhD thesis, 2023

Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning.
CoRR, 2023

AspectCSE: Sentence Embeddings for Aspect-Based Semantic Textual Similarity Using Contrastive Learning and Structured Knowledge.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

Integration of a Semantic Storytelling Recommender System in Speech Assistants.
Proceedings of Text2Story, 2023

2022
Identification of Relations between Text Segments for Semantic Storytelling.
Proceedings of the Third Conference on Digital Curation Technologies (Qurator 2022), 2022

Semantic Relations between Text Segments for Semantic Storytelling: Annotation Tool - Dataset - Evaluation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Claim Extraction and Law Matching for COVID-19-related Legislation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Generating Extended and Multilingual Summaries with Pre-trained Transformers.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Specialized document embeddings for aspect-based similarity of research papers.
Proceedings of the JCDL '22: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20, 2022

Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure Information.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
DFKI SLT at GermEval 2021: Multilingual Pre-training and Data Augmentation for the Classification of Toxicity in Social Media Comments.
Proceedings of the GermEval 2021 Shared Task on the Identification of Toxic, 2021

Evaluating document representations for content-based legal literature recommendations.
Proceedings of the ICAIL '21: Eighteenth International Conference for Artificial Intelligence and Law, São Paulo Brazil, June 21, 2021

A Qualitative Evaluation of User Preference for Link-Based vs. Text-Based Recommendations of Wikipedia Articles.
Proceedings of the Towards Open and Trustworthy Digital Societies, 2021

Ordering sentences and paragraphs with pre-trained encoder-decoder transformers and pointer ensembles.
Proceedings of the DocEng '21: ACM Symposium on Document Engineering 2021, 2021

2020
Contextual Document Similarity for Content-based Literature Recommender Systems.
CoRR, 2020

Towards Discourse Parsing-inspired Semantic Storytelling.
CoRR, 2020

Towards Discourse Parsing-inspired Semantic Storytelling.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2020), Berlin, Germany, January 20th, 2020


Named Entities in Medical Case Reports: Corpus and Experiments.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

Towards an Open Platform for Legal Information.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

Aspect-based Document Similarity for Research Papers.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

ARQMath Lab: An Incubator for Semantic Formula Search in zbMATH Open?
Proceedings of the Working Notes of CLEF 2020, 2020

2019
Enriching BERT with Knowledge Graph Embeddings for Document Classification.
Proceedings of the 15th Conference on Natural Language Processing, 2019


  Loading...