Georg Rehm
Orcid: 0000-0002-7800-1893Affiliations:
- DFKI GmbH, Speech and Language Technology Lab, Berlin, Germany
According to our database1,
Georg Rehm
authored at least 123 papers
between 2000 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on twitter.com
-
on orcid.org
-
on georg-re.hm
On csauthors.net:
Bibliography
2024
Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition.
CoRR, 2024
Toward FAIR Semantic Publishing of Research Dataset Metadata in the Open Research Knowledge Graph.
CoRR, 2024
Occiglot at WMT24: European Open-source Large Language Models Evaluated on Translation.
Proceedings of the Ninth Conference on Machine Translation, 2024
Proceedings of the Natural Scientific Language Processing and Research Knowledge Graphs, 2024
FoRC@NSLP2024: Overview and Insights from the Field of Research Classification Shared Task.
Proceedings of the Natural Scientific Language Processing and Research Knowledge Graphs, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
FoRC4CL: A Fine-grained Field of Research Classification and Annotated Dataset of NLP Articles.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Strategic Research, Innovation and Implementation Agenda for Digital Language Equality in Europe by 2030.
Proceedings of the European Language Equality, 2023
Proceedings of the European Language Equality, 2023
Proceedings of the European Language Equality, 2023
Proceedings of the European Language Equality, 2023
Proceedings of the European Language Equality, 2023
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning.
CoRR, 2023
Towards FAIR Semantic Publishing of Research Dataset Metadata in the Open Research Knowledge Graph.
Proceedings of the Joint Proceedings of the Onto4FAIR 2023 Workshops, 2023
Knowledge Storage Ecosystem: an Open Source Tool for NLP Results Management (Documents and Semantic Information).
Proceedings of the 4th Conference on Language, Data and Knowledge, 2023
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023
Proceedings of the 53. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2023, Designing Future, 2023
Proceedings of Text2Story, 2023
Proceedings of the 1st Conference on Research Data Infrastructure - Connecting Communities, 2023
2022
Proceedings of the European Language Equality, 2022
Proceedings of the European Language Equality, 2022
Lynx: A knowledge-based AI service platform for content processing, enrichment and analysis for the legal domain.
Inf. Syst., 2022
Proceedings of the Third Conference on Digital Curation Technologies (Qurator 2022), 2022
Automatic Assessment of Online Content Credibility by Measuring the Adherence to Journalistic Standards.
Proceedings of the Third Conference on Digital Curation Technologies (Qurator 2022), 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Semantic Relations between Text Segments for Semantic Storytelling: Annotation Tool - Dataset - Evaluation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022), 2022
Proceedings of the JCDL '22: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20, 2022
Learning Ontology Classes from Text by Clustering Lexical Substitutes Derived from Language Models.
Proceedings of the Towards a Knowledge-Aware AI - SEMANTiCS 2022, 2022
Proceedings of the Towards a Knowledge-Aware AI - SEMANTiCS 2022, 2022
Plow: A Novel Approach to Interlinking Modular Ontologies Based on Software Package Management.
Proceedings of the Towards a Knowledge-Aware AI - SEMANTiCS 2022, 2022
User Experience Design for Automatic Credibility Assessment of News Content About COVID-19.
Proceedings of the HCI International 2022 - Late Breaking Papers. Interaction in New Media, Learning and Games, 2022
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2nd Workshop Reducing Online Misinformation through Credible Information Retrieval 2022 co-located with The 44th European Conference on Information Retrieval (ECIR 2022), 2022
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022
Was sehe ich? Visualisierungsstrategien für Datentransparenz in der Historischen Netzwerkanalyse.
Proceedings of the 8. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2022
HiStruct+: Improving Extractive Text Summarization with Hierarchical Structure Information.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
Combining Knowledge about Text Types and Document Structures for Enhanced Content Curation.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021
Parsing Discourse Structures for Semantic Storytelling: Evaluating an efficient RST Parser.
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021
Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), Berlin, Germany, February 8th - to, 2021
Proceedings of the 3rd Conference on Language, Data and Knowledge, 2021
DFKI SLT at GermEval 2021: Multilingual Pre-training and Data Augmentation for the Classification of Toxicity in Social Media Comments.
Proceedings of the GermEval 2021 Shared Task on the Identification of Toxic, 2021
Evaluating document representations for content-based legal literature recommendations.
Proceedings of the ICAIL '21: Eighteenth International Conference for Artificial Intelligence and Law, São Paulo Brazil, June 21, 2021
European Language Grid: A Joint Platform for the European Language Technology Community.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021
Ordering sentences and paragraphs with pre-trained encoder-decoder transformers and pointer ensembles.
Proceedings of the DocEng '21: ACM Symposium on Document Engineering 2021, 2021
2020
Proceedings of the Joint Proceedings of Workshops AI4LEGAL2020, 2020
Proceedings of the Conference on Digital Curation Technologies (Qurator 2020), Berlin, Germany, January 20th, 2020
Proceedings of the Conference on Digital Curation Technologies (Qurator 2020), Berlin, Germany, January 20th, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020
The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability.
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Making Metadata Fit for Next Generation Language Technology Platforms: The Metadata Schema of the European Language Grid.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Abstractive Text Summarization based on Language Model Conditioning and Locality Modeling.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Pairwise Multi-Class Document Classification for Semantic Relations between Wikipedia Articles.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020
Graph Technologies for the Analysis of Historical Social Networks Using Heterogeneous Data Sources 124.
Proceedings of the Graph Technologies in the Humanities, 2020
Proceedings of the 7. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2020
Proceedings of the 28th International Conference on Computational Linguistics, 2020
2019
Proceedings of the 15th Conference on Natural Language Processing, 2019
Proceedings of the Semantic Systems. The Power of AI and Knowledge Graphs, 2019
Semantic Storytelling: Towards Identifying Storylines in Large Amounts of Text Content.
Proceedings of Text2Story, 2019
Curation Technologies for Cultural Heritage Archives: Analysing and transforming a heterogeneous data set into an interactive curation workbench.
Proceedings of the 3rd International Conference on Digital Access to Textual Cultural Heritage, 2019
2018
Automatic and Manual Web Annotations in an Infrastructure to handle Fake News and other Online Media Phenomena.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Language Technology for Multilingual Europe: An Analysis of a Large-Scale Survey regarding Challenges, Demands, Gaps and Needs.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
2017
Prague Bull. Math. Linguistics, 2017
DFKI-DKT at SemEval-2017 Task 8: Rumour Detection and Classification using Cascading Heuristics.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017
Proceedings of the Human Interface and the Management of Information: Supporting Learning, Decision-Making and Collaboration, 2017
Proceedings of the Human Interface and the Management of Information: Information, Knowledge and Interaction Design, 2017
Different German and English Coreference Resolution Models for Multi-domain Content Curation Scenarios.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017
Different Types of Automated and Semi-automated Semantic Storytelling: Curation Technologies for Different Sectors.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017
An Infrastructure for Empowering Internet Users to Handle Fake News and Other Online Media Phenomena.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017
Automatic Classification of Abusive Language and Personal Attacks in Various Forms of Online Communication.
Proceedings of the Language Technologies for the Challenges of the Digital Age, 2017
Semantic Storytelling, Cross-lingual Event Detection and other Semantic Services for a Newsroom Content Curation Dashboard.
Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, 2017
From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles.
Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, 2017
CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies.
Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2017
Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters.
Proceedings of the Events and Stories in the News Workshop@ACL 2017, 2017
2016
Lang. Resour. Evaluation, 2016
Fostering the Next Generation of European Language Technology: Recent Developments ― Emerging Initiatives ― Challenges and Opportunities.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
The Language Resource Life Cycle: Towards a Generic Model for Creating, Maintaining, Using and Distributing Language Resources.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer.
Proceedings of the Semantic Web - ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29, 2016
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016
Processing Document Collections to Automatically Extract Linked Data: Semantic Storytelling Technologies for Smart Curation Workflows.
Proceedings of the 2nd International Workshop on Natural Language Generation and the Semantic Web, 2016
2015
Digitale Kuratierungstechnologien: Verfahren für die effiziente Verarbeitung, Erstellung und Verteilung qualitativ hochwertiger Medieninhalte.
Proceedings of the International Conference of the German Society for Computational Linguistics and Language Technology, 2015
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015
2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
2013
The State of Computational Morphology for Europe's Languages and the META-NET Strategic Research Agenda.
Proceedings of the Systems and Frameworks for Computational Morphology, 2013
Proceedings of Machine Translation Summit XIV: European projects, 2013
MATECAT: Machine Translation Enhanced Computer Assisted Translation META - Multilingual Europe Technology Alliance.
Proceedings of Machine Translation Summit XIV: European projects, 2013
Strategic Research Agenda for Multilingual Europe 2020 - presented by the META Technology Council.
White Paper Series, Springer, ISBN: 978-3-642-36349-8, 2013
2012
White Paper Series, Springer, ISBN: 978-3-642-27166-3, 2012
2009
Lit. Linguistic Comput., 2009
Sustainability of annotated resources in linguistics: A web-platform for exploring, querying, and distributing linguistic corpora and other resources.
Lit. Linguistic Comput., 2009
2008
A Web-Platform for Preserving, Exploring, Visualising, and Querying Linguistic Corpora and other Resources.
Proces. del Leng. Natural, 2008
Digital Text Collections, Linguistic Research Data, and Mashups: Notes on the Legal Situation.
Libr. Trends, 2008
The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources.
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2008
Ontology-Based XQuery'ing of XML-Encoded Language Resources on Multiple Annotation Layers.
Proceedings of the International Conference on Language Resources and Evaluation, 2008
2006
2005
Language-Independent Text Parsing of Arbitrary HTML-Documents. Towards A Foundation For Web Genre Identification.
LDV Forum, 2005
2002
Proceedings of the 35th Hawaii International Conference on System Sciences (HICSS-35 2002), 2002
2001
Die Chronik der Chronik - Erfahrungen über die Konvertierung und Weiterverarbeitung proprietär annotierter Daten.
Proceedings of the Proceedings der GLDV-Frühjahrstagung 2001, 2001
korpus.html - Zur Sammlung, Datenbankbasierten Erfassung, Annotation und Auswertung von HTML-Dokumenten.
Proceedings of the Proceedings der GLDV-Frühjahrstagung 2001, 2001
2000
From Open Source to Open Information: Collaborative Methods in Creating XML-Based Markup Languages.
Proceedings of the Electronic Publishing 2000, Electronic Publishing in the Third Millenium: 4th ICCC/IFIP conference held at Kaliningrad/Svetlogorsk, 2000