Automatic Genre Identification for Robust Enrichment of Massive Text Collections: Investigation of Classification Methods in the Era of Large Language Models.

[BibT_eX]

[DOI]

Taja Kuzman

Igor Mozetic

Nikola Ljubesic

Mach. Learn. Knowl. Extr., June, 2023

ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification.

[BibT_eX]

[DOI]

Taja Kuzman

Igor Mozetic

Nikola Ljubesic

CoRR, 2023

BENCHić-lang: A Benchmark for Discriminating between Bosnian, Croatian, Montenegrin and Serbian.

[BibT_eX]

[DOI]

Peter Rupnik

Taja Kuzman

Nikola Ljubesic

Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

Get to Know Your Parallel Data: Performing English Variety and Genre Classification over MaCoCu Corpora.

[BibT_eX]

[DOI]

Taja Kuzman

Peter Rupnik

Nikola Ljubesic

Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages.

[BibT_eX]

[DOI]

Aarón Galiano Jiménez

Jaume Zaragoza-Bernabeu

Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

2022

The GINCO Training Dataset for Web Genre Identification of Documents Out in the Wild.

[BibT_eX]

[DOI]

Taja Kuzman

Peter Rupnik

Nikola Ljubesic

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

MaCoCu: Massive collection and curation of monolingual and bilingual data: focus on under-resourced languages.

[BibT_eX]

[DOI]

Marta Bañón

Miquel Esplà-Gomis

Mikel L. Forcada

Cristian García-Romero

Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

2017

Verbal Multiword Expressions in Slovene.

[BibT_eX]

[DOI]

Polona Gantar

Simon Krek

Taja Kuzman

Proceedings of the Computational and Corpus-Based Phraseology, 2017

Taja Kuzman

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...