Adrien Barbaresi
Orcid: 0000-0002-8079-8694
According to our database1,
Adrien Barbaresi
authored at least 25 papers
between 2011 and 2021.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2021
Trafilatura: A Web Scraping Library and Command-Line Tool for Text Discovery and Extraction.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
J. Open Source Softw., 2020
Bien choisir son outil d'extraction de contenu à partir du Web (Choosing the appropriate tool for Web Content Extraction ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020
Que recèlent les données textuelles issues du web ? (What do text data from the Web have to hide ?).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020
Out-of-the-Box and into the Ditch? Multilingual Evaluation of Generic Text Extraction Tools.
Proceedings of the 12th Web as Corpus Workshop, 2020
2019
Proceedings of the 15th Conference on Natural Language Processing, 2019
2018
Computationally efficient discrimination between language varieties with large feature vectors and regularized classifiers.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Proceedings of the GI-Workshop: Im Spannungsfeld zwischen Tool-Building und Forschung auf Augenhöhe, 2018
2017
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017
Proceedings of the Text, Speech, and Dialogue - 20th International Conference, 2017
Proceedings of the 11th Workshop on Geographic Information Retrieval, 2017
Proceedings of the 12th Annual International Conference of the Alliance of Digital Humanities Organizations, 2017
2016
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016
Proceedings of the 13th Conference on Natural Language Processing, 2016
Proceedings of the 11th Annual International Conference of the Alliance of Digital Humanities Organizations, 2016
Proceedings of the 11th Annual International Conference of the Alliance of Digital Humanities Organizations, 2016
Proceedings of the 3. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2016
Proceedings of the 10th Web as Corpus Workshop, 2016
2015
Ad hoc and general-purpose corpus construction from web sources. (Construction de corpus généraux et spécialisés à partir du Web).
PhD thesis, 2015
2014
Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources.
Proceedings of the 9th Web as Corpus Workshop, 2014
2013
Crawling microblogging services to gather language-classified URLs. Workflow and case study.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
2011
Proceedings of the Actes de la 18e conférence sur le Traitement Automatique des Langues Naturelles. REncontres jeunes Chercheurs en Informatique pour le Traitement Automatique des Langues (articles courts), 2011