Filip Klubicka

Orcid: 0000-0001-9712-6141

According to our database1, Filip Klubicka authored at least 25 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP.
CoRR, 2023

Queer In AI: A Case Study in Community-Led Participatory AI.
CoRR, 2023

Probing Taxonomic and Thematic Embeddings for Taxonomic Information.
Proceedings of the 12th Global Wordnet Conference, 2023

Idioms, Probing and Dangerous Things: Towards Structural Probing for Idiomaticity in Vector Space.
Proceedings of the 19th Workshop on Multiword Expressions, 2023


2022
Shapley Idioms: Analysing BERT Sentence Embeddings for General Idiom Token Identification.
Frontiers Artif. Intell., 2022

Probing with Noise: Unpicking the Warp and Weft of Embeddings.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

2020
Semantic Relatedness and Taxonomic Word Embeddings.
CoRR, 2020

English WordNet Random Walk Pseudo-Corpora.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Size Matters: The Impact of Training Size in Taxonomically-Enriched Word Embeddings.
Open Comput. Sci., 2019

Synthetic, yet natural: Properties of WordNet random walk corpora and the impact of rare words on embedding performance.
Proceedings of the 10th Global Wordnet Conference, 2019

2018
Quantitative fine-grained human evaluation of machine translation systems: a case study on English to Croatian.
Mach. Transl., 2018

Examining a hate speech corpus for hate speech detection and popularity prediction.
CoRR, 2018

ADAPT at SemEval-2018 Task 9: Skip-Gram Word Embeddings for Unsupervised Hypernym Discovery in Specialised Corpora.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

Is it worth it? Budget-related evaluation metrics for model selection.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017
Fine-Grained Human Evaluation of Neural Versus Phrase-Based Machine Translation.
Prague Bull. Math. Linguistics, 2017

Crawl and crowd to bring machine translation to under-resourced languages.
Lang. Resour. Evaluation, 2017

2016
Language Related Issues for Machine Translation between Closely Related South Slavic Languages.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

New Inflectional Lexicons and Training Corpora for Improved Morphosyntactic Annotation of Croatian and Serbian.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Dealing with Data Sparseness in SMT with Factured Models and Morphological Expansion: a Case Study on Croatian.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation, 2016

Collaborative Development of a Rule-Based Machine Translator between Croatian and Serbian.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation, 2016

2015
Predicting Inflectional Paradigms and Lemmata of Unknown Words for Semi-automatic Expansion of Morphological Lexicons.
Proceedings of the Recent Advances in Natural Language Processing, 2015

2014
Comparing two acquisition systems for automatically building an English-Croatian parallel corpus from multilingual websites.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

{bs, hr, sr}WaC - Web Corpora of Bosnian, Croatian and Serbian.
Proceedings of the 9th Web as Corpus Workshop, 2014


  Loading...