Vilém Zouhar

Orcid: 0000-0001-9874-2069

According to our database1, Vilém Zouhar authored at least 41 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Preliminary WMT24 Ranking of General MT Systems and LLMs.
CoRR, 2024

AI-Assisted Human Evaluation of Machine Translation.
CoRR, 2024

Interactive Analysis of LLMs using Meaningful Counterfactuals.
CoRR, 2024

Scaling the Authoring of AutoTutors with Large Language Models.
CoRR, 2024

Stolen Subwords: Importance of Vocabularies for Machine Translation Model Stealing.
CoRR, 2024

Quality and Quantity of Machine Translation References for Automated Metrics.
CoRR, 2024

Pitfalls and Outlooks in Using COMET.
Proceedings of the Ninth Conference on Machine Translation, 2024

Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation.
Proceedings of the Ninth Conference on Machine Translation, 2024


AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails.
Proceedings of the Eleventh ACM Conference on Learning @ Scale, 2024

Distributional Properties of Subword Regularization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Two Counterexamples to Tokenization and the Noiseless Channel.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

RELIC: Investigating Large Language Model Responses using Self-Consistency.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

Fine-Tuned Machine Translation Metrics Struggle in Unseen Domains.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

How to Engage your Readers? Generating Guiding Questions to Promote Active Reading.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Evaluating Optimal Reference Translations.
CoRR, 2023

A Decade of Scholarly Research on Open Knowledge Graphs.
CoRR, 2023

Re-visiting Automated Topic Model Evaluation with Large Language Models.
CoRR, 2023

PWESuite: Phonetic Word Embeddings and Tasks They Facilitate.
CoRR, 2023

Multimodal Shannon Game with Images.
CoRR, 2023

Findings of the WMT 2023 Shared Task on Machine Translation with Terminologies.
Proceedings of the Eighth Conference on Machine Translation, 2023

Revisiting Automated Topic Model Evaluation with Large Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Textbooks with Visuals from the Web for Improved Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Diachronic Perspective on User Trust in AI under Uncertainty.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

A Formal Perspective on Byte-Pair Encoding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Tokenization and the Noiseless Channel.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Fusing Sentence Embeddings Into LSTM-based Autoregressive Language Models.
CoRR, 2022

Knowledge Base Index Compression via Dimensionality and Precision Reduction.
CoRR, 2022

EMMT: A simultaneous eye-tracking, 4-electrode EEG and audio corpus for multi-modal reading and translation scenarios.
CoRR, 2022

Artefact Retrieval: Overview of NLP Models with Knowledge Base Access.
CoRR, 2022

Sentence Ambiguity, Grammaticality and Complexity Probes.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

2021
Leveraging Neural Machine Translation for Word Alignment.
Prague Bull. Math. Linguistics, 2021

Backtranslation Feedback Improves User Confidence in MT, Not Quality.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Sampling and Filtering of Neural Machine Translation Distillation Data.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 2021

Neural Machine Translation Quality and Post-Editing Performance.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Extending Ptakopět for Machine Translation User Interaction Experiments.
Prague Bull. Math. Linguistics, 2020

WMT20 Document-Level Markable Error Exploration.
Proceedings of the Fifth Conference on Machine Translation, 2020

Outbound Translation User Interface Ptakopet: A Pilot Study.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


  Loading...