Ondrej Bojar

Orcid: 0000-0002-0606-0050

According to our database1, Ondrej Bojar authored at least 261 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Revamping the SLTev Tool for Evaluation of Spoken Language Translation.
Prague Bull. Math. Linguistics, June, 2024

Adversarial Testing as a Tool for Interpretability: Length-based Overfitting of Elementary Functions in Transformers.
CoRR, 2024

Preliminary WMT24 Ranking of General MT Systems and LLMs.
CoRR, 2024

Understanding the role of FFNs in driving multilingual behaviour in LLMs.
CoRR, 2024

Quality and Quantity of Machine Translation References for Automated Metrics.
CoRR, 2024

Findings of WMT2024 English-to-Low Resource Multimodal Translation Task.
Proceedings of the Ninth Conference on Machine Translation, 2024


CUNI at WMT24 General Translation Task: LLMs, (Q)LoRA, CPO and Model Merging.
Proceedings of the Ninth Conference on Machine Translation, 2024

InsBERT: Word Importance from Artificial Insertions.
Proceedings of the 24th Conference Information Technologies, 2024

On Difficulties of Attention Factorization through Shared Memory.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024

Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

GAATME: A Genetic Algorithm for Adversarial Translation Metrics Evaluation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Khan Academy Corpus: A Multilingual Corpus of Khan Academy Lectures.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Evaluating Optimal Reference Translations.
CoRR, 2023

Boosting Unsupervised Machine Translation with Pseudo-Parallel Data.
CoRR, 2023

Long-Form End-to-End Speech Translation via Latent Alignment Segmentation.
CoRR, 2023

Minuteman: Machine and Human Joining Forces in Meeting Summarization.
CoRR, 2023

Character-level NMT and language similarity.
CoRR, 2023

Negative Lexical Constraints in Neural Machine Translation.
CoRR, 2023

Turning Whisper into Real-Time Transcription System.
CoRR, 2023

Multimodal Shannon Game with Images.
CoRR, 2023

Low-Resource Machine Translation Systems for Indic Languages.
Proceedings of the Eighth Conference on Machine Translation, 2023

Findings of the 2023 Conference on Machine Translation (WMT23): LLMs Are Here but Not Quite There Yet.
Proceedings of the Eighth Conference on Machine Translation, 2023

CUNI at WMT23 General Translation Task: MT and a Genetic Algorithm.
Proceedings of the Eighth Conference on Machine Translation, 2023

Searching for Reasons of Transformers' Success: Memorization vs Generalization.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Towards Efficient Simultaneous Speech Translation: CUNI-KIT System for Simultaneous Track at IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023


Shortening of the Results of Machine Translation using Paraphrasing Dataset.
Proceedings of the 23rd Conference Information Technologies, 2023

Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Team Iterate @ AutoMin 2023 - Experiments with Iterative Minuting.
Proceedings of the 16th International Natural Language Generation Conference, 2023

Overview of the Second Shared Task on Automatic Minuting (AutoMin) at INLG 2023.
Proceedings of the 16th International Natural Language Generation Conference, 2023

Unveiling Multilinguality in Transformer Models: Exploring Language Specificity in Feed-Forward Networks.
Proceedings of the 6th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, 2023

HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Robustness of Multi-Source MT to Transcription Errors.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Breeding Machine Translations: Evolutionary approach to survive and thrive in the world of automated evaluation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Assessing Word Importance Using Models Trained for Semantic Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
CUNI Submission in WMT22 General Task.
CoRR, 2022

Simultaneous Translation for Unsegmented Input: A Sliding Window Approach.
CoRR, 2022

CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022.
CoRR, 2022

Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models.
CoRR, 2022

EMMT: A simultaneous eye-tracking, 4-electrode EEG and audio corpus for multi-modal reading and translation scenarios.
CoRR, 2022

Short-Term Word-Learning in a Dynamically Changing Environment.
CoRR, 2022

Comprehension of Subtitles from Re-Translating Simultaneous Speech Translation.
CoRR, 2022

The Reality of Multi-Lingual Machine Translation.
CoRR, 2022

Automated Evaluation Metric for Terminology Consistency in MT.
Proceedings of the Seventh Conference on Machine Translation, 2022


CUNI-Bergamot Submission at WMT22 General Translation Task.
Proceedings of the Seventh Conference on Machine Translation, 2022

Continuous Rating as Reliable Human Evaluation of Simultaneous Speech Translation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Automatic Minuting: A Pipeline Method for Generating Minutes from Multi-Party Meeting Proceedings.
Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation, 2022

ALIGNMEET: A Comprehensive Tool for Meeting Annotation, Alignment, and Evaluation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

ELITR Minuting Corpus: A Novel Dataset for Automatic Minuting from Multi-Party Meetings in English and Czech.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022


Genre Transfer in NMT: Creating Synthetic Spoken Parallel Sentences using Written Parallel Data.
Proceedings of the 19th International Conference on Natural Language Processing, 2022

Sentence Ambiguity, Grammaticality and Complexity Probes.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

Overview of the 9th Workshop on Asian Translation.
Proceedings of the 9th Workshop on Asian Translation, 2022

2021
Report on the SIGDial 2021 special session on summarization of dialogues and multi-party meetings (SummDial).
SIGIR Forum, 2021

CUNI Systems for WMT21: Terminology Translation Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

A Fine-Grained Analysis of BERTScore.
Proceedings of the Sixth Conference on Machine Translation, 2021

CUNI Systems in WMT21: Revisiting Backtranslation Techniques for English-Czech NMT.
Proceedings of the Sixth Conference on Machine Translation, 2021

Results of the WMT21 Metrics Shared Task: Evaluating Metrics with Expert-based Human Evaluations on TED and News Domain.
Proceedings of the Sixth Conference on Machine Translation, 2021


ParCzech 3.0: A Large Czech Speech Corpus with Rich Metadata.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

An Empirical Performance Analysis of State-of-the-Art Summarization Models for Automatic Minuting.
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021

Backtranslation Feedback Improves User Confidence in MT, Not Quality.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Operating a Complex SLT System with Speakers and Human Interpreters.
Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings, 2021


Coarse-To-Fine And Cross-Lingual ASR Transfer.
Proceedings of the 21st Conference Information Technologies, 2021

Machine Translation of Covid-19 Information Resources via Multilingual Transfer.
Proceedings of the 21st Conference Information Technologies, 2021

When a Robot Writes a Play: Automatically Generating a Theatre Play Script.
Proceedings of the 2021 Conference on Artificial Life, 2021

Lost in Interpreting: Speech Translation from Source or Interpreter?
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Constrained Decoding for Technical Term Retention in English-Hindi MT.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

Bengali Visual Genome: A Multimodal Dataset for Machine Translation and Image Captioning.
Proceedings of the Intelligent Data Engineering and Analytics, 2021

Explainable Quality Estimation: CUNI Eval4NLP Submission.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Neural Machine Translation Quality and Post-Editing Performance.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Sequence Length is a Domain: Length-based Overfitting in Transformer Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021


ELITR Multilingual Live Subtitling: Demo and Strategy.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

SLTEV: Comprehensive Evaluation of Spoken Language Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

NLPHut's Participation at WAT2021.
Proceedings of the 8th Workshop on Asian Translation, 2021


End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Inferring Highly-dense Representations for Clustering Broadcast Media Content.
Prague Bull. Math. Linguistics, 2020

Announcing CzEng 2.0 Parallel Corpus with over 2 Gigawords.
CoRR, 2020

Automating Text Naturalness Evaluation of NLG Systems.
CoRR, 2020

Human or Machine: Automating Human Likeliness Evaluation of NLG Texts.
CoRR, 2020

WMT20 Document-Level Markable Error Exploration.
Proceedings of the Fifth Conference on Machine Translation, 2020

Results of the WMT20 Metrics Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

CUNI Systems for the Unsupervised and Very Low Resource Translation Task in WMT20.
Proceedings of the Fifth Conference on Machine Translation, 2020


Costra 1.1: An Inquiry into Geometric Properties of Sentence Spaces.
Proceedings of the Text, Speech, and Dialogue, 2020

How Many Pages?: Paper Length Prediction from the Metadata.
Proceedings of the NLPIR 2020: 4th International Conference on Natural Language Processing and Information Retrieval, 2020

Outbound Translation User Interface Ptakopet: A Pilot Study.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Large Corpus of Czech Parliament Plenary Hearings.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Removing European Language Barriers with Innovative Machine Translation Technology.
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020

Two Huge Title and Keyword Generation Corpora of Research Articles.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

COSTRA 1.0: A Dataset of Complex Sentence Transformations.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

CUNI Neural ASR with Phoneme-Level Intermediate Step for~Non-Native~SLT at IWSLT 2020.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

ELITR Non-Native Speech Translation at IWSLT 2020.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020


Presenting Simultaneous Translation in Limited Space.
Proceedings of the 20th Conference Information Technologies, 2020


Efficiently Reusing Old Models Across Languages via Transfer Learning.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

ELITR: European Live Translator.
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, 2020

ODIANLP's Participation in WAT2020.
Proceedings of the 7th Workshop on Asian Translation, 2020

Overview of the 7th Workshop on Asian Translation.
Proceedings of the 7th Workshop on Asian Translation, 2020

Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

2019
Replacing Linguists with Dummies: A Serious Need for Trivial Baselines in Multi-Task Neural Machine Translation.
Prague Bull. Math. Linguistics, 2019

Representation of sentence meaning (A JNLE Special Issue).
Nat. Lang. Eng., 2019

Promoting the Knowledge of Source Syntax in Transformer NMT Is Not Needed.
Computación y Sistemas, 2019

Hindi Visual Genome: A Dataset for Multi-Modal English to Hindi Machine Translation.
Computación y Sistemas, 2019

Transfer Learning across Languages from Someone Else's NMT Model.
CoRR, 2019

Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine Translation.
CoRR, 2019

SAO WMT19 Test Suite: Machine Translation of Audit Reports.
Proceedings of the Fourth Conference on Machine Translation, 2019

A Test Suite and Manual Evaluation of Document-Level NMT at WMT19.
Proceedings of the Fourth Conference on Machine Translation, 2019

English-Czech Systems in WMT19: Document-Level Transformer.
Proceedings of the Fourth Conference on Machine Translation, 2019

Results of the WMT19 Metrics Shared Task: Segment-Level and Strong MT Systems Pose Big Challenges.
Proceedings of the Fourth Conference on Machine Translation, 2019

CUNI Systems for the Unsupervised News Translation Task in WMT 2019.
Proceedings of the Fourth Conference on Machine Translation, 2019

CUNI Submission for Low-Resource Languages in WMT News 2019.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the 2019 Conference on Machine Translation (WMT19).
Proceedings of the Fourth Conference on Machine Translation, 2019

A Speech Test Set of Practice Business Presentations with Additional Relevant Texts.
Proceedings of the Statistical Language and Speech Processing, 2019

Keyphrase Generation: A Text Summarization Struggle.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Towards Automatic Minuting of the Meetings.
Proceedings of the 19th Conference Information Technologies, 2019

In Search for Linear Relations in Sentence Embedding Spaces.
Proceedings of the 19th Conference Information Technologies, 2019

Efficiency Metrics for Data-Driven Models: A Text Summarization Case Study.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

Sentiment Analysis of Czech Texts: An Algorithmic Survey.
Proceedings of the 11th International Conference on Agents and Artificial Intelligence, 2019

Keyphrase Generation: A Multi-Aspect Survey.
Proceedings of the 25th Conference of Open Innovations Association, 2019

Idiap NMT System for WAT 2019 Multimodal Translation Task.
Proceedings of the 6th Workshop on Asian Translation, 2019

Overview of the 6th Workshop on Asian Translation.
Proceedings of the 6th Workshop on Asian Translation, 2019

Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Training Tips for the Transformer Model.
Prague Bull. Math. Linguistics, 2018

Results of the WMT18 Metrics Shared Task: Both characters and embeddings achieve good performance.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

CUNI Submissions in WMT18.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Trivial Transfer Learning for Low-Resource Neural Machine Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

Testsuite on Czech-English Grammatical Contrasts.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

The WMT'18 Morpheval test suites for English-Czech, English-German, English-Finnish and Turkish-English.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

EvalD Reference-Less Discourse Evaluation for WMT18.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Findings of the 2018 Conference on Machine Translation (WMT18).
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Morphological and Language-Agnostic Word Segmentation for NMT.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

CUNI Basque-to-English Submission in IWSLT18.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

Translating Short Segments with NMT: A Case Study in English-to-Hindi.
Proceedings of the 21st Annual Conference of the European Association for Machine Translation, 2018

Neural Monkey: The Current State and Beyond.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

CUNI NMT System for WAT 2018 Translation Tasks.
Proceedings of the 32nd Pacific Asia Conference on Language, 2018

Are BLEU and Meaning Representation in Opposition?
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Visualizing Neural Machine Translation Attention and Confidence.
Prague Bull. Math. Linguistics, 2017

Extracting Parallel Paragraphs from Common Crawl.
Prague Bull. Math. Linguistics, 2017

LanideNN: Multilingual Language Identification on Character Window.
CoRR, 2017

CUNI System for WMT17 Automatic Post-Editing Task.
Proceedings of the Second Conference on Machine Translation, 2017

CUNI submission in WMT17: Chimera goes neural.
Proceedings of the Second Conference on Machine Translation, 2017

The QT21 Combined Machine Translation System for English to Latvian.
Proceedings of the Second Conference on Machine Translation, 2017

CUNI Experiments for WMT17 Metrics Task.
Proceedings of the Second Conference on Machine Translation, 2017

Findings of the WMT 2017 Biomedical Translation Shared Task.
Proceedings of the Second Conference on Machine Translation, 2017

Results of the WMT17 Neural MT Training Task.
Proceedings of the Second Conference on Machine Translation, 2017

Results of the WMT17 Metrics Shared Task.
Proceedings of the Second Conference on Machine Translation, 2017

Findings of the 2017 Conference on Machine Translation (WMT17).
Proceedings of the Second Conference on Machine Translation, 2017

Variable Mini-Batch Sizing and Pre-Trained Embeddings.
Proceedings of the Second Conference on Machine Translation, 2017

Curriculum Learning and Minibatch Bucketing in Neural Machine Translation.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

Paying Attention to Multi-Word Expressions in Neural Machine Translation.
Proceedings of Machine Translation Summit XVI, Volume 1: Research Track, 2017

An Exploration of Word Embedding Initialization in Deep-Learning Tasks.
Proceedings of the 14th International Conference on Natural Language Processing, 2017

Producing Unseen Morphological Variants in Statistical Machine Translation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

LanideNN: Multilingual Language Identification on Text Stream.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

CUNI NMT System for WAT 2017 Translation Tasks.
Proceedings of the 4th Workshop on Asian Translation, 2017

2016
Enriching Source for English-to-Urdu Machine Translation.
Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Edinburgh's Statistical Machine Translation Systems for WMT16.
Proceedings of the First Conference on Machine Translation, 2016

CUNI-LMU Submissions in WMT2016: Chimera Constrained and Beaten.
Proceedings of the First Conference on Machine Translation, 2016

Dictionary-based Domain Adaptation of MT Systems without Retraining.
Proceedings of the First Conference on Machine Translation, 2016


CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks.
Proceedings of the First Conference on Machine Translation, 2016

Using Term Position Similarity and Language Modeling for Bilingual Document Alignment.
Proceedings of the First Conference on Machine Translation, 2016

Particle Swarm Optimization Submission for WMT16 Tuning Task.
Proceedings of the First Conference on Machine Translation, 2016

Results of the WMT16 Tuning Shared Task.
Proceedings of the First Conference on Machine Translation, 2016

Results of the WMT16 Metrics Shared Task.
Proceedings of the First Conference on Machine Translation, 2016


Bilingual Embeddings and Word Alignments for Translation Quality Estimation.
Proceedings of the First Conference on Machine Translation, 2016

SubGram: Extending Skip-Gram Word Representation with Substrings.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

CzEng 1.6: Enlarged Czech-English Parallel Corpus with Processing Tools Dockered.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

UFAL Submissions to the IWSLT 2016 MT Track.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Verb sense disambiguation in Machine Translation.
Proceedings of the Sixth Workshop on Hybrid Approaches to Translation, 2016

HUME: Human UCCA-Based Evaluation of Machine Translation.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

TectoMT - a deep linguistic core of the combined Cimera MT system.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016

Pivoting Methods and Data for Czech-Vietnamese Translation via English.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation, 2016

Target-Side Context for Discriminative Models in Statistical Machine Translation.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Moses & Treex Hybrid MT Systems Bestiary.
Proceedings of the 2nd Deep Machine Translation Workshop, 2016

2015
Evaluating Machine Translation Quality Using Short Segments Annotations.
Prague Bull. Math. Linguistics, 2015

TmTriangulate: A Tool for Phrase Table Triangulation.
Prague Bull. Math. Linguistics, 2015

Resources for Indonesian Sentiment Analysis.
Prague Bull. Math. Linguistics, 2015

Results of the WMT15 Metrics Shared Task.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Results of the WMT15 Tuning Shared Task.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

CUNI in WMT15: Chimera Strikes Again.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Findings of the 2015 Workshop on Statistical Machine Translation.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

TeamUFAL: WSD+EL as Document Retrieval.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Giving a Sense: A Pilot Study in Concept Annotation from Multiple Resources.
Proceedings of the Proceedings ITAT 2015: Information Technologies, 2015

What a Transfer-Based System Brings to the Combination with PBMT.
Proceedings of the Fourth Workshop on Hybrid Approaches to Translation, 2015

2014
Czech Machine Translation in the project CzechMATE.
Prague Bull. Math. Linguistics, 2014

English to Urdu Statistical Machine Translation: Establishing a Baseline.
Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing, 2014

CUNI in WMT14: Chimera Still Awaits Bellerophon.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Results of the WMT14 Metrics Shared Task.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Findings of the 2014 Workshop on Statistical Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Tagged Corpus and a Tagger for Urdu.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Two-Step Machine Translation with Lattices.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Japonsko-český strojový překlad.
Proceedings of the main track of the 14th Conference on Information Technologies - Applications and Theory (ITAT 2014), with selected papers from Znalosti 2014 collocated with Znalosti 2014, Demanovska Dolina - Jasna, Slovakia, September 25, 2014

Comparing Czech and English AMRs.
Proceedings of Workshop on Lexical and Grammatical Resources for Language Processing, 2014

2013
The Design of Eman, an Experiment Manager.
Prague Bull. Math. Linguistics, 2013

Results of the WMT13 Metrics Shared Task.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

PhraseFix: Statistical Post-Editing of TectoMT.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Chimera - Three Heads for English-to-Czech Translation.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Findings of the 2013 Workshop on Statistical Machine Translation.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Scratching the Surface of Possible Translations.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

No Free Lunch in Factored Phrase-Based Machine Translation.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

2012
TrTok: A Fast and Trainable Tokenizer for Natural Languages.
Prague Bull. Math. Linguistics, 2012

Tagger Voting for Urdu.
Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing, 2012

Selecting Data for English-to-Czech Machine Translation.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

TerrorCat: a Translation Error Categorization-based MT Quality Metric.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Probes in a Taxonomy of Factored Phrase-Based Models.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Towards a Predicate-Argument Evaluation for MT.
Proceedings of the Sixth Workshop on Syntax, 2012

Announcing Prague Czech-English Dependency Treebank 2.0.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Terra: a Collection of Translation Error-Annotated Corpora.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The Joy of Parallelism with CzEng 1.0.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Automatic MT Error Analysis: Hjerson Helping Addicter.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Improving SMT by Using Parallel Data of a Closely Related Language.
Proceedings of the Human Language Technologies - The Baltic Perspective, 2012

2011
Addicter: What Is Wrong with My Translations?
Prague Bull. Math. Linguistics, 2011

eppex: Epochal Phrase Table Extraction for Statistical Machine Translation.
Prague Bull. Math. Linguistics, 2011

Analyzing Error Types in English-Czech Machine Translation.
Prague Bull. Math. Linguistics, 2011

Quiz-Based Evaluation of Machine Translation.
Prague Bull. Math. Linguistics, 2011

Two-step translation with grammatical post-processing.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Approximating a Deep-Syntactic Metric for MT Evaluation and Tuning.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Improving Translation Model by Monolingual Data.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

A Grain of Salt for the WMT Manual Evaluation.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Automatic Translation Error Analysis.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Named entities from Wikipedia for machine translation.
Proceedings of the Conference on Theory and Practice of Information Technologies, 2011

2010
2010 Failures in English-Czech Phrase-Based MT.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Building a Bilingual ValLex Using Treebank Token Alignment: First Observations.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Data Issues in English-to-Hindi Machine Translation.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Automatic source code reduction.
Proceedings of the Conference on Theory and Practice of Information Technologies, 2010

Tackling Sparse Data Issue in Machine Translation Evaluation.
Proceedings of the ACL 2010, 2010

2009
Evaluation of Machine Translation Metrics for Czech as the Target Language.
Prague Bull. Math. Linguistics, 2009

CzEng 0.9: Large Parallel Treebank with Rich Annotation.
Prague Bull. Math. Linguistics, 2009

English-Czech MT in 2008.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Statistical Machine Translation Between Related and Unrelated Languages.
Proceedings of the Conference on Theory and Practice of Information Technologies, 2009

2008
Towards English-to-Czech MT via Tectogrammatical Layer.
Prague Bull. Math. Linguistics, 2008

Phrase-Based and Deep Syntactic English-to-Czech Statistical Machine Translation.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

CzEng 0.7: Parallel Corpus with Community-Supplied Translations.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007
English-to-Czech Factored Machine Translation.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007


2006
CzEng: Czech-English Parallel Corpus release version 0.5.
Prague Bull. Math. Linguistics, 2006

Czech Verbs of Communication and the Extraction of Their Frames.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Czech-English Word Alignment.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Czech-English Phrase-Based Machine Translation.
Proceedings of the Advances in Natural Language Processing, 2006

2005
VALEVAL: Testing Vallex Consistency and Experimenting with Word-Frame Disambiguation.
Prague Bull. Math. Linguistics, 2005

Valency Lexicon of Czech Verbs VALLEX: Recent Experiments with Frame Disambiguation.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

Problems of Reusing an Existing MT System.
Proceedings of the Natural Language Processing - IJCNLP 2005, Second International Joint Conference, Jeju Island, Republic of Korea, October 11-13, 2005, 2005

2004
Czech Syntactic Analysis Constraint-based - XDG: One Possible Start.
Prague Bull. Math. Linguistics, 2004

Problems of Inducing Large Coverage Constraint-Based Dependency Grammar for Czech.
Proceedings of the Constraint Solving and Language Processing, 2004

2003
Towards Automatic Extraction of Verb Frames.
Prague Bull. Math. Linguistics, 2003


  Loading...