Mark Fishel

Orcid: 0000-0003-1932-2600

Affiliations:
  • University of Tartu, Estonia


According to our database1, Mark Fishel authored at least 82 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
LLMs for Extremely Low-Resource Finno-Ugric Languages.
CoRR, 2024

Preliminary WMT24 Ranking of General MT Systems and LLMs.
CoRR, 2024

Autocorrect for Estonian texts: final report from project EKTB25.
CoRR, 2024


Adapting Audiovisual Speech Synthesis to Estonian.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

To Err Is Human, but Llamas Can Learn It Too.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

SMUGRI-MT - Machine Translation System for Low-Resource Finno-Ugric Languages.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), 2024

Estonian-Centric Machine Translation: Data, Models, and Challenges.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

No Error Left Behind: Multilingual Grammatical Error Correction with Pre-trained Translation Models.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Multilinguality or Back-translation? A Case Study with Estonian.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Findings of the 2023 Conference on Machine Translation (WMT23): LLMs Are Here but Not Quite There Yet.
Proceedings of the Eighth Conference on Machine Translation, 2023

True Detective: A Deep Abductive Reasoning Benchmark Undoable for GPT-3 and Challenging for GPT-4.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

Machine Translation for Low-resource Finno-Ugric Languages.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Neural Text-to-Speech Synthesis for Võro.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Automatic Transcription for Estonian Children's Speech.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Translated Benchmarks Can Be Misleading: the Case of Estonian Question Answering.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Distilling Estonian Text Domains for Production-Oriented Machine Translation.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

2022
True Detective: A Challenging Benchmark for Deep Abductive Reasoning in Foundation Models.
CoRR, 2022

Open and Competitive Multilingual Neural Machine Translation in Production.
Balt. J. Mod. Comput., 2022

Cross-lingual Transfer from Large Multilingual Translation Models to Unseen Under-resourced Languages.
Balt. J. Mod. Comput., 2022

Estonian Text-to-Speech Synthesis with Non-autoregressive Transformers.
Balt. J. Mod. Comput., 2022

Estonian Language Understanding: a Case Study on the COPA Task.
Balt. J. Mod. Comput., 2022

Similarity of Sentence Representations in Multilingual LMs: Resolving Conflicting Literature and a Case Study of Baltic Languages.
Balt. J. Mod. Comput., 2022


Cross-lingual Similarity of Multilingual Representations Revisited.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

National Language Technology Platform (NLTP): overall view.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

MTee: Open Machine Translation Platform for Estonian Government.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Machine Translation for Livonian: Catering to 20 Speakers.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

2021
Establishing Interlingua in Multilingual Language Models.
CoRR, 2021

Direct Exploitation of Attention Weights for Translation Quality Estimation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Translation Transformers Rediscover Inherent Data Domains.
Proceedings of the Sixth Conference on Machine Translation, 2021

Extremely low-resource machine translation for closely related languages.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

2020
Unsupervised Quality Estimation for Neural Machine Translation.
Trans. Assoc. Comput. Linguistics, 2020

Neural Speech Synthesis for Estonian.
CoRR, 2020

BERGAMOT-LATTE Submissions for the WMT20 Quality Estimation Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

2019
Multimodal Sequential Fashion Attribute Prediction.
Inf., 2019

Grammatical Error Correction and Style Transfer via Zero-shot Monolingual Translation.
CoRR, 2019

Quality Estimation and Translation Metrics via Pre-trained Word and Sentence Embeddings.
Proceedings of the Fourth Conference on Machine Translation, 2019

University of Tartu's Multilingual Multi-domain WMT19 News Translation Shared Task Submission.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the WMT 2019 Shared Tasks on Quality Estimation.
Proceedings of the Fourth Conference on Machine Translation, 2019

Findings of the 2019 Conference on Machine Translation (WMT19).
Proceedings of the Fourth Conference on Machine Translation, 2019

2018
Integrating Vision and Language for First-Impression Personality Analysis.
IEEE Multim., 2018

Monolingual and Cross-lingual Zero-shot Style Transfer.
CoRR, 2018

Doubly Attentive Transformer Machine Translation.
CoRR, 2018

Quality Estimation with Force-Decoded Attention and Cross-lingual Embeddings.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Phrase-based Unsupervised Machine Translation with Compositional Phrase Embeddings.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Findings of the 2018 Conference on Machine Translation (WMT18).
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Low-Resource Translation Quality Estimation for Estonian.
Proceedings of the Human Language Technologies - The Baltic Perspective, 2018

Multi-Domain Neural Machine Translation.
Proceedings of the 21st Annual Conference of the European Association for Machine Translation, 2018

2017
Open-Source Neural Machine Translation API Server.
Prague Bull. Math. Linguistics, 2017

Visualizing Neural Machine Translation Attention and Confidence.
Prague Bull. Math. Linguistics, 2017

bleu2vec: the Painfully Familiar Metric on Continuous Vector Space Steroids.
Proceedings of the Second Conference on Machine Translation, 2017

C-3MA: Tartu-Riga-Zurich Translation Systems for WMT17.
Proceedings of the Second Conference on Machine Translation, 2017

Confidence through Attention.
Proceedings of Machine Translation Summit XVI, Volume 1: Research Track, 2017

2016
Whether to English More Alike Reordered Suits Estonian Input for Statistical Machine Translation Better?
Proceedings of the Human Language Technologies - The Baltic Perspective, 2016

MODERN: modelling discourse entities and relations for coherent machine translation.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016

2015
Detecting Document-level Context Triggers to Resolve Translation Ambiguity.
Proceedings of the Second Workshop on Discourse in Machine Translation, 2015

Leveraging Compounds to Improve Noun Phrase Translation from Chinese and German.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Machine Translation for Subtitling: A Large-Scale Evaluation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Enforcing Consistent Translation of German Compound Coreferences.
Proceedings of the 12th Edition of the Konvens Conference, 2014

Handling technical OOVs in SMT.
Proceedings of the 17th Annual conference of the European Association for Machine Translation, 2014

2013
Ranking Translations using Error Analysis and Quality Estimation.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Combining Statistical Machine Translation and Translation Memories with Domain Adaptation.
Proceedings of the 19th Nordic Conference of Computational Linguistics, 2013

Statistical Machine Translation for Automobile Marketing Texts.
Proceedings of Machine Translation Summit XIV: Posters, 2013

Assessing post-editing efficiency in a realistic translation environment.
Proceedings of the 2nd Workshop on Post-editing Technology and Practice, 2013

SUMAT: An Online Service for Subtitling by Machine Translation.
Proceedings of Machine Translation Summit XIV: European projects, 2013

2012
TerrorCat: a Translation Error Categorization-based MT Quality Metric.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Terra: a Collection of Translation Error-Annotated Corpora.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Automatic MT Error Analysis: Hjerson Helping Addicter.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

In-domain Data FTW.
Proceedings of the Human Language Technologies - The Baltic Perspective, 2012

From Subtitles to Parallel Corpora.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012

2011
Addicter: What Is Wrong with My Translations?
Prague Bull. Math. Linguistics, 2011

Automatic Translation Error Analysis.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

2010
CorporAl: a Method and Tool for Handling Overlapping Parallel Corpora.
Prague Bull. Math. Linguistics, 2010

Simpler Is Better: Re-evaluation of Default Word Alignment Models in Statistical MT.
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, 2010

Linguistically Motivated Unsupervised Segmentation for Machine Translation.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009
Voting and Stacking in Data-Driven Dependency Parsing.
Proceedings of the 17th Nordic Conference of Computational Linguistics, 2009

2008
Experiments on Processing Overlapping Parallel Corpora.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Mixing and Blending Syntactic and Semantic Dependencies.
Proceedings of the Twelfth Conference on Computational Natural Language Learning, 2008

2007
Estonian-English Statistical Machine Translation: the First Results.
Proceedings of the 16th Nordic Conference of Computational Linguistics, 2007


  Loading...