Mauro Cettolo

According to our database1, Mauro Cettolo authored at least 104 papers between 1993 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
CoRR, 2024

MOSEL: 950, 000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Evaluating Automatic Subtitling: Correlating Post-editing Effort and Automatic Metrics.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Direct Speech Translation for Automatic Subtitling.
Trans. Assoc. Comput. Linguistics, 2023


Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

No Pitch Left Behind: Addressing Gender Unbalance In Automatic Speech Recognition Through Pitch Manipulation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Evaluating Subtitle Segmentation for End-to-end Generation Systems.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Towards a methodology for evaluating automatic subtitling.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Post-editing in Automatic Subtitling: A Subtitlers' perspective.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Extending the MuST-C Corpus for a Comparative Evaluation of Speech Translation Technology.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

2021
Lexical Modeling of ASR Errors for Robust Speech Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation.
Proceedings of the 4th International Conference on Natural Language and Speech Processing, 2021

CTC-based Compression for Direct Speech Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

On the Development of Customized Neural Machine Translation Models.
Proceedings of the Eighth Italian Conference on Computational Linguistics, 2021

Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Contextualized Translation of Automatically Segmented Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018
Neural versus phrase-based MT quality: An in-depth analysis on English-German and English-French.
Comput. Speech Lang., 2018

The IWSLT 2018 Evaluation Campaign.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

Machine Translation Human Evaluation: an investigation of evaluation based on Post-Editing and its relation with Direct Assessment.
Proceedings of the 15th International Conference on Spoken Language Translation, 2018

A Comparison of Transformer and Recurrent Neural Networks on Multilingual Neural Machine Translation.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

2017
Overview of the IWSLT 2017 Evaluation Campaign.
Proceedings of the 14th International Conference on Spoken Language Translation, 2017

Findings of the 2017 DiscoMT Shared Task on Cross-lingual Pronoun Prediction.
Proceedings of the Third Workshop on Discourse in Machine Translation, 2017

2016
On the Evaluation of Adaptive Machine Translation for Human Post-Editing.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Unsupervised Clustering of Commercial Domains for Adaptive Machine Translation.
CoRR, 2016

An Arabic-Hebrew parallel corpus of TED talks.
CoRR, 2016

Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction.
Proceedings of the First Conference on Machine Translation, 2016

WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation on Rare Words.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The IWSLT 2016 Evaluation Campaign.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Neural versus Phrase-Based Machine Translation Quality: a Case Study.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Modern MT: a new open-source machine translation platform for the translation industry.
Proceedings of the 19th Annual Conference of the European Association for Machine Translation: Projects/Products, 2016

2015
The IWSLT 2015 Evaluation Campaign.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Pronoun-Focused MT and Cross-Lingual Pronoun Prediction: Findings of the 2015 DiscoMT Shared Task on Pronoun Translation.
Proceedings of the Second Workshop on Discourse in Machine Translation, 2015

2014
Translation project adaptation for MT-enhanced computer assisted translation.
Mach. Transl., 2014

Online adaptation to post-edits for phrase-based statistical machine translation.
Mach. Transl., 2014

Combined spoken language translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Report on the 11th IWSLT evaluation campaign.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014


Online multi-user adaptive statistical machine translation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

Optimized MT online learning in computer assisted translation.
Proceedings of the Workshop on Interactive and Adaptive Machine Translation, 2014

The repetition rate of text as a predictor of the effectiveness of machine translation adaptation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

2013
Online Learning Approaches in Computer Assisted Translation.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Issues in incremental adaptation of statistical MT from human post-edits.
Proceedings of the 2nd Workshop on Post-editing Technology and Practice, 2013

Project Adaptation for MT-Enhanced Computer Assisted Translation.
Proceedings of Machine Translation Summit XIV: Papers, 2013

Cache-based Online Adaptation for Machine Translation Enhanced Computer Assisted Translation.
Proceedings of Machine Translation Summit XIV: Papers, 2013

EU-BRIDGE MT: text translation of talks in the EU-BRIDGE project.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Report on the 10th IWSLT evaluation campaign.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

2012
Evaluating the Learning Curve of Domain Adaptive Statistical Machine Translation Systems.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Overview of the IWSLT 2012 evaluation campaign.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

WIT3: Web Inventory of Transcribed and Translated Talks.
Proceedings of the 16th Annual conference of the European Association for Machine Translation, 2012

2011
Methods for Smoothing the Optimizer Instability in SMT.
Proceedings of Machine Translation Summit XIII: Papers, 2011

Bootstrapping Arabic-Italian SMT through Comparable Texts and Pivot Translation.
Proceedings of the 15th Annual conference of the European Association for Machine Translation, 2011

2010
Statistical Machine Translation of Texts with Misspelled Words.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Mining parallel fragments from comparable texts.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

FBK @ IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Online Language Model adaptation via N-gram Mixtures for Statistical Machine Translation.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

2009
Online language model adaptation for spoken dialog translation.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

FBK @ IWSLT 2009.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

2008
IRSTLM: an open source toolkit for handling large scale language models.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Shallow-Syntax Phrase-Based Translation: Joint versus Factored String-to-Chunk Models.
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, 2008

2007
Efficient Handling of N-gram Language Models for Statistical Machine Translation.
Proceedings of the Second Workshop on Statistical Machine Translation, 2007

POS-based reordering models for statistical machine translation.
Proceedings of Machine Translation Summit XI: Papers, 2007

Better n-best translations through generative n-gram language models.
Proceedings of Machine Translation Summit XI: Papers, 2007

FBK@IWSLT 2007.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

The IRST English-Spanish translation system for european parliament speeches.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Reordering rules for phrase-based statistical machine translation.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

The ITC-irst SMT system for IWSLT 2006.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

Text Segmentation Criteria for Statistical Machine Translation.
Proceedings of the Advances in Natural Language Processing, 2006

Maximum Entropy Tagging with Binary and Real-Valued Features.
Proceedings of the Workshop on Learning Structured Information in Natural Language Applications@EACL 2006, 2006

A Web-based Demonstrator of a Multi-lingual Phrase-based Translation System.
Proceedings of the EACL 2006, 2006

2005
Evaluation of BIC-based algorithms for audio segmentation.
Comput. Speech Lang., 2005

The ITC-irst SMT system for IWSLT-2005.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Integrated n-best re-ranking for spoken language translation.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Minimum error training of log-linear translation models.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

The ITC-irst statistical machine translation system for IWSLT-2004.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Advances in the automatic transcription of lectures.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A DP algorithm for speaker change detection.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Language modeling and transcription of the TED corpus lectures.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Efficient audio segmentation algorithms based on the BIC.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

The ITC-irst News on Demand Platform.
Proceedings of the Advances in Information Retrieval, 2003

2002
Cross-task portability of a broadcast news speech recognition system.
Speech Commun., 2002

Issues in automatic transcription of historical audio data.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Porting an audio partitioner across domains.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Speaker tracking in a broadcast news corpus.
Proceedings of the 2001: A Speaker Odyssey, 2001

From broadcast news to spontaneous dialogue transcription: portability issues.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Segmentation and Classification of an Italian Broadcast News Corpus.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

A system for the segmentation and transcription of Italian Audio News.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Advances in automatic transcription of Italian broadcast news.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A baseline for the transcription of Italian broadcast news.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Semantic boundaries in multiple languages.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Language portability of a speech understanding system.
Comput. Speech Lang., 1998

Automatic detection of semantic boundaries based on acoustic and lexical knowledge.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Automatic recognition of spontaneous speech dialogues.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1997
Automatic detection of semantic boundaries.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Multilingual person to person communication at IRST.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
A mixed approach to speech understanding.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Language modelling for efficient beam-search.
Comput. Speech Lang., 1995

Improvements in tree-based language model representation.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Language model representations for beam-search decoding.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Language model estimations and representations for real-time continuous speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Radiological reporting by speech recognition: the a.re.s. system.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993
Techniques for robust recognition in restricted domains.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993


  Loading...