Marco Gaido

Orcid: 0000-0003-4217-1396

According to our database1, Marco Gaido authored at least 45 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Speech Foundation Models and Crowdsourcing for Efficient, High-Quality Data Collection.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation.
CoRR, 2024

MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
CoRR, 2024

How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not.
CoRR, 2024

Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond.
CoRR, 2024

SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation.
CoRR, 2024

MOSEL: 950, 000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Direct Speech Translation Toward High-Quality, Inclusive, and Augmented Systems.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

How Do Hyenas Deal with Human Speech? Speech Recognition and Translation with ConfHyena.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Explainability for Speech Models: On the Challenges of Acoustic Feature Selection.
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), 2024

MAGNET - MAchines GeNErating Translations: A CALAMITA Challenge.
Proceedings of the Tenth Italian Conference on Computational Linguistics (CLiC-it 2024), 2024

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Direct Speech Translation for Automatic Subtitling.
Trans. Assoc. Comput. Linguistics, 2023

Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP.
CoRR, 2023

Distributed Silhouette Algorithm: Evaluating Clustering on Big Data.
CoRR, 2023

Test Suites Task: Evaluation of Gender Fairness in MT with MuST-SHE and INES.
Proceedings of the Eighth Conference on Machine Translation, 2023

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Joint Speech Translation and Named Entity Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Named Entity Detection and Injection for Direct Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation.
Proceedings of the 9th Italian Conference on Computational Linguistics, Venice, Italy, November 30, 2023

No Pitch Left Behind: Addressing Gender Unbalance In Automatic Speech Recognition Through Pitch Manipulation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation.
CoRR, 2022

Efficient yet Competitive Speech Translation: FBK@IWSLT2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Who Are We Talking About? Handling Person Names in Speech Translation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Does Simultaneous Speech Translation need Simultaneous Models?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Extending the MuST-C Corpus for a Comparative Evaluation of Speech Translation Technology.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Gender Bias in Machine Translation.
Trans. Assoc. Comput. Linguistics, 2021

Dealing with training and test segmentation mismatch: FBK@IWSLT2021.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Between Flexibility and Consistency: Joint Generation of Captions and Subtitles.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation.
Proceedings of the 4th International Conference on Natural Language and Speech Processing, 2021

Speechformer: Reducing Information Loss in Direct Speech Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

CTC-based Compression for Direct Speech Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Contextualized Translation of Automatically Segmented Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Breeding Gender-aware Direct Speech Translation Systems.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

On Knowledge Distillation for Direct Speech Translation.
Proceedings of the Seventh Italian Conference on Computational Linguistics, 2020

On Target Segmentation for Direct Speech Translation.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020


  Loading...