Sara Papi
Orcid: 0000-0002-4494-8886
According to our database1,
Sara Papi
authored at least 28 papers
between 2020 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
CoRR, 2024
How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not.
CoRR, 2024
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024
What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
MOSEL: 950, 000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
How Do Hyenas Deal with Human Speech? Speech Recognition and Translation with ConfHyena.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Trans. Assoc. Comput. Linguistics, 2023
Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP.
CoRR, 2023
Proceedings of the 20th International Conference on Spoken Language Translation, 2023
AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation.
CoRR, 2022
Proceedings of the 19th International Conference on Spoken Language Translation, 2022
Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
2021
Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings, 2021
Proceedings of the 18th International Conference on Spoken Language Translation, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Eighth Italian Conference on Computational Linguistics, 2021
2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020