Aitor Álvarez

Juan Camilo Vásquez-Correa

Harritxu Gete Ugarte

2023

When Whisper Meets TTS: Domain Adaptation Using only Synthetic Speech Data.

[BibT_eX]

[DOI]

Juan M. Martín-Doñas

Joaquín Arellano

Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

An Online Diarization Approach for Streaming Applications Based on Tree-Clustering and Bayesian Resegmentation.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

The Vicomtech Partial Deepfake Detection and Location System for the 2023 ADD Challenge.

[BibT_eX]

[DOI]

Juan Manuel Martín-Doñas

Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022

ESAN: Automating medical scribing in Spanish.

[BibT_eX]

[DOI]

Pedro de la Peña Tejada

Itziar Cuenca

Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2022) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2022), 2022

iASSIST: Low-cost, portable and embedded assistants for on-premise automated transcription and translation services.

[BibT_eX]

[DOI]

The Vicomtech Audio Deepfake Detection System Based on Wav2vec2 for the 2022 ADD Challenge.

[BibT_eX]

[DOI]

Juan M. Martín-Doñas

Proceedings of the IEEE International Conference on Acoustics, 2022

The Vicomtech Spoofing-Aware Biometric System for the SASV Challenge.

[BibT_eX]

[DOI]

Juan Manuel Martín-Doñas

Iván González Torre

Joaquín Arellano

Proceedings of the 6th International Conference, 2022

Exploring the limits of neural voice cloning: A case study on two well-known personalities.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference, 2022

The Vicomtech-UPM Speech Transcription Systems for the Albayzín-RTVE 2022 Speech to Text Transcription Challenge.

[BibT_eX]

[DOI]

Iván G. Torres

Juan Manuel Martín-Doñas

Proceedings of the 6th International Conference, 2022

2021

AutoPunct: A BERT-based Automatic Punctuation and Capitalisation System for Spanish and Basque.

[BibT_eX]

[DOI]

Aitor García Pablos

Proces. del Leng. Natural, 2021

GAMES: Generación automática de metadato y contenido para medios y archivos en euskera.

[BibT_eX]

[DOI]

Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2021) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2021), 2021

mintzai-ST: Corpus and Baselines for Basque-Spanish Speech Translation.

[BibT_eX]

[DOI]

Edson Benites Fernandez

Proceedings of the Fifth International Conference, 2021

The Vicomtech Speech Transcription Systems for the Albayzín-RTVE 2020 Speech to Text Transcription Challenge.

[BibT_eX]

[DOI]

Iván González Torre

Proceedings of the Fifth International Conference, 2021

2020

Nalytics: Natural Speech and Text Analytics.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2020

MINTZAI: Sistemas de Aprendizaje Profundo E2E para Traducción Automática del Habla.

[BibT_eX]

[DOI]

Proces. del Leng. Natural, 2020

Towards a Natural Human-Robot Interaction in an Industrial Environment.

[BibT_eX]

[DOI]

Ignacio Fernández-Hernández

Proceedings of the Conversational Dialogue Systems for the Next Decade, 2020

European GNSS Service Centre (GSC): Current Status and Future Evolutions To Deliver Added Value Services.

[BibT_eX]

[DOI]

Pedro Gömez

Emilio González

Ana Senado

Jesùs David Calle

Proceedings of the European Navigation Conference, 2020

2018

Exploring E2E speech recognition systems for new languages.

[BibT_eX]

[DOI]

Conrad Bernath

Carlos David Martínez-Hinarejos

Carlos David Martínez

Proceedings of the Fourth International Conference, 2018

The Vicomtech-PRHLT Speech Transcription Systems for the IberSPEECH-RTVE 2018 Speech to Text Transcription Challenge.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

2017

Improving the automatic segmentation of subtitles through conditional random field.

[BibT_eX]

[DOI]

Carlos D. Martínez-Hinarejos

Luis Javier Rodríguez-Fuentes

Marina Balenciaga

Arantza del Pozo

Speech Commun., 2017

2016

Probabilistic Kernels for Improved Text-to-Speech Alignment in Long Audio Tracks.

[BibT_eX]

[DOI]

Germán Bordel

Mikel Peñagarikano

Amparo Varona

IEEE Signal Process. Lett., 2016

Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech.

[BibT_eX]

[DOI]

Carlos D. Martínez-Hinarejos

Basilio Sierra

Andoni Arruti

Juan Miguel López Gil

Nestor Garay-Vitoria

Sensors, 2016

Automating live and batch subtitling of multimedia contents for several European languages.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2014

Improving a Long Audio Aligner through Phone- Relatedness Matrices for English, Spanish and Basque.

[BibT_eX]

[DOI]

Pablo Ruiz Fabo

Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling.

[BibT_eX]

[DOI]

Pablo Ruiz Fabo

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Long audio alignment for automatic subtitling using different phone-relatedness measures.

[BibT_eX]

[DOI]

Pablo Ruiz Fabo

Proceedings of the IEEE International Conference on Acoustics, 2014

Towards Customized Automatic Segmentation of Subtitles.

[BibT_eX]

[DOI]

Thierry Etchegoyhen

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Interactive Multimodal Platform for Digital Signage.

[BibT_eX]

[DOI]

Helen V. Díez

Javier Barbadillo

Sara García

Maria del Puy Carretero

Jairo R. Sánchez

David Oyarzun

Proceedings of the Articulated Motion and Deformable Objects, 2014

2013

Realistic visual speech synthesis in WebGL.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Web3D Technology, 2013

2010

APyCA: Towards the Automatic Subtitling of Television Content in Spanish.

[BibT_eX]

[DOI]

Arantza del Pozo

Andoni Arruti

Proceedings of the International Multiconference on Computer Science and Information Technology, 2010

Combining color descriptors for improved codebook modelbased image retrieval.

[BibT_eX]

[DOI]

Proceedings of the 5th European Conference on Colour in Graphics, 2010

High-Realistic and Flexible Virtual Presenters.

[BibT_eX]

[DOI]

David Oyarzun

Andoni Mujika