Aitor Álvarez

Orcid: 0000-0002-7938-4486

According to our database1, Aitor Álvarez authored at least 41 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
ASASVIcomtech: The Vicomtech-UGR Speech Deepfake Detection and SASV Systems for the ASVspoof5 Challenge.
CoRR, 2024

Stream-based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

EMPHASIS: Empowering Decision Making with Higher Productivity by Means of HyperAutomation.
Proceedings of the Seminar of the Spanish Society for Natural Language Processing: Projects and System Demonstrations (SEPLN-CEDI-PD 2024) co-located with the 7th Spanish Conference on Informatics (CEDI 2024), 2024

STREAMS: Live Streaming and Micro-batching with Automated Multilingual Services.
Proceedings of the Seminar of the Spanish Society for Natural Language Processing: Projects and System Demonstrations (SEPLN-CEDI-PD 2024) co-located with the 7th Spanish Conference on Informatics (CEDI 2024), 2024

2023
When Whisper Meets TTS: Domain Adaptation Using only Synthetic Speech Data.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

An Online Diarization Approach for Streaming Applications Based on Tree-Clustering and Bayesian Resegmentation.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

The Vicomtech Partial Deepfake Detection and Location System for the 2023 ADD Challenge.
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022
ESAN: Automating medical scribing in Spanish.
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2022) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2022), 2022

iASSIST: Low-cost, portable and embedded assistants for on-premise automated transcription and translation services.
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2022) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2022), 2022

The Vicomtech Audio Deepfake Detection System Based on Wav2vec2 for the 2022 ADD Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

The Vicomtech Spoofing-Aware Biometric System for the SASV Challenge.
Proceedings of the 6th International Conference, 2022

Exploring the limits of neural voice cloning: A case study on two well-known personalities.
Proceedings of the 6th International Conference, 2022

The Vicomtech-UPM Speech Transcription Systems for the Albayzín-RTVE 2022 Speech to Text Transcription Challenge.
Proceedings of the 6th International Conference, 2022

2021
AutoPunct: A BERT-based Automatic Punctuation and Capitalisation System for Spanish and Basque.
Proces. del Leng. Natural, 2021

GAMES: Generación automática de metadato y contenido para medios y archivos en euskera.
Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing: Projects and Demonstrations (SEPLN-PD 2021) co-located with the Conference of the Spanish Society for Natural Language Processing (SEPLN 2021), 2021

mintzai-ST: Corpus and Baselines for Basque-Spanish Speech Translation.
Proceedings of the Fifth International Conference, 2021

The Vicomtech Speech Transcription Systems for the Albayzín-RTVE 2020 Speech to Text Transcription Challenge.
Proceedings of the Fifth International Conference, 2021

2020
Nalytics: Natural Speech and Text Analytics.
Proces. del Leng. Natural, 2020

MINTZAI: Sistemas de Aprendizaje Profundo E2E para Traducción Automática del Habla.
Proces. del Leng. Natural, 2020

Towards a Natural Human-Robot Interaction in an Industrial Environment.
Proceedings of the Conversational Dialogue Systems for the Next Decade, 2020

European GNSS Service Centre (GSC): Current Status and Future Evolutions To Deliver Added Value Services.
Proceedings of the European Navigation Conference, 2020

2018
Exploring E2E speech recognition systems for new languages.
Proceedings of the Fourth International Conference, 2018

The Vicomtech-PRHLT Speech Transcription Systems for the IberSPEECH-RTVE 2018 Speech to Text Transcription Challenge.
Proceedings of the Fourth International Conference, 2018

2017
Improving the automatic segmentation of subtitles through conditional random field.
Speech Commun., 2017

2016
Probabilistic Kernels for Improved Text-to-Speech Alignment in Long Audio Tracks.
IEEE Signal Process. Lett., 2016

Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech.
Sensors, 2016

Automating live and batch subtitling of multimedia contents for several European languages.
Multim. Tools Appl., 2016

Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2014
Improving a Long Audio Aligner through Phone- Relatedness Matrices for English, Spanish and Basque.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Long audio alignment for automatic subtitling using different phone-relatedness measures.
Proceedings of the IEEE International Conference on Acoustics, 2014

Towards Customized Automatic Segmentation of Subtitles.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2014

Interactive Multimodal Platform for Digital Signage.
Proceedings of the Articulated Motion and Deformable Objects, 2014

2013
Realistic visual speech synthesis in WebGL.
Proceedings of the 18th International Conference on Web3D Technology, 2013

2010
APyCA: Towards the Automatic Subtitling of Television Content in Spanish.
Proceedings of the International Multiconference on Computer Science and Information Technology, 2010

Combining color descriptors for improved codebook modelbased image retrieval.
Proceedings of the 5th European Conference on Colour in Graphics, 2010

High-Realistic and Flexible Virtual Presenters.
Proceedings of the Articulated Motion and Deformable Objects, 6th International Conference, 2010

2007
A Comparison Using Different Speech Parameters in the Automatic Emotion Recognition Using Feature Subset Selection Based on Evolutionary Algorithms.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Application of Feature Subset Selection Based on Evolutionary Algorithms for Automatic Emotion Recognition in Speech.
Proceedings of the Advances in Nonlinear Speech Processing, 2007

2006
Feature Subset Selection Based on Evolutionary Algorithms for Automatic Emotion Recognition in Spoken Spanish and Standard Basque Language.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006


  Loading...