Carlos D. Martínez-Hinarejos

EURASIP J. Audio Speech Music. Process., December, 2024

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers.

[BibT_eX]

[DOI]

CoRR, 2024

Reading Order Independent Metrics for Information Extraction in Handwritten Documents.

[BibT_eX]

[DOI]

Solène Tarride

Christopher Kermorvant

Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Reading Between the Frames: Multi-modal Depression Detection in Videos from Non-verbal Cues.

[BibT_eX]

[DOI]

Ana-Maria Bucur

Adrian Cosma

Paolo Rosso

Proceedings of the Advances in Information Retrieval, 2024

Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies.

[BibT_eX]

[DOI]

José-M. Acosta-Triana

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

PINK at EXIST2024: A Cross-Lingual and Multi-Modal Transformer Approach for Sexism Detection in Memes.

[BibT_eX]

[DOI]

Giulia Rizzi

Elisabetta Fersini

Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

2023

Evaluation of Different Tagging Schemes for Named Entity Recognition in Handwritten Documents.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Consistent Nested Named Entity Recognition in Handwritten Documents via Lattice Rescoring.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

2022

Guidelines to Develop Trustworthy Conversational Agents for Children.

[BibT_eX]

[DOI]

Marina Escobar-Planas

Emilia Gómez

CoRR, 2022

LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Spanish Lipreading in Realistic Scenarios: the LLEER project.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference, 2022

Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish.

[BibT_eX]

[DOI]

Proceedings of the 6th International Conference, 2022

Enhancing the Design of a Conversational Agent for an Ethical Interaction with Children.

[BibT_eX]

[DOI]

Marina Escobar-Planas

Emilia Gómez

Proceedings of the 6th International Conference, 2022

Evaluation of Named Entity Recognition in Handwritten Documents.

[BibT_eX]

[DOI]

Proceedings of the Document Analysis Systems - 15th IAPR International Workshop, 2022

2021

Generation of Synthetic Sign Language Sentences.

[BibT_eX]

[DOI]

Aitana Villaplana

Proceedings of the Fifth International Conference, 2021

Analysis of Visual Features for Continuous Lipreading in Spanish.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference, 2021

2020

Study of the influence of lexicon and language restrictions on computer assisted transcription of historical manuscripts.

[BibT_eX]

[DOI]

Neurocomputing, 2020

2019

Image-speech combination for interactive computer assisted transcription of handwritten documents.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2019

2018

Transcription of Spanish Historical Handwritten Documents with Deep Neural Networks.

[BibT_eX]

[DOI]

Edgard Chammas

Laurence Likforman-Sulem

Chafic Mokbel

Bogdan-Ionut Cirstea

J. Imaging, 2018

Multimodality, interactivity, and crowdsourcing for document transcription.

[BibT_eX]

[DOI]

Comput. Intell., 2018

Sign Language Gesture Classification using Neural Networks.

[BibT_eX]

[DOI]

Zuzanna Parcheta

Proceedings of the Fourth International Conference, 2018

Advances on the Transcription of Historical Manuscripts based on Multimodality, Interactivity and Crowdsourcing.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Improving Transcription of Manuscripts with Multimodality and Interaction.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Exploring E2E speech recognition systems for new languages.

[BibT_eX]

[DOI]

Conrad Bernath

Aitor Álvarez

Haritz Arzelus

Carlos David Martínez

Proceedings of the Fourth International Conference, 2018

The Vicomtech-PRHLT Speech Transcription Systems for the IberSPEECH-RTVE 2018 Speech to Text Transcription Challenge.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Comparing Different Feedback Modalities in Assisted Transcription of Manuscripts.

[BibT_eX]

[DOI]

Emilio Granell-Romero

Verónica Romero-Gomez

Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

2017

Multimodal Crowdsourcing for Transcribing Handwritten Documents.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Improving the automatic segmentation of subtitles through conditional random field.

[BibT_eX]

[DOI]

Aitor Álvarez

Haritz Arzelus

Marina Balenciaga

Arantza del Pozo

Speech Commun., 2017

Spanish Sign Language Recognition with Different Topology Hidden Markov Models.

[BibT_eX]

[DOI]

Zuzanna Parcheta

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Interactive Layout Detection.

[BibT_eX]

[DOI]

Lorenzo Quirós

Alejandro H. Toselli

Enrique Vidal

Proceedings of the Pattern Recognition and Image Analysis - 8th Iberian Conference, 2017

Sign Language Gesture Recognition Using HMM.

[BibT_eX]

[DOI]

Zuzanna Parcheta

Proceedings of the Pattern Recognition and Image Analysis - 8th Iberian Conference, 2017

Baseline Detection on Arabic Handwritten Documents.

[BibT_eX]

[DOI]

Ahmed Fawzi

Moisés Pastor

Proceedings of the 2017 ACM Symposium on Document Engineering, 2017

2016

Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Collaborator Effort Optimisation in Multimodal Crowdsourcing for Transcribing Historical Manuscripts.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

A Multimodal Crowdsourcing Framework for Transcribing Historical Handwritten Documents.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Symposium on Document Engineering, 2016

An Interactive Approach with Off-Line and On-Line Handwritten Text Recognition Combination for Transcribing Historical Documents.

[BibT_eX]

[DOI]

Proceedings of the 12th IAPR Workshop on Document Analysis Systems, 2016

2015

Unsegmented Dialogue Act Annotation and Decoding With N-Gram Transducers.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Combining handwriting and speech recognition for transcribing historical handwritten documents.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Multimodal Output Combination for Transcribing Historical Handwritten Documents.

[BibT_eX]

[DOI]

Proceedings of the Computer Analysis of Images and Patterns, 2015

2014

An iterative multimodal framework for the transcription of handwritten historical documents.

[BibT_eX]

[DOI]

Vicent Alabau

Antonio L. Lagarda

Pattern Recognit. Lett., 2014

2013

Handwriting recognition in historical documents using very large vocabularies.

[BibT_eX]

[DOI]

Volkmar Frinken

Andreas Fischer

Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, 2013

2012

Estimating the number of segments for improving dialogue act labelling.

[BibT_eX]

[DOI]

Nat. Lang. Eng., 2012

2011

Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers.

[BibT_eX]

[DOI]

Proceedings of the Human Language Technology Challenges for Computer Science and Linguistics, 2011

Active Learning to Speed-Up the Training Process for Dialogue Act Labelling.

[BibT_eX]

[DOI]

Fabrizio Ghigi

Proceedings of the Human Language Technology Challenges for Computer Science and Linguistics, 2011

A Multimodal Approach to Dictation of Handwritten Historical Documents.

[BibT_eX]

[DOI]

Vicent Alabau

Antonio L. Lagarda

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Active Learning for Dialogue Act Labelling.

[BibT_eX]

[DOI]

Fabrizio Ghigi

Proceedings of the Pattern Recognition and Image Analysis - 5th Iberian Conference, 2011

On the Use of N-Gram Transducers for Dialogue Annotation.

[BibT_eX]

[DOI]

Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

2010

Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Dialogue act tagging and segmentation with a single perceptron.

[BibT_eX]

[DOI]

Stephen G. Pulman

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Simultaneous Dialogue Act Segmentation and Labelling using Lexical and Syntactic Features.

[BibT_eX]

[DOI]

Stephen G. Pulman

Proceedings of the SIGDIAL 2009 Conference, 2009

Improving Unsegmented Statistical Dialogue Act Labelling.

[BibT_eX]

[DOI]

José-Miguel Benedí Ruíz

Proceedings of the Recent Advances in Natural Language Processing, 2009

Estimating the Number of Segments of a Turn in Dialogue Systems.

[BibT_eX]

Proceedings of the Pattern Recognition in Information Systems, 2009

Improving Unsegmented Dialogue Turns Annotation with N-gram Transducers.

[BibT_eX]

[DOI]

Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009

A Study of a Segmentation Technique for Dialogue Act Assignation (short paper).

[BibT_eX]

[DOI]

Proceedings of the Eight International Conference on Computational Semantics, 2009

2008

Statistical framework for a Spanish spoken dialogue corpus.

[BibT_eX]

[DOI]

Speech Commun., 2008

Evaluation of Different Segmentation Techniques for Dialogue Turns.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2008

Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation.

[BibT_eX]

[DOI]

Míriam Luján-Mares

Vicent Alabau Gonzalvo

Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007

On the Training Data Requirements for an Automatic Dialogue Annotation Technique.

[BibT_eX]

[DOI]

Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

A Study on Bilingual Speech Recognition Involving a Minority Language.

[BibT_eX]

[DOI]

Míriam Luján-Mares

Vicente Alabau

Proceedings of the Human Language Technology. Challenges of the Information Society, 2007

2006

Computer-assisted translation using speech recognition.

[BibT_eX]

[DOI]

Enrique Vidal

Luis Rodríguez

Jorge Civera

IEEE Trans. Speech Audio Process., 2006

Automatic Annotation of Dialogues Using <i>n</i>-Grams.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Bilingual speech corpus in two phonetically similar languages.

[BibT_eX]

[DOI]

Vicente Alabau

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Segmented and Unsegmented Dialogue-Act Annotation with Statistical Dialogue Models.

[BibT_eX]

[DOI]

Proceedings of the ACL 2006, 2006

2004

Some approaches to statistical and finite-state speech-to-speech translation.

[BibT_eX]

[DOI]

Sirko Molau

Comput. Speech Lang., 2004

2003

Median strings for k-nearest neighbour classification.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2003

Adaptive Learning for String Classification.

[BibT_eX]

[DOI]

Ramón Alberto Mollineda

Enrique Vidal

Proceedings of the Pattern Recognition and Image Analysis, First Iberian Conference, 2003

Generalized k-Medians Clustering for Strings.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Image Analysis, First Iberian Conference, 2003

2002

Evaluating a Probabilistic Dialogue Model for a Railway Information Task.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Reducing the Computational Cost of Computing Approximated Median Strings.

[BibT_eX]

[DOI]

Ramón Alberto Mollineda

Proceedings of the Structural, 2002

A Labelling Proposal to Annotate Dialogues.

[BibT_eX]

[DOI]

Emilio Sanchis

Fernando García-Granada

Pablo Aibar

Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

2001

Speech-to-speech translation based on finite-state transducers.

[BibT_eX]

[DOI]

David Llorens

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Use of Median String for Classification .

[BibT_eX]

[DOI]