Carlos D. Martínez-Hinarejos

Orcid: 0000-0002-6139-2891

Affiliations:
  • Polytechnic University of Valencia, Spain


According to our database1, Carlos D. Martínez-Hinarejos authored at least 73 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Continuous lipreading based on acoustic temporal alignments.
EURASIP J. Audio Speech Music. Process., December, 2024

Tailored Design of Audio-Visual Speech Recognition Models using Branchformers.
CoRR, 2024

Reading Order Independent Metrics for Information Extraction in Handwritten Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

Reading Between the Frames: Multi-modal Depression Detection in Videos from Non-verbal Cues.
Proceedings of the Advances in Information Retrieval, 2024

Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

AnnoTheia: A Semi-Automatic Annotation Toolkit for Audio-Visual Speech Technologies.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

PINK at EXIST2024: A Cross-Lingual and Multi-Modal Transformer Approach for Sexism Detection in Memes.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

2023
Evaluation of Different Tagging Schemes for Named Entity Recognition in Handwritten Documents.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Consistent Nested Named Entity Recognition in Handwritten Documents via Lattice Rescoring.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

2022
Guidelines to Develop Trustworthy Conversational Agents for Children.
CoRR, 2022

LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Spanish Lipreading in Realistic Scenarios: the LLEER project.
Proceedings of the 6th International Conference, 2022

Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish.
Proceedings of the 6th International Conference, 2022

Enhancing the Design of a Conversational Agent for an Ethical Interaction with Children.
Proceedings of the 6th International Conference, 2022

Evaluation of Named Entity Recognition in Handwritten Documents.
Proceedings of the Document Analysis Systems - 15th IAPR International Workshop, 2022

2021
Generation of Synthetic Sign Language Sentences.
Proceedings of the Fifth International Conference, 2021

Analysis of Visual Features for Continuous Lipreading in Spanish.
Proceedings of the Fifth International Conference, 2021

2020
Study of the influence of lexicon and language restrictions on computer assisted transcription of historical manuscripts.
Neurocomputing, 2020

2019
Image-speech combination for interactive computer assisted transcription of handwritten documents.
Comput. Vis. Image Underst., 2019

2018
Transcription of Spanish Historical Handwritten Documents with Deep Neural Networks.
J. Imaging, 2018

Multimodality, interactivity, and crowdsourcing for document transcription.
Comput. Intell., 2018

Sign Language Gesture Classification using Neural Networks.
Proceedings of the Fourth International Conference, 2018

Advances on the Transcription of Historical Manuscripts based on Multimodality, Interactivity and Crowdsourcing.
Proceedings of the Fourth International Conference, 2018

Improving Transcription of Manuscripts with Multimodality and Interaction.
Proceedings of the Fourth International Conference, 2018

Exploring E2E speech recognition systems for new languages.
Proceedings of the Fourth International Conference, 2018

The Vicomtech-PRHLT Speech Transcription Systems for the IberSPEECH-RTVE 2018 Speech to Text Transcription Challenge.
Proceedings of the Fourth International Conference, 2018

Comparing Different Feedback Modalities in Assisted Transcription of Manuscripts.
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems, 2018

2017
Multimodal Crowdsourcing for Transcribing Handwritten Documents.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Improving the automatic segmentation of subtitles through conditional random field.
Speech Commun., 2017

Spanish Sign Language Recognition with Different Topology Hidden Markov Models.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Interactive Layout Detection.
Proceedings of the Pattern Recognition and Image Analysis - 8th Iberian Conference, 2017

Sign Language Gesture Recognition Using HMM.
Proceedings of the Pattern Recognition and Image Analysis - 8th Iberian Conference, 2017

Baseline Detection on Arabic Handwritten Documents.
Proceedings of the 2017 ACM Symposium on Document Engineering, 2017

2016
Impact of Automatic Segmentation on the Quality, Productivity and Self-reported Post-editing Effort of Intralingual Subtitles.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Collaborator Effort Optimisation in Multimodal Crowdsourcing for Transcribing Historical Manuscripts.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

A Multimodal Crowdsourcing Framework for Transcribing Historical Handwritten Documents.
Proceedings of the 2016 ACM Symposium on Document Engineering, 2016

An Interactive Approach with Off-Line and On-Line Handwritten Text Recognition Combination for Transcribing Historical Documents.
Proceedings of the 12th IAPR Workshop on Document Analysis Systems, 2016

2015
Unsegmented Dialogue Act Annotation and Decoding With N-Gram Transducers.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Combining handwriting and speech recognition for transcribing historical handwritten documents.
Proceedings of the 13th International Conference on Document Analysis and Recognition, 2015

Multimodal Output Combination for Transcribing Historical Handwritten Documents.
Proceedings of the Computer Analysis of Images and Patterns, 2015

2014
An iterative multimodal framework for the transcription of handwritten historical documents.
Pattern Recognit. Lett., 2014

2013
Handwriting recognition in historical documents using very large vocabularies.
Proceedings of the 2nd International Workshop on Historical Document Imaging and Processing, 2013

2012
Estimating the number of segments for improving dialogue act labelling.
Nat. Lang. Eng., 2012

2011
Direct and Wordgraph-Based Confidence Measures in Dialogue Annotation with N-Gram Transducers.
Proceedings of the Human Language Technology Challenges for Computer Science and Linguistics, 2011

Active Learning to Speed-Up the Training Process for Dialogue Act Labelling.
Proceedings of the Human Language Technology Challenges for Computer Science and Linguistics, 2011

A Multimodal Approach to Dictation of Handwritten Historical Documents.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Active Learning for Dialogue Act Labelling.
Proceedings of the Pattern Recognition and Image Analysis - 5th Iberian Conference, 2011

On the Use of N-Gram Transducers for Dialogue Annotation.
Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

2010
Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Dialogue act tagging and segmentation with a single perceptron.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Simultaneous Dialogue Act Segmentation and Labelling using Lexical and Syntactic Features.
Proceedings of the SIGDIAL 2009 Conference, 2009

Improving Unsegmented Statistical Dialogue Act Labelling.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Estimating the Number of Segments of a Turn in Dialogue Systems.
Proceedings of the Pattern Recognition in Information Systems, 2009

Improving Unsegmented Dialogue Turns Annotation with N-gram Transducers.
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, 2009

A Study of a Segmentation Technique for Dialogue Act Assignation (short paper).
Proceedings of the Eight International Conference on Computational Semantics, 2009

2008
Statistical framework for a Spanish spoken dialogue corpus.
Speech Commun., 2008

Evaluation of Different Segmentation Techniques for Dialogue Turns.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007
On the Training Data Requirements for an Automatic Dialogue Annotation Technique.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

A Study on Bilingual Speech Recognition Involving a Minority Language.
Proceedings of the Human Language Technology. Challenges of the Information Society, 2007

2006
Computer-assisted translation using speech recognition.
IEEE Trans. Speech Audio Process., 2006

Automatic Annotation of Dialogues Using <i>n</i>-Grams.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Bilingual speech corpus in two phonetically similar languages.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Segmented and Unsegmented Dialogue-Act Annotation with Statistical Dialogue Models.
Proceedings of the ACL 2006, 2006

2004
Some approaches to statistical and finite-state speech-to-speech translation.
Comput. Speech Lang., 2004

2003
Median strings for k-nearest neighbour classification.
Pattern Recognit. Lett., 2003

Adaptive Learning for String Classification.
Proceedings of the Pattern Recognition and Image Analysis, First Iberian Conference, 2003

Generalized k-Medians Clustering for Strings.
Proceedings of the Pattern Recognition and Image Analysis, First Iberian Conference, 2003

2002
Evaluating a Probabilistic Dialogue Model for a Railway Information Task.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

Reducing the Computational Cost of Computing Approximated Median Strings.
Proceedings of the Structural, 2002

A Labelling Proposal to Annotate Dialogues.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

2001
Speech-to-speech translation based on finite-state transducers.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Use of Median String for Classification .
Proceedings of the 15th International Conference on Pattern Recognition, 2000


  Loading...