2023
Exploring the Impact of Pretrained Models and Web-Scraped Data for the 2022 NIST Language Recognition Evaluation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2019
Challenges in Audio Processing of Terrorist-Related Data.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019
2016
Sub-vector Extraction and Cascade Post-Processing for Speaker Verification Using MLLR Super-vectors.
CoRR, 2016
Language Recognition for Dialects and Closely Related Languages.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
A Divide-and-Conquer Approach for Language Identification Based on Recurrent Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Lexical speaker identification in TV shows.
Multim. Tools Appl., 2015
Active learning based data selection for limited resource STT and KWS.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Improving data selection for low-resource STT and KWS.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification.
IEEE Signal Process. Lett., 2014
Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast.
Int. J. Multim. Inf. Retr., 2014
Person Instance Graphs for Named Speaker Identification in TV Broadcast.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Developing STT and KWS systems using limited language resources.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Comparing decoding strategies for subword-based keyword spotting in low-resourced languages.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
2013
Unsupervised naming of speakers in broadcast TV: using written names, pronounced names or both?
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the First Workshop on Speech, 2013
Lattice MLLR based m-vector system for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2013
Score normalization and system combination for improved keyword spotting.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
Transcription of Russian conversational speech.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
Incorporating MLP features in the unsupervised training process.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Fusion of Speech, Faces and Text for Person Identification in TV Broadcast.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012
2011
Speech recognition for machine translation in Quaero.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011
Comparing Multi-Stage Approaches for Cross-Show Speaker Diarization.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Improved models for Mandarin speech-to-text transcription.
Proceedings of the IEEE International Conference on Acoustics, 2011
The Vocapia Research ASR Systems for Evalita 2011.
Proceedings of the Evaluation of Natural Language and Speech Tools for Italian, 2011
2010
Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet.
Multim. Tools Appl., 2010
Recherche par le contenu dans des documents audiovisuels multilingues.
Document Numérique, 2010
On the use of GSV-SVM for Speaker Diarization and Tracking.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010
Multi-style MLP features for BN transcription.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Automatic Speech Recognition for Under-Resourced Languages: Application to Vietnamese Language.
IEEE Trans. Speech Audio Process., 2009
Mining a Comparable Text Corpus for a Vietnamese-French Statistical Machine Translation System.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009
Exploitation d'un corpus bilingue pour la création d'un système de traduction probabiliste Vietnamien - Français.
Proceedings of the Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2009
2008
Which units for acoustic and language modeling for Khmer automatic speech recognition?
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008
Recent advances in automatic speech recognition for vietnamese.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008
Word/sub-word lattices decomposition and combination for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
The LIG Arabic/English speech translation system at IWSLT07.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007
Speaker diarization using normalized cross likelihood ratio.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
2006
Reconnaissance automatique de la parole pour des langues peu dotées. (Automatic Speech Recognition for Under-Ressourced Languages).
PhD thesis, 2006
Comparison of acoustic modeling techniques for Vietnamese and Khmer ASR.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Acoustic-Phonetic Unit Similarities For Context Dependent Acoustic Model Portability.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
ASR and Translation for Under-Resourced Languages.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
First Steps in Fast Acoustic Modeling for a New Target Language: Application to Vietnamese.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Toward Acoustic Models for Languages with Limited Linguistic Resources.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2005
2004
Spoken and Written Language Resources for Vietnamese.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004
2003
Using the web for fast language model construction in minority languages.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003