Gilles Boulianne

Orcid: 0000-0001-9383-6189

According to our database1, Gilles Boulianne authored at least 67 papers between 1990 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages.
Proceedings of the Speech and Computer - 25th International Conference, 2023

2022
CRIM's Speech Recognition System for OpenASR21 Evaluation with Conformer and Voice Activity Detector Embeddings.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Progress in Multilingual Speech Recognition for Low Resource Languages Kurmanji Kurdish, Cree and Inuktut.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Phoneme transcription of endangered languages: an evaluation of recent ASR architectures in the single speaker scenario.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2020
A Study of Inductive Biases for Unsupervised Speech Representation Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Speech Transcription Challenges for Resource Constrained Indigenous Language Cree.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020


Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


2019
CRIM's Speech Transcription and Call Sign Detection System for the ATC Airbus Challenge Task.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
CRIM's System for the MGB-3 English Multi-Genre Broadcast Media Transcription.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2015
Language-independent voice passphrase verification.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

CRIM and LIUM approaches for multi-genre broadcast media transcription.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
LIUM and CRIM ASR System Combination for the REPERE Evaluation Campaign.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

2013
Large Vocabulary Speech Recognition on Parallel Architectures.
IEEE Trans. Speech Audio Process., 2013

Comparing computation in Gaussian mixture and neural network based large-vocabulary speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Unsupervised topic model for broadcast program segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
CRIM's content-based audio copy detection system for TRECVID 2009.
Multim. Tools Appl., 2012

Content-based video copy detection using nearest-neighbor mapping.
Proceedings of the 11th International Conference on Information Science, 2012

The A* speech recognition system on parallel architectures.
Proceedings of the 11th International Conference on Information Science, 2012

Generating exact lattices in the WFST framework.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Using A* for the parallelization of speech recognition systems.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
CRIM AT TRECVID-2011: Content-Based Copy Detection using Nearest-Neighbor Mapping.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

2010
Content-based advertisement detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Content-based audio copy detection using nearest-neighbor mapping.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
CRIM´s Content-Based Copy Detection System for TRECVID.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Incorporating Knowledge of Source Language Text in a System for Dictation of Document Translations.
Proceedings of Machine Translation Summit XII: Papers, 2009

Using parallel architectures in speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Real-time correction of closed-captions.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Real-time speech recognition captioning of events and meetings.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

GPU accelerated acoustic likelihood computations.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker diarization of French broadcast news.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Speaker and Session Variability in GMM-Based Speaker Verification.
IEEE Trans. Speech Audio Process., 2007

Joint Factor Analysis Versus Eigenchannels in Speaker Recognition.
IEEE Trans. Speech Audio Process., 2007

Combining Gaussianized/Non-Gaussianized Features to Improve Speaker Diarization of Telephone Conversations.
IEEE Signal Process. Lett., 2007

Multiple feature combination to improve speaker diarization of telephone conversations.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Real-Time Correction of Closed-Captions.
Proceedings of the ACL 2007, 2007

2006
The Geometry of the Channel Space in GMM-Based Speaker Recognition.
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006

Feature normalization using smoothed mixture transformations.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Computer-assisted closed-captioning of live TV broadcasts in French.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Improvements in Factor Analysis Based Speaker Verification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Dempster-Shafer Based Fusion Approach for Audio-Visual Speech Recognition with Application to Large Vocabulary French Speech.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Eigenvoice modeling with sparse training data.
IEEE Trans. Speech Audio Process., 2005

Flavors of Gaussian warping.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Segmentation of recordings based on partial transcriptions.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Factor Analysis Simplified.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Speaker adaptation using an eigenphone basis.
IEEE Trans. Speech Audio Process., 2004

2003
Discriminative training and maximum likelihood detector for speaker identification.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automated closed-captioning of live TV broadcast news in French.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic segmentation of film dialogues into phonemes and graphemes.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Disambiguation of Finite-State Transducers.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

2001
What is the best type of prior distribution for EMAP speaker adaptation?
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A transducer approach to word graph generation.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Tree-structured vector quantization for speech recognition.
Comput. Speech Lang., 2000

French large vocabulary recognition with cross-word phonology transducers.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Experiments in constrained maximum likelihood extraction of temporal features for speech recognition.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1996
Bi-directional graph search strategies for speech recognition.
Comput. Speech Lang., 1996

Optimal tying of HMM mixture densities using decision trees.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1994
Experiments in continuous speech recognition using books on tape.
Speech Commun., 1994

Books on tape as training data for continuous speech recognition.
Speech Commun., 1994

Fast match acoustic models in large vocabulary continuous speech recognition.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1992
An A* algorithm for very large vocabulary continuous speech recognition.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Harriman, 1992

Experiments in continuous speech recognition with a 60, 000 word vocabulary.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

HMM training on unconstrained speech for large vocabulary, continuous speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

1990
On modelling the phonology phonetics interface for articulatory synthesis.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990


  Loading...