Vishwa Gupta

Proceedings of the Speech and Computer - 26th International Conference, 2024

2023

Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 25th International Conference, 2023

2022

CRIM's Speech Recognition System for OpenASR21 Evaluation with Conformer and Voice Activity Detector Embeddings.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 24th International Conference, 2022

Progress in Multilingual Speech Recognition for Low Resource Languages Kurmanji Kurdish, Cree and Inuktut.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2020

Speech Transcription Challenges for Resource Constrained Indigenous Language Cree.

[BibT_eX]

[DOI]

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

The Indigenous Languages Technology project at NRC Canada: An empowerment-oriented approach to developing language software.

[BibT_eX]

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019

CRIM's Speech Transcription and Call Sign Detection System for the ATC Airbus Challenge Task.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

CRIM's System for the MGB-3 English Multi-Genre Broadcast Media Transcription.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Robust video fingerprints using positions of salient regions.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Fast Audio Fingerprinting System Using GPU and a Clustering-Based Technique.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

A spectrogram-based audio fingerprinting system for content-based copy detection.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2016

Modelling speaker and channel variability using deep neural networks for robust speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Compensation for phonetic nuisance variability in speaker recognition using DNNs.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Tandem Features for Text-Dependent Speaker Verification on the RedDots Corpus.

[BibT_eX]

[DOI]

Md. Jahangir Alam

Patrick Kenny

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2015

Content-Based Multimedia Copy Detection.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Efficient spectrogram-based binary image feature for audio copy detection.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker change point detection using deep neural nets.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

GPU implementation of an audio fingerprints similarity search algorithm.

[BibT_eX]

[DOI]

Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

CRIM and LIUM approaches for multi-genre broadcast media transcription.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

LIUM and CRIM ASR System Combination for the REPERE Evaluation Campaign.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Deep Neural Networks for extracting Baum-Welch statistics for Speaker Recognition.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Robust features for content-based audio copy detection.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

A robust audio fingerprinting method for content-based copy detection.

[BibT_eX]

[DOI]

Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014

2013

Comparing computation in Gaussian mixture and neural network based large-vocabulary speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Compensation for inter-frame correlations in speaker diarization and recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

CRIM's content-based audio copy detection system for TRECVID 2009.

[BibT_eX]

[DOI]

Patrick Cardinal

Multim. Tools Appl., 2012

Content-based video copy detection using nearest-neighbor mapping.

[BibT_eX]

[DOI]

Parisa Darvish Zadeh Varcheie

Langis Gagnon

Proceedings of the 11th International Conference on Information Science, 2012

2011

CRIM AT TRECVID-2011: Content-Based Copy Detection using Nearest-Neighbor Mapping.

[BibT_eX]

[DOI]

Parisa Darvish Zadeh Varcheie

Langis Gagnon

Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

2010

Content-based advertisement detection.

[BibT_eX]

[DOI]

Patrick Cardinal

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Subword-based spoken term detection in audio course lectures.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Content-based audio copy detection using nearest-neighbor mapping.

[BibT_eX]

[DOI]

Patrick Cardinal

Proceedings of the IEEE International Conference on Acoustics, 2010

A computer-vision-assisted system for Videodescription scripting.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009

CRIM´s Content-Based Copy Detection System for TRECVID.

[BibT_eX]

[DOI]

Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

2008

A Study of Interspeaker Variability in Speaker Verification.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2008

The role of speaker factors in the NIST extended data task.

[BibT_eX]

[DOI]

Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker diarization of French broadcast news.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Combining Gaussianized/Non-Gaussianized Features to Improve Speaker Diarization of Telephone Conversations.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

Multiple feature combination to improve speaker diarization of telephone conversations.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Feature normalization using smoothed mixture transformations.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2000

Automation of locality recognition in ADAS plus.

[BibT_eX]

[DOI]

Serge Robillard

Claude Pelletier

Speech Commun., 2000

1999

Application of simultaneous decoding algorithms to automatic transcription of known and unknown words.

[BibT_eX]

[DOI]

Jian-Xiong Wu

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1996

Compensated mel frequency cepstrum coefficients.

[BibT_eX]

[DOI]

Rivarol Vergin

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1993

A*-admissible heuristics for rapid lexical access.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1993

1992

Flexible vocabulary recognition of speech.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Hybrid segmental-LVQ/HMM for large vocabulary speech recognition.

[BibT_eX]

[DOI]

Yan Ming Cheng

Sarangarajan Parthasarathy

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 1991

Energy, duration and Markov models.

[BibT_eX]

[DOI]

Patrick Kenny

Sarangarajan Parthasarathy

Matthew Lennig

Paul Mermelstein

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Using phoneme duration and energy contour information to improve large vocabulary isolated-word recognition.

[BibT_eX]

[DOI]

Proceedings of the 1991 International Conference on Acoustics, 1991

1990

An 86, 000-Word Recognizer Based on Phonemic Models.

[BibT_eX]

[DOI]

Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

Acoustic recognition component of an 86000-word speech recognizer.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

A locus model of coarticulation in an HMM speech recognizer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1989

1988

Three probabilistic language models for a large-vocabulary speech recognizer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1988

Modeling acoustic-phonetic detail in an HMM-based large vocabulary speech recognizer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1988

1987

Integration of acoustic information in a large vocabulary word recognizer.

[BibT_eX]

[DOI]

Matthew Lennig

Paul Mermelstein

Proceedings of the IEEE International Conference on Acoustics, 1987

1984

Decision rules for speaker-independent isolated word recognition.

[BibT_eX]

[DOI]

Matthew Lennig

Paul Mermelstein

Proceedings of the IEEE International Conference on Acoustics, 1984

1978

Speaker-independent vowel indetification in continuous speech.

[BibT_eX]

[DOI]