Vishwa Gupta

According to our database1, Vishwa Gupta authored at least 64 papers between 1978 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Advances in OpenASR21 Evaluation with Increased Temporal Resolution for Speech Self-supervised Learning Models.
Proceedings of the Speech and Computer - 26th International Conference, 2024

2023
Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages.
Proceedings of the Speech and Computer - 25th International Conference, 2023

2022
CRIM's Speech Recognition System for OpenASR21 Evaluation with Conformer and Voice Activity Detector Embeddings.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Progress in Multilingual Speech Recognition for Low Resource Languages Kurmanji Kurdish, Cree and Inuktut.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2020
Speech Transcription Challenges for Resource Constrained Indigenous Language Cree.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


2019
CRIM's Speech Transcription and Call Sign Detection System for the ATC Airbus Challenge Task.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
CRIM's System for the MGB-3 English Multi-Genre Broadcast Media Transcription.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Robust video fingerprints using positions of salient regions.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Fast Audio Fingerprinting System Using GPU and a Clustering-Based Technique.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

A spectrogram-based audio fingerprinting system for content-based copy detection.
Multim. Tools Appl., 2016

Modelling speaker and channel variability using deep neural networks for robust speaker verification.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Compensation for phonetic nuisance variability in speaker recognition using DNNs.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Tandem Features for Text-Dependent Speaker Verification on the RedDots Corpus.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation.
EURASIP J. Adv. Signal Process., 2015

Content-Based Multimedia Copy Detection.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Efficient spectrogram-based binary image feature for audio copy detection.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Speaker change point detection using deep neural nets.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

GPU implementation of an audio fingerprints similarity search algorithm.
Proceedings of the 13th International Workshop on Content-Based Multimedia Indexing, 2015

CRIM and LIUM approaches for multi-genre broadcast media transcription.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
LIUM and CRIM ASR System Combination for the REPERE Evaluation Campaign.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Deep Neural Networks for extracting Baum-Welch statistics for Speaker Recognition.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Robust features for content-based audio copy detection.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription.
Proceedings of the IEEE International Conference on Acoustics, 2014

A robust audio fingerprinting method for content-based copy detection.
Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014

2013
Comparing computation in Gaussian mixture and neural network based large-vocabulary speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Compensation for inter-frame correlations in speaker diarization and recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
CRIM's content-based audio copy detection system for TRECVID 2009.
Multim. Tools Appl., 2012

Content-based video copy detection using nearest-neighbor mapping.
Proceedings of the 11th International Conference on Information Science, 2012

2011
CRIM AT TRECVID-2011: Content-Based Copy Detection using Nearest-Neighbor Mapping.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

2010
Content-based advertisement detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Subword-based spoken term detection in audio course lectures.
Proceedings of the IEEE International Conference on Acoustics, 2010

Content-based audio copy detection using nearest-neighbor mapping.
Proceedings of the IEEE International Conference on Acoustics, 2010

A computer-vision-assisted system for Videodescription scripting.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

2009
CRIM´s Content-Based Copy Detection System for TRECVID.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

2008
A Study of Interspeaker Variability in Speaker Verification.
IEEE Trans. Speech Audio Process., 2008

The role of speaker factors in the NIST extended data task.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker diarization of French broadcast news.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Combining Gaussianized/Non-Gaussianized Features to Improve Speaker Diarization of Telephone Conversations.
IEEE Signal Process. Lett., 2007

Multiple feature combination to improve speaker diarization of telephone conversations.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Feature normalization using smoothed mixture transformations.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2000
Automation of locality recognition in ADAS plus.
Speech Commun., 2000

1999
Application of simultaneous decoding algorithms to automatic transcription of known and unknown words.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1996
Compensated mel frequency cepstrum coefficients.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1993
A*-admissible heuristics for rapid lexical access.
IEEE Trans. Speech Audio Process., 1993

1992
Flexible vocabulary recognition of speech.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

Hybrid segmental-LVQ/HMM for large vocabulary speech recognition.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition.
IEEE Trans. Signal Process., 1991

Energy, duration and Markov models.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Using phoneme duration and energy contour information to improve large vocabulary isolated-word recognition.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
An 86, 000-Word Recognizer Based on Phonemic Models.
Proceedings of the Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, 1990

Acoustic recognition component of an 86000-word speech recognizer.
Proceedings of the 1990 International Conference on Acoustics, 1990

1989
A locus model of coarticulation in an HMM speech recognizer.
Proceedings of the IEEE International Conference on Acoustics, 1989

1988
Three probabilistic language models for a large-vocabulary speech recognizer.
Proceedings of the IEEE International Conference on Acoustics, 1988

Modeling acoustic-phonetic detail in an HMM-based large vocabulary speech recognizer.
Proceedings of the IEEE International Conference on Acoustics, 1988

1987
Integration of acoustic information in a large vocabulary word recognizer.
Proceedings of the IEEE International Conference on Acoustics, 1987

1984
Decision rules for speaker-independent isolated word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1984

1978
Speaker-independent vowel indetification in continuous speech.
Proceedings of the IEEE International Conference on Acoustics, 1978


  Loading...