We stand with Ukraine

We stand with Ukraine

Brian Kan-Wing Mak

Orcid: 0000-0001-6787-5555

According to our database¹, Brian Kan-Wing Mak authored at least 124 papers between 1991 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Detecting Neurocognitive Disorders through Analyses of Topic Evolution and Cross-modal Consistency in Visual-Stimulated Narratives.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Timothy C. Y. Kwok

,

Vincent C. T. Mok

,

,

,

,

Patrick C. M. Wong

,

CoRR, January, 2025

2024

Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal.

[BibT_eX]

[DOI]

,

ACM Trans. Multim. Comput. Commun. Appl., June, 2024

Towards Online Sign Language Recognition and Translation.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Towards Online Continuous Sign Language Recognition and Translation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Bayesian Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

,

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

wav2vec 2.0 ASR for Cantonese-Speaking Older Adults in a Clinical Setting.

[BibT_eX]

[DOI]

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

On the Audio-visual Synchronization for Lip-to-Speech Synthesis.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Natural Language-Assisted Sign Language Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Hardware-Control: Instrument control and automation package.

[BibT_eX]

[DOI]

Grant Giesbrecht

,

,

,

,

,

,

J. Open Source Softw., 2022

Two-Stream Network for Sign Language Recognition and Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Local Context-aware Self-attention for Continuous Sign Language Recognition.

[BibT_eX]

[DOI]

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Synthesizing Near Native-accented Speech for a Non-native Speaker by Imitating the Pronunciation and Prosody of a Native Speaker.

[BibT_eX]

[DOI]

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

C<sup>2</sup>SLR: Consistency-enhanced Continuous Sign Language Recognition.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Access on Demand: Real-time, Multi-modal Accessibility for the Deaf and Hard-of-Hearing based on Augmented Reality.

[BibT_eX]

[DOI]

,

,

Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility, 2022

2021

Non-Parallel Many-To-Many Voice Conversion by Knowledge Transfer from a Text-To-Speech Model.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2021

A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer's Disease Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

On-The-Fly Data Augmentation for Text-to-Speech Style Transfer.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Transformer based Multilingual document Embedding model.

[BibT_eX]

[DOI]

,

CoRR, 2020

Orthogonality Regularizations for End-to-End Speaker Verification.

[BibT_eX]

[DOI]

,

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Multi-Lingual Multi-Speaker Text-to-Speech Synthesis for Voice Cloning with Online Speaker Enrollment.

[BibT_eX]

[DOI]

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Orthogonal Training for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Stochastic Fine-Grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

[BibT_eX]

[DOI]

,

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

Mixup Learning Strategies for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Recurrent Poisson Process Unit for Speech Recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Denoised Senone I-Vectors for Robust Speaker Verification.

[BibT_eX]

[DOI]

,

,

Brian Kan-Wing Mak

,

IEEE ACM Trans. Audio Speech Lang. Process., 2018

DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification.

[BibT_eX]

[DOI]

,

,

Brian Kan-Wing Mak

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Fast derivation of neural network based document vectors with distance constraint and negative sampling.

[BibT_eX]

[DOI]

,

CoRR, 2018

Domain Adaptation of End-to-end Speech Recognition in Low-Resource Settings.

[BibT_eX]

[DOI]

Lahiru Samarakoon

,

,

Albert Y. S. Lam

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Subspace Based Sequence Discriminative Training of LSTM Acoustic Models with Feed-Forward Layers.

[BibT_eX]

[DOI]

Lahiru Samarakoon

,

,

Albert Y. S. Lam

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

WaveNet MH-SRU: Deep and Wide Multiple-history Simple Recurrent Unit for Speech Recognition.

[BibT_eX]

[DOI]

,

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Multi-Head Attention for End-to-End Neural Machine Translation.

[BibT_eX]

[DOI]

,

Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Fast Derivation of Cross-lingual Document Vectors from Self-attentive Neural Machine Translation Model.

[BibT_eX]

[DOI]

,

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

learning Effective Factorized Hidden Layer Bases Using Student-Teacher Training for LSTM Acoustic Model Adaptation.

[BibT_eX]

[DOI]

Lahiru Samarakoon

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

End-To-End Low-Resource Lip-Reading with Maxout Cnn and Lstm.

[BibT_eX]

[DOI]

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Learning Factorized Transforms for Unsupervised Adaptation of LSTM-RNN Acoustic Models.

[BibT_eX]

[DOI]

Lahiru Samarakoon

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

To Improve the Robustness of LSTM-RNN Acoustic Models Using Higher-Order Feedback from Multiple Histories.

[BibT_eX]

[DOI]

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Speeding up softmax computations in DNN-based large vocabulary speech recognition by senone weight vector selection.

[BibT_eX]

[DOI]

,

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An investigation into learning effective speaker subspaces for robust unsupervised DNN adaptation.

[BibT_eX]

[DOI]

Lahiru Samarakoon

,

,

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Derivation of Document Vectors from Adaptation of LSTM Language Model.

[BibT_eX]

[DOI]

,

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Unsupervised adaptation of student DNNS learned from teacher RNNS for improved ASR performance.

[BibT_eX]

[DOI]

Lahiru Samarakoon

,

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

An investigation of adaptation techniques for building acoustic models for hearing-impaired children in a CAPT application.

[BibT_eX]

[DOI]

,

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Senone I-vectors for robust speaker verification.

[BibT_eX]

[DOI]

,

,

,

Brian Kan-Wing Mak

Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

2015

Multitask Learning of Deep Neural Networks for Low-Resource Speech Recognition.

[BibT_eX]

[DOI]

,

Brian Kan-Wing Mak

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Distinct triphone acoustic modeling using deep neural networks.

[BibT_eX]

[DOI]

,

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The harp of light: a musical string projection mapping.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 12th International Conference on Advances in Computer Entertainment Technology, 2015

2014

Eigentrigraphemes for under-resourced languages.

[BibT_eX]

[DOI]

,

Brian Kan-Wing Mak

Speech Commun., 2014

Modeling inter-cluster and intra-cluster discrimination among triphones.

[BibT_eX]

[DOI]

,

Brian Kan-Wing Mak

,

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Joint sequence training of phone and grapheme acoustic model based on multi-task learning deep neural networks.

[BibT_eX]

[DOI]

,

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Subspace Gaussian mixture model with state-dependent subspace dimensions.

[BibT_eX]

[DOI]

,

,

Cheung-Chi Leung

Proceedings of the IEEE International Conference on Acoustics, 2014

Joint acoustic modeling of triphones and trigraphemes by multi-task learning deep neural networks for low-resource speech recognition.

[BibT_eX]

[DOI]

,

,

Cheung-Chi Leung

,

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Eigentriphones for Context-Dependent Acoustic Modeling.

[BibT_eX]

[DOI]

,

IEEE Trans. Speech Audio Process., 2013

Distinct triphone modeling by reference model weighting.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Speaker-ensemble hidden Markov modeling for automatic speech recognition.

[BibT_eX]

[DOI]

,

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Welcome message from the technical program chairs.

[BibT_eX]

[DOI]

,

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Transition probabilities are more important than we once thought.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Derivation of eigentriphones by weighted principal component analysis.

[BibT_eX]

[DOI]

,

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Subspace high-density discrete hidden Markov model for automatic speech recognition.

[BibT_eX]

[DOI]

,

Proceedings of the 20th European Signal Processing Conference, 2012

2011

A Fully Automated Derivation of State-Based Eigentriphones for Triphone Modeling with No Tied States Using Regularization.

[BibT_eX]

[DOI]

,

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Eigentriphones: A basis for context-dependent acoustic modeling.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Subvector-quantized high-density discrete hidden Markov model and its re-estimation.

[BibT_eX]

[DOI]

,

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Problems of modeling phone deletion in conversational speech for speech recognition.

[BibT_eX]

[DOI]

,

Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

The use of subvector quantization and discrete densities for fast GMM computation for speaker verification.

[BibT_eX]

[DOI]

,

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Improving speech recognition by explicit modeling of phone deletions.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Maximum Penalized Likelihood Kernel Regression for Fast Adaptation.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

,

,

James Tin-Yau Kwok

IEEE Trans. Speech Audio Process., 2009

Fast GMM computation for speaker verification using scalar quantization and discrete densities.

[BibT_eX]

[DOI]

,

,

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Automatic estimation of decoding parameters using large-margin iterative linear programming.

[BibT_eX]

[DOI]

,

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Min-max discriminative training of decoding parameters using iterative linear programming.

[BibT_eX]

[DOI]

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions.

[BibT_eX]

[DOI]

Chien-Lin Huang

,

,

,

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Discriminative training by iterative linear programming optimization.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Kernel Eigenspace-Based MLLR Adaptation.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Roger Wend-Huu Hsiao

IEEE Trans. Speech Audio Process., 2007

Boosting with anti-models for automatic language identification.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A model-based estimation of phonotactic language verification performance.

[BibT_eX]

[DOI]

,

,

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Robustness of several kernel-based fast adaptation methods on noisy LVCSR.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Roger Wend-Huu Hsiao

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Roger Wend-Huu Hsiao

,

Simon Ka-Lung Ho

,

IEEE Trans. Speech Audio Process., 2006

Minimization of Utterance Verification Error Rate as a Constrained Optimization Problem.

[BibT_eX]

[DOI]

,

,

IEEE Signal Process. Lett., 2006

Joint Optimization of the Frequency-Domain and Time-Domain Transformations in Deriving Generalized Static and Dynamic MFCCs.

[BibT_eX]

[DOI]

,

,

IEEE Signal Process. Lett., 2006

Unsupervised Speaker Adaptation Using Reference Speaker Weighting.

[BibT_eX]

[DOI]

,

Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Automatic Audio Indexing and Audio Playback Speed Control as Tools for Language Learning.

[BibT_eX]

[DOI]

,

,

Proceedings of the Advances in Web Based Learning, 2006

Fast Speaker Adaption Via Maximum Penalized Likelihood Kernel Regression.

[BibT_eX]

[DOI]

,

,

,

,

Jeffrey Junfeng Pan

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Improving Reference Speaker Weighting Adaptation by the Use of Maximum-Likelihood Reference Speakers.

[BibT_eX]

[DOI]

,

,

Roger Wend-Huu Hsiao

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

A Comparison of Various Adaptation Methods for Speaker Verification With Limited Enrollment Data.

[BibT_eX]

[DOI]

,

Roger Wend-Huu Hsiao

,

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Kernel Eigenvoice Speaker Adaptation.

[BibT_eX]

[DOI]

,

James Tin-Yau Kwok

,

Simon Ka-Lung Ho

IEEE Trans. Speech Audio Process., 2005

Pruning Hidden Markov Models With Optimal Brain Surgeon.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

IEEE Trans. Speech Audio Process., 2005

High-density discrete HMM with the use of scalar quantization indexing.

[BibT_eX]

[DOI]

,

Jeff Siu-Kei Au-Yeung

,

,

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Roger Wend-Huu Hsiao

,

Brian Kan-Wing Mak

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Various Reference Speakers Determination Methods for Embedded Kernel Eigenvoice Speaker Adaptation.

[BibT_eX]

[DOI]

,

Simon Ka-Lung Ho

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Kernel Eigenspace-based MLLR Adaptation Using Multiple Regression Classes.

[BibT_eX]

[DOI]

Roger Wend-Huu Hsiao

,

Brian Kan-Wing Mak

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Discriminative auditory-based features for robust speech recognition.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

,

IEEE Trans. Speech Audio Process., 2004

An Acoustic-Phonetic and a Model-Theoretic Analysis of Subspace Distribution Clustering Hidden Markov Models.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2004

Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA.

[BibT_eX]

[DOI]

,

Simon Ka-Lung Ho

,

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Improving eigenspace-based MLLR adaptation by kernel PCA.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Roger Wend-Huu Hsiao

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A study of various composite kernels for kernel eigenvoice speaker adaptation.

[BibT_eX]

[DOI]

,

,

Simon Ka-Lung Ho

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Discriminative feature transformation by guided discriminative training.

[BibT_eX]

[DOI]

,

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Eigenvoice Speaker Adaptation via Composite Kernel PCA.

[BibT_eX]

[DOI]

,

,

Simon Ka-Lung Ho

Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Pruning transitions in a hidden Markov model with optimal brain surgeon.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Joint estimation of thresholds in a bi-threshold verification problem.

[BibT_eX]

[DOI]

Simon Ka-Lung Ho

,

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Discriminative training of auditory filters of different shapes for robust speech recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

A mathematical relationship between full-band and multiband mel-frequency cepstral coefficients.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2002

Knowledge-based sense pruning using the hownet: an alternative to word sense disambiguation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Performance of discriminatively trained auditory features on Aurora2 and Aurora3.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

An alternative approach of finding competing hypotheses for better minimum classification error training.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2002

Discriminative auditory features for robust speech recognition.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Direct training of subspace distribution clustering hidden Markov model.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Enrico Bocchieri

IEEE Trans. Speech Audio Process., 2001

Subspace distribution clustering hidden Markov model.

[BibT_eX]

[DOI]

Enrico Bocchieri

,

Brian Kan-Wing Mak

IEEE Trans. Speech Audio Process., 2001

Rapid speaker adaptation using MLLR and subspace regression classes.

[BibT_eX]

[DOI]

,

Brian Kan-Wing Mak

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Development of an asynchronous multi-band system for continuous speech recognition.

[BibT_eX]

[DOI]

,

Brian Kan-Wing Mak

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Optimization of sub-band weights using simulated noisy speech in multi-band speech recognition.

[BibT_eX]

[DOI]

,

Brian Kan-Wing Mak

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Asynchrony with trained transition probabilities improves performance in multi-band speech recognition.

[BibT_eX]

[DOI]

Brian Kan-Wing Mak

,

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Pruning of state-tying tree using bayesian information criterion with multiple mixtures.

[BibT_eX]

[DOI]

,

,

Brian Kan-Wing Mak

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

MAP adaptation with subspace regression classes and tying.

[BibT_eX]

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2000

1998

Training of context-dependent subspace distribution clustering hidden Markov model.

[BibT_eX]

[DOI]

,

Enrico Bocchieri

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Training of subspace distribution clustering hidden Markov model.

[BibT_eX]

[DOI]

,

Enrico Bocchieri

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

Subspace distribution clustering for continuous observation density hidden Markov models.

[BibT_eX]

[DOI]

Enrico Bocchieri

,

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Combining ANNs to improve phone recognition.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Phone clustering using the bhattacharyya distance.

[BibT_eX]

[DOI]

,

Etienne Barnard

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

The contribution of consonants versus vowels to word recognition in fluent speech.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Tone recognition of isolated Cantonese syllables.

[BibT_eX]

[DOI]

,

,

,

,

IEEE Trans. Speech Audio Process., 1995

1994

A robust algorithm for word boundary detection in the presence of noise.

[BibT_eX]

[DOI]

Jean-Claude Junqua

,

,

IEEE Trans. Speech Audio Process., 1994

1992

A robust speech/non-speech detection algorithm using time and frequency-based features.

[BibT_eX]

[DOI]

,

Jean-Claude Junqua

,

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

A study of endpoint detection algorithms in adverse conditions: incidence on a DTW and HMM recognizer.

[BibT_eX]

[DOI]

Jean-Claude Junqua

,

,

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Loading...