Keiichi Tokuda
Orcid: 0000-0001-6143-0133
According to our database1,
Keiichi Tokuda
authored at least 273 papers
between 1990 and 2024.
Collaborative distances:
Collaborative distances:
Awards
IEEE Fellow
IEEE Fellow 2014, "For contributions to hidden Markov model-based speech synthesis".
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Proceedings of the IEEE International Conference on Acoustics, 2024
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Singing voice synthesis based on frame-level sequence-to-sequence models considering vocal timing deviation.
CoRR, 2023
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Autoregressive Variational Autoencoder with a Hidden Semi-Markov Model-Based Structured Attention for Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components.
IEEE Access, 2021
Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F<sub>0</sub> Model for Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Fast and High-Quality Singing Voice Synthesis System Based on Convolutional Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Semi-Supervised Learning Based on Hierarchical Generative Models for End-to-End Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Low computational cost speech synthesis based on deep neural networks using hidden semi-Markov model structures.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Deep neural network based real-time speech vocoder with periodic and aperiodic inputs.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Image Recognition Based on Separable Lattice Hmms Using a Deep Neural Network for Output Probability Distributions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the Blizzard Challenge 2018, Hyderabad, India, September 8, 2018, 2018
Discriminative Feature Extraction Based on Sequential Variational Autoencoder for Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Speaker Adaptation for Speech Synthesis Based on Deep Neural Networks Using Hidden Semi-Markov Model Structures.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Image Recognition Based on Convolutional Neural Networks Using Features Generated from Separable Lattice Hidden Markov Models.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Articulatory Text-to-Speech Synthesis Using the Digital Waveguide Mesh Driven by a Deep Neural Network.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Image recognition based on discriminative models using features generated from separable lattice HMMS.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the Blizzard Challenge 2017, Stockholm, Sweden, August 25, 2017, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the Human-Harmonized Information Technology, Volume 2, 2017
2016
A Bayesian Approach to Image Recognition Based on Separable Lattice Hidden Markov Models.
IEICE Trans. Inf. Syst., 2016
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Redefining the Linguistic Context Feature Set for HMM and DNN TTS Through Position and Parsing.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Directly modeling voiced and unvoiced components in speech waveforms by neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Trajectory training considering global variance for speech synthesis based on neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016
2015
Using speaker adaptive training to realize Mandarin-Tibetan cross-lingual speech synthesis.
Multim. Tools Appl., 2015
Simultaneous optimization of multiple tree structures for factor analyzed HMM-based speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Directly modeling speech waveforms by neural networks for statistical parametric speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the Blizzard Challenge 2015, 2015
2014
IEEE J. Sel. Top. Signal Process., 2014
IEEE J. Sel. Top. Signal Process., 2014
IEICE Trans. Inf. Syst., 2014
Integration of Spectral Feature Extraction and Modeling for HMM-Based Speech Synthesis.
IEICE Trans. Inf. Syst., 2014
A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the second international conference on Human-agent interaction, 2014
Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014
Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014
2013
IEICE Trans. Inf. Syst., 2013
Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis.
Comput. Speech Lang., 2013
Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Integration of acoustic modeling and mel-cepstral analysis for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013
Separable lattice 2-D HMMS introducing state duration control for recognition of images with various variations.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the Blizzard Challenge 2013, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Image recognition based on hidden Markov eigen-image models using variational Bayesian method.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
IEEE Trans. Speech Audio Process., 2012
Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping.
Speech Commun., 2012
Speech Commun., 2012
IEICE Trans. Inf. Syst., 2012
Cross-lingual Speaker Adaptation for HMM-based Speech Synthesis based on Perceptual Characteristics and Speaker Interpolation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
A Bayesian Approach to Speaker Recognition Based on GMMs Using Multiple Model Structures.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Face recognition based on separable lattice 2-D HMMS using variational bayesian method.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012
2011
IEEE Trans. Speech Audio Process., 2011
IEICE Trans. Inf. Syst., 2011
Estimation of Perceptual Spaces for Speaker Identities Based on the Cross-Lingual Discrimination Task.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
A Bayesian Approach to Voice Conversion Based on GMMs Using Multiple Model Structures.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Large-Scale Subjective Evaluations of Speech Rate Control Methods for HMM-Based Speech Synthesizers.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
An optimization algorithm of independent mean and variance parameter tying structures for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011
Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2011
An analysis of machine translation and speech synthesis in speech-to-speech translation system.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011
2010
Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora.
IEEE Trans. Speech Audio Process., 2010
IEICE Trans. Inf. Syst., 2010
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
A Deterministic Annealing-Based Training Algorithm For Statistical Machine Translation Models.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010
2009
IEEE Trans. Speech Audio Process., 2009
A Reordering Model Using a Source-Side Parse-Tree for Statistical Machine Translation.
IEICE Trans. Inf. Syst., 2009
Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation.
Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
An improved minimum generation error based model adaptation for HMM-based speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
A decision tree-based clustering approach to state definition in an excitation modeling framework for HMM-based speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Minimum generation error training by using original spectrum as reference for log spectral distortion measure.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009
The NICT Entry for the Blizzard Challenge 2009: an Enhanced HMM-based Speech Synthesis System with Trajectory Training considering Global Variance and State-Dependent Mixed Excitation.
Proceedings of the Blizzard Challenge 2009, Edinburgh, Scotland, UK, September 4, 2009, 2009
2008
Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model.
Speech Commun., 2008
IEICE Trans. Inf. Syst., 2008
IEICE Trans. Inf. Syst., 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Probabilistic answer selection based on conditional random fields for spoken dialog system.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Minimum generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Performance evaluation of the speaker-independent HMM-based speech synthesis system "HTS 2007" for the Blizzard Challenge 2007.
Proceedings of the IEEE International Conference on Acoustics, 2008
Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM.
Proceedings of the IEEE International Conference on Acoustics, 2008
Acoustic modeling with contextual additive structure for HMM-based speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
On the state definition for a trainable excitation model in HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2008
The HTS-2008 System: Yet Another Evaluation of the Speaker-Adaptive HMM-based Speech Synthesis System in The 2008 Blizzard Challenge.
Proceedings of the Blizzard Challenge 2008, 2008
Proceedings of the Blizzard Challenge 2008, 2008
2007
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory.
IEEE Trans. Speech Audio Process., 2007
Vector quantization of mel-cepstral coefficients using distortion measure for spectral analysis.
Syst. Comput. Jpn., 2007
Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005.
IEICE Trans. Inf. Syst., 2007
IEICE Trans. Inf. Syst., 2007
IEICE Trans. Inf. Syst., 2007
A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis.
IEICE Trans. Inf. Syst., 2007
Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences.
Comput. Speech Lang., 2007
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Improved average-voice-based speech synthesis using gender-mixed modeling and a parameter generation algorithm considering GV.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007.
Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007
Proceedings of the Evaluation of text-to-speech systems: Blizzard Challenge 2007, 2007
2006
Syst. Comput. Jpn., 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
On the Use of Phonetic Information for Mapping from Articulatory Movements to Vocal Tract Spectrum.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Developing a Test Bed of English Text-to-Speech System XIMERA for the Blizzard Challenge 2006.
Proceedings of the Blizzard Challenge 2006, Pittsburgh, PA, USA, September 16, 2006, 2006
2005
Simultaneous clustering of phonetic context, dimension, and state position for acoustic modeling using decision trees.
Syst. Comput. Jpn., 2005
Incorporating a mixed excitation model and postfilter into HMM-based text-to-speech synthesis.
Syst. Comput. Jpn., 2005
IEICE Trans. Inf. Syst., 2005
IEICE Trans. Inf. Syst., 2005
IEICE Trans. Inf. Syst., 2005
Deterministic Annealing EM Algorithm in Acoustic Modeling for Speaker and Speech Recognition.
IEICE Trans. Inf. Syst., 2005
The blizzard challenge - 2005: evaluating corpus-based speech synthesis on common datasets.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Spectral Conversion Based on Maximum Likelihood Estimation Considering Global Variance of Converted Parameter.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents.
Proceedings of the Life-like characters - tools, affective functions, and applications., 2004
Syst. Comput. Jpn., 2004
IEICE Trans. Inf. Syst., 2004
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004
Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis.
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
A Viterbi algorithm for a trajectory model derived from HMM with explicit relationship between static and dynamic features.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Parameter sharing and minimum classification error training of mixtures of factor analyzers for speaker identification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2003
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2003
Decision tree-based simultaneous clustering of phonetic contexts, dimensions, and state positions for acoustic modeling.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Towards the development of a brazilian portuguese text-to-speech system based on HMM.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
A training method for average voice model based on shared decision tree context clustering and speaker adaptive training.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Syst. Comput. Jpn., 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
A context clustering technique for average voice model in HMM-based speech synthesis.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems 2002, 2002
2001
A new approach to designing a feature extractor in speaker identification based on discriminative feature extraction.
Speech Commun., 2001
Fast convergence transversal adaptive filtering algorithm for impulsive environment based on T distribution assumption.
Proceedings of the 2001 International Symposium on Circuits and Systems, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
A robust speaker verification system against imposture using an HMM-based speech synthesis system.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distribution.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
Speaker identification using Gaussian mixture models based on multi-space probability distribution.
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Signal Process., 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Audio-visual speech recognition using MCE-based hmms and model-dependent stream weights.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Imposture using synthetic speech against speaker verification based on spectrum and pitch.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the 2000 International Conference on Image Processing, 2000
Proceedings of the IEEE International Conference on Acoustics, 2000
Proceedings of the 10th European Signal Processing Conference, 2000
1999
A New Robust Two Dimensional Spectral Estimation Based on an AR Model Excited by a t Distribution Process and its QR-Decomposition Recursive Algorithm.
J. Circuits Syst. Comput., 1999
LMS-like two dimensional adaptive filter with t-distribution assumption and nonsymmetrical half-plane support.
Proceedings of the IEEE-EURASIP Workshop on Nonlinear Signal and Image Processing (NSIP'99), 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
On the security of HMM-based speaker verification systems against imposture using synthetic speech.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Proceedings of the 1999 International Conference on Image Processing, 1999
Proceedings of the 1999 International Conference on Image Processing, 1999
Hidden Markov models based on multi-space probability distribution for pitch pattern modeling.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
Robust recursive time series modeling based on an AR model excited by a t-distribution process.
IEEE Trans. Signal Process., 1998
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
A 16 kbit/s wideband CELP coder using MEL-generalized cepstral analysis and its subjective evaluation.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
A wideband CELP speech coder at 16 kbit/s based on mel-generalized cepstral analysis.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
Visual Speech Synthesis Based on Parameter Generation From HMM: Speech-Driven and Text-And-Speech-Driven Approaches.
Proceedings of the Auditory-Visual Speech Processing, 1998
1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
HMM compensation for noisy speech recognition based on cepstral parameter generation.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997
1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Robust two dimensional spectral estimation based on AR model excited by a t-distribution process.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
1994
Proceedings of the 1994 IEEE International Symposium on Circuits and Systems, ISCAS 1994, London, England, UK, May 30, 1994
Mel-generalized cepstral analysis - a unified approach to speech spectral estimation.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
Robust recursive spectral estimation based on AR model excited by a t-distribution process.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
1992
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
Design of stable two-dimensional IIR digital filters with arbitrary magnitude function.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992
1990
Generalized cepstral analysis of speech - unified approach to LPC and cepstral method.
Proceedings of the First International Conference on Spoken Language Processing, 1990
Adaptive filtering based on cepstral representation-adaptive cepstral analysis of speech.
Proceedings of the 1990 International Conference on Acoustics, 1990