Yoshihiko Nankaku

Akinobu Lee

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The effect of neural networks in statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Contextual Additive Structure for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Shinji Takaki

IEEE J. Sel. Top. Signal Process., 2014

Image Recognition Based on Separable Lattice Trajectory 2-D HMMs.

[BibT_eX]

[DOI]

Akira Tamamori

IEICE Trans. Inf. Syst., 2014

Integration of Spectral Feature Extraction and Modeling for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

HMM-Based singing voice synthesis and its application to Japanese and English.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Speech Synthesis Based on Hidden Markov Models.

[BibT_eX]

[DOI]

Proc. IEEE, 2013

A Bayesian Framework Using Multiple Model Structures for Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2013

Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Contextual partial additive structure for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Shinji Takaki

Proceedings of the IEEE International Conference on Acoustics, 2013

Integration of acoustic modeling and mel-cepstral analysis for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Separable lattice 2-D HMMS introducing state duration control for recognition of images with various variations.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Image recognition based on hidden Markov eigen-image models using variational Bayesian method.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Product of Experts for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2012

An Extension of Separable Lattice 2-D HMMs for Rotational Data Variations.

[BibT_eX]

[DOI]

Akira Tamamori

Viviane de Franca Oliveira

IEICE Trans. Inf. Syst., 2012

Cross-lingual Speaker Adaptation for HMM-based Speech Synthesis based on Perceptual Characteristics and Speaker Interpolation.

[BibT_eX]

[DOI]

Sayaka Shiota

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A Bayesian Approach to Speaker Recognition Based on GMMs Using Multiple Model Structures.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A model structure integration based on a Bayesian framework for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Face recognition based on separable lattice 2-D HMMS using variational bayesian method.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Pitch adaptive training for hmm-based singing voice synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Face recognition based on extended separable lattice 2-D HMMS.

[BibT_eX]

[DOI]

Keisuke Kumaki

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Continuous Stochastic Feature Mapping Based on Trajectory HMMs.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

Bayesian Context Clustering Using Cross Validation for Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

GMM-Based Missing-Feature Reconstruction on Multi-Frame Windows.

[BibT_eX]

[DOI]

Ulpu Remes

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A Bayesian Approach to Voice Conversion Based on GMMs Using Multiple Model Structures.

[BibT_eX]

[DOI]

Lei Li

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Evaluation of Tree-Trellis Based Decoding in Over-Million LVCSR.

[BibT_eX]

[DOI]

Naoaki Ito

Akinobu Lee

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An optimization algorithm of independent mean and variance parameter tying structures for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Global variance modeling on frequency domain delta LSP for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

A Covariance-Tying Technique for HMM-Based Speech Synthesis.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Spectral modeling with contextual additive structure for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Shinji Takaki

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Recent development of the HMM-based singing voice synthesis system - Sinsy.

[BibT_eX]

[DOI]

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Bayesian speech synthesis framework integrating training and synthesis processes.

[BibT_eX]

[DOI]

Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

Voice activity detection based on conditional random fields using multiple features.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

HMM-based singing voice synthesis system using pitch-shifted pseudo training data.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Speaker adaptation based on nonlinear spectral transform for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Statistical parametric speech synthesis based on product of experts.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Face recognition based on separable lattice 2-D HMM with state duration modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

Factor analyzed voice models for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Kyosuke Kazumi

Proceedings of the IEEE International Conference on Acoustics, 2010

A Deterministic Annealing-Based Training Algorithm For Statistical Machine Translation Models.

[BibT_eX]

[DOI]

Pascual Martínez-Gómez

Germán Sanchis-Trilles

Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

2009

State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis.

[BibT_eX]

[DOI]

Yi-Jian Wu

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Deterministic annealing based training algorithm for Bayesian speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A Bayesian approach to Hidden Semi-Markov Model based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Stereo-based stochastic noise compensation based on trajectory GMMS.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

Voice conversion based on simultaneous modelling of spectrum and F0.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

A Bayesian approach to HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems.

[BibT_eX]

[DOI]

Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Probabilistic feature mapping based on trajectory HMMs.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Simultaneous conversion of duration and spectrum based on statistical models including time-sequence matching.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Probabilistic answer selection based on conditional random fields for spoken dialog system.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Acoustic modeling based on model structure annealing for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker recognition based on variational Bayesian method.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Acoustic modeling with contextual additive structure for HMM-based speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Spectral conversion based on statistical models including time-sequence matching.

[BibT_eX]

[DOI]

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

An excitation model for HMM-based speech synthesis based on residual modeling.

[BibT_eX]

[DOI]

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Model-space MLLR for trajectory HMMs.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A trainable excitation model for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Face Recognition using Hidden Markov Eigenface Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Speaker adaptation of trajectory HMMs using feature-space MLLR.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Voice conversion based on mixtures of factor analyzers.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

An HMM-based singing voice synthesis system.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Reducing computation on parallel decoding using frame-wise confidence scores.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Estimating Trajectory Hmm Parameters Using Monte Carlo Em With Gibbs Sampler.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

On the Use of Phonetic Information for Mapping from Articulatory Movements to Vocal Tract Spectrum.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Face Recognition Based on Separable Lattice HMMS.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

Parameter Sharing in Mixture of Factor Analyzers for Speaker Identification.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2005

Continuous Speech Recognition Based on General Factor Dependent Acoustic Models.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2005

Applying Sparse KPCA for Feature Extraction in Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2005

Deterministic Annealing EM Algorithm in Acoustic Modeling for Speaker and Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2005

Sparse KPCA for Feature Extraction in Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

On the Use of Kernel PCA for Feature Extraction in Speech Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2004

Deterministic annealing EM algorithm in parameter estimation for acoustic model.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Parameter sharing and minimum classification error training of mixtures of factor analyzers for speaker identification.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Speech recognition using voice-characteristic-dependent acoustic models.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2000

Normalized Training for HMM-Based Visual Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2000 International Conference on Image Processing, 2000

1999

Intensity- and location-normalized training for HMM-based visual speech recognition.

[BibT_eX]

[DOI]