Hideyuki Mizuno

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Text and Time Series, 2019

2018

DNN-Based Speech Synthesis Using Speaker Codes.

[BibT_eX]

[DOI]

Nobukatsu Hojo

IEICE Trans. Inf. Syst., 2018

2016

Objective Evaluation Using Association Between Dimensions Within Spectral Features for Statistical Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Taichi Asami

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

An Investigation of DNN-Based Speech Synthesis Using Speaker Codes.

[BibT_eX]

[DOI]

Nobukatsu Hojo

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Statistical model training technique based on speaker clustering approach for HMM-based speech synthesis.

[BibT_eX]

[DOI]

Speech Commun., 2015

Similar Speaker Selection Technique Based on Distance Metric Learning Using Highly Correlated Acoustic Features with Perceptual Voice Quality Similarity.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2015

Sub-band text-to-speech combining sample-based spectrum with statistically generated spectrum.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis.

[BibT_eX]

[DOI]

Speech Commun., 2014

Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis.

[BibT_eX]

[DOI]

Hideharu Nakajima

Sumitaka Sakauchi

Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014

2013

Statistical model training technique for speech synthesis based on speaker class.

[BibT_eX]

[DOI]

Noboru Miyazaki

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Which resemblance is useful to predict phrase boundary rise labels for Japanese expressive text-to-speech synthesis, numerically-expressed stylistic or distribution-based semantic?

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

HMM-based expressive speech synthesis based on phrase-level F0 context labeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Similar Speaker Selection Technique Based on Distance Metric Learning with Perceptual Voice Quality Similarity.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Correlation Analysis of Acoustic Features with Perceptual Voice Quality Similarity for Similar Speaker Selection.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

Speech database reduction method for corpus-based TTS system.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Unit selection using k-nearest neighbor search for concatenative speech synthesis.

[BibT_eX]

[DOI]

Satoshi Takahashi

Proceedings of the 3rd International Universal Communication Symposium, 2009

2008

Segment selection method based on tonal validity evaluation using machine learning for concatenative speech synthesis.

[BibT_eX]

[DOI]

Akihiro Yoshida

Kazunori Mano

Proceedings of the IEEE International Conference on Acoustics, 2008

2005

Recording Script Design for Corpus-Based TTS System Based on Coverage of Various Phonetic Elements.

[BibT_eX]

[DOI]

Kazunori Mano

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Long vowel detection for letter-to-sound conversion for Japanese sourced words transliterated into the alphabet.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2001

A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2001

A bilingual speech design tool: Sesign2001.

[BibT_eX]

[DOI]

Proceedings of the 4th ITRW on Speech Synthesis, 2001

2000

A new Japanese TTS system based on speech-prosody database and speech modification.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999

WebMessenger: a new framework to produce multimedia content by combining synthesized speech and moving pictures in the WWW environment.

[BibT_eX]

[DOI]

Tsubasa Shinozaki

Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999

A Japanese text-to-speech system based on multi-form units with consideration of frequency distribution in Japanese.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Development of speech design tool "SESIGN99" to enhance synthesized speech.

[BibT_eX]

[DOI]

Shin'ya Nakajima

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A new F0 contour control method based on vector representation of F0 contour.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1997

A new framework to provide high-controllability speech signal and the development of a workbench for it.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1995

Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt.

[BibT_eX]

[DOI]

Speech Commun., 1995

1994

A strategy for changing speaking styles in text-to-speech systems.

[BibT_eX]

[DOI]

Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Speaking style conversion by changing prosodic parameters and formant frequencies.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Waveform-based speech synthesis approach with a formant frequency modification.

[BibT_eX]

[DOI]

Tomohisa Hirokawa

Proceedings of the IEEE International Conference on Acoustics, 1993

1990

Speech synthesis by optimum concatenation of phoneme segments.

[BibT_eX]

[DOI]

Tetsuya Nomura