Hideyuki Mizuno

Orcid: 0000-0002-2591-7152

According to our database1, Hideyuki Mizuno authored at least 39 papers between 1990 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
F0 Controllable General-Purpose Vocoder Enhanced Using Speaker Vectors.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

2022
Quality Improvement of Children's Speech by Speaker Vector in General Purpose Vocoder.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

Introducing Speaker Vectors for Child Speech Synthesis in Neural Vocoders.
Proceedings of the Complex, Intelligent and Software Intensive Systems - Proceedings of the 16th International Conference on Complex, Intelligent and Software Intensive Systems (CISIS-2022), June 29, 2022

2019
Robust Gait Authentication Using Autoencoder and Decision Tree.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2019: Text and Time Series, 2019

2018
DNN-Based Speech Synthesis Using Speaker Codes.
IEICE Trans. Inf. Syst., 2018

2016
Objective Evaluation Using Association Between Dimensions Within Spectral Features for Statistical Parametric Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

An Investigation of DNN-Based Speech Synthesis Using Speaker Codes.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Statistical model training technique based on speaker clustering approach for HMM-based speech synthesis.
Speech Commun., 2015

Similar Speaker Selection Technique Based on Distance Metric Learning Using Highly Correlated Acoustic Features with Perceptual Voice Quality Similarity.
IEICE Trans. Inf. Syst., 2015

Sub-band text-to-speech combining sample-based spectrum with statistically generated spectrum.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis.
Speech Commun., 2014

Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis.
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation, 2014

2013
Statistical model training technique for speech synthesis based on speaker class.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Which resemblance is useful to predict phrase boundary rise labels for Japanese expressive text-to-speech synthesis, numerically-expressed stylistic or distribution-based semantic?
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

HMM-based expressive speech synthesis based on phrase-level F0 context labeling.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Similar Speaker Selection Technique Based on Distance Metric Learning with Perceptual Voice Quality Similarity.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
HMM-Based Emphatic Speech Synthesis Using Unsupervised Context Labeling.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Correlation Analysis of Acoustic Features with Perceptual Voice Quality Similarity for Similar Speaker Selection.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Speech database reduction method for corpus-based TTS system.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Unit selection using k-nearest neighbor search for concatenative speech synthesis.
Proceedings of the 3rd International Universal Communication Symposium, 2009

2008
Segment selection method based on tonal validity evaluation using machine learning for concatenative speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2008

2005
Recording Script Design for Corpus-Based TTS System Based on Coverage of Various Phonetic Elements.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Long vowel detection for letter-to-sound conversion for Japanese sourced words transliterated into the alphabet.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2001
A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction.
IEEE Trans. Speech Audio Process., 2001

A bilingual speech design tool: Sesign2001.
Proceedings of the 4th ITRW on Speech Synthesis, 2001

2000
A new Japanese TTS system based on speech-prosody database and speech modification.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
WebMessenger: a new framework to produce multimedia content by combining synthesized speech and moving pictures in the WWW environment.
Proceedings of the Third IEEE Workshop on Multimedia Signal Processing, 1999

A Japanese text-to-speech system based on multi-form units with consideration of frequency distribution in Japanese.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Development of speech design tool "SESIGN99" to enhance synthesized speech.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

A new F0 contour control method based on vector representation of F0 contour.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1997
A new framework to provide high-controllability speech signal and the development of a workbench for it.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1995
Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt.
Speech Commun., 1995

1994
A strategy for changing speaking styles in text-to-speech systems.
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994

Speaking style conversion by changing prosodic parameters and formant frequencies.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Waveform-based speech synthesis approach with a formant frequency modification.
Proceedings of the IEEE International Conference on Acoustics, 1993

1990
Speech synthesis by optimum concatenation of phoneme segments.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

Phoneme segment concatenation and excitation control based on spectral distortion criterion for speech synthesis.
Proceedings of the First International Conference on Spoken Language Processing, 1990

A new Japanese text-to-speech synthesizer based on COC synthesis method.
Proceedings of the First International Conference on Spoken Language Processing, 1990


  Loading...