We stand with Ukraine

We stand with Ukraine

Masami Akamine

According to our database¹, Masami Akamine authored at least 50 papers between 1990 and 2019.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2019

Transfer Learning for Unseen Slots in End-to-End Dialogue State Tracking.

[BibT_eX]

[DOI]

,

,

Hiroshi Fujimura

,

Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

2018

Out-of-Domain Slot Value Detection for Spoken Dialogue Systems with Context Information.

[BibT_eX]

[DOI]

,

,

,

Hiroshi Fujimura

,

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Dialog State Tracking for Unseen Values Using an Extended Attention Mechanism.

[BibT_eX]

[DOI]

,

,

Hiroshi Fujimura

,

Proceedings of the 9th International Workshop on Spoken Dialogue System Technology, 2018

2016

Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach.

[BibT_eX]

[DOI]

Tudor-Catalin Zorila

,

Yannis Stylianou

,

Tatsuma Ishihara

,

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Voice Activity Detection: Merging Source and Filter-based Information.

[BibT_eX]

[DOI]

,

Yannis Stylianou

,

,

IEEE Signal Process. Lett., 2016

Statistical Bandwidth Extension for Speech Synthesis Based on Gaussian Mixture Model with Sub-Band Basis Spectrum Model.

[BibT_eX]

[DOI]

,

Masatsune Tamura

,

Masahiro Morita

,

IEICE Trans. Inf. Syst., 2016

2015

Emotional transplant in statistical speech synthesis based on emotion additive model.

[BibT_eX]

[DOI]

,

,

Masahiro Morita

,

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A maximum likelihood approach to the detection of moments of maximum excitation and its application to high-quality speech parameterization.

[BibT_eX]

[DOI]

,

Yannis Stylianou

,

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Building HMM-TTS Voices on Diverse Data.

[BibT_eX]

[DOI]

,

,

Kayoko Yanagisawa

,

Norbert Braunschweiler

,

,

Mark J. F. Gales

,

IEEE J. Sel. Top. Signal Process., 2014

Integrated Expression Prediction and Speech Synthesis From Text.

[BibT_eX]

[DOI]

,

Mark J. F. Gales

,

Norbert Braunschweiler

,

,

IEEE J. Sel. Top. Signal Process., 2014

On the impact of excitation and spectral parameters for expressive statistical parametric speech synthesis.

[BibT_eX]

[DOI]

,

Comput. Speech Lang., 2014

GMM-based bandwidth extension using sub-band basis spectrum model.

[BibT_eX]

[DOI]

,

Masatsune Tamura

,

Masahiro Morita

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

Complex cepstrum for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

,

,

Mark J. F. Gales

Speech Commun., 2013

Photo-realistic expressive text to talking head synthesis.

[BibT_eX]

[DOI]

,

Robert Anderson

,

,

Norbert Braunschweiler

,

,

BalaKrishna Kolluru

,

,

,

,

Kayoko Yanagisawa

,

Yannis Stylianou

,

,

Mark J. F. Gales

,

Roberto Cipolla

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Minimum mean squared error based warped complex cepstrum analysis for statistical parametric speech synthesis.

[BibT_eX]

[DOI]

,

Mark J. F. Gales

,

Yannis Stylianou

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Complex cepstrum analysis based on the minimum mean squared error.

[BibT_eX]

[DOI]

,

,

Mark J. F. Gales

Proceedings of the IEEE International Conference on Acoustics, 2013

Training a supra-segmental parametric F0 model without interpolating F0.

[BibT_eX]

[DOI]

,

Mark J. F. Gales

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

Integrated automatic expression prediction and speech synthesis from text.

[BibT_eX]

[DOI]

,

Mark J. F. Gales

,

Norbert Braunschweiler

,

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Decision tree-based acoustic models for speech recognition.

[BibT_eX]

[DOI]

,

Jitendra Ajmera

EURASIP J. Audio Speech Music. Process., 2012

Combining multiple high quality corpora for improving HMM-TTS.

[BibT_eX]

[DOI]

,

,

,

,

Mark J. F. Gales

,

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

HMM-based speech synthesis using sub-band basis spectrum model.

[BibT_eX]

[DOI]

,

Masatsune Tamura

,

Masahiro Morita

,

Takehiko Kagoshima

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Histogram-based spectral equalization for HMM-based speech synthesis using mel-LSP.

[BibT_eX]

[DOI]

,

Masatsune Tamura

,

Masahiro Morita

,

Takehiko Kagoshima

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Speech factorization for HMM-TTS based on cluster adaptive training.

[BibT_eX]

[DOI]

,

,

Mark J. F. Gales

,

,

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Exploring Rich Expressive Information from Audiobook Data Using Cluster Adaptive Training.

[BibT_eX]

[DOI]

,

Mark J. F. Gales

,

,

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Complex cepstrum as phase information in statistical parametric speech synthesis.

[BibT_eX]

[DOI]

,

,

Mark J. F. Gales

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Decision Tree-Based Acoustic Models for Speech Recognition with Improved Smoothness.

[BibT_eX]

[DOI]

,

Jitendra Ajmera

IEICE Trans. Inf. Syst., 2011

One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model.

[BibT_eX]

[DOI]

Masatsune Tamura

,

Masahiro Morita

,

Takehiko Kagoshima

,

Proceedings of the IEEE International Conference on Acoustics, 2011

Continuous F0 in the source-excitation generation for HMM-based TTS: Do we need voiced/unvoiced classification?

[BibT_eX]

[DOI]

,

Mark J. F. Gales

,

Sabine Buchholz

,

,

Masatsune Tamura

,

,

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Sub-band basis spectrum model for pitch-synchronous log-spectrum and phase based on approximation of sparse coding.

[BibT_eX]

[DOI]

Masatsune Tamura

,

Takehiko Kagoshima

,

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Unit selection speech synthesis using multiple speech units at non-adjacent segments for prosody and waveform generation.

[BibT_eX]

[DOI]

Masatsune Tamura

,

Norbert Braunschweiler

,

Takehiko Kagoshima

,

Proceedings of the IEEE International Conference on Acoustics, 2010

Covariance clustering on Riemannian manifolds for acoustic model compression.

[BibT_eX]

[DOI]

Yusuke Shinohara

,

,

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Feedback loop for prosody prediction in concatenative speech synthesis.

[BibT_eX]

[DOI]

,

,

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Decision tree acoustic models for ASR.

[BibT_eX]

[DOI]

Jitendra Ajmera

,

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Bayesian feature enhancement using a mixture of unscented transformation for uncertainty decoding of noisy speech.

[BibT_eX]

[DOI]

Yusuke Shinohara

,

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

Multilevel parametric-base F0 model for speech synthesis.

[BibT_eX]

[DOI]

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Comparative evaluation of different methods for voice activity detection.

[BibT_eX]

[DOI]

,

Koichi Yamamoto

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speech recognition using soft decision trees.

[BibT_eX]

[DOI]

Jitendra Ajmera

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Feature enhancement by speaker-normalized splice for robust speech recognition.

[BibT_eX]

[DOI]

Yusuke Shinohara

,

,

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

An F<sub>0</sub> contour control model using an F<sub>0</sub> contour codebook.

[BibT_eX]

[DOI]

Takehiko Kagoshima

,

Masahiro Morita

,

,

,

Yoshinori Shiga

Syst. Comput. Jpn., 2007

HMM-based speech recognition using decision trees instead of GMMs.

[BibT_eX]

[DOI]

,

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

1999

Automatic generation of synthesis units by unit selection based on closed-loop training.

[BibT_eX]

[DOI]

Takehiko Kagoshima

,

Syst. Comput. Jpn., 1999

Toshiba English text-to-speech synthesizer (TESS).

[BibT_eX]

[DOI]

,

Takehiko Kagoshima

,

Masahiro Morita

,

,

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

CELP speech coding based on an adaptive pulse position codebook.

[BibT_eX]

[DOI]

,

,

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS system.

[BibT_eX]

[DOI]

,

Masahiro Morita

,

Takehiko Kagoshima

,

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An F0 contour control model for totally speaker driven text to speech system.

[BibT_eX]

[DOI]

Takehiko Kagoshima

,

Masahiro Morita

,

,

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS).

[BibT_eX]

[DOI]

,

Takehiko Kagoshima

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A 2.4 kbps variable bit rate ADP-CELP speech coder.

[BibT_eX]

[DOI]

Masahiro Oshikiri

,

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

Automatic generation of speech synthesis units based on closed loop training.

[BibT_eX]

[DOI]

Takehiko Kagoshima

,

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1991

Adaptive bit-allocation between the pole-zero synthesis filter and excitation in CELP.

[BibT_eX]

[DOI]

,

Proceedings of the 1991 International Conference on Acoustics, 1991

1990

CELP coding with an adaptive density pulse excitation model.

[BibT_eX]

[DOI]

,

Proceedings of the 1990 International Conference on Acoustics, 1990

Loading...