Tomoki Koriyama
Orcid: 0000-0002-8347-5604
According to our database1,
Tomoki Koriyama
authored at least 52 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
2010
2012
2014
2016
2018
2020
2022
2024
0
1
2
3
4
5
6
7
8
3
1
1
2
1
1
2
2
2
6
5
4
4
2
3
5
4
2
1
1
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
CoRR, 2024
Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech.
CoRR, 2024
2023
Duration-Aware Pause Insertion Using Pre-Trained Language Model for Multi-Speaker Text-To-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Deep Gaussian process based multi-speaker speech synthesis with latent speaker representation.
Speech Commun., 2021
Accent Modeling of Low-Resourced Dialect in Pitch Accent Language Using Variational Autoencoder.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Audiobook Speech Synthesis Conditioned by Cross-Sentence Context-Aware Word Embeddings.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Sequence-to-Sequence Learning for Deep Gaussian Process Based Speech Synthesis Using Self-Attention GP Layer.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Harmonic WaveGAN: GAN-Based Speech Waveform Generation Model with Harmonic Structure Discriminator.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Emotion-Controllable Speech Synthesis Using Emotion Soft Labels and Fine-Grained Prosody Factors.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Generative Moment Matching Network-Based Neural Double-Tracking for Synthesized and Natural Singing Voices.
IEICE Trans. Inf. Syst., 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Utterance-Level Sequential Modeling for Deep Gaussian Process Based Speech Synthesis Using Simple Recurrent Unit.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Generative Moment Matching Network-based Random Modulation Post-filter for DNN-based Singing Voice Synthesis and Neural Double-tracking.
Proceedings of the IEEE International Conference on Acoustics, 2019
A Training Method Using DNN-guided Layerwise Pretraining for Deep Gaussian Processes.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Speech Commun., 2018
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Duration prediction using multiple Gaussian process experts for GPR-based speech synthesis.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Enhanced F0 generation for GPR-based speech synthesis considering syllable-based prosodic features.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Speech emotion recognition using convolutional long short-term memory neural network and support vector machines.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
Unsupervised Stress Information Labeling Using Gaussian Process Latent Variable Model for Statistical Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
A speaker adaptation technique for Gaussian process regression based speech synthesis using feature space transform.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
HMM-based expressive singing voice synthesis with singing style control and robust pitch modeling.
Comput. Speech Lang., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
A comparison of speech synthesis systems based on GPR, HMM, and DNN with a small amount of training data.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Prosody generation using frame-based Gaussian process regression and classification for statistical parametric speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Prosodic variation enhancement using unsupervised context labeling for HMM-based expressive speech synthesis.
Speech Commun., 2014
IEEE J. Sel. Top. Signal Process., 2014
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014
Transform mapping using shared decision tree context clustering for HMM-based cross-lingual speech synthesis.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Accent type and phrase boundary estimation using acoustic and language models for automatic prosodic labeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Parametric speech synthesis based on Gaussian process regression using global variance and hyperparameter optimization.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
A style control technique for singing voice synthesis based on multiple-regression HSMM.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Frame-level acoustic modeling based on Gaussian process regression for statistical nonparametric speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
On the Use of Extended Context for HMM-Based Spontaneous Conversational Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010