Kei Hashimoto

Orcid: 0000-0003-2081-0396

According to our database1, Kei Hashimoto authored at least 75 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Singing voice synthesis based on frame-level sequence-to-sequence models considering vocal timing deviation.
CoRR, 2023

Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System.
Proceedings of the IEEE International Conference on Acoustics, 2023

Singing Voice Synthesis Based on a Musical Note Position-Aware Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Autoregressive Variational Autoencoder with a Hidden Semi-Markov Model-Based Structured Attention for Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

PeriodNet: A Non-Autoregressive Raw Waveform Generative Model With a Structure Separating Periodic and Aperiodic Components.
IEEE Access, 2021

Periodnet: A Non-Autoregressive Waveform Generation Model with a Structure Separating Periodic and Aperiodic Components.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fast and High-Quality Singing Voice Synthesis System Based on Convolutional Neural Networks.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Semi-Supervised Learning Based on Hierarchical Generative Models for End-to-End Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Singing voice synthesis based on convolutional neural networks.
CoRR, 2019

Low computational cost speech synthesis based on deep neural networks using hidden semi-Markov model structures.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Deep neural network based real-time speech vocoder with periodic and aperiodic inputs.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Impacts of input linguistic feature representation on Japanese end-to-end speech synthesis.
Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Speaker-dependent Wavenet-based Delay-free Adpcm Speech Coding.
Proceedings of the IEEE International Conference on Acoustics, 2019

Singing Voice Synthesis Based on Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

WaveNet-Based Zero-Delay Lossless Speech Coding.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Statistical Voice Conversion Based on Wavenet.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Image Recognition Based on Separable Lattice Hmms Using a Deep Neural Network for Output Probability Distributions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

The NITech text-to-speech system for the Blizzard Challenge 2018.
Proceedings of the Blizzard Challenge 2018, Hyderabad, India, September 8, 2018, 2018

Discriminative Feature Extraction Based on Sequential Variational Autoencoder for Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Singing Voice Conversion Using Posted Waveform Data on Music Social Media.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Speaker Adaptation for Speech Synthesis Based on Deep Neural Networks Using Hidden Semi-Markov Model Structures.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Image Recognition Based on Convolutional Neural Networks Using Features Generated from Separable Lattice Hidden Markov Models.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Recent Development of the DNN-based Singing Voice Synthesis System - Sinsy.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Speech Synthesis Using WaveNet Vocoder Based on Periodic/Aperiodic Decomposition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Articulatory Text-to-Speech Synthesis Using the Digital Waveguide Mesh Driven by a Deep Neural Network.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Image recognition based on discriminative models using features generated from separable lattice HMMS.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Analyzing orientation patterns of human skin fibroblasts using image gradient information.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

The NITech text-to-speech system for the Blizzard Challenge 2017.
Proceedings of the Blizzard Challenge 2017, Stockholm, Sweden, August 25, 2017, 2017

User Generated Dialogue Systems: uDialogue.
Proceedings of the Human-Harmonized Information Technology, Volume 2, 2017

2016
A Bayesian Approach to Image Recognition Based on Separable Lattice Hidden Markov Models.
IEICE Trans. Inf. Syst., 2016

Temporal modeling in neural network based statistical parametric speech synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Singing Voice Synthesis Based on Deep Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Redefining the Linguistic Context Feature Set for HMM and DNN TTS Through Position and Parsing.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Privacy-preserving sound to degrade automatic speaker verification performance.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Trajectory training considering global variance for speech synthesis based on neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

The NITech text-to-speech system for the Blizzard Challenge 2016.
Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016

2015
Simultaneous optimization of multiple tree structures for factor analyzed HMM-based speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The effect of neural networks in statistical parametric speech synthesis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The NITECH HMM-based text-to-speech system for the Blizzard Challenge 2015.
Proceedings of the Blizzard Challenge 2015, 2015

2014
Integration of Spectral Feature Extraction and Modeling for HMM-Based Speech Synthesis.
IEICE Trans. Inf. Syst., 2014

A mel-cepstral analysis technique restoring high frequency components from low-sampling-rate speech.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Integration of speaker and pitch adaptive training for HMM-based singing voice synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014.
Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014

2013
A Bayesian Framework Using Multiple Model Structures for Speech Recognition.
IEICE Trans. Inf. Syst., 2013

Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Integration of acoustic modeling and mel-cepstral analysis for HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2013

Separable lattice 2-D HMMS introducing state duration control for recognition of images with various variations.
Proceedings of the IEEE International Conference on Acoustics, 2013

Overview of NITECH HMM-based speech synthesis system for Blizzard Challenge 2013.
Proceedings of the Blizzard Challenge 2013, 2013

Image recognition based on hidden Markov eigen-image models using variational Bayesian method.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Impacts of machine translation and speech synthesis on speech-to-speech translation.
Speech Commun., 2012

A Bayesian Approach to Speaker Recognition Based on GMMs Using Multiple Model Structures.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A model structure integration based on a Bayesian framework for speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Face recognition based on separable lattice 2-D HMMS using variational bayesian method.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2012.
Proceedings of the Blizzard Challenge 2012, Portland, OR, USA, September 14, 2012, 2012

2011
Bayesian Context Clustering Using Cross Validation for Speech Recognition.
IEICE Trans. Inf. Syst., 2011

Multi-Speaker Modeling with Shared Prior Distributions and Model Structures for Bayesian Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An analysis of machine translation and speech synthesis in speech-to-speech translation system.
Proceedings of the IEEE International Conference on Acoustics, 2011

Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2011.
Proceedings of the Blizzard Challenge 2011, Turin, Italy, September 2, 2011, 2011

2010
Bayesian speech synthesis framework integrating training and synthesis processes.
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010

A Deterministic Annealing-Based Training Algorithm For Statistical Machine Translation Models.
Proceedings of the 14th Annual conference of the European Association for Machine Translation, 2010

Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2010.
Proceedings of the Blizzard Challenge 2010, Kansai Science City, Japan, September 25, 2010, 2010

2009
A Reordering Model Using a Source-Side Parse-Tree for Statistical Machine Translation.
IEICE Trans. Inf. Syst., 2009

Reordering Model Using Syntactic Information of a Source Tree for Statistical Machine Translation.
Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation, 2009

Deterministic annealing based training algorithm for Bayesian speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A Bayesian approach to Hidden Semi-Markov Model based speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A Bayesian approach to HMM-based speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Acoustic modeling based on model structure annealing for speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Speaker recognition based on variational Bayesian method.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008


  Loading...