Takenori Yoshimura

Orcid: 0000-0003-3964-5677

According to our database1, Takenori Yoshimura authored at least 20 papers between 2013 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
SPTK4: An Open-Source Software Toolkit for Speech Signal Processing.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

Embedding a Differentiable Mel-Cepstral Synthesis Filter to a Neural Speech Synthesis System.
Proceedings of the IEEE International Conference on Acoustics, 2023

2021
ESPnet2-TTS: Extending the Edge of TTS Research.
CoRR, 2021

Spontaneous Speech Summarization: Transformers All The Way Through.
Proceedings of the 29th European Signal Processing Conference, 2021

An Ensemble Approach to Anomalous Sound Detection Based on Conformer-Based Autoencoder and Binary Classifier Incorporated with Metric Learning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

2020
End-to-End Automatic Speech Recognition Integrated with CTC-Based Voice Activity Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Espnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Speaker-dependent Wavenet-based Delay-free Adpcm Speech Coding.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Comparative Study on Transformer vs RNN in Speech Applications.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Mel-Cepstrum-Based Quantization Noise Shaping Applied to Neural-Network-Based Speech Waveform Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

WaveNet-Based Zero-Delay Lossless Speech Coding.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Statistical Voice Conversion Based on Wavenet.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

The NITech text-to-speech system for the Blizzard Challenge 2018.
Proceedings of the Blizzard Challenge 2018, Hyderabad, India, September 8, 2018, 2018

Discriminative Feature Extraction Based on Sequential Variational Autoencoder for Speaker Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Speech Synthesis Using WaveNet Vocoder Based on Periodic/Aperiodic Decomposition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Simultaneous Optimization of Multiple Tree-Based Factor Analyzed HMM for Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Articulatory Text-to-Speech Synthesis Using the Digital Waveguide Mesh Driven by a Deep Neural Network.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Simultaneous optimization of multiple tree structures for factor analyzed HMM-based speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2013
Cross-lingual speaker adaptation based on factor analysis using bilingual speech data for HMM-based speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013


  Loading...