Takuma Okamoto
Orcid: 0000-0001-9913-4647
According to our database1,
Takuma Okamoto
authored at least 44 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Fast Neural Speech Waveform Generative Models With Fully-Connected Layer-Based Upsampling.
IEEE Access, 2024
Convnext-TTS And Convnext-VC: Convnext-Based Fast End-To-End Sequence-To-Sequence Text-To-Speech And Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2024
FIRNet: Fundamental Frequency Controllable Fast Neural Vocoder With Trainable Finite Impulse Response Filter.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Harmonic-Net: Fundamental Frequency and Speech Rate Controllable Fast Neural Vocoder.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Continuous Action Space-Based Spoken Language Acquisition Agent Using Residual Sentence Embedding and Transformer Decoder.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
Speech Commun., 2022
IEEE J. Sel. Top. Signal Process., 2022
2021
Quasi-Periodic Parallel WaveGAN: A Non-Autoregressive Raw Waveform Generative Model With Pitch-Dependent Dilated Convolution Neural Network.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE Access, 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-Autoregressive Pitch-Dependent Dilated Convolution Model for Parametric Speech Generation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Investigations of Real-time Gaussian Fftnet and Parallel Wavenet Neural Vocoders with Simple Acoustic Features.
Proceedings of the IEEE International Conference on Acoustics, 2019
Horizontal 3D Sound Field Recording and 2.5D Synthesis with Omni-directional Circular Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2019
Tacotron-Based Acoustic Model Using Phoneme Alignment for Practical Neural Text-to-Speech Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
2.5D Localized Sound Zone Generation with a Circular Array of Fixed-Directivity Loudspeakers.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
An Investigation of Subband Wavenet Vocoder Covering Entire Audible Frequency Range with Limited Acoustic Features.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Development of Wearable Sheet-Type Shear Force Sensor and Measurement System that is Insusceptible to Temperature and Pressure.
Sensors, 2017
J. Inf. Hiding Multim. Signal Process., 2017
J. Inf. Hiding Multim. Signal Process., 2017
Angular spectrum decomposition-based 2.5D higher-order spherical harmonic sound field synthesis with a linear loudspeaker array.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017
Analytical approach to 2.5D sound field control using a circular double-layer array of fixed-directivity loudspeakers.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
2.5D higher order ambisonics for a sound field described by angular spectrum coefficients.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
A Spatial Extrapolation Method to Derive High-Order Ambisonics Data from Stereo Sources.
J. Inf. Hiding Multim. Signal Process., 2015
Analytical methods of generating multiple sound zones for open and baffled circular loudspeaker arrays.
Proceedings of the 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014
Generation of multiple sound zones by spatial filtering in wavenumber domain using a linear array of loudspeakers.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013
Proceedings of the 25th Canadian Conference on Computational Geometry, 2013
2010
J. Inf. Hiding Multim. Signal Process., 2010
Comparative performance evaluation of near 3D sound field reproduction system with directional loudspeakers and wave field synthesis.
Proceedings of the 4th International Universal Communication Symposium, 2010