Ryoichi Takashima
Orcid: 0000-0002-9808-0250
According to our database1,
Ryoichi Takashima
authored at least 71 papers
between 2009 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Fast Neural Speech Waveform Generative Models With Fully-Connected Layer-Based Upsampling.
IEEE Access, 2024
Dysarthric Speech Recognition Using Pseudo-Labeling, Self-Supervised Feature Learning, and a Joint Multi-Task Learning Approach.
IEEE Access, 2024
Training of VITS Model Reflecting the Duration of a Physically Unimpaired Speaker for a Text-to-speech System for a Person with a Stutter.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024
Speech Recognition for a Person With Cerebral Palsy Using Whisper Fine-Tuned on Japanese and English Dysarthric Speech.
Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024
Representation Learning Based on Variational Autoencoders for Imagined Speech Classification.
Proceedings of the 32nd European Signal Processing Conference, 2024
Generation of Colored Subtitle Images Based on Emotional Information of Speech Utterances.
Proceedings of the 32nd European Signal Processing Conference, 2024
Self-supervised learning using unlabeled speech with multiple types of speech disorder for disordered speech recognition.
Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility, 2024
Individuality-Preserving Speech Synthesis for Spinal Muscular Atrophy with a Tracheotomy.
Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility, 2024
2023
Harmonic-Net: Fundamental Frequency and Speech Rate Controllable Fast Neural Vocoder.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Zero-Shot Sound Event Classification Using a Sound Attribute Vector with Global and Local Feature Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023
2022
Phoneme-guided Dysarthric speech conversion With non-parallel data by joint training.
Signal Image Video Process., 2022
Learn to See Faster: Pushing the Limits of High-Speed Camera with Deep Underexposed Image Denoising.
CoRR, 2022
Optical Flow Regularization of Implicit Neural Representations for Video Frame Interpolation.
CoRR, 2022
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022
Data Augmentation for Dysarthric Speech Recognition Based on Text-to-Speech Synthesis.
Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022
Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022
2021
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation.
EURASIP J. Audio Speech Music. Process., 2021
IEEE Access, 2021
High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020
An Investigation of End-to-End Speech Recognition Using Model Adaptation for Dysarthric Speakers.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020
2019
Knowledge Transferability Between the Speech Data of Persons With Dysarthria Speaking Different Languages for Dysarthric Speech Recognition.
IEEE Access, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Investigation of Sequence-level Knowledge Distillation Methods for CTC Acoustic Models.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Improving Very Deep Time-Delay Neural Network With Vertical-Attention For Effectively Training CTC-Based ASR Systems.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Separation of vibration-derived sound signals based on fusion processing of vibration sensors and microphones.
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the 25th European Signal Processing Conference, 2017
Incremental training and constructing the very deep convolutional residual network acoustic models.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
Solving permutation problem with a cascade combination of phase difference entropy and power spectral correlation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Data Augmentation Using Multi-Input Multi-Output Source Separation for Deep Neural Network Based Acoustic Modeling.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Noise-Robust Voice Conversion Based on Sparse Spectral Mapping Using Non-negative Matrix Factorization.
IEICE Trans. Inf. Syst., 2014
A preliminary demonstration of exemplar-based voice conversion for articulation disorders using an individuality-preserving dictionary.
EURASIP J. Audio Speech Music. Process., 2014
Frequency domain acoustic echo reduction based on Kalman smoother with time-varying noise covariance matrix.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the 2013 IEEE/SICE International Symposium on System Integration, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Prediction of unlearned position based on local regression for single-channel talker localization using acoustic transfer function.
Proceedings of the IEEE International Conference on Acoustics, 2013
Individuality-preserving voice conversion for articulation disorders based on non-negative matrix factorization.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
A new multiple-kernel-learning weighting method for localizing human brain magnetic activity.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Robust feature extraction to utterance fluctuations due to articulation disorders based on sparse expression.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Consonant enhancement for articulation disorders based on non-negative matrix factorization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
Single-Channel Head Orientation Estimation Based on Discrimination of Acoustic Transfer Function.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Feature selection based on Multiple Kernel Learning for single-channel sound source localization using the acoustic transfer function.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
HMM-based separation of acoustic transfer function for single-channel sound source localization.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Single-Channel Talker Localization Based on Discrimination of Acoustic Transfer Functions.
EURASIP J. Adv. Signal Process., 2009
Monaural sound-source-direction estimation using the acoustic transfer function of an active microphone.
Proceedings of the 12th International Conference on Information Fusion, 2009