Ryoichi Takashima

Proceedings of the 13th IEEE Global Conference on Consumer Electronics, 2024

Representation Learning Based on Variational Autoencoders for Imagined Speech Classification.

[BibT_eX]

[DOI]

Proceedings of the 32nd European Signal Processing Conference, 2024

Generation of Colored Subtitle Images Based on Emotional Information of Speech Utterances.

[BibT_eX]

[DOI]

Proceedings of the 32nd European Signal Processing Conference, 2024

Self-supervised learning using unlabeled speech with multiple types of speech disorder for disordered speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility, 2024

Individuality-Preserving Speech Synthesis for Spinal Muscular Atrophy with a Tracheotomy.

[BibT_eX]

[DOI]

Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility, 2024

2023

Harmonic-Net: Fundamental Frequency and Speech Rate Controllable Fast Neural Vocoder.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Zero-Shot Sound Event Classification Using a Sound Attribute Vector with Global and Local Feature Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

EEG Source Estimation Using Deep Prior Without a Subject's Individual Lead Field.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Operatic Singing Voice Synthesis Using Diff-SVC.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

2022

Phoneme-guided Dysarthric speech conversion With non-parallel data by joint training.

[BibT_eX]

[DOI]

Signal Image Video Process., 2022

Learn to See Faster: Pushing the Limits of High-Speed Camera with Deep Underexposed Image Denoising.

[BibT_eX]

[DOI]

CoRR, 2022

Optical Flow Regularization of Implicit Neural Representations for Video Frame Interpolation.

[BibT_eX]

[DOI]

CoRR, 2022

Current Source Localization Using Deep Prior with Depth Weighting.

[BibT_eX]

[DOI]

CoRR, 2022

MEG Source Localization Using Deep Prior.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Comparative Evaluation of Neural Vocoders for Speech Synthesis of Operatic Singing.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Adaptation of a Pronunciation Dictionary for Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Yuya Sawa

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Data Augmentation for Dysarthric Speech Recognition Based on Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE Global Conference on Life Sciences and Technologies, 2022

Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Binary Attribute Embeddings for Zero-Shot Sound Event Classification.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021

Multimodal fusion for indoor sound source localization.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2021

Full-Band LPCNet: A Real-Time Neural Vocoder for 48 kHz Audio With a CPU.

[BibT_eX]

[DOI]

IEEE Access, 2021

High-Intelligibility Speech Synthesis for Dysarthric Speakers with LPCNet-Based TTS and CycleVAE-Based VC.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Data Augmentation Based on Frequency Warping for Recognition of Cleft Palate Speech.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Dysarthric Speech Recognition Based on Deep Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Two-Step Acoustic Model Adaptation for Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Convolutional neural networks Memory optimization Inference with Splitting Image.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

An Investigation of End-to-End Speech Recognition Using Model Adaptation for Dysarthric Speakers.

[BibT_eX]

[DOI]

Yuya Sawa

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

Opera Singing Voice Synthesis Considering Vowel Variations.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

FasterRCNN Monitoring of Road Damages: Competition and Deployment.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019

Knowledge Transferability Between the Speech Data of Persons With Dysarthria Speaking Different Languages for Dysarthric Speech Recognition.

[BibT_eX]

[DOI]

IEEE Access, 2019

Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Investigation of Sequence-level Knowledge Distillation Methods for CTC Acoustic Models.

[BibT_eX]

[DOI]

Sheng Li

Hisashi Kawai

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Improving Very Deep Time-Delay Neural Network With Vertical-Attention For Effectively Training CTC-Based ASR Systems.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

CTC Loss Function with a Unit-Level Ambiguity Penalty.

[BibT_eX]

[DOI]

Sheng Li

Hisashi Kawai

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Investigation of a Knowledge Distillation Method for CTC Acoustic Models.

[BibT_eX]

[DOI]

Sheng Li

Hisashi Kawai

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Separation of vibration-derived sound signals based on fusion processing of vibration sensors and microphones.

[BibT_eX]

[DOI]

Yohei Kawaguchi

Masahito Togami

Proceedings of the 25th European Signal Processing Conference, 2017

ADMM-based audio reconstruction for low-cost-sound-monitoring.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

Incremental training and constructing the very deep convolutional residual network acoustic models.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

An application of noise-robust speech translation using asynchronous smart devices.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Time-domain subsampling and reconstruction for microphone array.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Sub-Nyquist non-uniform sampling for low-cost sound monitoring.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Solving permutation problem with a cascade combination of phase difference entropy and power spectral correlation.

[BibT_eX]

[DOI]

Masahito Togami

Yusuke Fujita

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Data Augmentation Using Multi-Input Multi-Output Source Separation for Deep Neural Network Based Acoustic Modeling.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Unified ASR system using LGM-based source separation, noise-robust feature extraction, and word hypothesis selection.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Noise-Robust Voice Conversion Based on Sparse Spectral Mapping Using Non-negative Matrix Factorization.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

A preliminary demonstration of exemplar-based voice conversion for articulation disorders using an individuality-preserving dictionary.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2014

Frequency domain acoustic echo reduction based on Kalman smoother with time-varying noise covariance matrix.

[BibT_eX]

[DOI]

Masahito Togami

Yohei Kawaguchi

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Exemplar-Based Voice Conversion Using Sparse Representation in Noisy Environments.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013

Noise-robust voice conversion based on spectral mapping on sparse space.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Voice conversion based on Non-negative Matrix Factorization in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE/SICE International Symposium on System Integration, 2013

Voice conversion in high-order eigen space using deep belief nets.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Prediction of unlearned position based on local regression for single-channel talker localization using acoustic transfer function.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Individuality-preserving voice conversion for articulation disorders based on non-negative matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Exemplar-based voice conversion in noisy environment.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Estimation of Talker's Head Orientation Based on Discrimination of the Shape of Cross-power Spectrum Phase Coefficients.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A new multiple-kernel-learning weighting method for localizing human brain magnetic activity.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Robust feature extraction to utterance fluctuations due to articulation disorders based on sparse expression.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

An adaboost-based weighting method for localizing human brain magnetic activity.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Consonant enhancement for articulation disorders based on non-negative matrix factorization.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011

Single-Channel Head Orientation Estimation Based on Discrimination of Acoustic Transfer Function.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Agglomerative Hierarchical Clustering of Emotions in Speech Based on Subjective Relative Similarity.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Feature selection based on Multiple Kernel Learning for single-channel sound source localization using the acoustic transfer function.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

HMM-based separation of acoustic transfer function for single-channel sound source localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Single-Channel Talker Localization Based on Discrimination of Acoustic Transfer Functions.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2009

Monaural sound-source-direction estimation using the acoustic transfer function of an active microphone.

[BibT_eX]

[DOI]