Masato Mimura

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

ASR Rescoring and Confidence Estimation with Electra.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An End-To-End Model from Speech to Clean Transcript for Parliamentary Meetings.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Enhancing Monotonic Multihead Attention for Streaming ASR.

[BibT_eX]

[DOI]

Hirofumi Inaguma

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

CTC-Synchronous Training for Monotonic Attention Model.

[BibT_eX]

[DOI]

Hirofumi Inaguma

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-end Music-mixed Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Amenability versus non-exactness of dense subgroups of a compact group.

[BibT_eX]

[DOI]

J. Lond. Math. Soc., 2019

Multi-speaker Sequence-to-sequence Speech Synthesis for Data Augmentation in Acoustic-to-word Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Forward-Backward Attention Decoder.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Acoustic-to-Word Attention-Based Model Complemented with Character-Level CTC-Based Model.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Semi-Blind speech enhancement basedon recurrent neural network for source separation and dereverberation.

[BibT_eX]

[DOI]

Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Semi-supervised ensemble DNN acoustic model training.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Cross-domain speech recognition using nonparallel corpora with cycle-consistent adversarial networks.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-Target Learning for Noisy Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2015

Speech dereverberation using long short-term memory.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Deep autoencoders augmented with phone-class feature for reverberant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Exploring deep neural networks and deep autoencoders in reverberant speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Unsupervised speaker adaptation of DNN-HMM by selecting similar speakers for lecture transcription.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2012

Bayesian Learning of a Language Model from Continuous Speech.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2012

2010

Learning a language model from continuous speech.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Semi-automated update of automatic transcription system for the Japanese national congress.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

Automatic transcription system for meetings of the Japanese national congress.

[BibT_eX]

[DOI]

Yuya Akita

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Language model transformation applied to lightly supervised training of acoustic model for congress meetings.

[BibT_eX]

[DOI]