Masato Mimura

According to our database1, Masato Mimura authored at least 53 papers between 2002 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Waveform-Domain Speech Enhancement Using Spectrogram Encoding for Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding.
CoRR, 2024

Alignment-Free Training for Transducer-based Multi-Talker ASR.
CoRR, 2024

Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation.
CoRR, 2024

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling.
CoRR, 2024

2023
Embedding Articulatory Constraints for Low-resource Speech Recognition Based on Large Pre-trained Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Time-Domain Speech Enhancement Assisted by Multi-Resolution Frequency Encoder and Decoder.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
On the spectrum and linear programming bound for hypergraphs.
Eur. J. Comb., 2022

Distilling the Knowledge of BERT for CTC-based ASR.
CoRR, 2022

Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Selective Multi-Task Learning For Speech Emotion Recognition Using Corpora Of Different Styles.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies.
Int. J. Asian Lang. Process., 2021

Solving linear equations in a vector space over a finite field.
Discret. Math., 2021

Khmer Speech Translation Corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC).
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

Data Augmentation for ASR Using TTS Via a Discrete Representation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

ASR Rescoring and Confidence Estimation with Electra.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An End-To-End Model from Speech to Clean Transcript for Parliamentary Meetings.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Enhancing Monotonic Multihead Attention for Streaming ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

CTC-Synchronous Training for Monotonic Attention Model.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-end Music-mixed Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Amenability versus non-exactness of dense subgroups of a compact group.
J. Lond. Math. Soc., 2019

Multi-speaker Sequence-to-sequence Speech Synthesis for Data Augmentation in Acoustic-to-word Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Leveraging Sequence-to-Sequence Speech Synthesis for Enhancing Acoustic-to-Word Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Forward-Backward Attention Decoder.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Acoustic-to-Word Attention-Based Model Complemented with Character-Level CTC-Based Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Beamforming Based on Multichannel Nonnegative Matrix Factorization for Noisy Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An End-to-End Approach to Joint Social Signal Detection and Automatic Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Semi-Blind speech enhancement basedon recurrent neural network for source separation and dereverberation.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Semi-supervised ensemble DNN acoustic model training.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Cross-domain speech recognition using nonparallel corpora with cycle-consistent adversarial networks.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-Target Learning for Noisy Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature.
EURASIP J. Adv. Signal Process., 2015

Speech dereverberation using long short-term memory.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Deep autoencoders augmented with phone-class feature for reverberant speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Exploring deep neural networks and deep autoencoders in reverberant speech recognition.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Unsupervised speaker adaptation of DNN-HMM by selecting similar speakers for lecture transcription.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2012
Bayesian Learning of a Language Model from Continuous Speech.
IEICE Trans. Inf. Syst., 2012

2010
Learning a language model from continuous speech.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Semi-automated update of automatic transcription system for the Japanese national congress.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Automatic transcription system for meetings of the Japanese national congress.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Language model transformation applied to lightly supervised training of acoustic model for congress meetings.
Proceedings of the IEEE International Conference on Acoustics, 2009

2002
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002


  Loading...