Gakuto Kurata
According to our database1,
Gakuto Kurata
authored at least 61 papers
between 2002 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024
2023
Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2022
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Knowledge Distillation Leveraging Alternative Soft Targets from Non-Parallel Qualified Speech Data.
CoRR, 2021
Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Generalized Knowledge Distillation from an Ensemble of Specialized Teachers Leveraging Unsupervised Neural Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Converting Written Language to Spoken Language with Neural Machine Translation for Language Modeling.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
Multi-Task CTC Training with Auxiliary Feature Reconstruction for End-to-End Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Improvements to N-gram Language Model Using Text Generated from Neural Language Model.
Proceedings of the IEEE International Conference on Acoustics, 2019
Data Augmentation Based on Vowel Stretch for Improving Children's Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Improved Knowledge Distillation from Bi-Directional to Uni-Directional LSTM CTC for End-to-End Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Effective joint training of denoising feature space transforms and Neural Network based acoustic models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
Leveraging Sentence-level Information with Encoder LSTM for Natural Language Understanding.
CoRR, 2016
Improved Neural Network-based Multi-label Classification with Better Initialization Leveraging Label Co-occurrence.
Proceedings of the NAACL HLT 2016, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Improved Neural Network Initialization by Grouping Context-Dependent Targets for Acoustic Modeling.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Speech recognition robust against speech overlapping in monaural recordings of telephone conversations.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
2015
Discriminative re-ranking for automatic speech recognition by leveraging invariant structures.
Speech Commun., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Leveraging phonetic context dependent invariant structure for continuous speech recognition.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014
2012
Speech Commun., 2012
Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech.
Speech Commun., 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Named entity recognition from Conversational Telephone Speech leveraging Word Confusion Networks for training and recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2007
IEICE Trans. Inf. Syst., 2007
Preliminary experiments toward automatic generation of new TTS voices from recorded speech alone.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the ACL 2006, 2006
2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
Proceedings of the Fourth NTCIR Workshop on Research in Information Access Technologies Information Retrieval, 2004
2002
Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002