Yui Sudo
Orcid: 0000-0003-2094-6701
According to our database1,
Yui Sudo
authored at least 24 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Online adaptation of fourier series-based acoustic transfer function model and its application to sound source localization and separation.
Adv. Robotics, October, 2024
Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss.
CoRR, 2024
4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders.
CoRR, 2024
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
CoRR, 2024
Improving Noise Robustness of Automatic Speech Recognition Based on a Parallel Adapter Model with Near-Identity Initialization.
Proceedings of the Advances and Trends in Artificial Intelligence. Theory and Applications, 2024
Contextualized Automatic Speech Recognition With Attention-Based Bias Phrase Boosted Beam Search.
Proceedings of the IEEE International Conference on Acoustics, 2024
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2024
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Online Adaptation of Fourier Series Based Acoustic Transfer Function Model to Improve Sound Source Localization and Separation.
Proceedings of the 32nd IEEE International Conference on Robot and Human Interactive Communication, 2023
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Time-synchronous one-pass Beam Search for Parallel Online and Offline Transducers with Dynamic Block Training.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Flexible Evidence Model to Reduce Uncertainty Mismatch Between Speech Enhancement and ASR Based on Encoder-Decoder Architecture.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
Empirical Sampling from Latent Utterance-wise Evidence Model for Missing Data ASR based on Neural Encoder-Decoder Model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Multi-channel Environmental Sound Segmentation utilizing Sound Source Localization and Separation U-Net.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2021
2020
Adv. Robotics, 2020
Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, 2020
2019
Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019
Improvement of DOA Estimation by using Quaternion Output in Sound Event Localization and Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019