Tsubasa Ochiai
Orcid: 0000-0002-2519-2032
According to our database1,
Tsubasa Ochiai
authored at least 65 papers
between 2014 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, January, 2025
2024
Microphone Array Signal Processing and Deep Learning for Speech Enhancement: Combining model-based and data-driven approaches to parameter estimation and filtering [Special Issue On Model-Based and Data-Driven Audio Signal Processing].
IEEE Signal Process. Mag., November, 2024
Module-Based End-to-End Distant Speech Processing: A case study of far-field automatic speech recognition [Special Issue On Model-Based and Data-Driven Audio Signal Processing].
IEEE Signal Process. Mag., November, 2024
Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling.
CoRR, 2024
CoRR, 2024
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Online Target Sound Extraction with Knowledge Distillation from Partially Non-Causal Teacher.
Proceedings of the IEEE International Conference on Acoustics, 2024
Neural Network-Based Virtual Microphone Estimation with Virtual Microphone and Beamformer-Level Multi-Task Loss.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Mask-Based Neural Beamforming for Moving Speakers With Self-Attention-Based Tracking.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
SoundBeam: Target Sound Extraction Conditioned on Sound-Class Labels and Enrollment Clues for Increased Performance and Continuous Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Streaming End-to-End Target-Speaker Automatic Speech Recognition and Activity Detection.
IEEE Access, 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Impact of Residual Noise and Artifacts in Speech Enhancement Errors on Intelligibility of Human and Machine.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.
Proceedings of the IEEE International Conference on Acoustics, 2021
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Simpleflat: A Simple Whole-Network Pre-Training Approach for RNN Transducer-Based End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation.
CoRR, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
DNN-supported Mask-based Convolutional Beamforming for Simultaneous Denoising, Dereverberation, and Source Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving Noise Robust Automatic Speech Recognition with Single-Channel Time-Domain Enhancement Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization.
Proceedings of the 28th European Signal Processing Conference, 2020
2019
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures.
IEEE J. Sel. Top. Signal Process., 2019
Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming.
IEEE J. Sel. Top. Signal Process., 2017
Does speech enhancement work with end-to-end ASR objectives?: Experimental analysis of multichannel end-to-end ASR.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017
Proceedings of the 34th International Conference on Machine Learning, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Cumulative moving averaged bottleneck speaker vectors for online speaker adaptation of CNN-based acoustic models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers.
IEICE Trans. Inf. Syst., 2016
Bottleneck linear transformation network adaptation for speaker adaptive training-based hybrid DNN-HMM speech recognizer.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Speaker adaptive training for deep neural networks embedding linear transformation networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Proceedings of the IEEE International Conference on Acoustics, 2014