Takuya Higuchi
Orcid: 0000-0002-0361-7132
According to our database1,
Takuya Higuchi
authored at least 38 papers
between 2001 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
CoRR, 2024
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models.
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study.
CoRR, 2023
2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Stacked 1D Convolutional Networks for End-to-End Small Footprint Voice Trigger Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2018
Nonnegative Matrix Factorization With Basis Clustering Using Cepstral Distance Regularization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Integrating DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Unsupervised utterance-wise beamformer estimation with speech recognition-level criterion.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017
Learning speaker representation for neural network based multichannel speaker extraction.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Multichannel Speech Enhancement Approaches to DNN-Based Far-Field Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Sparseness-based multichannel nonnegative matrix factorization for blind source separation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Semi-Supervised Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Optimization of Speech Enhancement Front-End with Speech Recognition-Level Criterion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Spatial correlation model based observation vector clustering and MVDR beamforming for meeting recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Unified approach for audio source separation with multichannel factorial HMM and DOA mixture model.
Proceedings of the 23rd European Signal Processing Conference, 2015
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Joint audio source separation and dereverberation based on multichannel factorial hidden Markov model.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2014
A unified approach for underdetermined blind signal separation and source activity detection by multichannel factorial hidden Markov models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Unified approach for underdetermined BSS, VAD, dereverberation and DOA estimation with multichannel factorial HMM.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
2001
Proceedings of the 2001 IEEE International Conference on Robotics and Automation, 2001