Keisuke Kinoshita
Orcid: 0009-0008-7987-8188
According to our database1,
Keisuke Kinoshita
authored at least 173 papers
between 1990 and 2024.
Collaborative distances:
Collaborative distances:
Book In proceedings Article PhD thesis Dataset OtherLinks
Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined Blind Source Separation and Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
SoundBeam: Target Sound Extraction Conditioned on Sound-Class Labels and Enrollment Clues for Increased Performance and Continuous Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems.
Proceedings of the IEEE International Conference on Acoustics, 2023
Switching Independent Vector Analysis and its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithms.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Putamen Atrophy Is a Possible Clinical Evaluation Index for Parkinson's Disease Using Human Brain Magnetic Resonance Imaging.
J. Imaging, 2022
Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening.
CoRR, 2022
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined BSS in Reverberant Environments.
Proceedings of the IEEE International Conference on Acoustics, 2022
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Tight Integration Of Neural- And Clustering-Based Diarization Through Deep Unfolding Of Infinite Gaussian Mixture Model.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Online Speech Dereverberation Using Mixture of Multichannel Linear Prediction Models.
IEEE Signal Process. Lett., 2021
Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm.
CoRR, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Comparison of Remote Experiments Using Crowdsourcing and Laboratory Experiments on Speech Intelligibility.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.
Proceedings of the IEEE International Conference on Acoustics, 2021
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain.
Proceedings of the IEEE International Conference on Acoustics, 2021
Low Latency Online Blind Source Separation Based on Joint Optimization with Blind Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Blind and Neural Network-Guided Convolutional Beamformer for Joint Denoising, Dereverberation, and Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Integrating End-to-End Neural and Clustering-Based Diarization: Getting the Best of Both Worlds.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Low Latency Online Source Separation and Noise Reduction Based on Joint Optimization with Dereverberation.
Proceedings of the 29th European Signal Processing Conference, 2021
Proceedings of the 29th European Signal Processing Conference, 2021
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2020
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech.
Speech Commun., 2020
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording.
CoRR, 2020
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation.
CoRR, 2020
Cognitive-Driven Convolutional Beamforming Using EEG-Based Auditory Attention Decoding.
Proceedings of the 30th IEEE International Workshop on Machine Learning for Signal Processing, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Predicting Intelligibility of Enhanced Speech Using Posteriors Derived from DNN-Based ASR System.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
DNN-supported Mask-based Convolutional Beamforming for Simultaneous Denoising, Dereverberation, and Source Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving Noise Robust Automatic Speech Recognition with Single-Channel Time-Domain Enhancement Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Tackling Real Noisy Reverberant Meetings with All-Neural Source Separation, Counting, and Diarization System.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization.
Proceedings of the 28th European Signal Processing Conference, 2020
IEEE Signal Process. Lett., 2019
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures.
IEEE J. Sel. Top. Signal Process., 2019
Simultaneous Denoising, Dereverberation, and Source Separation Using a Unified Convolutional Beamformer.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Simultaneous Denoising and Dereverberation for Low-Latency Applications Using Frame-by-Frame Online Unified Convolutional Beamformer.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Predicting Speech Intelligibility of Enhanced Speech Using Phone Accuracy of DNN-Based ASR System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Mask-based MVDR Beamformer for Noisy Multisource Environments: Introduction of Time-varying Spatial Covariance Model.
Proceedings of the IEEE International Conference on Acoustics, 2019
Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Estimation of Sampling Frequency Mismatch between Distributed Asynchronous Microphones under Existence of Source Movements with Stationary Time Periods Detection.
Proceedings of the IEEE International Conference on Acoustics, 2019
Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation.
Proceedings of the 27th European Signal Processing Conference, 2019
Projection Back onto Filtered Observations for Speech Separation with Distributed Microphone Array.
Proceedings of the 8th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Online Integration of DNN-Based and Spatial Clustering-Based Mask Estimation for Robust MVDR Beamforming.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Comparison of Reference Microphone Selection Algorithms for Distributed Microphone Array Based Speech Enhancement in Meeting Recognition Scenarios.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Meeting Recognition with Asynchronous Distributed Microphone Array Using Block-Wise Refinement of Mask-Based MVDR Beamformer.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Cumulative moving averaged bottleneck speaker vectors for online speaker adaptation of CNN-based acoustic models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Integrating DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Online environmental adaptation of CNN-based acoustic models using spatial diffuseness features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Unsupervised utterance-wise beamformer estimation with speech recognition-level criterion.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017
Learning speaker representation for neural network based multichannel speaker extraction.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
Multichannel Speech Enhancement Approaches to DNN-Based Far-Field Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research.
EURASIP J. Adv. Signal Process., 2016
Speech Intelligibility Prediction Based on the Envelope Power Spectrum Model with the Dynamic Compressive Gammachirp Auditory Filterbank.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the IECON 2016, 2016
Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution.
Proceedings of the 24th European Signal Processing Conference, 2016
EURASIP J. Adv. Signal Process., 2015
Exploiting spectro-temporal locality in deep learning based acoustic event detection.
EURASIP J. Audio Speech Music. Process., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Modeling inter-node acoustic dependencies with Restricted Boltzmann Machine for distributed microphone array based BSS.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Control of the position and attitude of a tethered quadrotor considering the influence of a tether.
Proceedings of the 10th Asian Control Conference, 2015
Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Fast segment search for corpus-based speech enhancement based on speech recognition technology.
Proceedings of the IEEE International Conference on Acoustics, 2014
Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction.
IEEE Trans. Speech Audio Process., 2013
Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds.
Comput. Speech Lang., 2013
The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Microphone-location dependent mask estimation for BSS using spatially distributed asynchronous microphones.
Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2013
On the robustness of distributed EM based BSS in asynchronous distributed microphone array scenarios.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Conditional emission densities for combining speech enhancement and recognition systems.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Blind source separation using spatially distributed microphones based on microphone-location dependent source activities.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 18th International Conference on Digital Signal Processing, 2013
An integration of source location cues for speech clustering in distributed microphone arrays.
Proceedings of the IEEE International Conference on Acoustics, 2013
Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera.
IEEE Trans. Speech Audio Process., 2012
Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition.
IEEE Signal Process. Mag., 2012
IEEE Signal Process. Lett., 2012
Distributed microphone array processing for speech source separation with classifier fusion.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2012
Example-based speech enhancement with joint utilization of spatial, spectral & temporal cues of speech and noise.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
A multichannel MMSE-based framework for joint blind source separation and noise reduction.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Single Channel Dereverberation Using Example-Based Speech Enhancement with Uncertainty Decoding Technique.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
IEEE Trans. Speech Audio Process., 2010
Real-time meeting recognition and understanding using distant microphones and omni-directional camera.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Blind upmix of stereo music signals using multi-step linear prediction based reverberation extraction.
Proceedings of the IEEE International Conference on Acoustics, 2010
Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information.
Proceedings of the Speech Dereverberation., 2010
Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction.
IEEE Trans. Speech Audio Process., 2009
Real-time speech enhancement in noisy reverberant multi-talker environments based on a location-independent room acoustics model.
Proceedings of the IEEE International Conference on Acoustics, 2009
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model.
IEEE Trans. Speech Audio Process., 2008
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2008
Blind speech dereverberation with multi-channel linear prediction based on short time fourier transform representation.
Proceedings of the IEEE International Conference on Acoustics, 2008
Principles and applications of dereverberation for noisy and reverberant audio signals.
Proceedings of the 42nd Asilomar Conference on Signals, Systems and Computers, 2008
IEEE Trans. Speech Audio Process., 2007
Robust blind dereverberation of speech signals based on characteristics of short-time speech segments.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007
Multi-step linear prediction based speech dereverberation in noisy reverberant environment.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Syst. Comput. Jpn., 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Spectral Subtraction Steered by Multi-Step Forward Linear Prediction For Single Channel Speech Dereverberation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Modulation enhancement of speech by a pre-processing algorithm for improving intelligibility in reverberant environments.
Speech Commun., 2005
Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005
Measurement of cricothyroid articulation using high-resolution MRI and 3d pattern matching.
Proceedings of the Fourth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004
Harmonicity based monaural speech dereverberation with time warping and F0 adaptive window.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Improving automatic speech recognition performance and speech inteligibility with harmonicity based dereverberation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 4th IEEE/RAS International Conference on Humanoid Robots, 2004
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003
Improving speech intelligibility by steady-state suppression as pre-processing in small to medium sized halls.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Designing modulation filters for improving speech intelligibility in reverberant environments.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the 2000 Conference on Computer Vision and Pattern Recognition (CVPR 2000), 2000
Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992
Proceedings of IAPR Workshop on Machine Vision Applications, 1990