Tomohiro Nakatani
Orcid: 0000-0002-7487-7150
According to our database1,
Tomohiro Nakatani
authored at least 307 papers
between 1994 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
DOA-informed switching independent vector extraction and beamforming for speech enhancement in underdetermined situations.
EURASIP J. Audio Speech Music. Process., December, 2024
Microphone Array Signal Processing and Deep Learning for Speech Enhancement: Combining model-based and data-driven approaches to parameter estimation and filtering [Special Issue On Model-Based and Data-Driven Audio Signal Processing].
IEEE Signal Process. Mag., November, 2024
Geometrically-Regularized Fast Independent Vector Extraction by Pure Majorization-Minimization.
IEEE Trans. Signal Process., 2024
Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
CoRR, 2024
Multi-Stream Diffusion Model for Probabilistic Integration of Model-Based and Data-Driven Speech Enhancement.
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Neural Network-Based Virtual Microphone Estimation with Virtual Microphone and Beamformer-Level Multi-Task Loss.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined Blind Source Separation and Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Mask-Based Neural Beamforming for Moving Speakers With Self-Attention-Based Tracking.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Impact of Residual Noise and Artifacts in Speech Enhancement Errors on Intelligibility of Human and Machine.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Fast Online Source Steering Algorithm for Tracking Single Moving Source Using Online Independent Vector Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2023
NoisyILRMA: Diffuse-Noise-Aware Independent Low-Rank Matrix Analysis for Fast Blind Source Extraction.
Proceedings of the 31st European Signal Processing Conference, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Modified Parametric Multichannel Wiener Filter for Low-latency Enhancement of Speech Mixtures with Unknown Number of Speakers.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Switching Independent Vector Analysis and its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithms.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening.
CoRR, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Multi-Frame Full-Rank Spatial Covariance Analysis for Underdetermined BSS in Reverberant Environments.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
ISS2: An Extension of Iterative Source Steering Algorithm for Majorization-Minimization-Based Independent Vector Analysis.
Proceedings of the 30th European Signal Processing Conference, 2022
2021
Block Coordinate Descent Algorithms for Auxiliary-Function-Based Independent Vector Extraction.
IEEE Trans. Signal Process., 2021
A Joint Diagonalization Based Efficient Approach to Underdetermined Blind Audio Source Separation Using the Multichannel Wiener Filter.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Independent Vector Extraction for Fast Joint Blind Source Separation and Dereverberation.
IEEE Signal Process. Lett., 2021
IEEE Signal Process. Lett., 2021
Online Speech Dereverberation Using Mixture of Multichannel Linear Prediction Models.
IEEE Signal Process. Lett., 2021
Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm.
CoRR, 2021
CoRR, 2021
Integration of Variational Autoencoder and Spatial Clustering for Adaptive Multi-Channel Neural Speech Separation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Comparison of Remote Experiments Using Crowdsourcing and Laboratory Experiments on Speech Intelligibility.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend.
Proceedings of the IEEE International Conference on Acoustics, 2021
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain.
Proceedings of the IEEE International Conference on Acoustics, 2021
Low Latency Online Blind Source Separation Based on Joint Optimization with Blind Dereverberation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Blind and Neural Network-Guided Convolutional Beamformer for Joint Denoising, Dereverberation, and Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Low Latency Online Source Separation and Noise Reduction Based on Joint Optimization with Dereverberation.
Proceedings of the 29th European Signal Processing Conference, 2021
Proceedings of the 29th European Signal Processing Conference, 2021
Proceedings of the 29th European Signal Processing Conference, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech.
Speech Commun., 2020
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation.
CoRR, 2020
Cognitive-Driven Convolutional Beamforming Using EEG-Based Auditory Attention Decoding.
Proceedings of the 30th IEEE International Workshop on Machine Learning for Signal Processing, 2020
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Predicting Intelligibility of Enhanced Speech Using Posteriors Derived from DNN-Based ASR System.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
DNN-supported Mask-based Convolutional Beamforming for Simultaneous Denoising, Dereverberation, and Source Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Convergence-Guaranteed Independent Positive Semidefinite Tensor Analysis Based on Student's T Distribution.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving Noise Robust Automatic Speech Recognition with Single-Channel Time-Domain Enhancement Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Tackling Real Noisy Reverberant Meetings with All-Neural Source Separation, Counting, and Diarization System.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving Speaker Discrimination of Target Speech Extraction With Time-Domain Speakerbeam.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization.
Proceedings of the 28th European Signal Processing Conference, 2020
Experimental Analysis of EM and MU Algorithms for Optimizing Full-rank Spatial Covariance Model.
Proceedings of the 28th European Signal Processing Conference, 2020
2019
Speech Processing for Digital Home Assistants: Combining signal processing with deep-learning techniques.
IEEE Signal Process. Mag., 2019
IEEE Signal Process. Lett., 2019
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures.
IEEE J. Sel. Top. Signal Process., 2019
Feature Based Domain Adaptation for Neural Network Language Models with Factorised Hidden Layers.
IEICE Trans. Inf. Syst., 2019
Simultaneous Denoising, Dereverberation, and Source Separation Using a Unified Convolutional Beamformer.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019
Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Simultaneous Denoising and Dereverberation for Low-Latency Applications Using Frame-by-Frame Online Unified Convolutional Beamformer.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Improving Transformer-Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Predicting Speech Intelligibility of Enhanced Speech Using Phone Accuracy of DNN-Based ASR System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
ILP-based Compressive Speech Summarization with Content Word Coverage Maximization and Its Oracle Performance Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Mask-based MVDR Beamformer for Noisy Multisource Environments: Introduction of Time-varying Spatial Covariance Model.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
FastMNMF: Joint Diagonalization Based Accelerated Algorithms for Multichannel Nonnegative Matrix Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2019
Joint Optimization of Neural Network-based WPE Dereverberation and Acoustic Model for Robust Online ASR.
Proceedings of the IEEE International Conference on Acoustics, 2019
A Unified Framework for Feature-based Domain Adaptation of Neural Network Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Computational Acceleration and Smart Initialization of Full-RANK Spatial Covariance Analysis.
Proceedings of the 27th European Signal Processing Conference, 2019
Maximum likelihood convolutional beamformer for simultaneous denoising and dereverberation.
Proceedings of the 27th European Signal Processing Conference, 2019
A Unifying Framework for Blind Source Separation Based on A Joint Diagonalizability Constraint.
Proceedings of the 27th European Signal Processing Conference, 2019
2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
IEEE Signal Process. Lett., 2018
FastFCA: A Joint Diagonalization Based Fast Algorithm for Audio Source Separation Using A Full-Rank Spatial Covariance Model.
CoRR, 2018
Online Integration of DNN-Based and Spatial Clustering-Based Mask Estimation for Robust MVDR Beamforming.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Fastfca-As: Joint Diagonalization Based Acceleration of Full-Rank Spatial Covariance Analysis for Separating any Number of Sources.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Rescoring N-Best Speech Recognition List Based on One-on-One Hypothesis Comparison Using Encoder-Classifier Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Sequence Training of Encoder-Decoder Model Using Policy Gradient for End-to-End Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Maximum-Likelihood Online Speaker Diarization in Noisy Meetings Based on Categorical Mixture Model and Probabilistic Spatial Dictionary.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Dual Frequency- and Block-Permutation Alignment for Deep Learning Based Block-Online Blind Source Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Permutation-Free Cgmm: Complex Gaussian Mixture Model with Inverse Wishart Mixture Model Based Spatial Prior for Permutation-Free Source Separation and Source Counting.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Multiplicative Updates and Joint Diagonalization Based Acceleration for Under-Determined BSS Using a Full-Rank Spatial Covariance Model.
Proceedings of the 2018 IEEE Global Conference on Signal and Information Processing, 2018
Noisy cGMM: Complex Gaussian Mixture Model with Non-Sparse Noise Model for Joint Source Separation and Denoising.
Proceedings of the 26th European Signal Processing Conference, 2018
FastFCA: Joint Diagonalization Based Acceleration of Audio Source Separation Using a Full-Rank Spatial Covariance Model.
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Factorised Hidden Layer Based Domain Adaptation for Recurrent Neural Network Language Models.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Integration of Spatial Cue-Based Noise Reduction and Speech Model-Based Source Restoration for Real Time Speech Enhancement.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017
Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Predicting Speech Intelligibility Using a Gammachirp Envelope Distortion Index Based on the Signal-to-Distortion Ratio.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Uncertainty Decoding with Adaptive Sampling for Noise Robust DNN-Based Acoustic Modeling.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Cumulative moving averaged bottleneck speaker vectors for online speaker adaptation of CNN-based acoustic models.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Integrating DNN-based and spatial clustering-based mask estimation for robust MVDR beamforming.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Probabilistic spatial dictionary based online adaptive beamforming for meeting recognition in noisy and reverberant environments.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Online environmental adaptation of CNN-based acoustic models using spatial diffuseness features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Unsupervised utterance-wise beamformer estimation with speech recognition-level criterion.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Online meeting recognition in noisy environments with time-frequency mask based MVDR beamforming.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017
Data-driven and physical model-based designs of probabilistic spatial dictionary for online meeting diarization and adaptive beamforming.
Proceedings of the 25th European Signal Processing Conference, 2017
Learning speaker representation for neural network based multichannel speaker extraction.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Exploiting imbalanced textual and acoustic data for training prosodically-enhanced RNNLMs.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
Multichannel Speech Enhancement Approaches to DNN-Based Far-Field Speech Recognition.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research.
EURASIP J. Adv. Signal Process., 2016
Differenced maximum mutual information criterion for robust unsupervised acoustic model adaptation.
Comput. Speech Lang., 2016
Sparseness-based multichannel nonnegative matrix factorization for blind source separation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Modeling audio directional statistics using a probabilistic spatial dictionary for speaker diarization in real meetings.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016
Speech Intelligibility Prediction Based on the Envelope Power Spectrum Model with the Dynamic Compressive Gammachirp Auditory Filterbank.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Optimization of Speech Enhancement Front-End with Speech Recognition-Level Criterion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Noise robust speech recognition using recent developments in neural networks for computer vision.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
A generative-discriminative hybrid approach to multi-channel noise reduction for robust automatic speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Real-time integration of statistical model-based speech enhancement with unsupervised noise PSD estimation using microphone array.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Modeling audio directional statistics using a complex bingham mixture model for blind source extraction from diffuse noise.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Multi-pass feature enhancement based on generative-discriminative hybrid approach for noise robust speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Spatial correlation model based observation vector clustering and MVDR beamforming for meeting recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Reverberation-robust underdetermined source separation with non-negative tensor double deconvolution.
Proceedings of the 24th European Signal Processing Conference, 2016
Complex angular central Gaussian mixture model for directional statistics in mask-based microphone array signal processing.
Proceedings of the 24th European Signal Processing Conference, 2016
2015
Blind Suppression of Nonstationary Diffuse Acoustic Noise Based on Spatial Covariance Matrix Decomposition.
J. Signal Process. Syst., 2015
Acoustic Event Detection in Speech Overlapping Scenarios Based on High-Resolution Spectral Input and Deep Learning.
IEICE Trans. Inf. Syst., 2015
EURASIP J. Adv. Signal Process., 2015
Exploiting spectro-temporal locality in deep learning based acoustic event detection.
EURASIP J. Audio Speech Music. Process., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Modeling inter-node acoustic dependencies with Restricted Boltzmann Machine for distributed microphone array based BSS.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Feature enhancement based on generative-discriminative hybrid approach with gmms and DNNS for noise robust speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Permutation-free clustering of relative transfer function features for blind source separation.
Proceedings of the 23rd European Signal Processing Conference, 2015
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
2014
Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Relaxed disjointness based clustering for joint blind source separation and dereverberation.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014
Fast segment search for corpus-based speech enhancement based on speech recognition technology.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Unsupervised non-parametric Bayesian modeling of non-stationary noise for model-based noise suppression.
Proceedings of the IEEE International Conference on Acoustics, 2014
Spectrogram patch based acoustic event detection and classification in speech overlapping conditions.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014
Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition.
Proceedings of the 2014 IEEE Global Conference on Signal and Information Processing, 2014
2013
IEEE Trans. Speech Audio Process., 2013
A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction.
IEEE Trans. Speech Audio Process., 2013
IEEE ACM Trans. Audio Speech Lang. Process., 2013
Cluster-based dynamic variance adaptation for interconnecting speech enhancement pre-processor and speech recognizer.
Comput. Speech Lang., 2013
Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds.
Comput. Speech Lang., 2013
The reverb challenge: Acommon evaluation framework for dereverberation and recognition of reverberant speech.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013
Source number estimation based on clustering of speech activity sequences for microphone array processing.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013
Microphone-location dependent mask estimation for BSS using spatially distributed asynchronous microphones.
Proceedings of the International Symposium on Intelligent Signal Processing and Communication Systems, 2013
On the robustness of distributed EM based BSS in asynchronous distributed microphone array scenarios.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Conditional emission densities for combining speech enhancement and recognition systems.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Blind source separation using spatially distributed microphones based on microphone-location dependent source activities.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Model-based noise suppression using unsupervised estimation of hidden Markov model for non-stationary noise.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Is speech enhancement pre-processing still relevant when using deep neural networks for acoustic modeling?
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 18th International Conference on Digital Signal Processing, 2013
Noise model transfer using affine transformation with application to large vocabulary reverberant speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013
An integration of source location cues for speech clustering in distributed microphone arrays.
Proceedings of the IEEE International Conference on Acoustics, 2013
Coupling beamforming with spatial and spectral feature based spectral enhancement and its application to meeting recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013
Permutation-free convolutive blind source separation via full-band clustering based on frequency-independent source presence priors.
Proceedings of the IEEE International Conference on Acoustics, 2013
Unsupervised discriminative adaptation using differenced maximum mutual information based linear regression.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 21st European Signal Processing Conference, 2013
2012
Generalization of Multi-Channel Linear Prediction Methods for Blind MIMO Impulse Response Shortening.
IEEE Trans. Speech Audio Process., 2012
Probabilistic Speaker Diarization With Bag-of-Words Representations of Speaker Angle Information.
IEEE Trans. Speech Audio Process., 2012
Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera.
IEEE Trans. Speech Audio Process., 2012
Making Machines Understand Us in Reverberant Rooms: Robustness Against Reverberation for Automatic Speech Recognition.
IEEE Signal Process. Mag., 2012
IEEE Signal Process. Lett., 2012
Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection.
Speech Commun., 2012
Distributed microphone array processing for speech source separation with classifier fusion.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2012
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012
Example-based speech enhancement with joint utilization of spatial, spectral & temporal cues of speech and noise.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Time-varying residual noise feature model estimation for multi-microphone speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A multichannel MMSE-based framework for joint blind source separation and noise reduction.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
LogMax observation model with MFCC-based spectral prior for reduction of highly nonstationary ambient noise.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
New analytical update rule for TDOA inference for underdetermined BSS in noisy environments.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Introduction of speech log-spectral priors into dereverberation based on Itakura-Saito distance minimization.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Noise suppression with unsupervised joint speaker adaptation and noise mixture model estimation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Sparse vector factorization for underdetermined BSS using wrapped-phase GMM and source log-spectral prior.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
New analytical calculation and estimation of TDOA for underdetermined BSS in noisy environments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
IEEE Trans. Speech Audio Process., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Reduction of Highly Nonstationary Ambient Noise by Integrating Spectral and Locational Characteristics of Speech and Noise for Robust ASR.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Single Channel Dereverberation Using Example-Based Speech Enhancement with Uncertainty Decoding Technique.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Speech enhancement based on log spectral envelope model and harmonicity-derived spectral mask, and its coupling with feature compensation.
Proceedings of the IEEE International Conference on Acoustics, 2011
Joint unsupervised learning of hidden Markov source models and source location models for multichannel source separation.
Proceedings of the IEEE International Conference on Acoustics, 2011
Non-stationary noise estimation method based on bias-residual component decomposition for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
Hybrid approach for multichannel source separation combining time-frequency mask with multi-channel Wiener filter.
Proceedings of the IEEE International Conference on Acoustics, 2011
Variance Compensation for Recognition of Reverberant Speech with Dereverberation Preprocessing.
Proceedings of the Robust Speech Recognition of Uncertain or Missing Data, 2011
2010
IEEE Trans. Speech Audio Process., 2010
Introduction to the Special Issue on Processing Reverberant Speech: Methodologies and Applications.
IEEE Trans. Speech Audio Process., 2010
Noise robust voice activity detection based on periodic to aperiodic component ratio.
Speech Commun., 2010
Real-time meeting recognition and understanding using distant microphones and omni-directional camera.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2010), May 30, 2010
Multichannel source separation based on source location cue with log-spectral shaping by hidden Markov source model.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Voice activity detection using frame-wise model re-estimation method based on Gaussian pruning with weight normalization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure.
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Single channel source separation based on sparse source observation model with harmonic constraint.
Proceedings of the IEEE International Conference on Acoustics, 2010
Blind upmix of stereo music signals using multi-step linear prediction based reverberation extraction.
Proceedings of the IEEE International Conference on Acoustics, 2010
Simultaneous clustering of mixing and spectral model parameters for blind sparse source separation.
Proceedings of the IEEE International Conference on Acoustics, 2010
Inverse Filtering for Speech Dereverberation Without the Use of Room Acoustics Information.
Proceedings of the Speech Dereverberation., 2010
2009
IEEE Trans. Speech Audio Process., 2009
Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction.
IEEE Trans. Speech Audio Process., 2009
Static and Dynamic Variance Compensation for Recognition of Reverberant Speech With Dereverberation Preprocessing.
IEEE Trans. Speech Audio Process., 2009
Speech Commun., 2009
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009
A study of mutual front-end processing method based on statistical model for noise robust speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Stereo Source Separation and Source Counting with MAP Estimation with Dirichlet Prior Considering Spatial Aliasing Problem.
Proceedings of the Independent Component Analysis and Signal Separation, 2009
A speaker diarization method based on the probabilistic fusion of audio-visual location information.
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Real-time speech enhancement in noisy reverberant multi-talker environments based on a location-independent room acoustics model.
Proceedings of the IEEE International Conference on Acoustics, 2009
Robust speech dereverberation based on non-negativity and sparse nature of speech spectrograms.
Proceedings of the IEEE International Conference on Acoustics, 2009
Blind sparse source separation for unknown number of sources using Gaussian mixture model fitting with Dirichlet prior.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 17th European Signal Processing Conference, 2009
2008
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model.
IEEE Trans. Speech Audio Process., 2008
A method for fundamental frequency estimation and voicing decision: Application to infant utterances recorded in real acoustical environments.
Speech Commun., 2008
Missing feature speech recognition in a meeting situation with maximum SNR beamforming.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2008), 2008
Study of integration of statistical model-based voice activity detection and noise suppression.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Blind speech dereverberation with multi-channel linear prediction based on short time fourier transform representation.
Proceedings of the IEEE International Conference on Acoustics, 2008
A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme.
Proceedings of the IEEE International Conference on Acoustics, 2008
Combined static and dynamic variance adaptation for efficient interconnection of speech enhancement pre-processor with speech recognizer.
Proceedings of the IEEE International Conference on Acoustics, 2008
An integrated method for blind separation and dereverberation of convolutive audio mixtures.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
Principles and applications of dereverberation for noisy and reverberant audio signals.
Proceedings of the 42nd Asilomar Conference on Signals, Systems and Computers, 2008
2007
IEEE Trans. Speech Audio Process., 2007
Robust blind dereverberation of speech signals based on characteristics of short-time speech segments.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2007), 2007
Multi-step linear prediction based speech dereverberation in noisy reverberant environment.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Noise robust front-end processing with voice activity detection based on periodic to aperiodic component ratio.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Two-Microphone Voice Activity Detection Based on the Homogeneity of the Direction of Arrival Estimates.
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition.
Speech Commun., 2006
Syst. Comput. Jpn., 2006
Study of noise robust voice activity detection based on periodic component to aperiodic component ratio.
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Spectral Subtraction Steered by Multi-Step Forward Linear Prediction For Single Channel Speech Dereverberation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Harmonicity Based Dereverberation for Improving Automatic Speech Recognition Performance and Speech Intelligibility.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Automatic Sound-Imitation Word Recognition from Environmental Sounds Focusing on Ambiguity Problem in Determining Phonemes.
Proceedings of the PRICAI 2004: Trends in Artificial Intelligence, 2004
Proceedings of the ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, 2004
Harmonicity based monaural speech dereverberation with time warping and F0 adaptive window.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Improving automatic speech recognition performance and speech inteligibility with harmonicity based dereverberation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Improvement in robustness of speech feature extraction method using sub-band based periodicity and aperiodicity decomposition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Disambiguation in determining phonemes of sound-imitation words for environmental sound recognition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
2003
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003
Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Robust fundamental frequency estimation against background noise and spectral distortion.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
1999
Harmonic sound stream segregation using localization and its application to speech stream segregation.
Speech Commun., 1999
1998
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998
1997
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997
1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Localization by harmonic structure and its application to harmonic sound stream segregation.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
Interfacing Sound Stream Segregation to Automatic Speech Recognition - Preliminary Results on Listening to Several Sounds Simultaneously.
Proceedings of the Thirteenth National Conference on Artificial Intelligence and Eighth Innovative Applications of Artificial Intelligence Conference, 1996
1995
Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA, USA, July 31, 1994