Joon-Hyuk Chang

Orcid: 0000-0003-2610-2323

According to our database1, Joon-Hyuk Chang authored at least 215 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Proper Error Estimation and Calibration for Attention-Based Encoder-Decoder Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Partitioning Attention Weight: Mitigating Adverse Effect of Incorrect Pseudo-Labels for Self-Supervised ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Efficient Lightweight Speaker Verification With Broadcasting CNN-Transformer and Knowledge Distillation Training of Self-Attention Maps.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Robust Time-of-Arrival-Based Splitting Mean Moving Object Localization.
IEEE Signal Process. Lett., 2024

Differentiable Duration Refinement Using Internal Division for Non-Autoregressive Text-to-Speech.
IEEE Signal Process. Lett., 2024

DM: Dual-path Magnitude Network for General Speech Restoration.
CoRR, 2024

Acoustic-Scene-Aware Target Sound Separation With Sound Embedding Refinement.
IEEE Access, 2024

Stationary Latent Weight Inference for Unreliable Observations from Online Test-Time Adaptation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Text-Only Unsupervised Domain Adaptation for Neural Transducer-Based ASR Personalization Using Synthesized Data.
Proceedings of the IEEE International Conference on Acoustics, 2024

Class: Continual Learning Approach for Speech Super-Resolution.
Proceedings of the IEEE International Conference on Acoustics, 2024

Improving Target Sound Extraction with Timestamp Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Adversarial Learning on Compressed Posterior Space for Non-Iterative Score-based End-to-End Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

Generalized Specaugment via Multi-Rectangle Inverse Masking For Acoustic Scene Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Attention-based latent features for jointly trained end-to-end automatic speech recognition with modified speech enhancement.
J. King Saud Univ. Comput. Inf. Sci., March, 2023

Robust Localization Method Based on Non-Parametric Probability Density Estimation.
IEEE Access, 2023

Masked Frequency Modeling for Improving Packet Loss Concealment in Speech Transmission Systems.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Class Activation Mapping-Driven Data Augmentation: Masking Significant Regions for Enhanced Acoustic Scene Classification.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

HAD-ANC: A Hybrid System Comprising an Adaptive Filter and Deep Neural Networks for Active Noise Control.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Improving Joint Speech and Emotion Recognition Using Global Style Tokens.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

General-purpose Adversarial Training for Enhanced Automatic Speech Recognition Model Generalization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Intra-ensemble: A New Method for Combining Intermediate Outputs in Transformer-based Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Self-Distillation into Self-Attention Heads for Improving Transformer-based End-to-End Neural Speaker Diarization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Prior-free Guided TTS: An Improved and Efficient Diffusion-based Text-Guided Speech Synthesis.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Resolution Consistency Training on Time-Frequency Domain for Semi-Supervised Sound Event Detection.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SR-SRP: Super-Resolution based SRP-PHAT for Sound Source Localization and Tracking.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Deeply Supervised Curriculum Learning for Deep Neural Network-based Sound Source Localization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Audio Captioning Using Semantic Alignment Enhancer.
Proceedings of the 8th IEEE International Conference on Network Intelligence and Digital Content, 2023

Restoration of Face Mask-Induced Speech Intelligibility Degradation Via Neural Bandwidth Extension.
Proceedings of the 8th IEEE International Conference on Network Intelligence and Digital Content, 2023

Effective Masking Shapes Based Robust Data Augmentation for Acoustic Scene Classification.
Proceedings of the 8th IEEE International Conference on Network Intelligence and Digital Content, 2023

Selective Film Conditioning with CTC-Based ASR Probability for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

Noise-Aware Target Extension with Self-Distillation for Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Repackagingaugment: Overcoming Prediction Error Amplification in Weight-Averaged Speech Recognition Models Subject to Self-Training.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Transformer-Based End-to-End Speaker Diarization by Assigning Auxiliary Losses to Attention Heads.
Proceedings of the IEEE International Conference on Acoustics, 2023

Adaptive Time-Scale Modification for Improving Speech Intelligibility Based On Phoneme Clustering For Streaming Services.
Proceedings of the IEEE International Conference on Acoustics, 2023

M-CTRL: A Continual Representation Learning Framework with Slowly Improving Past Pre-Trained Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

CAN2V: Can-Bus Data-Based Seq2seq Model for Vehicle Velocity Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2023

A Multi-modal Teacher-student Framework for Improved Blood Pressure Estimation.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

Towards Robust Packet Loss Concealment System With ASR-Guided Representations.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Knowledge Distillation From Offline to Streaming Transducer: Towards Accurate and Fast Streaming Model by Matching Alignments.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Cross-Modal Learning for CTC-Based ASR: Leveraging CTC-Bertscore and Sequence-Level Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

AWMC: Online Test-Time Adaptation Without Mode Collapse for Continual Adaptation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Extending Self-Distilled Self-Supervised Learning For Semi-Supervised Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Revisiting Skipped Filter and Development of Robust Localization Method Based on Variational Bayesian Gaussian Mixture Algorithm.
IEEE Trans. Signal Process., 2022

Task-Specific Optimization of Virtual Channel Linear Prediction-Based Speech Dereverberation Front-End for Far-Field Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

VACE-WPE: Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Non-Autoregressive Fully Parallel Deep Convolutional Neural Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Supervised Learning Approach for Explicit Spatial Filtering of Speech.
IEEE Signal Process. Lett., 2022

Instance-level loss based multiple-instance learning for acoustic scene classification.
CoRR, 2022

Robust Localization Based on Mixed-Norm Minimization Criterion.
IEEE Access, 2022

NAS-TasNet: Neural Architecture Search for Time-Domain Speech Separation.
IEEE Access, 2022

FiLM Conditioning with Enhanced Feature to the Transformer-based End-to-End Noisy Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

CTRL: Continual Representation Learning to Transfer Information of Pre-trained for WAV2VEC 2.0.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Regularizing Transformer-based Acoustic Models by Penalizing Attention Weights.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Advanced Speaker Embedding with Predictive Variance of Gaussian Distribution for Speaker Adaptation in TTS.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

One-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

W2V2-Light: A Lightweight Version of Wav2vec 2.0 for Automatic Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

HYU Submission for the SASV Challenge 2022: Reforming Speaker Embeddings with Spoofing-Aware Conditioning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improved CNN-Transformer using Broadcasted Residual Learning for Text-Independent Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Convolutional Recurrent Neural Network with Auxiliary Stream for Robust Variable-Length Acoustic Scene Classification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Knowledge Distillation from Language Model to Acoustic Model: A Hierarchical Multi-Task Learning Approach.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-Scale Architecture and Device-Aware Data-Random-Drop Based Fine-Tuning Method for Acoustic Scene Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

Confidence Regularized Entropy for Polyphonic Sound Event Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Robust Localization Employing Weighted Least Squares Method Based on MM Estimator and Kalman Filter With Maximum Versoria Criterion.
IEEE Signal Process. Lett., 2021

Attention-Based Joint Training of Noise Suppression and Sound Event Detection for Noise-Robust Classification.
Sensors, 2021

Luminance-Degradation Compensation Based on Multistream Self-Attention to Address Thin-Film Transistor-Organic Light Emitting Diode Burn-In.
Sensors, 2021

Deep Q-network-based noise suppression for robust speech recognition.
Turkish J. Electr. Eng. Comput. Sci., 2021

Attribution Mask: Filtering Out Irrelevant Features By Recursively Focusing Attention on Inputs of DNNs.
CoRR, 2021

Modified MM Algorithm and Bayesian Expectation Maximization-Based Robust Localization Under NLOS Contaminated Environments.
IEEE Access, 2021

Document-Level Neural TTS Using Curriculum Learning and Attention Masking.
IEEE Access, 2021

MIMO Noise Suppression Preserving Spatial Cues for Sound Source Localization in Mobile Robot.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Deep Neural Network Calibration for E2E Speech Recognition System.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Zero-Shot Voice Cloning Using Variational Embedding with Attention Mechanism.
Proceedings of the 7th IEEE International Conference on Network Intelligence and Digital Content, 2021

Short-Utterance Embedding Enhancement Method Based on Time Series Forecasting Technique for Text-Independent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Robust Localization Based on ML-Type, Multi-Stage ML-Type, and Extrapolated Single Propagation UKF Methods Under Mixed LOS/NLOS Conditions.
IEEE Trans. Wirel. Commun., 2020

Cuffless Deep Learning-Based Blood Pressure Estimation for Smart Wristwatches.
IEEE Trans. Instrum. Meas., 2020

Multi-TALK: Multi-Microphone Cross-Tower Network for Jointly Suppressing Acoustic Echo and Background Noise.
Sensors, 2020

Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments.
Sensors, 2020

Robust range estimation algorithm based on hyper-tangent loss function.
IET Signal Process., 2020

Deep neural network ensemble for reducing artificial noise in bandwidth extension.
Digit. Signal Process., 2020

Delayless Block Individual-Weighting-Factors Sign Subband Adaptive Filters With an Improved Band-Dependent Variable Step-Size.
IEEE Access, 2020

End-to-End Speech Endpoint Detection Utilizing Acoustic and Language Modeling Knowledge for Online Low-Latency Speech Recognition.
IEEE Access, 2020

Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Attention Wave-U-Net for Acoustic Echo Cancellation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Robust Shrinkage Range Estimation Algorithms Based on Hampel and Skipped Filters.
Wirel. Commun. Mob. Comput., 2019

Smart Wristwatches Employing Finger-Conducted Voice Transmission System.
IEEE Trans. Ind. Informatics, 2019

State-Space Microphone Array Nonlinear Acoustic Echo Cancellation Using Multi-Microphone Near-End Speech Covariance.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Shrinkage sinusoidal phase estimation based on spherical simplex unscented transform and bootstrap method.
Trans. Emerg. Telecommun. Technol., 2019

Ensemble of jointly trained deep neural network-based acoustic models for reverberant speech recognition.
Digit. Signal Process., 2019

Robust LMedS-Based WLS and Tukey-Based EKF Algorithms Under LOS/NLOS Mixture Conditions.
IEEE Access, 2019

WLS Localization Using Skipped Filter, Hampel Filter, Bootstrapping and Gaussian Mixture EM in LOS/NLOS Conditions.
IEEE Access, 2019

Joint Optimization of Neural Acoustic Beamforming and Dereverberation with x-Vectors for Robust Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

DNN based multi-speaker speech synthesis with temporal auxiliary speaker ID embedding.
Proceedings of the International Conference on Electronics, Information, and Communication, 2019

2018
Sequential source localisation and range estimation based on shrinkage algorithm.
IET Signal Process., 2018

Dempster-Shafer theory for enhanced statistical model-based voice activity detection.
Comput. Speech Lang., 2018

Augmenting Bottleneck Features of Deep Neural Network Employing Motor State for Speech Recognition at Humanoid Robots.
CoRR, 2018

Sequential Deep Neural Networks Ensemble for Speech Bandwidth Extension.
IEEE Access, 2018

DNN-based Speech Recognition System dealing with Motor State as Auxiliary Information of DNN for Head Shaking Robot.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

2017
Oscillometric Blood Pressure Estimation Based on Deep Learning.
IEEE Trans. Ind. Informatics, 2017

Noncontact Sleep Study by Multi-Modal Sensor Fusion.
Sensors, 2017

Spectral difference for statistical model-based speech enhancement in speech recognition.
Multim. Tools Appl., 2017

Adaptive robust time-of-arrival source localization algorithm based on variable step size weighted block Newton method.
EURASIP J. Wirel. Commun. Netw., 2017

TOA source localization and DOA estimation algorithms using prior distribution for calibrated source.
Digit. Signal Process., 2017

Bayesian feature enhancement using independent vector analysis and reverberation parameter re-estimation for noisy reverberant speech recognition.
Comput. Speech Lang., 2017

Deep learning ensemble with asymptotic techniques for oscillometric blood pressure estimation.
Comput. Methods Programs Biomed., 2017

Oscillometric blood pressure estimation by combining nonparametric bootstrap with Gaussian mixture model.
Comput. Biol. Medicine, 2017

Deep Belief Networks Ensemble for Blood Pressure Estimation.
IEEE Access, 2017

2016
Closed-Form Localization for Distributed MIMO Radar Systems Using Time Delay Measurements.
IEEE Trans. Wirel. Commun., 2016

Improved Gaussian Mixture Regression Based on Pseudo Feature Generation Using Bootstrap in Blood Pressure Estimation.
IEEE Trans. Ind. Informatics, 2016

Packet Loss Concealment Based on Deep Neural Networks for Digital Speech Transmission.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

On using multivariate polynomial regression model with spectral difference for statistical model-based speech enhancement.
J. Syst. Archit., 2016

Time-of-arrival source localization based on weighted least squares estimator in line-of-sight/non-line-of-sight mixture environments.
Int. J. Distributed Sens. Networks, 2016

Closed-form two-step weighted-least-squares-based time-of-arrival source localisation using invariance property of maximum likelihood estimator in multiple-sample environment.
IET Commun., 2016

Robust time-of-arrival source localization employing error covariance of sample mean and sample median in line-of-sight/non-line-of-sight mixture environments.
EURASIP J. Adv. Signal Process., 2016

Novel adaptive muting technique for packet loss concealment of ITU-T G.722 using optimized parametric shaping functions.
EURASIP J. Audio Speech Music. Process., 2016

Ensemble of deep neural networks using acoustic environment classification for statistical model-based voice activity detection.
Comput. Speech Lang., 2016

Ensemble of Jointly Trained Deep Neural Network-Based Acoustic Models for Reverberant Speech Recognition.
CoRR, 2016

Blind estimation of reverberation time using deep neural network.
Proceedings of the IEEE International Conference on Network Infrastructure and Digital Content, 2016

An experimental study: The sufficient respiration rate detection technique via continuous wave Doppler radar.
Proceedings of the IEEE International Conference on Network Infrastructure and Digital Content, 2016

Dual-microphone voice activity detection based on using optimally weighted maximum a posteriori probabilities.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Robust closed-form time-of-arrival source localization based on α-trimmed mean and Hodges-Lehmann estimator under NLOS environments.
Signal Process., 2015

Efficient implementation techniques of an SVM-based speech/music classifier in SMV.
Multim. Tools Appl., 2015

Shrinkage-based biased signal-to-noise ratio estimator using pilot and data symbols for linearly modulated signals.
IET Commun., 2015

Estimated confidence interval from single blood pressure measurement based on algorithmic fusion.
Comput. Biol. Medicine, 2015

A statistical model-based voice activity detection using multiple DNNs and noise awareness.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Dual-Microphone Voice Activity Detection Technique Based on Two-Step Power Level Difference Ratio.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Frequency-Domain Volterra Filter Based on Data-Driven Soft Decision for Nonlinear Acoustic Echo Suppression.
IEEE Signal Process. Lett., 2014

A new television audience measurement framework using smart devices.
Multim. Tools Appl., 2014

Biased SNR estimation using pilot and data symbols in BPSK and QPSK systems.
J. Commun. Networks, 2014

Efficient Implementation of Statistical Model-Based Voice Activity Detection Using Taylor Series Approximation.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014

Shrinkage estimation-based source localization with minimum mean squared error criterion and minimum bias criterion.
Digit. Signal Process., 2014

A new a priori SNR estimator based on multiple linear regression technique for speech enhancement.
Digit. Signal Process., 2014

Enhanced muting method in packet loss concealment of ITU-t g.722 using sigmoid function with on-line optimized parameters.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Voice Activity Detection Based on Statistical Model Employing Deep Neural Network.
Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014

2013
Oscillometric Blood Pressure Estimation Based on Maximum Amplitude Algorithm Employing Gaussian Mixture Regression.
IEEE Trans. Instrum. Meas., 2013

Improved Speech-Presence Uncertainty Estimation Based on Spectral Gradient for Global Soft Decision-Based Speech Enhancement.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013

Online Sparse Volterra System Identification Using Projections onto Weighted <i>l</i><sub>1</sub> Balls.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2013

Noisy speech enhancement based on improved minimum statistics incorporating acoustic environment-awareness.
Digit. Signal Process., 2013

Enhanced muting method in packet loss concealment of ITU-t g.722 employing optimized sigmoid function.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Efficient implementation of an SVM-based speech/music classifier by enhancing temporal locality in support vector references.
IEEE Trans. Consumer Electron., 2012

On using acoustic environment classification for statistical model-based speech enhancement.
Speech Commun., 2012

Voice activity detection based on conditional MAP criterion incorporating the spectral gradient.
Signal Process., 2012

Enhancing support vector machine-based speech/music classification using conditional maximum a posteriori criterion.
IET Signal Process., 2012

Improvement of SVM-Based Speech/Music Classification Using Adaptive Kernel Technique.
IEICE Trans. Inf. Syst., 2012

A Statistical Model-Based Speech Enhancement Using Acoustic Noise Classification for Robust Speech Communication.
IEICE Trans. Commun., 2012

Integrated acoustic echo and background noise suppression technique based on soft decision.
EURASIP J. Adv. Signal Process., 2012

On using spectral gradient in conditional MAP criterion for robust voice activity detection.
Proceedings of the 3rd IEEE International Conference on Network Infrastructure and Digital Content, 2012

New techniques for improving the practicality of an SVM-based speech/music classifier.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Adaptive noise power estimation using spectral difference for robust speech enhancement.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Minima-controlled speech presence uncertainty tracking method for speech enhancement.
Signal Process., 2011

Speech Enhancement Based on Data-Driven Residual Gain Estimation.
IEICE Trans. Inf. Syst., 2011

Speech Enhancement Based on Adaptive Noise Power Estimation Using Spectral Difference.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2011

A Soft Decision-Based Speech Enhancement Using Acoustic Noise Classification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Frequency-Domain Double-Talk Detection Based on the Gaussian Mixture Model.
IEEE Signal Process. Lett., 2010

Double-talk detection based on soft decision for acoustic echo suppression.
Signal Process., 2010

Improved Global Soft Decision Incorporating Second-Order Conditional MAP in Speech Enhancement.
IEICE Trans. Inf. Syst., 2010

Discriminative Weight Training for Support Vector Machine-Based Speech/Music Classification in 3GPP2 SMV Codec.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2010

Efficient Speech Reinforcement Based on Low-Bit-Rate Speech Coding Parameters.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2010

Improved minima controlled recursive averaging technique using conditional maximum a posteriori criterion for speech enhancement.
Digit. Signal Process., 2010

Voice Activity Detection Based on Discriminative Weight Training Incorporating a Spectral Flatness Measure.
Circuits Syst. Signal Process., 2010

Voice activity detection based on statistical models and machine learning approaches.
Comput. Speech Lang., 2010

On using Gaussian mixture model for double-talk detection in acoustic echo suppression.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Toward detecting voice activity employing soft decision in second-order conditional MAP.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A statistical model-based double-talk detection incorporating soft decision.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Frequency Domain Acoustic Echo Suppression Based on Soft Decision.
IEEE Signal Process. Lett., 2009

Speech Enhancement Based on Minima Controlled Recursive Averaging Incorporating Second-Order Conditional MAP Criterion.
IEEE Signal Process. Lett., 2009

Global Soft Decision Employing Support Vector Machine For Speech Enhancement.
IEEE Signal Process. Lett., 2009

Efficient Implementation of Voiced/Unvoiced Sounds Classification Based on GMM for SMV Codec.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

Acoustic Environment Classification Based on SMV Speech Codec Parameters for Context-Aware Mobile Phone.
IEICE Trans. Inf. Syst., 2009

Speech/Music Classification Enhancement for 3GPP2 SMV Codec Based on Support Vector Machine.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

Speech Reinforcement Based on Soft Decision under Far-End Noise Environments.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

Discriminative weight training-based optimally weighted MFCC for gender identification.
IEICE Electron. Express, 2009

Soft decision-based acoustic echo suppression in a frequency domain.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Enhanced minimum statistics technique incorporating soft decision for noise suppression.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Speech enhancement based on minima controlled recursive averaging incorporating conditional maximum a posteriori criterion.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Analysis and Improvement of Speech/Music Classification for 3GPP2 SMV Based on GMM.
IEEE Signal Process. Lett., 2008

A Probabilistic Combination Method of Minimum Statistics and Soft Decision for Robust Noise Power Estimation in Speech Enhancement.
IEEE Signal Process. Lett., 2008

Discriminative Weight Training for a Statistical Model-Based Voice Activity Detection.
IEEE Signal Process. Lett., 2008

A Support Vector Machine-Based Gender Identification Using Speech Signal.
IEICE Trans. Commun., 2008

Frame Splitting Scheme for Error-Robust Audio Streaming over Packet-Switching Networks.
IEICE Trans. Commun., 2008

A Support Vector Machine-Based Voice Activity Detection Employing Effective Feature Vectors.
IEICE Trans. Commun., 2008

Group delay function for improved gender identification.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

A statistical model-based voice activity detection employing minimum classification error technique.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
A New Statistical Voice Activity Detection Based on UMP Test.
IEEE Signal Process. Lett., 2007

Voice activity detection based on a family of parametric distributions.
Pattern Recognit. Lett., 2007

Multiple statistical models for soft decision in noisy speech enhancement.
Pattern Recognit., 2007

Speech Enhancement Based on Perceptually Comfortable Residual Noise.
IEICE Trans. Commun., 2007

A Novel Approach to a Robust <i>a Priori</i> SNR Estimator in Speech Enhancement.
IEICE Trans. Commun., 2007

Improved Global Soft Decision Using Smoothed Global Likelihood Ratio for Speech Enhancement.
IEICE Trans. Commun., 2007

Residual echo reduction based on MMSE estimator in acoustic echo canceller.
IEICE Electron. Express, 2007

Complex laplacian probability density function for noisy speech enhancement.
IEICE Electron. Express, 2007

Voice activity detection based on support vector machine using effective feature vectors.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A uniformly most powerful test for statistical model-based voice activity detection.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Voice activity detection based on multiple statistical models.
IEEE Trans. Signal Process., 2006

A new structural approach in system identification with generalized analysis-by-synthesis for robust speech coding.
IEEE Trans. Speech Audio Process., 2006

Perceptual weighting filter for robust speech modification.
Signal Process., 2006

Signal modification incorporating perceptual weighting filter.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Warped discrete cosine transform-based noisy speech enhancement.
IEEE Trans. Circuits Syst. II Express Briefs, 2005

Statistical modeling of speech signals based on generalized gamma distribution.
IEEE Signal Process. Lett., 2005

Image probability distribution based on generalized gamma function.
IEEE Signal Process. Lett., 2005

Pitch estimation of speech signal based on adaptive lattice notch filter.
Signal Process., 2005

Multiband Vector Quantization Based on Inner Product for Wideband Speech Coding.
IEICE Trans. Inf. Syst., 2005

A new structural preprocessor for low-bit rate speech coding.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Voice Activity Detection based on Generalized Gamma Distribution.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Signal modification for robust speech coding.
IEEE Trans. Speech Audio Process., 2004

A Statistical Model-Based V/UV Decision under Background Noise Environments.
IEICE Trans. Inf. Syst., 2004

Distorted Speech Rejection for Automatic Speech Recognition in Wireless Communication.
IEICE Trans. Inf. Syst., 2004

Speech probability distribution based on generalized gama distribution.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Inner product based-multiband vector quantization for wideband speech coding at 16 kbps.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003
Likelihood ratio test with complex laplacian model for voice activity detection.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
A preprocessor for low-bit-rate speech coding.
IEEE Signal Process. Lett., 2002

Generalized analysis-by-synthesis based on system identification.
Proceedings of the IEEE International Conference on Acoustics, 2002

2000
Spectral enhancement based on global soft decision.
IEEE Signal Process. Lett., 2000

Speech enhancement: new approaches to soft decision.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000


  Loading...