Hong-Goo Kang

IEEE ACM Trans. Audio Speech Lang. Process., 2024

LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration.

[BibT_eX]

[DOI]

Sangmin Lee

CoRR, 2024

Optimization of DNN-based speaker verification model through efficient quantization technique.

[BibT_eX]

[DOI]

Yeona Hong

CoRR, 2024

Speaker-Independent Acoustic-to-Articulatory Inversion through Multi-Channel Attention Discriminator.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient, Cluster-Informed, Deep Speech Separation with Cross-Cluster Information in AD-HOC Wireless Acoustic Sensor Networks.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

On Fine-Tuning Pre-Trained Speech Models With EMA-Target Self-Supervised Loss.

[BibT_eX]

[DOI]

Hejung Yang

Proceedings of the IEEE International Conference on Acoustics, 2024

Spectrum-Aware Neural Vocoder Based on Self-Supervised Learning for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 32nd European Signal Processing Conference, 2024

SC-ERM: Speaker-Centric Learning for Speech Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Electronics, Information, and Communication, 2024

On the Disentanglement and Robustness of Self-Supervised Speech Representations.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Electronics, Information, and Communication, 2024

Contextual Learning for Missing Speech Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Electronics, Information, and Communication, 2024

2023

Real-time neural speech enhancement based on temporal refinement network and channel-wise gating methods.

[BibT_eX]

[DOI]

Jinyoung Lee

Digit. Signal Process., March, 2023

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2023

C2C: Cough to COVID-19 Detection in BHI 2023 Data Challenge.

[BibT_eX]

[DOI]

Miseul Kim

CoRR, 2023

A High-Rate Extension to Soundstream.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Feature Normalization for Fine-tuning Self-Supervised Models in Speech Enhancement.

[BibT_eX]

[DOI]

Hejung Yang

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Contrastive Learning based Deep Latent Masking for Music Source Separation.

[BibT_eX]

[DOI]

Jihyun Kim

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

HD-DEMUCS: General Speech Restoration with Heterogeneous Decoders.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MF-PAM: Accurate Pitch Estimation through Periodicity Analysis and Multi-level Feature Fusion.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

HappyQuokka System for ICASSP 2023 Auditory EEG Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

End-to-End Neural Audio Coding in the MDCT Domain.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Style Modeling for Multi-Speaker Articulation-to-Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Progressive Multi-Stage Neural Audio Codec with Psychoacoustic Loss and Discriminator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Facetron: A Multi-Speaker Face-to-Speech Model Based on Cross-Modal Latent Representations.

[BibT_eX]

[DOI]

Proceedings of the 31st European Signal Processing Conference, 2023

BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0.

[BibT_eX]

[DOI]

Proceedings of the IEEE EMBS International Conference on Biomedical and Health Informatics, 2023

PDF-NET: Pitch-adaptive Dynamic Filter Network for Intra-gender Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

Consideration of Varying Training Lengths for Short-Duration Speaker Verification.

[BibT_eX]

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

Two-Stage Refinement of Magnitude and Complex Spectra for Real-Time Speech Enhancement.

[BibT_eX]

[DOI]

Jinyoung Lee

IEEE Signal Process. Lett., 2022

ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence.

[BibT_eX]

[DOI]

Sangshin Oh

Seyun Um

CoRR, 2022

AILTTS: Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2022

SASV Challenge 2022: A Spoofing Aware Speaker Verification Challenge Evaluation Plan.

[BibT_eX]

[DOI]

CoRR, 2022

Length-Normalized Representation Learning for Speech Signals.

[BibT_eX]

[DOI]

Kyungguen Byun

Seyun Um

IEEE Access, 2022

Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Light-Weight Speaker Verification with Global Context Information.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Adversarial Audio Synthesis Using a Harmonic-Percussive Discriminator.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Progressive Multi-Stage Neural Audio Coding with Guided References.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Facetron: Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations.

[BibT_eX]

[DOI]

CoRR, 2021

LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Self-supervised Complex Network for Machine Sound Anomaly Detection.

[BibT_eX]

[DOI]

Miseul Kim

Minh Tri Ho

Proceedings of the 29th European Signal Processing Conference, 2021

A Fast and Lightweight Text-To-Speech Model with Spectrum and Waveform Alignment Algorithms.

[BibT_eX]

[DOI]

Kihyuk Jeong

Huu-Kim Nguyen

Proceedings of the 29th European Signal Processing Conference, 2021

Disentangled Representations for Arabic Dialect Identification based on Supervised Clustering with Triplet Loss.

[BibT_eX]

[DOI]

Zainab Alhakeem

Yoohwan Kwon

Proceedings of the 29th European Signal Processing Conference, 2021

Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Stacked U-Net with High-Level Feature Transfer for Parameter Efficient Speech Enhancement.

[BibT_eX]

[DOI]

Jinyoung Lee

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020

Perfect Match: Self-Supervised Embeddings for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

Joon Son Chung

IEEE J. Sel. Top. Signal Process., 2020

Effective Emotion Transplantation in an End-to-End Text-to-Speech System.

[BibT_eX]

[DOI]

IEEE Access, 2020

Speaker-Adaptive Neural Vocoders for Parametric Speech Synthesis Systems.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Intra-Class Variation Reduction of Speaker Representation in Disentanglement Framework.

[BibT_eX]

[DOI]

Yoohwan Kwon

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Cross-Channel Attention-Based Wave-U-Net for Multi-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

MIRNet: Learning Multiple Identities Representations in Overlapped Speech.

[BibT_eX]

[DOI]

Hyewon Han

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Seeing Voices and Hearing Voices: Learning Discriminative Embeddings Using Cross-Modal Self-Supervision.

[BibT_eX]

[DOI]

Joon Son Chung

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

FaceFilter: Audio-Visual Speech Separation Using Still Images.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Emotional Speech Synthesis with Rich and Granularized Control.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Speaker-invariant Psychological Stress Detection Using Attention-based Network.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

A Study on Conditional Features for a Flow-based Neural Vocoder.

[BibT_eX]

[DOI]

Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, 2020

2019

A Joint Learning Algorithm for Complex-Valued T-F Masks in Deep Learning-Based Single-Channel Speech Enhancement Systems.

[BibT_eX]

[DOI]

Jinkyu Lee

IEEE ACM Trans. Audio Speech Lang. Process., 2019

An Effective Style Token Weight Control Technique for End-to-End Emotional Speech Synthesis.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2019

Dry Electrode-Based Body Fat Estimation System with Anthropometric Data for Use in a Wearable Device.

[BibT_eX]

[DOI]

Sensors, 2019

Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems.

[BibT_eX]

[DOI]

CoRR, 2019

Orthonormal Embedding-based Deep Clustering for Single-channel Speech Separation.

[BibT_eX]

[DOI]

CoRR, 2019

Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment.

[BibT_eX]

[DOI]

Min-Jae Hwang

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Gradient-based Active Learning Query Strategy for End-to-end Speech Recognition.

[BibT_eX]

[DOI]

Yang Yuan

Proceedings of the IEEE International Conference on Acoustics, 2019

Perfect Match: Improved Cross-modal Embeddings for Audio-visual Synchronisation.

[BibT_eX]

[DOI]

Joon Son Chung

Proceedings of the IEEE International Conference on Acoustics, 2019

ExcitNet Vocoder: A Neural Excitation Model for Parametric Speech Synthesis Systems.

[BibT_eX]

[DOI]

Kyungguen Byun

Proceedings of the 27th European Signal Processing Conference, 2019

A Study on Acoustic Parameter Selection Strategies to Improve Deep Learning-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018

SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2018

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, 2018

Speaker-adaptive neural vocoders for statistical parametric speech synthesis systems.

[BibT_eX]

[DOI]

CoRR, 2018

Session details: Oral Session 2.

[BibT_eX]

[DOI]

Proceedings of the 2018 Workshop on Audio-Visual Scene Understanding for Immersive Multimedia, 2018

AVSU: Workshop on Audio-Visual Scene Understanding for Immersive Multimedia.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

A Deep Learning-based Stress Detection Algorithm with Speech Signal.

[BibT_eX]

[DOI]

Hyewon Han

Kyunggeun Byun

Proceedings of the 2018 Workshop on Audio-Visual Scene Understanding for Immersive Multimedia, 2018

A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Dnn-Based Wireless Positioning in an Outdoor Environment.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Modeling-By-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Two electrode based healthcare device for continuously monitoring ECG and BIA signals.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE EMBS International Conference on Biomedical & Health Informatics, 2018

2017

Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Deep bi-directional long short-term memory based speech enhancement for wind noise reduction.

[BibT_eX]

[DOI]

Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

A study on search grid points for data-driven 3-D beamsteering.

[BibT_eX]

[DOI]

Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

Continuous bladder volume monitoring system for wearable applications.

[BibT_eX]

[DOI]

Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

On pre-filtering strategies for the GCC-PHAT algorithm.

[BibT_eX]

[DOI]

Michael Graczyk

Jan Skoglund

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Parametric-based non-intrusive speech quality assessment by deep neural network.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016

A pitch-synchronous speech analysis and synthesis method for DNN-SPSS system.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016

Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Efficient deep neural networks for speech synthesis using bottleneck features.

[BibT_eX]

[DOI]

Won-Suk Jun

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015

A Priori SNR Estimation Using Air- and Bone-Conduction Microphones.

[BibT_eX]

[DOI]

Ho Seon Shin

Tim Fingscheidt

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Systematic integration of acoustic echo canceller and noise reduction modules for voice communication systems.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Coherent channel based subband multichannel dereverberation.

[BibT_eX]

[DOI]

JeeSok Lee

Sejin Oh

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Detection of fiducial points in ECG waves using iteration based adaptive thresholds.

[BibT_eX]

[DOI]

Wonjune Kang

Kyunguen Byun

Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

A constrained two-layer compression technique for ECG waves.

[BibT_eX]

[DOI]

Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

Robust formant features for speaker verification in the lombard effect.

[BibT_eX]

[DOI]

Ileun Kwak

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

An Efficient Multichannel Linear Prediction-Based Blind Equalization Algorithm in Near Common Zeros Condition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2014

Knowledge-Based Manner Class Segmentation Based on the Acoustic Event and Landmark Detection Algorithm.

[BibT_eX]

[DOI]

Jung In Lee

Jeung-Yoon Choi

IEICE Trans. Inf. Syst., 2014

New aliasing cancelation algorithm for the transition between non-aliased and TDAC-based coding modes.

[BibT_eX]

[DOI]

Jeongook Song

EURASIP J. Audio Speech Music. Process., 2014

Fixed-point implementation of MPEG-D unified speech and audio coding decoder.

[BibT_eX]

[DOI]

Joonil Lee

Proceedings of the 19th International Conference on Digital Signal Processing, 2014

A maximum a Posterior-based reconstruction approach to speech bandwidth expansion in noise.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Factored adaptation of speaker and environment using orthogonal subspace transforms.

[BibT_eX]

[DOI]

Michael L. Seltzer

Proceedings of the IEEE International Conference on Acoustics, 2014

Detecting pathological speech using contour modeling of harmonic-to-noise ratio.

[BibT_eX]

[DOI]

Jung-Won Lee

Samuel Kim

Proceedings of the IEEE International Conference on Acoustics, 2014

Mean normalization of power function based cepstral coefficients for robust speech recognition in noisy environment.

[BibT_eX]

[DOI]

Soonho Baek

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Refinement of Landmark Detection and Extraction of Articulator-Free Features for Knowledge-Based Speech Recognition.

[BibT_eX]

[DOI]

Jung In Lee

Jeung-Yoon Choi

IEICE Trans. Inf. Syst., 2013

A source-filter based adaptive harmonic model and its application to speech prosody modification.

[BibT_eX]

[DOI]

JeeSok Lee

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Enhancement of spectral clarity for HMM-based text-to-speech systems.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Adaptive multichannel linear prediction based dereverberation in time-varying room environments.

[BibT_eX]

[DOI]

Proceedings of the 21st European Signal Processing Conference, 2013

Adaptation of HMM dynamic parameters in reverberant environment.

[BibT_eX]

[DOI]

Jinkyu Lee

Proceedings of the 21st European Signal Processing Conference, 2013

Vector Taylor series based HMM adaptation for generalized cepstrum in noisy environment.

[BibT_eX]

[DOI]

Soonho Baek

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Speech enhancement for pathological voice using time-frequency trajectory excitation modeling.

[BibT_eX]

[DOI]

Jongyoub Ryu

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Detecting pathological speech using local and global characteristics of harmonic-to-noise ratio.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Waveform Interpolation-Based Speech Analysis/Synthesis for HMM-Based TTS Systems.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2012

Two-stage source tracking method using a multiple linear regression model in the expanded phase domain.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2012

Survey of Speech Enhancement Supported by a Bone Conduction Microphone.

[BibT_eX]

[DOI]

Ho Seon Shin

Tim Fingscheidt

Proceedings of the 10th ITG Conference on Speech Communication, 2012

2011

An Interactive 3-D Audio System With Loudspeakers.

[BibT_eX]

[DOI]

Cha Zhang

Dinei A. F. Florêncio

IEEE Trans. Multim., 2011

Robust Session Variability Compensation for SVM Speaker Verification.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2011

A Two-Channel Noise Estimator for Speech Enhancement in a Highly Nonstationary Environment.

[BibT_eX]

[DOI]

Min-Seok Choi

IEEE ACM Trans. Audio Speech Lang. Process., 2011

Estimating redundancy information of selected features in multi-dimensional pattern classification.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2011

Transient noise reduction in speech signal with a modified long-term predictor.

[BibT_eX]

[DOI]

Min-Seok Choi

EURASIP J. Adv. Signal Process., 2011

Classification of Fricatives Using Feature Extrapolation of Acoustic-Phonetic Features in Telephone Speech.

[BibT_eX]

[DOI]

Jung-Won Lee

Jeung-Yoon Choi

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Enhanced long-term predictor for Unified Speech and Audio Coding.

[BibT_eX]

[DOI]

Jeongook Song

Hyen-O Oh

Proceedings of the IEEE International Conference on Acoustics, 2011

Signal and feature domain enhancement approaches for robust speech recognition.

[BibT_eX]

[DOI]

Jinkyu Lee

Soonho Baek

Proceedings of the 8th International Conference on Information, 2011

2010

Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information.

[BibT_eX]

[DOI]

Moo Young Kim

IEEE Trans. Speech Audio Process., 2010

On the Importance of Transition Regions for Automatic Speaker Recognition.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2010

Harmonic Enhancement in Low Bitrate Audio Coding Using an Efficient Long-Term Predictor.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2010

Enhancing loudspeaker-based 3D audio with room modeling.

[BibT_eX]

[DOI]

Cha Zhang

Dinei A. F. Florêncio

Proceedings of the 2010 IEEE International Workshop on Multimedia Signal Processing, 2010

A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification.

[BibT_eX]

[DOI]

Kyu Jeong Han

Shrikanth S. Narayanan

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Personal 3D audio system with loudspeakers.

[BibT_eX]

[DOI]

Cha Zhang

Dinei A. F. Florêncio

Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Binaural loudness based speech reinforcement with a closed-form solution.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

On the Study of Noise Allocation for Speech Signal in Low Bit-Rate Audio Coding.

[BibT_eX]

[DOI]

Chang-Heon Lee

Hyen-O Oh

IEEE Signal Process. Lett., 2009

Normalized minimum-redundancy and maximum-relevancy based feature selection for speaker verification systems.

[BibT_eX]

[DOI]

Moo Young Kim

Proceedings of the IEEE International Conference on Acoustics, 2009

A robust time difference of arrival estimator in reverberant environments.

[BibT_eX]

[DOI]

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Speech Bandwidth Extension Using Temporal Envelope Modeling.

[BibT_eX]

[DOI]

Kyung-Tae Kim

Min-Ki Lee

IEEE Signal Process. Lett., 2008

Designing a unified speech/audio codec by adopting a single channel harmonic source separation module.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Applying a Speaker-Dependent Speech Compression Technique to Concatenative TTS Synthesizers.

[BibT_eX]

[DOI]

Chang-Heon Lee

Sung-Kyo Jung

IEEE Trans. Speech Audio Process., 2007

Speech quality estimation using packet loss effects in CELP-type speech coders.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A Soft-Decision Adaptation Mode Controller for an Efficient Frequency-Domain Generalized Sidelobe Canceller.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2007

2006

Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition.

[BibT_eX]

[DOI]

Chang-Heon Lee

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

An efficient segment-based speech compression technique for hand-held TTS systems.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

On the Use of Voting Methods for Speaker Identification Based on Various Resolution Filterbanks.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005

A fast adaptive-codebook search algorithm for G.723.1 speech coder.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2005

An information-theoretic perspective on feature selection in speaker recognition.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2005

Improving the Performance of the Minimum Statistics Noise Estimator for Single Channel Speech Enhancement.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005

Adaptive Microphone Array System with Two-Stage Adaptation Mode Controller.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005

A noise-robust pitch synchronous feature extraction algorithm for speaker recognition systems.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

An improved estimation of a priori speech absence probability for speech enhancement : in perspective of speech perception.

[BibT_eX]

[DOI]

Min-Seok Choi

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

An efficient transcoding algorithm for G.723.1 and G.729A speech coders: interoperability between mobile and IP network.

[BibT_eX]

[DOI]

Speech Commun., 2004

Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

On the time variability of vocal tract for speaker recognition.

[BibT_eX]

[DOI]

Samuel Kim

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Performance analysis of transcoding algorithms in packet-loss environments.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Theory for speaker recognition over IP.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

A pitch synchronous feature extraction method for speaker recognition.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

A bit-rate/bandwidth scalable speech coder based on ITU-T G.723.1 standard.

[BibT_eX]

[DOI]

Sung-Kyo Jung

Kyung-Tae Kim

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Improvement issues on transcoding algorithms: for the flexible usage to the various pairs of speech codec.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Improving the transcoding capability of speech coders.

[BibT_eX]

[DOI]

Richard V. Cox

IEEE Trans. Multim., 2003

Performance Comparison of Single and Multi-Stage Algebraic Codebooks.

[BibT_eX]

[DOI]

Sung-Kyo Jung

Dae Hee Youn

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2003

Transcoding algorithm for g.723.1 and AMR speech coders: for interoperability between voIP and mobile networks.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A packet loss concealment algorithm based on time-scale modification for CELP-type speech coders.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

A cascaded algebraic codebook structure to improve the performance of speech coder.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

An adaptive short-term postfilter based on pseudo-cepstral representation of line spectral frequencies.

[BibT_eX]

[DOI]

Speech Commun., 2002

A phase generation method for speech reconstruction from spectral envelope and pitch intervals.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

A frame erasure concealment algorithm based on gain parameter re-estimation for CELP coders.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2001

Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments.

[BibT_eX]

[DOI]

Richard C. Rose

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

A candidate for the ITU-T 4 kbit/s speech coding standard.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Low-rate quantization of spectrum parameters.

[BibT_eX]

[DOI]

Per Hedelin

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Low delay analysis/synthesis schemes for joint speech enhancement and low bit rate speech coding.

[BibT_eX]

[DOI]

Rainer Martin

Richard V. Cox

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Phase adjustment in waveform interpolation.

[BibT_eX]

[DOI]

Dipanjan Sen

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Pitch quantization in low bit-rate speech coding.

[BibT_eX]

[DOI]

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Quantization of the spectral envelope for sinusoidal coders.

[BibT_eX]

[DOI]

Yannis Stylianou

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

Improved regular pulse VSELP coding of speech at low bit-rates.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

A 3 channel digital CVSD bit-rate conversion system using a general purpose DSP.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996

A fast VSELP speech coder based on mutually orthonormal regular pulse vectors.

[BibT_eX]

[DOI]

Yong-Soo Choi