Nam Soo Kim
Orcid: 0000-0002-0568-4902
According to our database1,
Nam Soo Kim
authored at least 217 papers
between 1990 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE Signal Process. Lett., 2024
IEEE Signal Process. Lett., 2024
SegINR: Segment-wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech.
CoRR, 2024
MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance.
CoRR, 2024
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction.
CoRR, 2024
Cons-KD: Dropout-Robust Knowledge Distillation for CTC-Based Automatic Speech Recognition.
IEEE Access, 2024
2023
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Text Implicates Prosodic Ambiguity: A Corpus for Intention Identification of the Korean Spoken Language.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2023
EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings.
CoRR, 2023
When Crowd Meets Persona: Creating a Large-Scale Open-Domain Persona Dialogue Corpus.
CoRR, 2023
MCR-Data2vec 2.0: Improving Self-supervised Speech Pre-training via Model-level Consistency Regularization.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the International Conference on Machine Learning, 2023
Multi-Resolution Sequence Aggregation and Model-Agnostic Framework for Time-Series Forecasting.
Proceedings of the IEEE International Conference on Acoustics, 2023
Improving Learning Objectives for Speaker Verification from the Perspective of Score Comparison.
Proceedings of the IEEE International Conference on Acoustics, 2023
Transduce and Speak: Neural Transducer for Text-To-Speech with Semantic Token Prediction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
IEEE Signal Process. Lett., 2022
SNAC: Speaker-Normalized Affine Coupling Layer in Flow-Based Architecture for Zero-Shot Multi-Speaker Text-to-Speech.
IEEE Signal Process. Lett., 2022
A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization.
IEEE Signal Process. Lett., 2022
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech.
CoRR, 2022
CoRR, 2022
Inter-KD: Intermediate Knowledge Distillation for CTC-Based Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
OpenKorPOS: Democratizing Korean Tokenization with Voting-Based Open Corpus Annotation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
"Feels Like I've Known You Forever": Empathy and Self-Awareness in Human Open-Domain Dialogs.
Proceedings of the 44th Annual Meeting of the Cognitive Science Society, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Gated Recurrent Context: Softmax-Free Attention for Online Encoder-Decoder Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-Supervised Speaker Verification.
IEEE Access, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 7th IEEE International Conference on Network Intelligence and Digital Content, 2021
Proceedings of the FAccT '21: 2021 ACM Conference on Fairness, 2021
Giving Space to Your Message: Assistive Word Segmentation for the Electronic Typing of Digital Minorities.
Proceedings of the DIS '21: Designing Interactive Systems Conference 2021, 2021
2020
Memory Attention: Robust Alignment Using Gating Mechanism for End-to-End Speech Synthesis.
IEEE Signal Process. Lett., 2020
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning.
CoRR, 2020
Disentangled Speaker and Nuisance Attribute Embedding for Robust Speaker Verification.
IEEE Access, 2020
Pay Attention to Categories: Syntax-Based Sentence Modeling with Metadata Projection Matrix.
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Discourse Component to Sentence (DC2S): An Efficient Human-Aided Construction of Paraphrase and Sentence Similarity Dataset.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Robust Text-Dependent Speaker Verification via Character-Level Information Preservation for the SdSV Challenge 2020.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Text Matters but Speech Influences: A Computational Analysis of Syntactic Ambiguity Resolution.
Proceedings of the 42th Annual Meeting of the Cognitive Science Society, 2020
Towards an Efficient Code-Mixed Grapheme-to-Phoneme Conversion in an Agglutinative Language: A Case Study on To-Korean Transliteration.
Proceedings of the The 4th Workshop on Computational Approaches to Code Switching, 2020
2019
Adversarially Learned Total Variability Embedding for Speaker Recognition with Random Digit Strings.
Sensors, 2019
Disambiguating Speech Intention via Audio-Text Co-attention Framework: A Case of Prosody-semantics Interface.
CoRR, 2019
Investigating an Effective Character-level Embedding in Korean Sentence Classification.
CoRR, 2019
End-to-End Multi-Channel Speech Enhancement Using Inter-Channel Time-Restricted Attention on Raw Waveform.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
DNN-based monaural speech enhancement with temporal and spectral variations equalization.
Digit. Signal Process., 2018
Speech Intention Understanding in a Head-final Language: A Disambiguation Utilizing Intonation-dependency.
CoRR, 2018
HashCount at SemEval-2018 Task 3: Concatenative Featurization of Tweet and Hashtags for Irony Detection.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Robust Time-Delay Estimation for Acoustic Indoor Localization in Reverberant Environments.
IEEE Signal Process. Lett., 2017
Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Overlapping acoustic event classification based on joint training with source separation.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
IEICE Trans. Inf. Syst., 2016
DNN-Based Feature Enhancement Using Joint Training Framework for Robust Multichannel Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the IEEE International Conference on Consumer Electronics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
IEEE Signal Process. Lett., 2015
Tampering Detection Scheme for Speech Signals using Formant Enhancement based Watermarking.
J. Inf. Hiding Multim. Signal Process., 2015
Target Source Separation Based on Discriminative Nonnegative Matrix Factorization Incorporating Cross-Reconstruction Error.
IEICE Trans. Inf. Syst., 2015
IEICE Trans. Inf. Syst., 2015
An acoustic data transmission system based on audio data hiding: method and performance evaluation.
EURASIP J. Audio Speech Music. Process., 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Discriminative nonnegative matrix factorization using cross-reconstruction error for source separation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Acoustic modeling and parameter generation using relevance vector machines for speech synthesis.
Proceedings of the 23rd European Signal Processing Conference, 2015
2014
IEEE Signal Process. Lett., 2014
Spectro-Temporal Filtering for Multichannel Speech Enhancement in Short-Time Fourier Transform Domain.
IEEE Signal Process. Lett., 2014
Factored Maximum Penalized Likelihood Kernel Regression for HMM-Based Style-Adaptive Speech Synthesis.
IEEE J. Sel. Top. Signal Process., 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Speaker Adaptation Using Nonlinear Regression Techniques for HMM-Based Speech Synthesis.
Proceedings of the 2014 Tenth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Speech enhancement combining statistical models and NMF with update of speech and noise bases.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Parametric multichannel noise reduction algorithm utilizing temporal correlations in reverberant environment.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
IEEE Trans. Speech Audio Process., 2013
IEICE Trans. Inf. Syst., 2013
Factored maximum likelihood kernelized regression for HMM-based singing voice synthesis.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Robust Audio Data Hiding Method Based on Phase of Modulated Complex Lapped Transform.
Proceedings of the Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2013
Blind method of estimating speech transmission index from reverberant speech signals.
Proceedings of the 21st European Signal Processing Conference, 2013
Blind method of estimating speech transmission index in room acoustics based on concept of modulation transfer function.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
IMM-based feature compensation robust to slowly time-varying noise and reverberation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
2012
IEEE Trans. Speech Audio Process., 2012
IEICE Trans. Inf. Syst., 2012
Outlier Detection and Removal for HMM-Based Speech Synthesis with an Insufficient Speech Database.
IEICE Trans. Inf. Syst., 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Quality Enhancement of Audio Watermarking for Data Transmission in Aerial Space Based on Segmental SNR Adjustment.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the International Multi-Conference on Systems, Signals & Devices, 2012
2011
IEICE Trans. Inf. Syst., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Decision Tree-Based Clustering with Outlier Detection for HMM-Based Speech Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
IEEE Signal Process. Lett., 2010
IEEE Signal Process. Lett., 2010
IEEE Signal Process. Lett., 2010
IEICE Trans. Inf. Syst., 2010
IEICE Trans. Inf. Syst., 2010
IEICE Trans. Inf. Syst., 2010
Implementation of HMM-Based Human Activity Recognition Using Single Triaxial Accelerometer.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2010
Voice activity detection based on statistical models and machine learning approaches.
Comput. Speech Lang., 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Phone mismatch penalty matrices for two-stage keyword spotting via multi-pass phone recognizer.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
IEEE Signal Process. Lett., 2009
IEEE Signal Process. Lett., 2009
IEICE Trans. Inf. Syst., 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
IEEE Signal Process. Lett., 2008
IEEE Signal Process. Lett., 2008
Frame Splitting Scheme for Error-Robust Audio Streaming over Packet-Switching Networks.
IEICE Trans. Commun., 2008
IEICE Trans. Inf. Syst., 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
IEEE Trans. Speech Audio Process., 2007
IEEE Signal Process. Lett., 2007
IEEE Signal Process. Lett., 2007
IEEE Signal Process. Lett., 2007
Pattern Recognit. Lett., 2007
Pattern Recognit., 2007
IEICE Trans. Commun., 2007
IEICE Trans. Inf. Syst., 2007
Improved Global Soft Decision Using Smoothed Global Likelihood Ratio for Speech Enhancement.
IEICE Trans. Commun., 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
IEEE Trans. Signal Process., 2006
A new structural approach in system identification with generalized analysis-by-synthesis for robust speech coding.
IEEE Trans. Speech Audio Process., 2006
IEEE Signal Process. Lett., 2006
IEEE Signal Process. Lett., 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
2005
IEEE Trans. Speech Audio Process., 2005
IEEE Signal Process. Lett., 2005
IEEE Signal Process. Lett., 2005
IEEE Signal Process. Lett., 2005
IEEE Signal Process. Lett., 2005
Signal Process., 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
IEEE Trans. Speech Audio Process., 2004
IEEE Signal Process. Lett., 2004
Speech Commun., 2004
IEICE Trans. Inf. Syst., 2004
Distorted Speech Rejection for Automatic Speech Recognition in Wireless Communication.
IEICE Trans. Inf. Syst., 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Inner product based-multiband vector quantization for wideband speech coding at 16 kbps.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Speech Commun., 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
IEEE Signal Process. Lett., 2001
IEEE Signal Process. Lett., 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999
1998
IEEE Signal Process. Lett., 1998
IEEE Signal Process. Lett., 1998
IEEE Signal Process. Lett., 1998
Speech Commun., 1998
1997
IEEE Trans. Speech Audio Process., 1997
IEEE Trans. Speech Audio Process., 1997
Model-based approach for robust speech recognition in noisy environements with multiple noise sources.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
1995
IEEE Trans. Speech Audio Process., 1995
1990
Proceedings of the First International Conference on Spoken Language Processing, 1990