Hoirin Kim
Orcid: 0000-0002-8787-6982
According to our database1,
Hoirin Kim
authored at least 84 papers
between 2003 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition.
CoRR, 2024
CoRR, 2024
STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
AdaMS: Deep Metric Learning with Adaptive Margin and Adaptive Scale for Acoustic Word Discrimination.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Recycle-and-Distill: Universal Compression Strategy for Transformer-based Speech SSL Models with Attention Map Reusing and Masking Distillation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
Deep Metric Learning with Adaptive Margin and Adaptive Scale for Acoustic Word Discrimination.
CoRR, 2022
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning.
CoRR, 2022
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech.
IEEE Access, 2022
ACNN-VC: Utilizing Adaptive Convolution Neural Network for One-Shot Voice Conversion.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning with Spoofing Detection and Spoofing Type Classification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
2020
Single-Variable-Input Active Sidelobe Suppression Method for Synthesized Magnetic Field Focusing Technology and Its Optimization.
IEEE Trans. Ind. Electron., 2020
IEEE Signal Process. Lett., 2020
Interlayer Selective Attention Network for Robust Personalized Wake-Up Word Detection.
IEEE Signal Process. Lett., 2020
Multi-Scale Aggregation Using Feature Pyramid Module for Text-Independent Speaker Verification.
CoRR, 2020
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments.
IEEE Access, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification Using CTC-Based Soft VAD and Global Query Attention.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
2-D Synthesized Magnetic Field Focusing Technology With Loop Coils Distributed in a Rectangular Formation.
IEEE Trans. Ind. Electron., 2019
Spatial Pyramid Encoding with Convex Length Normalization for Text-Independent Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Additional Shared Decoder on Siamese Multi-View Encoders for Learning Acoustic Word Embeddings.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Self-Adaptive Soft Voice Activity Detection Using Deep Neural Networks for Robust Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Learning Self-Informed Feature Contribution for Deep Learning-Based Acoustic Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
CoRR, 2018
Joint Learning Using Denoising Variational Autoencoders for Voice Activity Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
2017
Development of distant multi-channel speech and noise databases for speech recognition by in-door conversational robots.
Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017
Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
CNN-based bottleneck feature for noise robust query-by-example spoken term detection.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
Dysarthric Speech Recognition Using Kullback-Leibler Divergence-Based Hidden Markov Model.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Automatic Intelligibility Assessment of Dysarthric Speech Using Phonologically-Structured Sparse Linear Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Probabilistic Class Histogram Equalization Based on Posterior Mean Estimation for Robust Speech Recognition.
IEEE Signal Process. Lett., 2015
EURASIP J. Adv. Signal Process., 2015
Robust sound event classification using LBP-HOG based bag-of-audio-words feature representation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Discriminative likelihood score weighting based on acoustic-phonetic classification for speaker identification.
EURASIP J. Adv. Signal Process., 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Dysarthric speech recognition using dysarthria-severity-dependent and speaker-adaptive models.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
ROBUST detection of infant crying in adverse environments using weighted segmental two-dimensional linear frequency cepstral coefficients.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013
2012
Audio-Based Objectionable Content Detection Using Discriminative Transforms of Time-Frequency Dynamics.
IEEE Trans. Multim., 2012
Multiple Acoustic Model-Based Discriminative Likelihood Ratio Weighting for Voice Activity Detection.
IEEE Signal Process. Lett., 2012
Combination of Multiple Speech Dimensions for Automatic Assessment of Dysarthric Speech Intelligibility.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Automatic Assessment of Dysarthric Speech Intelligibility Based on Selected Phonetic Quality Features.
Proceedings of the Computers Helping People with Special Needs, 2012
2011
Reliable likelihood ratios for statistical model-based voice activity detector with low false-alarm rate.
EURASIP J. Adv. Signal Process., 2011
Automatic extraction of pornographic contents using radon transform based audio features.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011
2010
Robust speaker recognition based on filtering in autocorrelation domain and sub-band feature recombination.
Pattern Recognit. Lett., 2010
Acoustic Model Combination Incorporated With Mask-Based Multi-Channel Source Separation for Automatic Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2010
IEICE Trans. Inf. Syst., 2010
Utterance Verification Using State-Level Log-Likelihood Ratio with Frame and State Selection.
IEICE Trans. Inf. Syst., 2010
EURASIP J. Adv. Signal Process., 2010
Automatic detection of malicious sound using segmental two-dimensional mel-frequency cepstral coefficients and histograms of oriented gradients.
Proceedings of the 18th International Conference on Multimedia 2010, 2010
A robust target signal detector based on statistical models using binaural cross-similarity information.
Proceedings of the 18th European Signal Processing Conference, 2010
2009
IEEE Signal Process. Lett., 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Text-Independent Speaker Identification using Soft Channel Selection in Home Robot Environments.
IEEE Trans. Consumer Electron., 2008
Histogram Equalization Utilizing Window-Based Smoothed CDF Estimation for Feature Compensation.
IEICE Trans. Inf. Syst., 2008
Utterance Verification Using Word Voiceprint Models Based on Probabilistic Distributions of Phone-Level Log-Likelihood Ratio and Phone Duration.
IEICE Trans. Inf. Syst., 2008
2007
IEEE Signal Process. Lett., 2007
IEICE Trans. Inf. Syst., 2007
Text-Independent Speaker Identification in a Distant-Talking Multi-Microphone Environment.
IEICE Trans. Inf. Syst., 2007
IEICE Trans. Inf. Syst., 2007
Compensating Acoustic Mismatch Using Class-Based Histogram Equalization for Robust Speech Recognition.
EURASIP J. Adv. Signal Process., 2007
Reliable Speaker Identification Using Multiple Microphones in Ubiquitous Robot Companion Environment.
Proceedings of the IEEE RO-MAN 2007, 2007
2006
Soft Counting Poisson Mixture Model-Based Polling Method for Speech/Nonspeech Classification.
IEICE Trans. Inf. Syst., 2006
Frequency Filtering for a Highly Robust Audio Fingerprinting Scheme in a Real-Noise Environment.
IEICE Trans. Inf. Syst., 2006
Intelligent broadcasting system and services for personalized semantic contents consumption.
Expert Syst. Appl., 2006
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006
2005
Audio Fingerprinting Scheme by Temporal Filtering for Audio Identification Immune to Channel-Distortion.
Proceedings of the Information Retrieval Technology, 2005
2004
Proceedings of the AI 2004: Advances in Artificial Intelligence, 2004
2003
Proceedings of the Signal and Image Processing (SIP 2003), 2003