Hung-Shin Lee

Orcid: 0000-0001-7044-9434

According to our database1, Hung-Shin Lee authored at least 54 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Leveraging Retrieval-Augmented Generation for Culturally Inclusive Hakka Chatbots: Design Insights and User Perceptions.
CoRR, 2024

Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition.
CoRR, 2024

Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages.
CoRR, 2024

Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture.
CoRR, 2024

VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka.
CoRR, 2024

Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation.
CoRR, 2024

2023
Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization.
IEEE Signal Process. Lett., 2023

The North System for Formosa Speech Recognition Challenge 2023.
CoRR, 2023

A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
CasNet: Investigating Channel Robustness for Speech Separation.
CoRR, 2022

A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference.
CoRR, 2022

Filter-based Discriminative Autoencoders for Children Speech Recognition.
CoRR, 2022

Multi-Target Filter and Detector for Speaker Diarization.
CoRR, 2022

Speech-enhanced and Noise-aware Networks for Robust Speech Recognition.
CoRR, 2022

Speech-enhanced and Noise-aware Networks for Robust Speech Recognition.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Chain-based Discriminative Autoencoders for Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

AlloST: Low-Resource Speech Translation Without Source Transcription.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Melody Harmonization Using Orderless Nade, Chord Balancing, and Blocked Gibbs Sampling.
Proceedings of the IEEE International Conference on Acoustics, 2021

Generation of Speaker Representations Using Heterogeneous Training Batch Assembly.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Using Taigi Dramas with Mandarin Chinese Subtitles to Improve Taigi Speech Recognition.
Proceedings of the 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2020

Joint Training of Guided Learning and Mean Teacher Models for Sound Event Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

The Academia Sinica Systems of Voice Conversion for VCC2020.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural Networks.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Multi-task Learning for Acoustic Modeling Using Articulatory Attributes.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

2017
A Replay Spoofing Detection System Based on Discriminative Autoencoders.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

基於i-vector與PLDA並使用GMM-HMM強制對位之自動語者分段標記系統 (Speaker Diarization based on I-vector PLDA Scoring and using GMM-HMM Forced Alignment) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Discriminative Autoencoders for Acoustic Modeling.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Discriminative autoencoders for speaker verification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Incorporating proximity information in relevance language modeling for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Clustering-based i-vector formulation for speaker recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Ensemble of machine learning algorithms for cognitive and physical speaker load detection.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Speaker verification using kernel-based binary classifiers with binary operation derived features.
Proceedings of the IEEE International Conference on Acoustics, 2014

I-vector based language modeling for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Subspace-based phonotactic language recognition using multivariate dynamic linear models.
Proceedings of the IEEE International Conference on Acoustics, 2013

A Study of Language Modeling for Chinese Spelling Check.
Proceedings of the Seventh SIGHAN Workshop on Chinese Language Processing, 2013

2012
Subspace-Based Feature Representation and Learning for Language Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Learning the Similarity of Audio Music in Bag-of-frames Representation from Tagged Music Data.
Proceedings of the 12th International Society for Music Information Retrieval Conference, 2011

2010
Exploiting semantic associative information in topic modeling.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

A Discriminative and Heteroscedastic Linear Feature Transformation for Multiclass Classification.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
相似度比率式鑑別分析應用於大詞彙連續語音辨識 (Likelihood Ratio Based Discriminant Analysis for Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

Empirical error rate minimization based linear discriminant analysis.
Proceedings of the IEEE International Conference on Acoustics, 2009

Generalized likelihood ratio discriminant analysis.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Improved Linear Discriminant Analysis Considering Empirical Pairwise Classification Error Rates.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Linear discriminant feature extraction using weighted classification confusion information.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Training data selection for improving discriminative training of acoustic models.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007


  Loading...