Berlin Chen
Orcid: 0000-0003-0693-8932
According to our database1,
Berlin Chen
authored at least 264 papers
between 1996 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on dl.acm.org
On csauthors.net:
Bibliography
2024
An Effective Hierarchical Graph Attention Network Modeling Approach for Pronunciation Assessment.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition.
CoRR, 2024
Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment.
CoRR, 2024
Automated Speaking Assessment of Conversation Tests with Novel Graph-based Modeling on Spoken Response Coherence.
CoRR, 2024
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition.
CoRR, 2024
Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation.
CoRR, 2024
Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies.
CoRR, 2024
Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by Attention Constraints.
CoRR, 2024
An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement.
Proceedings of the IEEE International Conference on Acoustics, 2024
What Do Neural Networks Listen to? Exploring the Crucial Bands in Speech Enhancement Using SINC-Convolution.
Proceedings of the IEEE International Conference on Acoustics, 2024
DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
An Effective Pronunciation Assessment Approach Leveraging Hierarchical Transformers and Pre-training Strategies.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition.
CoRR, 2023
Effective Neural Modeling Leveraging Readability Features for Automated Essay Scoring.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023
Graph-Enhanced Transformer Architecture with Novel Use of CEFR Vocabulary Profile and Filled Pauses in Automated Speaking Assessment.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023
Proceedings of the 33rd IEEE International Workshop on Machine Learning for Signal Processing, 2023
A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning.
Proceedings of the 24th International Conference on Digital Signal Processing, 2023
Effective Graph-Based Modeling of Articulation Traits for Mispronunciation Detection and Diagnosis.
Proceedings of the IEEE International Conference on Acoustics, 2023
Preserving Phonemic Distinctions For Ordinal Regression: A Novel Loss Function For Automatic Pronunciation Assessment.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
Time-Reversal Enhancement Network With Cross-Domain Information for Noise-Robust Speech Recognition.
IEEE Multim., 2022
3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment.
CoRR, 2022
CoRR, 2022
Proceedings of the International Conference on Technologies and Applications of Artificial Intelligence, 2022
Peppanet: Effective Mispronunciation Detection and Diagnosis Leveraging Phonetic, Phonological, and Acoustic Cues.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Adaptive-FSN: Integrating Full-Band Extraction and Adaptive Sub-Band Encoding for Monaural Speech Enhancement.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
A Preliminary Study on Automated Speaking Assessment of English as a Second Language (ESL) Students.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022
Building an Enhanced Autoregressive Document Retriever Leveraging Supervised Contrastive Learning.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Maximum F1-Score Training for End-to-End Mispronunciation Detection and Diagnosis of L2 English Speech.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Exploring Non-Autoregressive End-to-End Neural Modeling for English Mispronunciation Detection and Diagnosis.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Conversational speech recognition leveraging effective fusion methods for cross-utterance language modeling.
CoRR, 2021
Effective FAQ Retrieval and Question Matching Tasks with Unsupervised Knowledge Injection.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021
Exploring the Integration of E2E ASR and Pronunciation Modeling for English Mispronunciation Detection.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021
A Preliminary Study on Environmental Sound Classification Leveraging Large-Scale Pretrained Model and Semi-Supervised Learning.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021
Proceedings of the International Joint Conference on Neural Networks, 2021
Cross-Domain Single-Channel Speech Enhancement Model with BI-Projection Fusion Module for Noise-Robust ASR.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021
Proceedings of the 29th European Signal Processing Conference, 2021
Towards Robust Mispronunciation Detection and Diagnosis for L2 English Learners with Accent-Modulating Methods.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
An Empirical Study on Transformer-Based End-to-End Speech Recognition with Novel Decoder Masking.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
FAQ Retrieval using Question-Aware Graph Convolutional Network and Contextualized Language Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Multi-Instrument Automatic Music Transcription With Self-Attention-Based Instance Segmentation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Enhanced Language Modeling with Proximity and Sentence Relatedness Information for Extractive Broadcast News Summarization.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020
CoRR, 2020
CoRR, 2020
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020
Exploring Disparate Language Model Combination Strategies for Mandarin-English Code-Switching ASR.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020
Innovative Pretrained-based Reranking Language Models for N-best Speech Recognition Lists.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020
Exploiting Text Prompts for the Development of an End-to-End Computer-Assisted Pronunciation Training System.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020
Multi-view Attention-based Speech Enhancement Model for Noise-robust Automatic Speech Recognition.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020
An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features.
Proceedings of the 28th European Signal Processing Conference, 2020
Exploring Feature Enhancement in The Modulation Spectrum Domain via Ideal Ratio Mask for Robust Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts.
Nat. Lang. Eng., 2019
使用生成對抗網路於強健式自動語音辨識的應用(Exploiting Generative Adversarial Network for Robustness Automatic Speech Recognition).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019
基於階層式編碼架構之文本可讀性預測(A Hierarchical Encoding Framework for Text Readability Prediction).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019
探究端對端語音辨識於發音檢測與診斷(Investigating on Computer-Assisted Pronunciation Training Leveraging End-to-End Speech Recognition Techniques).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019
What do you learn from context? Probing for sentence structure in contextualized word representations.
Proceedings of the 7th International Conference on Learning Representations, 2019
Proceedings of the Innovative Technologies and Learning - Second International Conference, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Semi-supervised Training of Acoustic Models Leveraging Knowledge Transferred from Out-of-Domain Data.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the AISS 2019: 2019 International Conference on Advanced Information Science and System, 2019
Can You Tell Me How to Get Past Sesame Street? Sentence-Level Pretraining Beyond Language Modeling.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
IEEE ACM Trans. Audio Speech Lang. Process., 2018
CoRR, 2018
探索結合快速文本及卷積神經網路於可讀性模型之建立 (Exploring Combination of FastText and Convolutional Neural Networks for Building Readability Models) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018
探討聲學模型的合併技術與半監督鑑別式訓練於會議語音辨識之研究 (Investigating acoustic model combination and semi-supervised discriminative training for meeting speech recognition) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018
會議語音辨識使用語者資訊之語言模型調適技術 (On the Use of Speaker-Aware Language Model Adaptation Techniques for Meeting Speech Recognition ) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018
探討鑑別式訓練聲學模型之類神經網路架構及優化方法的改進 (Discriminative Training of Acoustic Models Leveraging Improved Neural Network Architecture and Optimization Method) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018
Automatic Music Transcription Leveraging Generalized Cepstral Features and Deep Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 International Conference on Asian Language Processing, 2018
2017
A Position-Aware Language Modeling Framework for Extractive Broadcast News Speech Summarization.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2017
Exploring the Use of Neural Newtork based Features for Text Readability Classification.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017
Int. J. Comput. Linguistics Chin. Lang. Process., 2017
An Empirical Comparison of Contemporary Unsupervised Approaches for Extractive Speech Summarization.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017
序列標記與配對方法用於語音辨識錯誤偵測及修正 (On the Use of Sequence Labeling and Matching Methods for ASR Error Detection and Correction) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017
使用查詢意向探索與類神經網路於語音文件檢索之研究 (Exploring Query Intent and Neural Network modeling Techniques for Spoken Document Retrieval) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Exploring Low-Dimensional Structures of Modulation Spectra for Robust Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Exploring the Use of Significant Words Language Modeling for Spoken Document Retrieval.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Enhancing feature modulation spectra with dictionary learning approaches for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
A locality-preserving essence vector modeling framework for spoken document retrieval.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
Robust Speech Recognition via Enhancing the Complex-Valued Acoustic Spectrum in Modulation Domain.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Exploring the use of unsupervised query modeling techniques for speech recognition and summarization.
Speech Commun., 2016
Leveraging Multi-Task Learning with Neural Network Based Acoustic Modeling for Improved Meeting Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2016
Int. J. Comput. Linguistics Chin. Lang. Process., 2016
Evaluation Metric-related Optimization Methods for Mandarin Mispronunciation Detection.
Int. J. Comput. Linguistics Chin. Lang. Process., 2016
The development and evaluation of listening and speaking diagnosis and remedial teaching system.
Br. J. Educ. Technol., 2016
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
融合多任務學習類神經網路聲學模型訓練於會議語音辨識之研究(Leveraging Multi-task Learning with Neural Network Based Acoustic Modeling for Improved Meeting Speech Recognition) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016
使用字典學習法於強健性語音辨識(The Use of Dictionary Learning Approach for Robustness Speech Recognition) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016
基於深層類神經網路及表示學習技術之文件可讀性分類(Classification of Text Readability Based on Deep Neural Network and Representation Learning Techniques)[In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016
評估尺度相關最佳化方法於華語錯誤發音檢測之研究(Evaluation Metric-related Optimization Methods for Mandarin Mispronunciation Detection) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016
運用序列到序列生成架構於重寫式自動摘要(Exploiting Sequence-to-Sequence Generation Framework for Automatic Abstractive Summarization)[In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016
Novel Word Embedding and Translation-based Language Modeling for Extractive Speech Summarization.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Employing median filtering to enhance the complex-valued acoustic spectrograms in modulation domain for noise-robust speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Exploring Word Mover's Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Mispronunciation Detection Leveraging Maximum Performance Criterion Training of Acoustic Models and Decision Functions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the COLING 2016, 2016
Exploiting graph regularized nonnegative matrix factorization for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
Combining Relevance Language Modeling and Clarity Measure for Extractive Speech Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Extractive Broadcast News Summarization Leveraging Recurrent Neural Network Language Modeling Techniques.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Histogram equalization of contextual statistics of speech features for robust speech recognition.
Multim. Tools Appl., 2015
Int. J. Comput. Linguistics Chin. Lang. Process., 2015
Investigating Modulation Spectrum Factorization Techniques for Robust Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2015
表示法學習技術於節錄式語音文件摘要之研究(A Study on Representation Learning Techniques for Extractive Spoken Document Summarization) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015
可讀性預測於中小學國語文教科書及優良課外讀物之研究(A Study of Readability Prediction on Elementary and Secondary Chinese Textbooks and Excellent Extracurricular Reading Materials) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015
融合多種深層類神經網路聲學模型與分類技術於華語錯誤發音檢測之研究(Exploring Combinations of Various Deep Neural Network based Acoustic Models and Classification Techniques for Mandarin Mispro-nunciation Detection)[In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015
使用詞向量表示與概念資訊於中文大詞彙連續語音辨識之語言模型調適(Exploring Word Embedding and Concept Information for Language Model Adaptation in Mandarin Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015
調變頻譜分解之改良於強健性語音辨識(Several Refinements of Modulation Spectrum Factorization for Robust Speech Recognition) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Incorporating paragraph embeddings and density peaks clustering for spoken document summarization.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Incorporating proximity information in relevance language modeling for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Enhancing the complex-valued acoustic spectrograms in modulation domain for creating noise-robust features in speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
2014
Multim. Tools Appl., 2014
Exploring Concept Information for Mandarin Large Vocabulary Continuous Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2014
探究新穎語句模型化技術於節錄式語音摘要 (Investigating Novel Sentence Modeling Techniques for Extractive Speech Summarization) [In Chinese].
Proceedings of the 26th Conference on Computational Linguistics and Speech Processing, 2014
運用概念模型化技術於中文大詞彙連續語音辨識之語言模型調適 (Leveraging Concept Modeling Techniques for Language Model Adaptation in Mandarin Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 26th Conference on Computational Linguistics and Speech Processing, 2014
Enhanced language modeling for extractive speech summarization with sentence relatedness information.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A recurrent neural network language modeling framework for extractive speech summarization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014
Effective pseudo-relevance feedback for language modeling in extractive speech summarization.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Leveraging Effective Query Modeling Techniques for Speech Recognition and Summarization.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
Inf. Process. Manag., 2013
Inf. Process. Manag., 2013
改良語句模型技術於節錄式語音摘要之研究 (Improved Sentence Modeling Techniques for Extractive Speech Summarization) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013
改良調變頻譜統計圖等化法於強健性語音辨識之研究 (Improved Modulation Spectrum Histogram Equalization for Robust Speech Recognition) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013
Distribution-based feature normalization for robust speech recognition leveraging context and dynamics cues.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Incorporating proximity information for relevance language modeling in speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
The Influence of Guanxi Gradient on Crew Resource Management and Values in the Cockpit.
Proceedings of the Engineering Psychology and Cognitive Ergonomics. Applications and Services, 2013
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Speech Audio Process., 2012
Int. J. Comput. Linguistics Chin. Lang. Process., 2012
Int. J. Comput. Linguistics Chin. Lang. Process., 2012
Spoken Document Retrieval Leveraging Unsupervised and Supervised Topic Modeling Techniques.
IEICE Trans. Inf. Syst., 2012
遞迴式類神經網路語言模型應用額外資訊於語音辨識之研究 (Recurrent Neural Network-based Language Modeling with Extra Information Cues for Speech Recognition) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012
改良式統計圖等化法強鍵性語音辨識之研究 (Improved Histogram Equalization Methods for Robust Speech Recognition) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012
Leveraging distributional characteristics of modulation spectra for robust speech recognition.
Proceedings of the 11th International Conference on Information Science, 2012
Exploring Joint Equalization of Spatial-Temporal Contextual Statistics of Speech Features for Robust Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
Leveraging Kullback-Leibler Divergence Measures and Information-Rich Cues for Speech Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Robust speech recognition using spatial-temporal feature distribution characteristics.
Pattern Recognit. Lett., 2011
實證探究多種鑑別式語言模型於語音辨識之研究 (Empirical Comparisons of Various Discriminative Language Models for Speech Recognition) [In Chinese].
Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing, 2011
機率式調變頻譜分解於強健性語音辨識 (Probabilistic Modulation Spectrum Factorization for Robust Speech Recognition) [In Chinese].
Proceedings of the Poster Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
整合邊際資訊於鑑別式聲學模型訓練方法之比較研究 (A Comparative Study on Margin-Based Discriminative Training of Acoustic Models) [In Chinese].
Proceedings of the 22th Conference on Computational Linguistics and Speech Processing, 2010
鑑別式語言模型於語音辨識結果重新排序之研究 (Exploiting Discriminative Language Models for Reranking Speech Recognition Hypotheses) [In Chinese].
Proceedings of the 22th Conference on Computational Linguistics and Speech Processing, 2010
Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
A Discriminative and Heteroscedastic Linear Feature Transformation for Multiclass Classification.
Proceedings of the 20th International Conference on Pattern Recognition, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the Information Retrieval Technology, 2010
Proceedings of the ACL 2010, 2010
2009
Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2009
A Probabilistic Generative Framework for Extractive Broadcast News Speech Summarization.
IEEE Trans. Speech Audio Process., 2009
A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization.
ACM Trans. Asian Lang. Inf. Process., 2009
ACM Trans. Asian Lang. Inf. Process., 2009
Pattern Recognit. Lett., 2009
Proceedings of the Second Text Analysis Conference, 2009
相似度比率式鑑別分析應用於大詞彙連續語音辨識 (Likelihood Ratio Based Discriminant Analysis for Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009
主題語言模型於大詞彙連續語音辨識之研究 (On the Use of Topic Models for Large-Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009
Topic modeling for spoken document retrieval using word- and syllable-level information.
Proceedings of the third workshop on Searching spontaneous conversational speech, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Improved speech summarization with multiple-hypothesis representations and kullback-leibler divergence measures.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Latent topic modelling of word co-occurence information for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
Pattern Recognit. Lett., 2008
Improved Minimum Phone Error based Discriminative Training of Acoustic Models for Mandarin Large Vocabulary Continuous Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2008
Improved Linear Discriminant Analysis Considering Empirical Pairwise Classification Error Rates.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Linear discriminant feature extraction using weighted classification confusion information.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Exploiting spatial-temporal feature distribution characteristics for robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
改善以最小化音素錯誤為基礎的鑑別式聲學模型訓練於中文連續語音辨識之研究 (Improved Minimum Phone Error based Discriminative Training of Acoustic Models for Chinese Continuous Speech Reconigtion) [In Chinese].
Proceedings of the 19th Conference on Computational Linguistics and Speech Processing, 2007
Subword-based position specific posterior lattices (s-PSPL) for indexing speech information.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
A unified probabilistic generative framework for extractive spoken document summarization.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Investigating the use of speech features and their corresponding distribution characteristics for robust speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007
2006
Exploring the use of latent topical information for statistical Chinese spoken document retrieval.
Pattern Recognit. Lett., 2006
Int. J. Pattern Recognit. Artif. Intell., 2006
An Empirical Study of Word Error Minimization Approaches for Mandarin Large Vocabulary Continuous Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006
統計圖等化法於雜訊語音辨識之進一步研究 (An Improved Histogram Equalization Approach for Robust Speech Recognition) [In Chinese].
Proceedings of the 18th Conference on Computational Linguistics and Speech Processing, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
On Using Entropy Information to Improve Posterior Probability-Based Confidence Measures.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Exploiting polynomial-fit histogram equalization and temporal average for robust speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Chinese Spoken Document Summarization Using Probabilistic Latent Topical Information.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Int. J. Comput. Linguistics Chin. Lang. Process., 2005
Lightly Supervised and Data-Driven Approaches to Mandarin Broadcast News Transcription.
Int. J. Comput. Linguistics Chin. Lang. Process., 2005
風險最小化準則在中文大詞彙連續語音辨識之研究 (Risk Minimization Criterion for Mandarin Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 17th Conference on Computational Linguistics and Speech Processing, 2005
Hierarchical topic organization and visual presentation of spoken documents using probabilistic latent semantic analysis (PLSA) for efficient retrieval/browsing applications.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Dynamic language model adaptation using latent topical information and automatic transcripts.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
2004
ACM Trans. Asian Lang. Inf. Process., 2004
Comput. Speech Lang., 2004
非監督式學習於中文電視新聞自動轉寫之初步應用 (Unsupervised Learning for Chinese Broadcast News Transcription) [In Chinese].
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
2002
Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese.
IEEE Trans. Speech Audio Process., 2002
A hierarchical tag-graph search scheme with layered grammar rules for spontaneous speech understanding.
Pattern Recognit. Lett., 2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Improved Chinese spoken document retrieval with hybrid modeling and data-driven indexing features.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
2001
Int. J. Comput. Process. Orient. Lang., 2001
Comparison of Word and Subword Indexing Techniques for Mandarin Chinese Spoken Document Retrieval.
Proceedings of the Advances in Multimedia Information Processing, 2001
Proceedings of the First International Conference on Human Language Technology Research, 2001
An HMM/n-gram-based linguistic processing approach for Mandarin spoken document retrieval.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
Proceedings of the IEEE International Conference on Acoustics, 2001
2000
Int. J. Pattern Recognit. Artif. Intell., 2000
Int. J. Comput. Process. Orient. Lang., 2000
Initial Experiments On Recognition of Internet-Accessible Compressed Mandarin Speech.
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Retrieval of broadcast news speech in Mandarin Chinese collected in Taiwan using syllable-level statistical characteristics.
Proceedings of the IEEE International Conference on Acoustics, 2000
1998
Large-Vocabulary Chinese Text/Speech Information Retrieval Using Mandarin Speech Queries.
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Hierarchical tag-graph search for spontaneous speech understanding in spoken dialog systems.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1996
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996