Berlin Chen

Orcid: 0000-0003-0693-8932

According to our database1, Berlin Chen authored at least 264 papers between 1996 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
An Effective Hierarchical Graph Attention Network Modeling Approach for Pronunciation Assessment.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

A Novel LLM-based Two-stage Summarization Approach for Long Dialogues.
CoRR, 2024

Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition.
CoRR, 2024

Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment.
CoRR, 2024

Automated Speaking Assessment of Conversation Tests with Novel Graph-based Modeling on Spoken Response Coherence.
CoRR, 2024

An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition.
CoRR, 2024

Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation.
CoRR, 2024

Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies.
CoRR, 2024

Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by Attention Constraints.
CoRR, 2024

An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement.
Proceedings of the IEEE International Conference on Acoustics, 2024

What Do Neural Networks Listen to? Exploring the Crucial Bands in Speech Enhancement Using SINC-Convolution.
Proceedings of the IEEE International Conference on Acoustics, 2024

DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

An Effective Pronunciation Assessment Approach Leveraging Hierarchical Transformers and Pre-training Strategies.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition.
CoRR, 2023

Effective Neural Modeling Leveraging Readability Features for Automated Essay Scoring.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

Graph-Enhanced Transformer Architecture with Novel Use of CEFR Vocabulary Profile and Filled Pauses in Automated Speaking Assessment.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

NAaLOSS: Rethinking the Objective of Speech Enhancement.
Proceedings of the 33rd IEEE International Workshop on Machine Learning for Signal Processing, 2023

A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning.
Proceedings of the 24th International Conference on Digital Signal Processing, 2023

Effective Graph-Based Modeling of Articulation Traits for Mispronunciation Detection and Diagnosis.
Proceedings of the IEEE International Conference on Acoustics, 2023

Preserving Phonemic Distinctions For Ordinal Regression: A Novel Loss Function For Automatic Pronunciation Assessment.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Time-Reversal Enhancement Network With Cross-Domain Information for Noise-Robust Speech Recognition.
IEEE Multim., 2022

3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment.
CoRR, 2022

Geometric Learning of Hidden Markov Models via a Method of Moments Algorithm.
CoRR, 2022

Bi-Sep: A Multi-Resolution Cross-Domain Monaural Speech Separation Framework.
Proceedings of the International Conference on Technologies and Applications of Artificial Intelligence, 2022

Peppanet: Effective Mispronunciation Detection and Diagnosis Leveraging Phonetic, Phonological, and Acoustic Cues.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Adaptive-FSN: Integrating Full-Band Extraction and Adaptive Sub-Band Encoding for Monaural Speech Enhancement.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Preliminary Study on Automated Speaking Assessment of English as a Second Language (ESL) Students.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

Building an Enhanced Autoregressive Document Retriever Leveraging Supervised Contrastive Learning.
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing, 2022

Effective Cross-Utterance Language Modeling for Conversational Speech Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2022

Maximum F1-Score Training for End-to-End Mispronunciation Detection and Diagnosis of L2 English Speech.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Exploring Non-Autoregressive End-to-End Neural Modeling for English Mispronunciation Detection and Diagnosis.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Conversational speech recognition leveraging effective fusion methods for cross-utterance language modeling.
CoRR, 2021

Effective FAQ Retrieval and Question Matching Tasks with Unsupervised Knowledge Injection.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

Innovative Bert-Based Reranking Language Models for Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

A Study on Contextualized Language Modeling for Machine Reading Comprehension.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

Exploring the Integration of E2E ASR and Pronunciation Modeling for English Mispronunciation Detection.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

A Preliminary Study on Environmental Sound Classification Leveraging Large-Scale Pretrained Model and Semi-Supervised Learning.
Proceedings of the 33rd Conference on Computational Linguistics and Speech Processing, 2021

Cross-sentence Neural Language Models for Conversational Speech Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2021

Cross-Domain Single-Channel Speech Enhancement Model with BI-Projection Fusion Module for Noise-Robust ASR.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

End-to-End Mispronunciation Detection and Diagnosis From Raw Waveforms.
Proceedings of the 29th European Signal Processing Conference, 2021

Towards Robust Mispronunciation Detection and Diagnosis for L2 English Learners with Accent-Modulating Methods.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

TENET: A Time-Reversal Enhancement Network for Noise-Robust ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

An Empirical Study on Transformer-Based End-to-End Speech Recognition with Novel Decoder Masking.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

FAQ Retrieval using Question-Aware Graph Convolutional Network and Contextualized Language Model.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Multi-Instrument Automatic Music Transcription With Self-Attention-Based Instance Segmentation.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Enhanced Language Modeling with Proximity and Sentence Relatedness Information for Extractive Broadcast News Summarization.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020

Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020.
CoRR, 2020

Effective FAQ Retrieval and Question Matching With Unsupervised Knowledge Injection.
CoRR, 2020

A Study on Contextualized Language Modeling for FAQ Retrieval.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

Exploring Disparate Language Model Combination Strategies for Mandarin-English Code-Switching ASR.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

Innovative Pretrained-based Reranking Language Models for N-best Speech Recognition Lists.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

Exploiting Text Prompts for the Development of an End-to-End Computer-Assisted Pronunciation Training System.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

Multi-view Attention-based Speech Enhancement Model for Noise-robust Automatic Speech Recognition.
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing, 2020

An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

An Effective End-to-End Modeling Approach for Mispronunciation Detection.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Spoken Document Retrieval Leveraging Bert-Based Modeling and Query Reformulation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features.
Proceedings of the 28th European Signal Processing Conference, 2020

Exploring Feature Enhancement in The Modulation Spectrum Domain via Ideal Ratio Mask for Robust Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Integrating LSA-based hierarchical conceptual space and machine learning methods for leveling the readability of domain-specific texts.
Nat. Lang. Eng., 2019

使用生成對抗網路於強健式自動語音辨識的應用(Exploiting Generative Adversarial Network for Robustness Automatic Speech Recognition).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019

基於階層式編碼架構之文本可讀性預測(A Hierarchical Encoding Framework for Text Readability Prediction).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019

探究端對端語音辨識於發音檢測與診斷(Investigating on Computer-Assisted Pronunciation Training Leveraging End-to-End Speech Recognition Techniques).
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing, 2019

What do you learn from context? Probing for sentence structure in contextualized word representations.
Proceedings of the 7th International Conference on Learning Representations, 2019

An Innovative BERT-Based Readability Model.
Proceedings of the Innovative Technologies and Learning - Second International Conference, 2019

Polyphonic Music Transcription with Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Hierarchical Neural Summarization Framework for Spoken Documents.
Proceedings of the IEEE International Conference on Acoustics, 2019

Enhanced Bert-Based Ranking Models for Spoken Document Retrieval.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Semi-supervised Training of Acoustic Models Leveraging Knowledge Transferred from Out-of-Domain Data.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Modulation spectrum augmentation for robust speech recognition.
Proceedings of the AISS 2019: 2019 International Conference on Advanced Information Science and System, 2019

Can You Tell Me How to Get Past Sesame Street? Sentence-Level Pretraining Beyond Language Modeling.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
An Information Distillation Framework for Extractive Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Looking for ELMo's friends: Sentence-Level Pretraining Beyond Language Modeling.
CoRR, 2018

探索結合快速文本及卷積神經網路於可讀性模型之建立 (Exploring Combination of FastText and Convolutional Neural Networks for Building Readability Models) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

探討聲學模型的合併技術與半監督鑑別式訓練於會議語音辨識之研究 (Investigating acoustic model combination and semi-supervised discriminative training for meeting speech recognition) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

會議語音辨識使用語者資訊之語言模型調適技術 (On the Use of Speaker-Aware Language Model Adaptation Techniques for Meeting Speech Recognition ) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

探討鑑別式訓練聲學模型之類神經網路架構及優化方法的改進 (Discriminative Training of Acoustic Models Leveraging Improved Neural Network Architecture and Optimization Method) [In Chinese].
Proceedings of the 30th Conference on Computational Linguistics and Speech Processing, 2018

Automatic Music Transcription Leveraging Generalized Cepstral Features and Deep Learning.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Essence Vector-Based Query Modeling for Spoken Document Retrieval.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Investigating Manifold Learning Technique for Robust Speech Recognition.
Proceedings of the 2018 International Conference on Asian Language Processing, 2018

2017
A Position-Aware Language Modeling Framework for Extractive Broadcast News Speech Summarization.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2017

Exploring the Use of Neural Newtork based Features for Text Readability Classification.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

On the Use of Neural Network Modeling Techniques for Spoken Document Retrieval.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

An Empirical Comparison of Contemporary Unsupervised Approaches for Extractive Speech Summarization.
Int. J. Comput. Linguistics Chin. Lang. Process., 2017

序列標記與配對方法用於語音辨識錯誤偵測及修正 (On the Use of Sequence Labeling and Matching Methods for ASR Error Detection and Correction) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

探究不同領域文件之可讀性分析 (Exploring Readability Analysis on Multi-Domain Texts) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

使用查詢意向探索與類神經網路於語音文件檢索之研究 (Exploring Query Intent and Neural Network modeling Techniques for Spoken Document Retrieval) [In Chinese].
Proceedings of the 29th Conference on Computational Linguistics and Speech Processing, 2017

Discriminative Autoencoders for Acoustic Modeling.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Exploring Low-Dimensional Structures of Modulation Spectra for Robust Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Exploring the Use of Significant Words Language Modeling for Spoken Document Retrieval.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Enhancing feature modulation spectra with dictionary learning approaches for robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Leveraging manifold learning for extractive broadcast news summarization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A locality-preserving essence vector modeling framework for spoken document retrieval.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Neural relevance-aware query modeling for spoken document retrieval.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Investigating Siamese LSTM networks for text categorization.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Robust Speech Recognition via Enhancing the Complex-Valued Acoustic Spectrum in Modulation Domain.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Exploring the use of unsupervised query modeling techniques for speech recognition and summarization.
Speech Commun., 2016

Leveraging Multi-Task Learning with Neural Network Based Acoustic Modeling for Improved Meeting Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2016

The Use of Dictionary Learning Approach for Robustness Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2016

Evaluation Metric-related Optimization Methods for Mandarin Mispronunciation Detection.
Int. J. Comput. Linguistics Chin. Lang. Process., 2016

The development and evaluation of listening and speaking diagnosis and remedial teaching system.
Br. J. Educ. Technol., 2016

Extractive speech summarization leveraging convolutional neural network techniques.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

融合多任務學習類神經網路聲學模型訓練於會議語音辨識之研究(Leveraging Multi-task Learning with Neural Network Based Acoustic Modeling for Improved Meeting Speech Recognition) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

使用字典學習法於強健性語音辨識(The Use of Dictionary Learning Approach for Robustness Speech Recognition) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

基於深層類神經網路及表示學習技術之文件可讀性分類(Classification of Text Readability Based on Deep Neural Network and Representation Learning Techniques)[In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

評估尺度相關最佳化方法於華語錯誤發音檢測之研究(Evaluation Metric-related Optimization Methods for Mandarin Mispronunciation Detection) [In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

運用序列到序列生成架構於重寫式自動摘要(Exploiting Sequence-to-Sequence Generation Framework for Automatic Abstractive Summarization)[In Chinese].
Proceedings of the 28th Conference on Computational Linguistics and Speech Processing, 2016

Novel Word Embedding and Translation-based Language Modeling for Extractive Speech Summarization.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Employing median filtering to enhance the complex-valued acoustic spectrograms in modulation domain for noise-robust speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Exploring Word Mover's Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Mispronunciation Detection Leveraging Maximum Performance Criterion Training of Acoustic Models and Decision Functions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Improved spoken document summarization with coverage modeling techniques.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Learning to Distill: The Essence Vector Modeling Framework.
Proceedings of the COLING 2016, 2016

Exploiting graph regularized nonnegative matrix factorization for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

A novel paragraph embedding method for spoken document summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Combining Relevance Language Modeling and Clarity Measure for Extractive Speech Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Extractive Broadcast News Summarization Leveraging Recurrent Neural Network Language Modeling Techniques.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Histogram equalization of contextual statistics of speech features for robust speech recognition.
Multim. Tools Appl., 2015

Extractive Spoken Document Summarization with Representation Learning Techniques.
Int. J. Comput. Linguistics Chin. Lang. Process., 2015

Investigating Modulation Spectrum Factorization Techniques for Robust Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2015

表示法學習技術於節錄式語音文件摘要之研究(A Study on Representation Learning Techniques for Extractive Spoken Document Summarization) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

可讀性預測於中小學國語文教科書及優良課外讀物之研究(A Study of Readability Prediction on Elementary and Secondary Chinese Textbooks and Excellent Extracurricular Reading Materials) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

融合多種深層類神經網路聲學模型與分類技術於華語錯誤發音檢測之研究(Exploring Combinations of Various Deep Neural Network based Acoustic Models and Classification Techniques for Mandarin Mispro-nunciation Detection)[In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

使用詞向量表示與概念資訊於中文大詞彙連續語音辨識之語言模型調適(Exploring Word Embedding and Concept Information for Language Model Adaptation in Mandarin Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

調變頻譜分解之改良於強健性語音辨識(Several Refinements of Modulation Spectrum Factorization for Robust Speech Recognition) [In Chinese].
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing, 2015

Positional language modeling for extractive broadcast news speech summarization.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Leveraging word embeddings for spoken document summarization.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

I-vector based language modeling for query representation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Incorporating paragraph embeddings and density peaks clustering for spoken document summarization.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Incorporating proximity information in relevance language modeling for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Enhancing the complex-valued acoustic spectrograms in modulation domain for creating noise-robust features in speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Leveraging topical and positional cues for language modeling in speech recognition.
Multim. Tools Appl., 2014

Enhancing Query Formulation for Spoken Document Retrieval.
J. Inf. Sci. Eng., 2014

Exploring Concept Information for Mandarin Large Vocabulary Continuous Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2014

探究新穎語句模型化技術於節錄式語音摘要 (Investigating Novel Sentence Modeling Techniques for Extractive Speech Summarization) [In Chinese].
Proceedings of the 26th Conference on Computational Linguistics and Speech Processing, 2014

運用概念模型化技術於中文大詞彙連續語音辨識之語言模型調適 (Leveraging Concept Modeling Techniques for Language Model Adaptation in Mandarin Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 26th Conference on Computational Linguistics and Speech Processing, 2014

Enhanced language modeling for extractive speech summarization with sentence relatedness information.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Effective modulation spectrum factorization for robust speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A recurrent neural network language modeling framework for extractive speech summarization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Effective pseudo-relevance feedback for language modeling in extractive speech summarization.
Proceedings of the IEEE International Conference on Acoustics, 2014

I-vector based language modeling for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2014

Leveraging Effective Query Modeling Techniques for Speech Recognition and Summarization.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

A margin-based discriminative modeling approach for extractive speech summarization.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Spatial histogram equalization of complex-valued acoustic spectra in modulation domain for noise-robust speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Extractive speech summarization using evaluation metric-related training criteria.
Inf. Process. Manag., 2013

Leveraging relevance cues for language modeling in speech recognition.
Inf. Process. Manag., 2013

改良語句模型技術於節錄式語音摘要之研究 (Improved Sentence Modeling Techniques for Extractive Speech Summarization) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

改良調變頻譜統計圖等化法於強健性語音辨識之研究 (Improved Modulation Spectrum Histogram Equalization for Robust Speech Recognition) [In Chinese].
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing, 2013

Distribution-based feature normalization for robust speech recognition leveraging context and dynamics cues.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Incorporating proximity information for relevance language modeling in speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Sentence modeling for extractive speech summarization.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Weighted matrix factorization for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2013

Effective pseudo-relevance feedback for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2013

The Influence of Guanxi Gradient on Crew Resource Management and Values in the Cockpit.
Proceedings of the Engineering Psychology and Cognitive Ergonomics. Applications and Services, 2013

Effective pseudo-relevance feedback for language modeling in speech recognition.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
A Risk-Aware Modeling Framework for Speech Summarization.
IEEE Trans. Speech Audio Process., 2012

Spoken Document Retrieval With Unsupervised Query Modeling Techniques.
IEEE Trans. Speech Audio Process., 2012

A Comparative Study of Methods for Topic Modeling in Spoken Document Retrieval.
Int. J. Comput. Linguistics Chin. Lang. Process., 2012

Speech Recognition Leveraging Histogram Equalization Methods.
Int. J. Comput. Linguistics Chin. Lang. Process., 2012

Spoken Document Retrieval Leveraging Unsupervised and Supervised Topic Modeling Techniques.
IEICE Trans. Inf. Syst., 2012

遞迴式類神經網路語言模型應用額外資訊於語音辨識之研究 (Recurrent Neural Network-based Language Modeling with Extra Information Cues for Speech Recognition) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012

改良式統計圖等化法強鍵性語音辨識之研究 (Improved Histogram Equalization Methods for Robust Speech Recognition) [In Chinese].
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, 2012

Leveraging distributional characteristics of modulation spectra for robust speech recognition.
Proceedings of the 11th International Conference on Information Science, 2012

Exploring Joint Equalization of Spatial-Temporal Contextual Statistics of Speech Features for Robust Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Word Relevance Modeling for Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Constructing effective ranking models for speech summarization.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Leveraging Kullback-Leibler Divergence Measures and Information-Rich Cues for Speech Summarization.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Robust speech recognition using spatial-temporal feature distribution characteristics.
Pattern Recognit. Lett., 2011

實證探究多種鑑別式語言模型於語音辨識之研究 (Empirical Comparisons of Various Discriminative Language Models for Speech Recognition) [In Chinese].
Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing, 2011

機率式調變頻譜分解於強健性語音辨識 (Probabilistic Modulation Spectrum Factorization for Robust Speech Recognition) [In Chinese].
Proceedings of the Poster Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing, 2011

Leveraging Relevance Cues for Improved Spoken Document Retrieval.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

An Effective and Robust Framework for Transliteration Exploration.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Discriminative language modeling for speech recognition with relevance information.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Handling verbose queries for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2011

Relevance language modeling for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Query modeling for spoken document retrieval.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
整合邊際資訊於鑑別式聲學模型訓練方法之比較研究 (A Comparative Study on Margin-Based Discriminative Training of Acoustic Models) [In Chinese].
Proceedings of the 22th Conference on Computational Linguistics and Speech Processing, 2010

鑑別式語言模型於語音辨識結果重新排序之研究 (Exploiting Discriminative Language Models for Reranking Speech Recognition Hypotheses) [In Chinese].
Proceedings of the 22th Conference on Computational Linguistics and Speech Processing, 2010

Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Extractive speech summarization - from the view of decision theory.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A Discriminative and Heteroscedastic Linear Feature Transformation for Multiclass Classification.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Leveraging evaluation metric-related training criteria for speech summarization.
Proceedings of the IEEE International Conference on Acoustics, 2010

Latent topic modeling of word vicinity information for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Transliteration Retrieval Model for Cross Lingual Information Retrieval.
Proceedings of the Information Retrieval Technology, 2010

A Risk Minimization Framework for Extractive Speech Summarization.
Proceedings of the ACL 2010, 2010

2009
Exploring the Use of Speech Features and Their Corresponding Distribution Characteristics for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2009

A Probabilistic Generative Framework for Extractive Broadcast News Speech Summarization.
IEEE Trans. Speech Audio Process., 2009

A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization.
ACM Trans. Asian Lang. Inf. Process., 2009

Word Topic Models for Spoken Document Retrieval and Transcription.
ACM Trans. Asian Lang. Inf. Process., 2009

Training data selection for improving discriminative training of acoustic models.
Pattern Recognit. Lett., 2009

The NTNU Summarization System at TAC 2009.
Proceedings of the Second Text Analysis Conference, 2009

相似度比率式鑑別分析應用於大詞彙連續語音辨識 (Likelihood Ratio Based Discriminant Analysis for Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

主題語言模型於大詞彙連續語音辨識之研究 (On the Use of Topic Models for Large-Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing, 2009

Topic modeling for spoken document retrieval using word- and syllable-level information.
Proceedings of the third workshop on Searching spontaneous conversational speech, 2009

Hybrids of supervised and unsupervised models for extractive speech summarization.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Improved speech summarization with multiple-hypothesis representations and kullback-leibler divergence measures.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Empirical error rate minimization based linear discriminant analysis.
Proceedings of the IEEE International Conference on Acoustics, 2009

Latent topic modelling of word co-occurence information for spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2009

Generalized likelihood ratio discriminant analysis.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Extractive spoken document summarization for information retrieval.
Pattern Recognit. Lett., 2008

Improved Minimum Phone Error based Discriminative Training of Acoustic Models for Mandarin Large Vocabulary Continuous Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2008

Improved Linear Discriminant Analysis Considering Empirical Pairwise Classification Error Rates.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Position Information for Language Modeling in Speech Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Linear discriminant feature extraction using weighted classification confusion information.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Exploiting spatial-temporal feature distribution characteristics for robust speech recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
改善以最小化音素錯誤為基礎的鑑別式聲學模型訓練於中文連續語音辨識之研究 (Improved Minimum Phone Error based Discriminative Training of Acoustic Models for Chinese Continuous Speech Reconigtion) [In Chinese].
Proceedings of the 19th Conference on Computational Linguistics and Speech Processing, 2007

Subword-based position specific posterior lattices (s-PSPL) for indexing speech information.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Cluster-based polynomial-fit histogram equalization (CPHEQ) for robust speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

A unified probabilistic generative framework for extractive spoken document summarization.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Investigating Data Selection for Minimum Phone Error Training of Acoustic Models.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Improved Histogram Equalzaiton (HEQ) for Robust Speech Recogntion.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Word Topical Mixture Models for Extractive Spoken Document Summarization.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Word Topical Mixture Models for Dynamic Language Model Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Training data selection for improving discriminative training of acoustic models.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Investigating the use of speech features and their corresponding distribution characteristics for robust speech recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Spoken document summarization using relevant information.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Exploring the use of latent topical information for statistical Chinese spoken document retrieval.
Pattern Recognit. Lett., 2006

Voice retrieval of Mandarin broadcast news speech.
Int. J. Pattern Recognit. Artif. Intell., 2006

An Empirical Study of Word Error Minimization Approaches for Mandarin Large Vocabulary Continuous Speech Recognition.
Int. J. Comput. Linguistics Chin. Lang. Process., 2006

統計圖等化法於雜訊語音辨識之進一步研究 (An Improved Histogram Equalization Approach for Robust Speech Recognition) [In Chinese].
Proceedings of the 18th Conference on Computational Linguistics and Speech Processing, 2006

Extractive Chinese Spoken Document Summarization Using Probabilistic Ranking Models.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Minimum Phone Error (MPE) Model and Feature Training on Mandarin Broadcast News Task.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

On Using Entropy Information to Improve Posterior Probability-Based Confidence Measures.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

Exploiting polynomial-fit histogram equalization and temporal average for robust speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Chinese Spoken Document Summarization Using Probabilistic Latent Topical Information.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Spoken document understanding and organization.
IEEE Signal Process. Mag., 2005

MATBN: A Mandarin Chinese Broadcast News Corpus.
Int. J. Comput. Linguistics Chin. Lang. Process., 2005

Lightly Supervised and Data-Driven Approaches to Mandarin Broadcast News Transcription.
Int. J. Comput. Linguistics Chin. Lang. Process., 2005

風險最小化準則在中文大詞彙連續語音辨識之研究 (Risk Minimization Criterion for Mandarin Large Vocabulary Continuous Speech Recognition) [In Chinese].
Proceedings of the 17th Conference on Computational Linguistics and Speech Processing, 2005

Hierarchical topic organization and visual presentation of spoken documents using probabilistic latent semantic analysis (PLSA) for efficient retrieval/browsing applications.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Minimum word error based discriminative training of language models.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Speech retrieval of Mandarin broadcast news via mobile devices.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Dynamic language model adaptation using latent topical information and automatic transcripts.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents.
ACM Trans. Asian Lang. Inf. Process., 2004

Mandarin-English Information (MEI): investigating translingual speech retrieval.
Comput. Speech Lang., 2004

非監督式學習於中文電視新聞自動轉寫之初步應用 (Unsupervised Learning for Chinese Broadcast News Transcription) [In Chinese].
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing, 2004

Statistical language model adaptation for Mandarin broadcast news transcription.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Statistical Chinese spoken document retrieval using latent topical information.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2002
Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese.
IEEE Trans. Speech Audio Process., 2002

A hierarchical tag-graph search scheme with layered grammar rules for spontaneous speech understanding.
Pattern Recognit. Lett., 2002

A data-driven indexing approach for Chinese spoken document retrieval.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Improved Chinese spoken document retrieval with hybrid modeling and data-driven indexing features.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Content-based Language Models for Spoken Document Retrieval.
Int. J. Comput. Process. Orient. Lang., 2001

Comparison of Word and Subword Indexing Techniques for Mandarin Chinese Spoken Document Retrieval.
Proceedings of the Advances in Multimedia Information Processing, 2001

Mandarin-English Information: Investigating Translingual Speech Retrieval.
Proceedings of the First International Conference on Human Language Technology Research, 2001

An HMM/n-gram-based linguistic processing approach for Mandarin spoken document retrieval.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Improved spoken document retrieval by exploring extra acoustic and linguistic cues.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Multi-scale-audio indexing for translingual spoken document retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Syllable-Based Chinese Text/Spoken Document Retrieval Using Text/Speech Queries.
Int. J. Pattern Recognit. Artif. Intell., 2000

Browsing the Chinese Web Pages Using Mandarin Speech.
Int. J. Comput. Process. Orient. Lang., 2000

Initial Experiments On Recognition of Internet-Accessible Compressed Mandarin Speech.
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000

Retrieval of mandarin broadcast news using spoken queries.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Retrieval of broadcast news speech in Mandarin Chinese collected in Taiwan using syllable-level statistical characteristics.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
Large-Vocabulary Chinese Text/Speech Information Retrieval Using Mandarin Speech Queries.
Proceedings of the 1998 International Symposium on Chinese Spoken Language Processing, 1998

Towards a Mandarin voice memo system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Hierarchical tag-graph search for spontaneous speech understanding in spoken dialog systems.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

A*-admissible key-phrase spotting with sub-syllable level utterance verification.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

1996
Speaker-independent mandarin polysyllabic word recognition.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996


  Loading...