Nancy F. Chen

Orcid: 0000-0003-0872-5877

According to our database1, Nancy F. Chen authored at least 187 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models.
CoRR, 2024

Semi-supervised Learning For Robust Speech Evaluation.
CoRR, 2024

Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization.
CoRR, 2024

MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders.
CoRR, 2024

MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues.
CoRR, 2024

LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs.
CoRR, 2024

PRESENT: Zero-Shot Text-to-Prosody Control.
CoRR, 2024

AudioBench: A Universal Benchmark for Audio Large Language Models.
CoRR, 2024

Dataset-Distillation Generative Model for Speech Emotion Recognition.
CoRR, 2024

Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework.
CoRR, 2024

CRAFT: Extracting and Tuning Cultural Instructions from the Wild.
CoRR, 2024

CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment.
CoRR, 2024

Resilience of Large Language Models for Noisy Instructions.
CoRR, 2024

Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems.
CoRR, 2024

Scaffolding Language Learning via Multi-modal Tutoring Systems with Pedagogical Instructions.
CoRR, 2024

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing.
CoRR, 2024

Optimizing Code-Switching in Conversational Tutoring Systems: A Pedagogical Framework and Evaluation.
Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024

Exploring Self-supervised Logic-enhanced Training for Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Distilling Distributional Uncertainty from a Gaussian Process.
Proceedings of the IEEE International Conference on Acoustics, 2024

Noise Robust Distillation of Self-Supervised Speech Models via Correlation Metrics.
Proceedings of the IEEE International Conference on Acoustics, 2024

Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LOCOST: State-Space Models for Long Document Abstractive Summarization.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Granular Change Accuracy: A More Accurate Performance Metric for Dialogue State Tracking.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On Context Utilization in Summarization with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models' Understanding of Discourse Relations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Prompt Optimization via Adversarial In-Context Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Context Aggregation with Topic-focused Summarization for Personalized Medical Dialogue Generation.
Proceedings of the 6th Clinical Natural Language Processing Workshop, 2024

2023
Data Science Education: The Signal Processing Perspective [SP Education].
IEEE Signal Process. Mag., November, 2023

Modelling Inter-Rater Uncertainty in Spoken Language Assessment.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Prompt Optimization via Adversarial In-Context Learning.
CoRR, 2023

ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning.
CoRR, 2023

VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency.
CoRR, 2023

Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection.
CoRR, 2023

Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input.
CoRR, 2023

Instructive Dialogue Summarization with Query Aggregations.
CoRR, 2023

On Position Bias in Summarization with Large Language Models.
CoRR, 2023

Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards.
CoRR, 2023

LLM Based Multi-Document Summarization Exploiting Main-Event Biased Monotone Submodular Content Extraction.
CoRR, 2023

SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning.
CoRR, 2023

PromptSum: Parameter-Efficient Controllable Abstractive Summarization.
CoRR, 2023

Multiple output samples for each input in a single-output Gaussian process.
CoRR, 2023

LogicLLM: Exploring Self-supervised Logic-enhanced Training for Large Language Models.
CoRR, 2023

Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models.
CoRR, 2023

Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation.
CoRR, 2023

Inclusive AI for Language Learning.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

C3: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues.
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

Distilling knowledge from Gaussian process teacher to neural network student.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information.
Proceedings of the IEEE International Conference on Acoustics, 2023

Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Variational Gaussian Process Data Uncertainty.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Unsupervised Summarization Re-ranking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Guiding Computational Stance Detection with Expanded Stance Triangle Framework.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
A transformer-Based neural language model that synthesizes brain activation maps from free-form text queries.
Medical Image Anal., 2022

SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech.
CoRR, 2022

Are Current Task-oriented Dialogue Systems Able to Satisfy Impolite Users?
CoRR, 2022

Domain-specific Language Pre-training for Dialogue Comprehension on Clinical Inquiry-Answering Conversations.
CoRR, 2022

Large-Scale Acoustic Characterization of Singaporean Children's English Pronunciation.
CoRR, 2022

Entity-based De-noising Modeling for Controllable Dialogue Summarization.
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

Learning from Bootstrapping and Stepwise Reinforcement Reward: A Semi-Supervised Framework for Text Style Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Multimodal Dialogue State Tracking.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Variations of multi-task learning for spoken language assessment.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Dynamic Sliding Window Modeling for Abstractive Meeting Summarization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Incremental Context Aware Attentive Knowledge Tracing.
Proceedings of the IEEE International Conference on Acoustics, 2022

Progressive Continual Learning for Spoken Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Summary Candidates Fusion.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Singlish Message Paraphrasing: A Joint Task of Creole Translation and Text Normalization.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

CoHS-CQG: Context and History Selection for Conversational Question Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Domain-Shift Conditioning Using Adaptable Filtering Via Hierarchical Embeddings for Robust Chinese Spell Check.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Improving Multi-Party Dialogue Discourse Parsing via Domain Integration.
CoRR, 2021

DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing.
CoRR, 2021

Dynamic Sliding Window for Meeting Summarization.
CoRR, 2021

C<sup>3</sup>: Compositional Counterfactual Constrastive Learning for Video-grounded Dialogues.
CoRR, 2021

VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks.
CoRR, 2021

A Simple But Effective Approach to n-shot Task-Oriented Dialogue Augmentation.
CoRR, 2021

Coreference-Aware Dialogue Summarization.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Velocidapter: Task-oriented Dialogue Comprehension Modeling Pairing Synthetic Text Generation with Domain Adaptation.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Coherent and Concise Radiology Report Generation via Context Specific Image Representations and Orthogonal Sentence States.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

Text2Brain: Synthesis of Brain Activation Maps from Free-Form Text Query.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Multilingual Speech Evaluation: Case Studies on English, Malay and Tamil.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

WittyKiddy: Multilingual Spoken Language Learning for Kids.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues.
Proceedings of the 9th International Conference on Learning Representations, 2021

Senone-Aware Adversarial Multi-Task Training for Unsupervised Child to Adult Speech Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Controllable Neural Dialogue Summarization with Personal Named Entity Planning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Analyzing Code Embeddings for Coding Clinical Narratives.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Have We Solved The Hard Problem? It's Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Explanations in Predictive Analytics: Case Studies.
Proceedings of the Knowledge Graphs for eXplainable Artificial Intelligence: Foundations, 2020

Hierarchical Character Embeddings: Learning Phonological and Semantic Representations in Languages of Logographic Origin Using Recursive Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Hierarchical multimodal attention for end-to-end audio-visual scene-aware dialogue response generation.
Comput. Speech Lang., 2020

An End-to-End Document-Level Neural Discourse Parser Exploiting Multi-Granularity Representations.
CoRR, 2020

Adaptable Filtering using Hierarchical Embeddings for Chinese Spell Check.
CoRR, 2020

Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge.
CoRR, 2020

Computer-Assisted Language Learning System: Automatic Speech Evaluation for Children Learning Malay and Tamil.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Characterization of Singaporean Children's English: Comparisons to American and British Counterparts Using Archetypal Analysis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Feature Adaptation Using Adversarial Multi-Task Training for Automatic Evaluation of Children's Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conditional Neural Generation using Sub-Aspect Functions for Extractive News Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented Dialogues.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multilingual Neural RST Discourse Parsing.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Uncertainty Modeling for Machine Comprehension Systems using Efficient Bayesian Neural Networks.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Phonology-Augmented Statistical Framework for Machine Transliteration Using Limited Linguistic Resources.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Reranking of Responses Using Transfer Learning for a Retrieval-Based Chatbot.
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

Set to Ordered Text: Generating Discharge Instructions from Medical Billing Codes.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Joint Learning of Word and Label Embeddings for Sequence Labelling in Spoken Language Understanding.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Topic-Aware Pointer-Generator Networks for Summarizing Spoken Conversations.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Reading Turn by Turn: Hierarchical Attention Architecture for Spoken Dialogue Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Isolating the Effects of Modeling Recursive Structures: A Case Study in Pronunciation Prediction of Chinese Characters.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

Acoustic Characterization of Singaporean Children's English: Comparisons to American and British Counterparts.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

2018
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks.
J. Signal Process. Syst., 2018

Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Re-ranking spoken term detection with acoustic exemplars of keywords.
Speech Commun., 2018

Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Recognizing Zero-Resourced Languages Based on Mismatched Machine Transcriptions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multimodal neural pronunciation modeling for spoken languages with logographic origin.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Attention-based Semantic Priming for Slot-filling.
Proceedings of the Seventh Named Entities Workshop, 2018

Statistical Machine Transliteration Baselines for NEWS 2018.
Proceedings of the Seventh Named Entities Workshop, 2018

NEWS 2018 Whitepaper.
Proceedings of the Seventh Named Entities Workshop, 2018

Report of NEWS 2018 Named Entity Transliteration Shared Task.
Proceedings of the Seventh Named Entities Workshop, 2018

2017
ASR for Under-Resourced Languages From Probabilistic Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Introduction to the Special Issue on End-to-End Speech and Language Processing.
IEEE J. Sel. Top. Signal Process., 2017

Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text.
CoRR, 2017

Pruning Strategies for Partial Search in Spoken Term Detection.
Proceedings of the Eighth International Symposium on Information and Communication Technology, 2017

Multi-Task Learning for Mispronunciation Detection on Singapore Children's Mandarin Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Mismatched Crowdsourcing from Multiple Annotator Languages for Recognizing Zero-Resourced Languages: A Nullspace Clustering Approach.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
A Keyword-Aware Language Modeling Approach to Spoken Keyword Search.
J. Signal Process. Syst., 2016

Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL.
Speech Commun., 2016

Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR.
Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Mismatched Crowdsourcing based Language Perception for Under-resourced Languages.
Proceedings of the SLTU-2016, 2016

A many-to-one phone mapping approach for cross-lingual speech recognition.
Proceedings of the 2016 IEEE RIVF International Conference on Computing & Communication Technologies, 2016

Context Aware Mispronunciation Detection for Mandarin Pronunciation Training.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Perception of Tone in Whispered Mandarin Sentences: The Case for Singapore Mandarin.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Keyword search using query expansion for graph-based rescoring of hypothesized detections.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exemplar-inspired strategies for low-resource spoken keyword search in Swahili.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Speech recognition of under-resourced languages using mismatched transcriptions.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Regulating Orthography-Phonology Relationship for English to Thai Transliteration.
Proceedings of the Sixth Named Entity Workshop, 2016

2015
Corpus-based pronunciation variation rule analysis for singapore English.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Goodness of tone (GOT) for non-native Mandarin tone recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Phonology-augmented statistical transliteration for low-resource languages.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Tokenizing fundamental frequency variation for Mandarin tone error detection.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A keyword-aware grammar framework for LVCSR-based spoken keyword search.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Low-resource keyword search strategies for tamil.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

System and keyword dependent fusion for spoken term detection.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Multiple time-span feature fusion for deep neural network modeling.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A minimal-resource transliteration framework for vietnamese.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A whispered Mandarin corpus for speech technology applications.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Subspace Gaussian mixture model for computer-assisted language learning.
Proceedings of the IEEE International Conference on Acoustics, 2014

Discriminative score normalization for keyword search decision.
Proceedings of the IEEE International Conference on Acoustics, 2014

Strategies for Vietnamese keyword search.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Large-scale characterization of Mandarin pronunciation errors made by native speakers of European languages.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Minimal-resource phonetic language models to summarize untranscribed speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Analyzing and Interpreting Automatically Learned Rules Across Dialects.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Characterizing Deletion Transformations Across Dialects Using a Sophisticated Tying Mechanism.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Informative dialect recognition using context-dependent pronunciation modeling.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Large-scale analysis of formant frequency estimation variability in conversational telephone speech.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Dialect recognition using adapted phonetic models.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008


  Loading...