Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Optimizing Code-Switching in Conversational Tutoring Systems: A Pedagogical Framework and Evaluation.

[DOI]

Stella Xin Yin

Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024

SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning.

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Exploring Self-supervised Logic-enhanced Training for Large Language Models.

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Dataset-Distillation Generative Model for Speech Emotion Recognition.

[DOI]

Fabian Ritter Gutierrez

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Distilling Distributional Uncertainty from a Gaussian Process.

[DOI]

Jeremy H. M. Wong

Proceedings of the IEEE International Conference on Acoustics, 2024

Noise Robust Distillation of Self-Supervised Speech Models via Correlation Metrics.

[DOI]

Fabian Ritter Gutierrez

Proceedings of the IEEE International Conference on Acoustics, 2024

Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning.

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024

Resilience of Large Language Models for Noisy Instructions.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models.

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems.

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing.

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LOCOST: State-Space Models for Long Document Abstractive Summarization.

[DOI]

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Granular Change Accuracy: A More Accurate Performance Metric for Dialogue State Tracking.

[DOI]

Taha Aksu

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On Context Utilization in Summarization with Large Language Models.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models' Understanding of Discourse Relations.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Prompt Optimization via Adversarial In-Context Learning.

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Context Aggregation with Topic-focused Summarization for Personalized Medical Dialogue Generation.

[DOI]

Pavitra Krishnaswamy

Proceedings of the 6th Clinical Natural Language Processing Workshop, 2024

2023

Data Science Education: The Signal Processing Perspective [SP Education].

[DOI]

IEEE Signal Process. Mag., November, 2023

Modelling Inter-Rater Uncertainty in Spoken Language Assessment.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2023

Prompt Optimization via Adversarial In-Context Learning.

[DOI]

CoRR, 2023

ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning.

[DOI]

CoRR, 2023

VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency.

[DOI]

Vernon Toh

Ratish Puduppully

CoRR, 2023

Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input.

[DOI]

CoRR, 2023

On Position Bias in Summarization with Large Language Models.

[DOI]

CoRR, 2023

Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards.

[DOI]

CoRR, 2023

LLM Based Multi-Document Summarization Exploiting Main-Event Biased Monotone Submodular Content Extraction.

[DOI]

CoRR, 2023

PromptSum: Parameter-Efficient Controllable Abstractive Summarization.

[DOI]

CoRR, 2023

Multiple output samples for each input in a single-output Gaussian process.

[DOI]

CoRR, 2023

LogicLLM: Exploring Self-supervised Logic-enhanced Training for Large Language Models.

[DOI]

CoRR, 2023

Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models.

[DOI]

CoRR, 2023

Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation.

[DOI]

CoRR, 2023

Inclusive AI for Language Learning.

[DOI]

Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023

C3: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues.

[DOI]

Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023

Distilling knowledge from Gaussian process teacher to neural network student.

[DOI]

Jeremy H. M. Wong

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Instructive Dialogue Summarization with Query Aggregations.

[DOI]

Bin Wang

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations.

[DOI]

Hong Choon Oh

Pavitra Krishnaswamy

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection.

[DOI]

Hai Leong Chieu

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation.

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer.

[DOI]

Zhiqiang Hu

Roy Ka-Wei Lee

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Variational Gaussian Process Data Uncertainty.

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Unsupervised Summarization Re-ranking.

[DOI]

Mathieu Ravaut

Shafiq R. Joty

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Guiding Computational Stance Detection with Expanded Stance Triangle Framework.

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation.

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation.

[DOI]

Ibrahim Taha Aksu

Min-Yen Kan

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

A transformer-Based neural language model that synthesizes brain activation maps from free-form text queries.

[DOI]

Medical Image Anal., 2022

SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech.

[DOI]

CoRR, 2022

Are Current Task-oriented Dialogue Systems Able to Satisfy Impolite Users?

[DOI]

Zhiqiang Hu

Roy Ka-Wei Lee

CoRR, 2022

Domain-specific Language Pre-training for Dialogue Comprehension on Clinical Inquiry-Answering Conversations.

[DOI]

Pavitra Krishnaswamy

CoRR, 2022

Large-Scale Acoustic Characterization of Singaporean Children's English Pronunciation.

[DOI]

CoRR, 2022

Entity-based De-noising Modeling for Controllable Dialogue Summarization.

[DOI]

Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

Learning from Bootstrapping and Stepwise Reinforcement Reward: A Semi-Supervised Framework for Text Style Transfer.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Multimodal Dialogue State Tracking.

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems.

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Variations of multi-task learning for spoken language assessment.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Dynamic Sliding Window Modeling for Abstractive Meeting Summarization.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models.

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Incremental Context Aware Attentive Knowledge Tracing.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Progressive Continual Learning for Spoken Keyword Spotting.

[DOI]

Yizheng Huang

Nana Hou

Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Summary Candidates Fusion.

[DOI]

Mathieu Ravaut

Shafiq R. Joty

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Singlish Message Paraphrasing: A Joint Task of Creole Translation and Text Normalization.

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

CoHS-CQG: Context and History Selection for Conversational Question Generation.

[DOI]

Proceedings of the 29th International Conference on Computational Linguistics, 2022

SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization.

[DOI]

Mathieu Ravaut

Shafiq R. Joty

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Domain-Shift Conditioning Using Adaptable Filtering Via Hierarchical Embeddings for Robust Chinese Spell Check.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Improving Multi-Party Dialogue Discourse Parsing via Domain Integration.

[DOI]

CoRR, 2021

DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing.

[DOI]

CoRR, 2021

Dynamic Sliding Window for Meeting Summarization.

[DOI]

CoRR, 2021

C<sup>3</sup>: Compositional Counterfactual Constrastive Learning for Video-grounded Dialogues.

[DOI]

CoRR, 2021

VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks.

[DOI]

CoRR, 2021

A Simple But Effective Approach to n-shot Task-Oriented Dialogue Augmentation.

[DOI]

CoRR, 2021

Coreference-Aware Dialogue Summarization.

[DOI]

Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Velocidapter: Task-oriented Dialogue Comprehension Modeling Pairing Synthetic Text Generation with Domain Adaptation.

[DOI]

Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Coherent and Concise Radiology Report Generation via Context Specific Image Representations and Orthogonal Sentence States.

[DOI]

Ai Ti Aw

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021

Text2Brain: Synthesis of Brain Activation Maps from Free-Form Text Query.

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021

Multilingual Speech Evaluation: Case Studies on English, Malay and Tamil.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

WittyKiddy: Multilingual Spoken Language Learning for Kids.

[DOI]

Kye Min Tan

Shikang Ni

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues.

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Senone-Aware Adversarial Multi-Task Training for Unsupervised Child to Adult Speech Adaptation.

[DOI]

Richeng Duan

Proceedings of the IEEE International Conference on Acoustics, 2021

Controllable Neural Dialogue Summarization with Personal Named Entity Planning.

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Analyzing Code Embeddings for Coding Clinical Narratives.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Have We Solved The Hard Problem? It's Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence.

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Explanations in Predictive Analytics: Case Studies.

[DOI]

Proceedings of the Knowledge Graphs for eXplainable Artificial Intelligence: Foundations, 2020

Hierarchical Character Embeddings: Learning Phonological and Semantic Representations in Languages of Logographic Origin Using Recursive Neural Networks.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Hierarchical multimodal attention for end-to-end audio-visual scene-aware dialogue response generation.

[DOI]

Comput. Speech Lang., 2020

An End-to-End Document-Level Neural Discourse Parser Exploiting Multi-Granularity Representations.

[DOI]

CoRR, 2020

Adaptable Filtering using Hierarchical Embeddings for Chinese Spell Check.

[DOI]

CoRR, 2020

Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge.

[DOI]

CoRR, 2020

Computer-Assisted Language Learning System: Automatic Speech Evaluation for Children Learning Malay and Tamil.

[DOI]

Kye Min Tan

Richeng Duan

Ngoc Thuy Huong Helen Thai

Nur Farah Ain Suhaimi

Rajan Vellu

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Characterization of Singaporean Children's English: Comparisons to American and British Counterparts Using Archetypal Analysis.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Unsupervised Feature Adaptation Using Adversarial Multi-Task Training for Automatic Evaluation of Children's Speech.

[DOI]

Richeng Duan

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conditional Neural Generation using Sub-Aspect Functions for Extractive News Summarization.

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented Dialogues.

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues.

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multilingual Neural RST Discourse Parsing.

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Uncertainty Modeling for Machine Comprehension Systems using Efficient Bayesian Neural Networks.

[DOI]

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences.

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Phonology-Augmented Statistical Framework for Machine Transliteration Using Limited Linguistic Resources.

[DOI]

Hoang Gia Ngo

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring.

[DOI]

Nur Farah Ain Binte Sahimi

Jia Hui Hazel Lim

Shao Chuen Tong

Sharon Ong

Angela Ng

Sheldon Lee Shao Guang

Michael Ross Macdonald

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Reranking of Responses Using Transfer Learning for a Retrieval-Based Chatbot.

[DOI]

Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

Set to Ordered Text: Generating Discharge Instructions from Medical Billing Codes.

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Joint Learning of Word and Label Embeddings for Sequence Labelling in Spoken Language Understanding.

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Topic-Aware Pointer-Generator Networks for Summarizing Spoken Conversations.

[DOI]

Angela Ng

Sheldon Lee Shao Guang

Ai Ti Aw

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Reading Turn by Turn: Hierarchical Attention Architecture for Spoken Dialogue Comprehension.

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems.

[DOI]

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Isolating the Effects of Modeling Recursive Structures: A Case Study in Pronunciation Prediction of Chinese Characters.

[DOI]

Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

Acoustic Characterization of Singaporean Children's English: Comparisons to American and British Counterparts.

[DOI]

Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

2018

Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks.

[DOI]

Jinsong Zhang

J. Signal Process. Syst., 2018

Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription.

[DOI]

Mark A. Hasegawa-Johnson

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Re-ranking spoken term detection with acoustic exemplars of keywords.

[DOI]

Speech Commun., 2018

Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models.

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Recognizing Zero-Resourced Languages Based on Mismatched Machine Transcriptions.

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multimodal neural pronunciation modeling for spoken languages with logographic origin.

[DOI]

Hoang Gia Ngo

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Attention-based Semantic Priming for Slot-filling.

[DOI]

Proceedings of the Seventh Named Entities Workshop, 2018

Statistical Machine Transliteration Baselines for NEWS 2018.

[DOI]

Proceedings of the Seventh Named Entities Workshop, 2018

NEWS 2018 Whitepaper.

[DOI]

Proceedings of the Seventh Named Entities Workshop, 2018

Report of NEWS 2018 Named Entity Transliteration Shared Task.

[DOI]

Proceedings of the Seventh Named Entities Workshop, 2018

2017

ASR for Under-Resourced Languages From Probabilistic Transcription.

[DOI]

Mark A. Hasegawa-Johnson

Preethi Jyothi

Daniel McCloy

Majid Mirbagheri

Giovanni M. Di Liberto

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Introduction to the Special Issue on End-to-End Speech and Language Processing.

[DOI]

IEEE J. Sel. Top. Signal Process., 2017

Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text.

[DOI]

CoRR, 2017

Pruning Strategies for Partial Search in Spoken Term Detection.

[DOI]

Proceedings of the Eighth International Symposium on Information and Communication Technology, 2017

Multi-Task Learning for Mispronunciation Detection on Singapore Children's Mandarin Speech.

[DOI]

Rong Tong

Bin Ma

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models.

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition.

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Mismatched Crowdsourcing from Multiple Annotator Languages for Recognizing Zero-Resourced Languages: A Nullspace Clustering Approach.

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search.

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

A Keyword-Aware Language Modeling Approach to Spoken Keyword Search.

[DOI]

J. Signal Process. Syst., 2016

Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL.

[DOI]

Speech Commun., 2016

Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR.

[DOI]

Preethi Jyothi

Lav R. Varshney

Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016

Mismatched Crowdsourcing based Language Perception for Under-resourced Languages.

[DOI]

Proceedings of the SLTU-2016, 2016

A many-to-one phone mapping approach for cross-lingual speech recognition.

[DOI]

Proceedings of the 2016 IEEE RIVF International Conference on Computing & Communication Technologies, 2016

Context Aware Mispronunciation Detection for Mandarin Pronunciation Training.

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples.

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees.

[DOI]

Kehuang Li

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Perception of Tone in Whispered Mandarin Sentences: The Case for Singapore Mandarin.

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages.

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese.

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Keyword search using query expansion for graph-based rescoring of hypothesized detections.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery.

[DOI]

Ann Lee

James R. Glass

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exemplar-inspired strategies for low-resource spoken keyword search in Swahili.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Speech recognition of under-resourced languages using mismatched transcriptions.

[DOI]

Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations.

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning.

[DOI]

Haizhou Li

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

Regulating Orthography-Phonology Relationship for English to Thai Transliteration.

[DOI]

Binh Minh Nguyen

Hoang Gia Ngo

Proceedings of the Sixth Named Entity Workshop, 2016

2015

Corpus-based pronunciation variation rule analysis for singapore English.

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Goodness of tone (GOT) for non-native Mandarin tone recognition.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Phonology-augmented statistical transliteration for low-resource languages.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Tokenizing fundamental frequency variation for Mandarin tone error detection.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A keyword-aware grammar framework for LVCSR-based spoken keyword search.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Low-resource keyword search strategies for tamil.

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2014

System and keyword dependent fusion for spoken term detection.

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Multiple time-span feature fusion for deep neural network modeling.

[DOI]

Chongjia Ni

Bin Ma

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search.

[DOI]

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A minimal-resource transliteration framework for vietnamese.

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A whispered Mandarin corpus for speech technology applications.

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling.

[DOI]

I-Fan Chen

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Subspace Gaussian mixture model for computer-assisted language learning.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Discriminative score normalization for keyword search decision.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Strategies for Vietnamese keyword search.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Large-scale characterization of Mandarin pronunciation errors made by native speakers of European languages.

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Minimal-resource phonetic language models to summarize untranscribed speech.

[DOI]

Bin Ma

Haizhou Li

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Analyzing and Interpreting Automatically Learned Rules Across Dialects.

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

Characterizing Deletion Transformations Across Dialects Using a Sophisticated Tying Mechanism.

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Informative dialect recognition using context-dependent pronunciation modeling.

[DOI]

Pedro A. Torres-Carrasquillo

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Large-scale analysis of formant frequency estimation variability in conversational telephone speech.

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Dialect recognition using adapted phonetic models.

[DOI]