2025
Confidence-Based Self-Training for EMG-to-Speech: Leveraging Synthetic EMG for Robust Modeling.
CoRR, June, 2025
A correlation-permutation approach for speech-music encoders model merging.
CoRR, June, 2025
COGENT: A Curriculum-oriented Framework for Generating Grade-appropriate Educational Content.
CoRR, June, 2025
What Makes a Good Natural Language Prompt?
CoRR, June, 2025
Beyond Classification: Towards Speech Emotion Reasoning with Multitask AudioLLMs.
,
,
,
,
,
,
,
,
,
,
,
CoRR, June, 2025
Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions.
CoRR, June, 2025
SingaKids: A Multilingual Multimodal Dialogic Tutor for Language Learning.
,
,
,
,
,
,
,
,
,
,
CoRR, June, 2025
Beyond In-Context Learning: Aligning Long-form Generation of Large Language Models via Task-Inherent Attribute Guidelines.
CoRR, June, 2025
Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems.
CoRR, May, 2025
Distilling a speech and music encoder with task arithmetic.
CoRR, May, 2025
Integrating Video and Text: A Balanced Approach to Multimodal Summary Generation and Evaluation.
CoRR, May, 2025
Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts.
CoRR, April, 2025
Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models.
CoRR, January, 2025
PRESENT: Zero-Shot Text-to-Prosody Control.
IEEE Signal Process. Lett., 2025
Transformer-based document-level discourse processing: Exploiting prior language knowledge and hierarchical parsing.
Comput. Speech Lang., 2025
AudioBench: A Universal Benchmark for Audio Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
Scaling Up Collaborative Dialogue Analysis: An AI-driven Approach to Understanding Dialogue Patterns in Computational Thinking Education.
Proceedings of the 15th International Learning Analytics and Knowledge Conference, 2025
Preference Optimization for Reasoning with Pseudo Feedback.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
DnA-Eval: Enhancing Large Language Model Evaluation through Decomposition and Aggregation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
Aligning Large Language Models with Human Opinions through Persona Selection and Value-Belief-Norm Reasoning.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs.
CoRR, 2024
MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond.
CoRR, 2024
MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models.
CoRR, 2024
Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework.
CoRR, 2024
CRAFT: Extracting and Tuning Cultural Instructions from the Wild.
CoRR, 2024
CrossIn: An Efficient Instruction Tuning Approach for Cross-Lingual Knowledge Alignment.
CoRR, 2024
Scaffolding Language Learning via Multi-modal Tutoring Systems with Pedagogical Instructions.
CoRR, 2024
SNIPER Training: Single-Shot Sparse Training for Text-to-Speech.
Proceedings of the IEEE Region 10 Conference, 2024
Semi-Supervised Learning for Robust Speech Evaluation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Optimizing Code-Switching in Conversational Tutoring Systems: A Pedagogical Framework and Evaluation.
Proceedings of the 25th Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2024
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Exploring Self-supervised Logic-enhanced Training for Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Dataset-Distillation Generative Model for Speech Emotion Recognition.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Distilling Distributional Uncertainty from a Gaussian Process.
Proceedings of the IEEE International Conference on Acoustics, 2024
Noise Robust Distillation of Self-Supervised Speech Models via Correlation Metrics.
Proceedings of the IEEE International Conference on Acoustics, 2024
Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2024, 2024
Resilience of Large Language Models for Noisy Instructions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
LOCOST: State-Space Models for Long Document Abstractive Summarization.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
Granular Change Accuracy: A More Accurate Performance Metric for Dialogue State Tracking.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
On Context Utilization in Summarization with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models' Understanding of Discourse Relations.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Prompt Optimization via Adversarial In-Context Learning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Context Aggregation with Topic-focused Summarization for Personalized Medical Dialogue Generation.
Proceedings of the 6th Clinical Natural Language Processing Workshop, 2024
2023
Data Science Education: The Signal Processing Perspective [SP Education].
IEEE Signal Process. Mag., November, 2023
Modelling Inter-Rater Uncertainty in Spoken Language Assessment.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Prompt Optimization via Adversarial In-Context Learning.
CoRR, 2023
ChOiRe: Characterizing and Predicting Human Opinions with Chain of Opinion Reasoning.
CoRR, 2023
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit Consistency.
CoRR, 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input.
CoRR, 2023
On Position Bias in Summarization with Large Language Models.
CoRR, 2023
Controllable Multi-document Summarization: Coverage & Coherence Intuitive Policy with Large Language Model Based Rewards.
CoRR, 2023
LLM Based Multi-Document Summarization Exploiting Main-Event Biased Monotone Submodular Content Extraction.
CoRR, 2023
PromptSum: Parameter-Efficient Controllable Abstractive Summarization.
CoRR, 2023
Multiple output samples for each input in a single-output Gaussian process.
CoRR, 2023
LogicLLM: Exploring Self-supervised Logic-enhanced Training for Large Language Models.
CoRR, 2023
Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models.
CoRR, 2023
Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation.
CoRR, 2023
Inclusive AI for Language Learning.
Proceedings of the 9th Workshop on Speech and Language Technology in Education, 2023
C3: Compositional Counterfactual Contrastive Learning for Video-grounded Dialogues.
Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, 2023
Distilling knowledge from Gaussian process teacher to neural network student.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Picking the Underused Heads: A Network Pruning Perspective of Attention Head Selection for Fusing Dialogue Coreference Information.
Proceedings of the IEEE International Conference on Acoustics, 2023
Instructive Dialogue Summarization with Query Aggregations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023
Multi-label and Multi-target Sampling of Machine Annotation for Computational Stance Detection.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
CoAnnotating: Uncertainty-Guided Work Allocation between Human and Large Language Models for Data Annotation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Adapter-TST: A Parameter Efficient Method for Multiple-Attribute Text Style Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Variational Gaussian Process Data Uncertainty.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
Unsupervised Summarization Re-ranking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Guiding Computational Stance Detection with Expanded Stance Triangle Framework.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
A transformer-Based neural language model that synthesizes brain activation maps from free-form text queries.
Medical Image Anal., 2022
SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech.
CoRR, 2022
Are Current Task-oriented Dialogue Systems Able to Satisfy Impolite Users?
CoRR, 2022
Domain-specific Language Pre-training for Dialogue Comprehension on Clinical Inquiry-Answering Conversations.
CoRR, 2022
Large-Scale Acoustic Characterization of Singaporean Children's English Pronunciation.
CoRR, 2022
Entity-based De-noising Modeling for Controllable Dialogue Summarization.
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022
Learning from Bootstrapping and Stepwise Reinforcement Reward: A Semi-Supervised Framework for Text Style Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022
Multimodal Dialogue State Tracking.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Variations of multi-task learning for spoken language assessment.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Dynamic Sliding Window Modeling for Abstractive Meeting Summarization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Incremental Context Aware Attentive Knowledge Tracing.
Proceedings of the IEEE International Conference on Acoustics, 2022
Progressive Continual Learning for Spoken Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2022
Towards Summary Candidates Fusion.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Singlish Message Paraphrasing: A Joint Task of Creole Translation and Text Normalization.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
CoHS-CQG: Context and History Selection for Conversational Question Generation.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
2021
Domain-Shift Conditioning Using Adaptable Filtering Via Hierarchical Embeddings for Robust Chinese Spell Check.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Improving Multi-Party Dialogue Discourse Parsing via Domain Integration.
CoRR, 2021
DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing.
CoRR, 2021
Dynamic Sliding Window for Meeting Summarization.
CoRR, 2021
C<sup>3</sup>: Compositional Counterfactual Constrastive Learning for Video-grounded Dialogues.
CoRR, 2021
VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks.
CoRR, 2021
A Simple But Effective Approach to n-shot Task-Oriented Dialogue Augmentation.
CoRR, 2021
Coreference-Aware Dialogue Summarization.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021
Velocidapter: Task-oriented Dialogue Comprehension Modeling Pairing Synthetic Text Generation with Domain Adaptation.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021
Coherent and Concise Radiology Report Generation via Context Specific Image Representations and Orthogonal Sentence States.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Papers, 2021
Text2Brain: Synthesis of Brain Activation Maps from Free-Form Text Query.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2021 - 24th International Conference, Strasbourg, France, September 27, 2021
Multilingual Speech Evaluation: Case Studies on English, Malay and Tamil.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
WittyKiddy: Multilingual Spoken Language Learning for Kids.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues.
Proceedings of the 9th International Conference on Learning Representations, 2021
Senone-Aware Adversarial Multi-Task Training for Unsupervised Child to Adult Speech Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Controllable Neural Dialogue Summarization with Personal Named Entity Planning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Analyzing Code Embeddings for Coding Clinical Narratives.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Have We Solved The Hard Problem? It's Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Explanations in Predictive Analytics: Case Studies.
Proceedings of the Knowledge Graphs for eXplainable Artificial Intelligence: Foundations, 2020
Hierarchical Character Embeddings: Learning Phonological and Semantic Representations in Languages of Logographic Origin Using Recursive Neural Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Hierarchical multimodal attention for end-to-end audio-visual scene-aware dialogue response generation.
Comput. Speech Lang., 2020
An End-to-End Document-Level Neural Discourse Parser Exploiting Multi-Granularity Representations.
CoRR, 2020
Adaptable Filtering using Hierarchical Embeddings for Chinese Spell Check.
CoRR, 2020
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge.
CoRR, 2020
Computer-Assisted Language Learning System: Automatic Speech Evaluation for Children Learning Malay and Tamil.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Characterization of Singaporean Children's English: Comparisons to American and British Counterparts Using Archetypal Analysis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Unsupervised Feature Adaptation Using Adversarial Multi-Task Training for Automatic Evaluation of Children's Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Conditional Neural Generation using Sub-Aspect Functions for Extractive News Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
UniConv: A Unified Conversational Neural Architecture for Multi-domain Task-oriented Dialogues.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Multilingual Neural RST Discourse Parsing.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Uncertainty Modeling for Machine Comprehension Systems using Efficient Bayesian Neural Networks.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Phonology-Augmented Statistical Framework for Machine Transliteration Using Limited Linguistic Resources.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Reranking of Responses Using Transfer Learning for a Retrieval-Based Chatbot.
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019
Set to Ordered Text: Generating Discharge Instructions from Medical Billing Codes.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Joint Learning of Word and Label Embeddings for Sequence Labelling in Spoken Language Understanding.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Topic-Aware Pointer-Generator Networks for Summarizing Spoken Conversations.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Reading Turn by Turn: Hierarchical Attention Architecture for Spoken Dialogue Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
Isolating the Effects of Modeling Recursive Structures: A Case Study in Pronunciation Prediction of Chinese Characters.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019
Acoustic Characterization of Singaporean Children's English: Comparisons to American and British Counterparts.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019
2018
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks.
J. Signal Process. Syst., 2018
Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Re-ranking spoken term detection with acoustic exemplars of keywords.
Speech Commun., 2018
Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Improving Mandarin Tone Mispronunciation Detection for Non-Native Learners with Soft-Target Tone Labels and BLSTM-Based Deep Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Recognizing Zero-Resourced Languages Based on Mismatched Machine Transcriptions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Multimodal neural pronunciation modeling for spoken languages with logographic origin.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Attention-based Semantic Priming for Slot-filling.
Proceedings of the Seventh Named Entities Workshop, 2018
Statistical Machine Transliteration Baselines for NEWS 2018.
Proceedings of the Seventh Named Entities Workshop, 2018
Proceedings of the Seventh Named Entities Workshop, 2018
Report of NEWS 2018 Named Entity Transliteration Shared Task.
Proceedings of the Seventh Named Entities Workshop, 2018
2017
ASR for Under-Resourced Languages From Probabilistic Transcription.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Introduction to the Special Issue on End-to-End Speech and Language Processing.
IEEE J. Sel. Top. Signal Process., 2017
Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2017
Pruning Strategies for Partial Search in Spoken Term Detection.
Proceedings of the Eighth International Symposium on Information and Communication Technology, 2017
Multi-Task Learning for Mispronunciation Detection on Singapore Children's Mandarin Speech.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Mismatched Crowdsourcing from Multiple Annotator Languages for Recognizing Zero-Resourced Languages: A Nullspace Clustering Approach.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
A Keyword-Aware Language Modeling Approach to Spoken Keyword Search.
J. Signal Process. Syst., 2016
Large-scale characterization of non-native Mandarin Chinese spoken by speakers of European origin: Analysis on iCALL.
Speech Commun., 2016
Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR.
Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing, 2016
Mismatched Crowdsourcing based Language Perception for Under-resourced Languages.
Proceedings of the SLTU-2016, 2016
A many-to-one phone mapping approach for cross-lingual speech recognition.
Proceedings of the 2016 IEEE RIVF International Conference on Computing & Communication Technologies, 2016
Context Aware Mispronunciation Detection for Mandarin Pronunciation Training.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Perception of Tone in Whispered Mandarin Sentences: The Case for Singapore Mandarin.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
SingaKids-Mandarin: Speech Corpus of Singaporean Children Speaking Mandarin Chinese.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Keyword search using query expansion for graph-based rescoring of hypothesized detections.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Cross-lingual deep neural network based submodular unbiased data selection for low-resource keyword search.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Improving non-native mispronunciation detection and enriching diagnostic feedback with DNN-based speech attribute modeling.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Personalized mispronunciation detection and diagnosis based on unsupervised error pattern discovery.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Speech recognition of under-resourced languages using mismatched transcriptions.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016
Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Regulating Orthography-Phonology Relationship for English to Thai Transliteration.
Proceedings of the Sixth Named Entity Workshop, 2016
2015
Corpus-based pronunciation variation rule analysis for singapore English.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015
Goodness of tone (GOT) for non-native Mandarin tone recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Phonology-augmented statistical transliteration for low-resource languages.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
iCALL corpus: Mandarin Chinese spoken by non-native speakers of European descent.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Tokenizing fundamental frequency variation for Mandarin tone error detection.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
A keyword-aware grammar framework for LVCSR-based spoken keyword search.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Low-resource keyword search strategies for tamil.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
System and keyword dependent fusion for spoken term detection.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Multiple time-span feature fusion for deep neural network modeling.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
A novel keyword+LVCSR-filler based grammar network representation for spoken keyword search.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
A minimal-resource transliteration framework for vietnamese.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A whispered Mandarin corpus for speech technology applications.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
A keyword-boosted sMBR criterion to enhance keyword search performance in deep neural network based acoustic modeling.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Subspace Gaussian mixture model for computer-assisted language learning.
Proceedings of the IEEE International Conference on Acoustics, 2014
Discriminative score normalization for keyword search decision.
Proceedings of the IEEE International Conference on Acoustics, 2014
Strategies for Vietnamese keyword search.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Large-scale characterization of Mandarin pronunciation errors made by native speakers of European languages.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Minimal-resource phonetic language models to summarize untranscribed speech.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Analyzing and Interpreting Automatically Learned Rules Across Dialects.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
Characterizing Deletion Transformations Across Dialects Using a Sophisticated Tying Mechanism.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Informative dialect recognition using context-dependent pronunciation modeling.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
A linguistically-informative approach to dialect recognition using dialect-discriminating context-dependent phonetic models.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Large-scale analysis of formant frequency estimation variability in conversational telephone speech.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
Dialect recognition using adapted phonetic models.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008