Chng Eng Siong
Orcid: 0000-0001-6257-7399Affiliations:
- Nanyang Technological University, Singapore
According to our database1,
Chng Eng Siong
authored at least 355 papers
between 1994 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on ntu.edu.sg
On csauthors.net:
Bibliography
2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs.
CoRR, 2024
LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation.
CoRR, 2024
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition.
CoRR, 2024
Continual Learning Optimizations for Auto-regressive Decoder of Multilingual ASR systems.
CoRR, 2024
CoRR, 2024
Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection.
CoRR, 2024
CoRR, 2024
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models.
CoRR, 2024
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model.
CoRR, 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators.
CoRR, 2024
Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Cross-Modality and Within-Modality Regularization for Audio-Visual Deepfake Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Low Resource Language Adaptation using Two-stage Regularization for Multilingual ASR.
Proceedings of the International Conference on Asian Language Processing, 2024
Proceedings of the International Conference on Asian Language Processing, 2024
2023
Generative error correction for code-switching speech recognition using large language models.
CoRR, 2023
CoRR, 2023
deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition.
CoRR, 2023
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Dual-Memory Multi-Modal Learning for Continual Spoken Keyword Spotting with Confidence Selection and Diversity Enhancement.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Dual Acoustic Linguistic Self-supervised Representation Learning for Cross-Domain Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Small Footprint Multi-channel Network for Keyword Spotting with Centroid Based Awareness.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Blind Estimation of Room Impulse Response from Monaural Reverberant Speech with Segmental Generative Neural Network.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Self-supervised Learning Representation based Accent Recognition with Persistent Accent Memory.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition.
Proceedings of the International Joint Conference on Neural Networks, 2023
Proceedings of the International Joint Conference on Neural Networks, 2023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Speech-Text Based Multi-Modal Training with Bidirectional Attention for Improved Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
De'hubert: Disentangling Noise in a Self-Supervised Model for Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Singaporean Conversational English-Malay Code-Switching Speech: An Analysis Based on Code-switching Points and Part -of-Speech.
Proceedings of the International Conference on Asian Language Processing, 2023
CASSI: Contextual and Semantic Structure-based Interpolation Augmentation for Low-Resource NER.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
ASR Model Adaptation for Rare Words Using Synthetic Data Generated by Multiple Text-To-Speech Systems.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Study of Generative Adversarial Networks for Noisy Speech Simulation from Clean Speech.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Intelligent Information and Database Systems - 15th Asian Conference, 2023
An Empirical Study on Punctuation Restoration for English, Mandarin, and Code-Switching Speech.
Proceedings of the Intelligent Information and Database Systems - 15th Asian Conference, 2023
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition.
Speech Commun., 2022
Efficient Self-Supervised Learning Representations for Spoken Language Identification.
IEEE J. Sel. Top. Signal Process., 2022
Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin.
CoRR, 2022
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization.
CoRR, 2022
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder.
CoRR, 2022
Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition.
CoRR, 2022
CoRR, 2022
CoRR, 2022
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness.
CoRR, 2022
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning.
CoRR, 2022
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Minimum Word Error Training For Non-Autoregressive Transformer-Based Code-Switching ASR.
Proceedings of the IEEE International Conference on Acoustics, 2022
Convmixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-Field Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2022
Automated Audio Captioning Using Transfer Learning and Reconstruction Latent Space Similarity Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
2021
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling.
CoRR, 2021
End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN.
CoRR, 2021
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training.
Proceedings of the IEEE International Conference on Acoustics, 2021
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Multitask-based joint learning approach to robust ASR for radio communication speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
End-to-End Speaker Age and Height Estimation using Attention Mechanism and Triplet Loss.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance.
CoRR, 2020
A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework.
CoRR, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
2019
CoRR, 2019
CoRR, 2019
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
QASA: Advanced Document Retriever for Open-Domain Question Answering by Learning to Rank Question-Aware Self-Attentive Document Representations.
Proceedings of the 3rd International Conference on Machine Learning and Soft Computing, 2019
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the Blizzard Challenge 2019, Vienna, Austria, September 23, 2019, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Improving code-switching speech recognition with data augmentation and system combination.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Speech Commun., 2018
Signal Process., 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Single Channel Speech Separation with Constrained Utterance Level Permutation Invariant Training Using Grid LSTM.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
An Investigation of Word Embeddings with Deep Bidirectional LSTM for Sentence Unit Detection in Automatic Speech Transcription.
Proceedings of the 2018 International Conference on Asian Language Processing, 2018
Proceedings of the 2018 International Conference on Asian Language Processing, 2018
Proceedings of the Seventh Named Entities Workshop, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
A hybrid neural network hidden Markov model approach for automatic story segmentation.
J. Ambient Intell. Humaniz. Comput., 2017
Proceedings of the Eighth International Symposium on Information and Communication Technology, 2017
Weighted Spatial Covariance Matrix Estimation for MUSIC Based TDOA Estimation of Speech Source.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Towards Age-friendly E-commerce Through Crowd-Improved Speech Recognition, Multimodal Search, and Personalized Speech Feedback.
Proceedings of the 2nd International Conference on Crowd Science and Engineering, 2017
On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 International Conference on Asian Language Processing, 2017
Proceedings of the 2017 International Conference on Asian Language Processing, 2017
Improving air traffic control speech intelligibility by reducing speaking rate effectively.
Proceedings of the 2017 International Conference on Asian Language Processing, 2017
Proceedings of the Human Aspects of IT for the Aged Population. Applications, Services and Contexts, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
Proceedings of the Intelligent Information and Database Systems - 9th Asian Conference, 2017
2016
Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization.
J. Signal Process. Syst., 2016
Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Multim. Tools Appl., 2016
Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation.
EURASIP J. Adv. Signal Process., 2016
Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting.
CoRR, 2016
Spoofing detection under noisy conditions: a preliminary investigation and an initial database.
CoRR, 2016
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Semi-Supervised and Cross-Lingual Knowledge Transfer Learnings for DNN Hybrid Acoustic Models Under Low-Resource Conditions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Approximate search of audio queries by using DTW with phone time boundary and data augmentation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
An expectation-maximization eigenvector clustering approach to direction of arrival estimation of multiple speech sources.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Combining non-negative matrix factorization and deep neural networks for speech enhancement and automatic speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Keyword search using query expansion for graph-based rescoring of hypothesized detections.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Content-aware local variability vector for speaker verification with short utterance.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
I-vector based deep neural network acoustic model adaptation using multilingual language resource.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Beamforming networks using spatial covariance features for far-field speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Zero resource anti-spoofing detection for unit selection based synthetic speech using image spectrogram artifacts.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
Proceedings of the Intelligent Information and Database Systems - 8th Asian Conference, 2016
2015
Decoupling Word-Pair Distance and Co-occurrence Information for Effective Long History Context Language Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Multim. Tools Appl., 2015
Lang. Resour. Evaluation, 2015
Int. J. Asian Lang. Process., 2015
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
Regularized non-negative matrix factorization using alternating direction method of multipliers and its application to source separation.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Language independent query-by-example spoken term detection using N-best phone sequences and partial matching.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
A learning-based approach to direction of arrival estimation in noisy and reverberant environments.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Language-resource independent speech segmentation using cues from a spectrogram image.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Modelling Public Sentiment in Twitter: Using Linguistic Patterns to Enhance Supervised Learning.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2015
On statistical machine translation method for lexicon refinement in speech recognition.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Non-negative matrix factorization using stable alternating direction method of multipliers for source separation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Distance metric learning for kernel density-based acoustic model under limited training data conditions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
2014
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2014
Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages.
IEICE Trans. Inf. Syst., 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Kernel density-based acoustic model with cross-lingual bottleneck features for resource limited LVCSR.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Feature compensation using linear combination of speaker and environment dependent correction vectors.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Generalization of temporal filter and linear transformation for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Improving language modeling by using distance and co-occurrence information of word-pairs and its application to LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2014
A Bayesian performance bound for time-delay of arrival based acoustic source tracking in a reverberant environment.
Proceedings of the 17th International Conference on Information Fusion, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
Multi-view features in a DNN-CRF model for improved sentence unit detection on English broadcast news.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014
2013
Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification.
IEEE Trans. Speech Audio Process., 2013
Softw. Pract. Exp., 2013
Overlapping sound event recognition using local spectrogram features and the generalised hough transform.
Pattern Recognit. Lett., 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Attribute-based histogram equalization (HEQ) and its adaptation for robust speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Temporal filter design by minimum KL divergence criterion for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
Language diarization for conversational code-switch speech with pronunciation dictionary adaptation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
Modeling of term-distance and term-occurrence information for improving n-gram language model performance.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013
2012
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion.
IEEE Signal Process. Lett., 2012
Discriminative feature extraction for speech recognition using continuous output codes.
Pattern Recognit. Lett., 2012
Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features.
IEICE Trans. Inf. Syst., 2012
Integration of language identification into a recognition system for spoken conversations containing code-Switches.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012
An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Overlapping Sound Event Recognition using Local Spectrogram Features with the Generalised Hough Transform.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Joint spectral and temporal normalization of features for robust recognition of noisy and reverberated speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
A first speech recognition system for Mandarin-English code-switch conversational speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the 2012 International Conference on Asian Language Processing, 2012
Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation HyTra@EACL 2012, 2012
A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
2011
IEICE Trans. Inf. Syst., 2011
Feature Normalization Using Structured Full Transforms for Robust Speech Recognition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Maximum likelihood adaptation of histogram equalization with constraint for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition.
IEEE Trans. Speech Audio Process., 2010
Pattern Recognit. Lett., 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 20th International Conference on Pattern Recognition, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010
2009
IEEE Trans. Speech Audio Process., 2009
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Efficient sparse self-similarity matrix construction for repeating sequence detection.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Cluster criterion functions in spectral subspace and their application in speaker clustering.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009
2008
IEEE Trans. Speech Audio Process., 2008
Efficient mobile phone Chinese optical character recognition systems by use of heuristic fuzzy rules and bigram Markov language models.
Appl. Soft Comput., 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008
Target-oriented phone selection from universal phone set for spoken language recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008
Proceedings of the IEEE International Conference on Acoustics, 2008
Proceedings of the International Conference on Computer Science and Software Engineering, 2008
2007
IEEE Trans. Multim., 2007
IEEE Signal Process. Lett., 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Using direction of arrival estimate and acoustic feature information in speaker diarization.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 13th International Conference on Parallel and Distributed Systems, 2007
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Speaker Diarization Using Direction of Arrival Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007 Evaluation.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007
2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006
Temporal Discrete Cosine Transform: Towards Longer Term Temporal Features for Speaker Verification.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Int. J. Autom. Comput., 2005
Proceedings of the 13th ACM International Conference on Multimedia, 2005
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
2004
Proceedings of the 12th ACM International Conference on Multimedia, 2004
Proceedings of the 17th International Conference on Pattern Recognition, 2004
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
Event detection based on non-broadcast sports video.
Proceedings of the 2004 International Conference on Image Processing, 2004
Concurrent constant modulus algorithm and soft decision directed scheme for fractionally-spaced blind equalization.
Proceedings of IEEE International Conference on Communications, 2004
2000
Proceedings of the 2000 International Symposium on Chinese Spoken Language Processing, 2000
1996
Gradient radial basis function networks for nonlinear and nonstationary time series prediction.
IEEE Trans. Neural Networks, 1996
Orthogonal least-squares learning algorithm with local adaptation process for the radial basis function networks.
IEEE Signal Process. Lett., 1996
Proceedings of International Conference on Neural Networks (ICNN'96), 1996
1995
IEEE Trans. Signal Process., 1995
1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994