2025
Adversarially adaptive temperatures for decoupled knowledge distillation with applications to speaker verification.
Neurocomputing, 2025
Wi-Fi CSI fingerprinting-based indoor positioning using deep learning and vector embedding for temporal stability.
Expert Syst. Appl., 2025
2024
Automatic selection of spoken language biomarkers for dementia detection.
Neural Networks, January, 2024
Contrastive Self-Supervised Speaker Embedding With Sequential Disentanglement.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
DITA: DETR with improved queries for end-to-end temporal action detection.
Neurocomputing, 2024
VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis.
CoRR, 2024
Action Progression Networks for Temporal Action Detection in Videos.
IEEE Access, 2024
On the Effectiveness of Enrollment Speech Augmentation For Target Speaker Extraction.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024
Naturalistic Language-Related Movie-Watching fMRI Task for Detecting Neurocognitive Decline and Disorder.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024
Promoting Independence of Depression and Speaker Features for Speaker Disentanglement in Speech-Based Depression Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024
Contrastive Speaker Embedding With Sequential Disentanglement.
Proceedings of the IEEE International Conference on Acoustics, 2024
Dual Parameter-Efficient Fine-Tuning for Speaker Representation Via Speaker Prompt Tuning and Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2024
Asymmetric Clean Segments-Guided Self-Supervised Learning for Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024
Joseph: phonetic-aware speaker embedding for far-field speaker verification.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2024
2023
Avoiding dominance of speaker features in speech-based depression detection.
Pattern Recognit. Lett., September, 2023
Model-Agnostic Meta-Learning for Fast Text-Dependent Speaker Embedding Adaptation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Robust Speaker Verification Using Deep Weight Space Ensemble.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Cluster-Guided Unsupervised Domain Adaptation for Deep Speaker Embedding.
IEEE Signal Process. Lett., 2023
Phonetic-aware speaker embedding for far-field speaker verification.
CoRR, 2023
Progression-Guided Temporal Action Detection in Videos.
CoRR, 2023
Deep Segment-Attentive Network for Altered-Engine Recognition.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023
Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations.
Proceedings of the International Conference on Machine Learning, 2023
Discriminative Speaker Representation Via Contrastive Learning with Class-Aware Attention in Angular Space.
Proceedings of the IEEE International Conference on Acoustics, 2023
Feature Selection and Text Embedding for Detecting Dementia from Spontaneous Cantonese.
Proceedings of the IEEE International Conference on Acoustics, 2023
Cross-Domain adaptation in Distance Space for Speaker Verification.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Modeling Suprasegmental Information Using Finite Difference Network for End-to-End Speaker Verification.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Jointly Modelling Transcriptions and Phonemes with Optimal Features to Detect Dementia from Spontaneous Cantonese.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
Improving Speech Emotion Recognition With Adversarial Data Augmentation Network.
IEEE Trans. Neural Networks Learn. Syst., 2022
Contrastive Adversarial Domain Adaptation Networks for Speaker Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022
Aggregating Frame-Level Information in the Spectral Domain With Self-Attention for Speaker Embedding.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Mixture Representation Learning for Deep Speaker Embedding.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Inter-patient ECG classification with i-vector based unsupervised patient adaptation.
Expert Syst. Appl., 2022
Speaker Representation Learning via Contrastive Loss with Maximal Speaker Separability.
CoRR, 2022
A Survey on Text-Dependent and Text-Independent Speaker Verification.
IEEE Access, 2022
Automatic Selection of Discriminative Features for Dementia Detection in Cantonese-Speaking People.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
UNet-DenseNet for Robust Far-Field Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Disentangled Speaker Embedding for Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022
Robust Speaker Verification Using Population-Based Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Channel Interdependence Enhanced Speaker Embeddings for Far-Field Speaker Verification.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Age-Invariant Speaker Embedding for Diarization of Cognitive Assessments.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Mutual Information Enhanced Training for Speaker Embedding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Short-Time Spectral Aggregation for Speaker Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2021
A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer's Disease Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021
Speaker Turn Aware Similarity Scoring for Diarization of Speech-Based Cognitive Assessments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Dual Dropout Ranking of Linguistic Features for Alzheimer's Disease Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Guest Editorial: Modern Speech Processing and Learning.
J. Signal Process. Syst., 2020
I-Vector-Based Patient Adaptation of Deep Neural Networks for Automatic Heartbeat Classification.
IEEE J. Biomed. Health Informatics, 2020
Variational Domain Adversarial Learning With Mutual Information Maximization for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
A Framework for Adapting DNN Speaker Embedding Across Languages.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Learning Mixture Representation for Deep Speaker Embedding Using Attention.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Adversarial Separation and Adaptation Network for Far-Field Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Strategies for End-to-End Text-Independent Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Wav2Spk: A Simple DNN Architecture for Learning Speaker Embeddings from Waveforms.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Information Maximized Variational Domain Adversarial Learning for Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Multi-Level Deep Neural Network Adaptation for Speaker Verification Using MMD and Consistency Regularization.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Gaussian Models for CSI Fingerprinting in Practical Indoor Environment Identification.
Proceedings of the IEEE Global Communications Conference, 2020
2019
Towards End-to-End ECG Classification With Raw Signal Extraction and Deep Neural Networks.
IEEE J. Biomed. Health Informatics, 2019
Fingerprint Quality Classification for CSI-based Indoor Positioning Systems.
Proceedings of the ACM MobiHoc Workshop on Pervasive Systems in the IoT Era, 2019
Variational Domain Adversarial Learning for Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Semi-supervised Nuisance-attribute Networks for Domain Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2019
Adversarial Data Augmentation Network for Speech Emotion Recognition.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Guest Editorial: Advances in Deep Learning for Speech Processing.
J. Signal Process. Syst., 2018
Denoised Senone I-Vectors for Robust Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
DNN-Based Score Calibration With Multitask Learning for Noise Robust Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
Multisource I-Vectors Domain Adaptation Using Maximum Mean Discrepancy Based Autoencoders.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
SNR-Invariant Multitask Deep Neural Networks for Robust Speaker Verification.
IEEE Signal Process. Lett., 2018
Special issue on semantic data analytics and bioinformatics.
Int. J. Mach. Learn. Cybern., 2018
Predicting subcellular localization of multi-location proteins by improving support vector machines with an adaptive-decision scheme.
Int. J. Mach. Learn. Cybern., 2018
The Application of Machine Learning Techniques on Channel Frequency Response Based Indoor Positioning in Dynamic Environments.
Proceedings of the 2018 IEEE International Conference on Sensing, 2018
Reducing Domain Mismatch by Maximum Mean Discrepancy Based Autoencoders.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Unsupervised Domain Adaptation for Gender-Aware PLDA Mixture Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Patient-Specific Heartbeat Classification Based on I-Vector Adapted Deep Neural Networks.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018
2017
Transductive Learning for Multi-Label Protein Subchloroplast Localization Prediction.
IEEE ACM Trans. Comput. Biol. Bioinform., 2017
DNN-Driven Mixture of PLDA for Robust Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Fast scoring for PLDA with uncertainty propagation via i-vector grouping.
Comput. Speech Lang., 2017
Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification.
Comput. Speech Lang., 2017
Protecting Genomic Privacy by a Sequence-Similarity Based Obfuscation Method.
CoRR, 2017
FUEL-mLoc: feature-unified prediction and explanation of multi-localization of cellular proteins in multiple organisms.
Bioinform., 2017
i-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Deep neural networks versus support vector machines for ECG arrhythmia classification.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017
2016
Guest Editorial: Advances in Machine Learning for Speech Processing.
J. Signal Process. Syst., 2016
Mem-mEN: Predicting Multi-Functional Types of Membrane Proteins by Interpretable Elastic Nets.
IEEE ACM Trans. Comput. Biol. Bioinform., 2016
Mixture of PLDA for Noise Robust I-Vector Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Robust scream sound detection via sound event partitioning.
Multim. Tools Appl., 2016
Sparse kernel machines with empirical kernel maps for PLDA speaker verification.
Comput. Speech Lang., 2016
Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins.
BMC Bioinform., 2016
Deep neural network driven mixture of PLDA for robust i-vector speaker verification.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Fast Scoring for PLDA with Uncertainty Propagation.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Senone I-vectors for robust speaker verification.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
SNR-invariant PLDA with multiple speaker subspaces.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
SNR-Invariant PLDA Modeling in Nonparametric Subspace for Robust Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA.
Int. J. Speech Technol., 2015
SNR-invariant PLDA modeling for robust speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Normalization of total variability matrix for i-vector/PLDA speaker verification.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Bottleneck features from SNR-adaptive denoising deep classifier for speaker identification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
Fast scoring for mixture of PLDA in i-vector/PLDA speaker verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015
2014
A study of voice activity detection techniques for NIST speaker recognition evaluations.
Comput. Speech Lang., 2014
Relevance vector machines with empirical likelihood-ratio kernels for PLDA speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Fusion of SNR-dependent PLDA models for noise robust speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
PLDA modeling in the fishervoice subspace for speaker verification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
SNR-dependent mixture of PLDA for noise robust speaker verification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Sound-event partitioning and feature normalization for robust sound-event detection.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014
Ensemble random projection for multi-label classification with application to protein subcellular localization.
Proceedings of the IEEE International Conference on Acoustics, 2014
Construction of discriminative Kernels from known and unknown non-targets for PLDA-SVM scoring.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning.
IEEE Trans. Speech Audio Process., 2013
Adaptive thresholding for multi-label SVM classification with application to protein subcellular localization prediction.
Proceedings of the IEEE International Conference on Acoustics, 2013
Likelihood-ratio empirical kernels for i-vector based PLDA-SVM scoring.
Proceedings of the IEEE International Conference on Acoustics, 2013
An ensemble classifier with random projection for predicting multi-label protein subcellular localization.
Proceedings of the 2013 IEEE International Conference on Bioinformatics and Biomedicine, 2013
2012
mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines.
BMC Bioinform., 2012
Utterance partitioning with acoustic vector resampling for i-vector based speaker verification.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Alleviating the small sample-size problem in i-vector based speaker verification.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
GOASVM: Protein subcellular localization prediction based on Gene ontology annotation and SVM.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Low-power SVM classifiers for sound event classification on mobile devices.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
PDA-SVM Hybrid: A Unified Model for Kernel-Based Supervised Classification.
J. Signal Process. Syst., 2011
Optimized Discriminative Kernel for SVM Scoring and Its Application to Speaker Verification.
IEEE Trans. Neural Networks, 2011
Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification.
Speech Commun., 2011
Protein subcellular localization prediction based on profile alignment and Gene Ontology.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011
Comparison of Voice Activity Detectors for Interview Speech in NIST Speaker Recognition Evaluation.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Addressing the Data-Imbalance Problem in Kernel-Based Speaker Verification via Utterance Partitioning and Speaker Comparison.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
The HKCUPU system for the NIST 2010 speaker recognition evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2011
2010
Feature Selection for Genomic Signal Processing: Unsupervised, Supervised, and Self-Supervised Scenarios.
J. Signal Process. Syst., 2010
Acoustic vector resampling for GMMSVM-based speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Speeding up subcellular localization by extracting informative regions of protein sequences for profile alignment.
Proceedings of the 2010 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, 2010
Truncation of protein sequences for fast profile alignment with application to subcellular localization.
Proceedings of the 2010 IEEE International Conference on Bioinformatics and Biomedicine, 2010
2009
A new adaptation approach to high-level speaker-model creation in speaker verification.
Speech Commun., 2009
Optimization of discriminative kernels in SVM speaker verification.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Fast GMM computation for speaker verification using scalar quantization and discrete densities.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Conditional random fields for the prediction of signal peptide cleavage sites.
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
PairProSVM: Protein Subcellular Localization Based on Local Pairwise Profile Alignment and SVM.
IEEE ACM Trans. Comput. Biol. Bioinform., 2008
Feature Selection for Self-Supervised Classification With Applications to Microarray and Sequence Data.
IEEE J. Sel. Top. Signal Process., 2008
Fusion of feature selection methods for pairwise scoring SVM.
Neurocomputing, 2008
High-level speaker verification via articulatory-feature based sequence kernels and SVM.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Fusion of cleavage site detection and pairwise alignment for fast subcellular localization.
Proceedings of the IEEE International Conference on Acoustics, 2008
2007
Speaker Verification via High-Level Feature Based Phonetic-Class Pronunciation Modeling.
IEEE Trans. Computers, 2007
Probabilistic feature-based transformation for speaker verification over telephone networks.
Neurocomputing, 2007
Environment adaptation for robust speaker verification by cascading maximum likelihood linear regression and reinforced learning.
Comput. Speech Lang., 2007
A New Adaptation Method for Speaker-Model Creation in High-Level Speaker Verification.
Proceedings of the Advances in Multimedia Information Processing, 2007
High-level feature-based speaker verification via articulatory phonetic-class pronunciation modeling.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Effects of Device Mismatch, Language Mismatch and Environmental Mismatch on Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2007
Feature Selection for Pairwise Scoring Kernels with Applications to Protein Subcellular Localization.
Proceedings of the IEEE International Conference on Acoustics, 2007
Adaptive Weight Estimation in Multi-Biometric Verification using Fuzzy Logic Decision Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
Blind Stochastic Feature Transformation for Channel Robust Speaker Verification.
J. VLSI Signal Process., 2006
Machine learning for multimodality genomic signal processing.
IEEE Signal Process. Mag., 2006
Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification.
Speech Commun., 2006
Symmetric and Asymmetric Multi-modality Biclustering Analysis for Microarray Data Matrix.
J. Bioinform. Comput. Biol., 2006
A Solution to the Curse of Dimensionality Problem in Pairwise Scoring Techniques.
Proceedings of the Neural Information Processing, 13th International Conference, 2006
A Comparison of Various Adaptation Methods for Speaker Verification With Limited Enrollment Data.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
On Consistent Fusion of Multimodal Biometrics.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Extraction of Speaker Features from Different Stages of DSR Front-Ends for Distributed Speaker Verification.
Int. J. Speech Technol., 2005
Channel robust speaker verification via Bayesian blind stochastic feature transformation.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Speaker verification via articulatory feature-based conditional pronunciation modeling with vowel and consonant mixture models.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Speaker Verification Using Adapted Articulatory Feature-based Conditional Pronunciation Modeling.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
A two-level fusion approach to multimodal biometric verification.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Multi-Metric and Multi-Substructure Biclustering Analysis for Gene Expression Data.
Proceedings of the Fourth International IEEE Computer Society Computational Systems Bioinformatics Conference, 2005
2004
Stochastic Feature Transformation with Divergence-Based Out-of-Handset Rejection for Robust Speaker Verification.
EURASIP J. Adv. Signal Process., 2004
Adaptive conditional pronunciation modeling using articulatory features for speaker verification.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
A new approach to channel robust speaker verification via constrained stochastic feature transformation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Articulatory feature-based conditional pronunciation modeling for speaker verification.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Multi-sample fusion with constrained feature transformation for robust speaker verification.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Maximum Likelihood and Maximum a Posteriori Adaptation for Distributed Speaker Recognition Systems.
Proceedings of the Biometric Authentication, First International Conference, 2004
Applying articulatory features to telephone-based speaker verification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Multi-sample data-dependent fusion of sorted score sequences for biometric verification.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Speaker verification based on g.729 and g.723.1 coder parameters and handset mismatch compensation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Environment adaptation for robust speaker verification.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Adaptive decision fusion for multi-sample speaker verification over GSM networks.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Cluster-Dependent Feature Transformation for Telephone-Based Speaker Verification.
Proceedings of the Audio-and Video-Based Biometrie Person Authentication, 2003
2002
A Comparative Study on Kernel-Based Probabilistic Neural Networks for Speaker Verification.
Int. J. Neural Syst., 2002
Sun-Yuan Kung, Speaker Verification from Coded Telephone Speech Using Stochastic Feature Transformation and Handset Identification.
Proceedings of the Advances in Multimedia Information Processing, 2002
Kernel-Based Probabilistic Neural Networks with Integrated Scoring Normalization for Speaker Verification.
Proceedings of the Advances in Multimedia Information Processing, 2002
Divergence-based out-of-class rejection for telephone handset identification.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Combining stochastic feature transformation and handset identification for telephone-based speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
A GMM-Based Handset Selector for Channel Mismatch Compensation with Applications to Speaker Identification.
Proceedings of the Advances in Multimedia Information Processing, 2001
2000
Estimation of elliptical basis function parameters by the EM algorithm with application to speaker verification.
IEEE Trans. Neural Networks Learn. Syst., 2000
A study of the Lamarckian evolution of recurrent neural networks.
IEEE Trans. Evol. Comput., 2000
A two-stage scoring method combining world and cohort models for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2000
1999
Adding learning to cellular genetic algorithms for training recurrent neural networks.
IEEE Trans. Neural Networks, 1999
Gaussian Mixture Models and Probabilistic Decision-Based Neural Networks for Pattern Classification: A Comparative Study.
Neural Comput. Appl., 1999
Determining the Optimal Number of Clusters by an Extended RPCL Algorithm.
J. Adv. Comput. Intell. Intell. Informatics, 1999
On the improvement of the real time recurrent learning algorithm for recurrent neural networks.
Neurocomputing, 1999
A conjugate gradient learning algorithm for recurrent neural networks.
Neurocomputing, 1999
A priori threshold determination for phrase-prompted speaker verification.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
A new cepstrum-based channel compensation method for speaker verification.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Elliptical basis function networks and radial basis function networks for speaker verification: a comparative study.
Proceedings of the International Joint Conference Neural Networks, 1999
1998
Empirical Analysis of the Factors that Affect the Baldwin Effect.
Proceedings of the Parallel Problem Solving from Nature, 1998
1995
A learning algorithm for Recurrent Radial Basis Function Networks.
Neural Process. Lett., 1995
Application of a fast real time recurrent learning algorithm to text-to-phoneme conversion.
Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995
1994
A lip-tracking system based on morphological processing and block matching techniques.
Signal Process. Image Commun., 1994
Lip-motion analysis for speech segmentation in noise.
Speech Commun., 1994
Speaker identification using multilayer perceptrons and radial basis function networks.
Neurocomputing, 1994