Kong-Aik Lee

Orcid: 0000-0001-9133-3000

Affiliations:
  • Institute for Infocomm Research, Singapore


According to our database1, Kong-Aik Lee authored at least 214 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Encoder-Decoder Calibration for Multimodal Machine Translation.
IEEE Trans. Artif. Intell., August, 2024

t-EER: Parameter-Free Tandem Evaluation of Countermeasures and Biometric Comparators.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Generalizing Speaker Verification for Spoof Awareness in the Embedding Space.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Cosine Scoring With Uncertainty for Neural Speaker Embedding.
IEEE Signal Process. Lett., 2024

NTU-NPU System for Voice Privacy 2024 Challenge.
CoRR, 2024

LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation.
CoRR, 2024

Room Impulse Responses help attackers to evade Deep Fake Detection.
CoRR, 2024

On the effectiveness of enrollment speech augmentation for Target Speaker Extraction.
CoRR, 2024

Towards Quantifying and Reducing Language Mismatch Effects in Cross-Lingual Speech Anti-Spoofing.
CoRR, 2024

Malacopula: adversarial automatic speaker verification attacks using a neural-based generalised Hammerstein model.
CoRR, 2024

ASVspoof 5: Crowdsourced Speech Data, Deepfakes, and Adversarial Attacks at Scale.
CoRR, 2024

Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection.
CoRR, 2024

Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis.
CoRR, 2024

Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding.
CoRR, 2024

Text-dependent Speaker Verification (TdSV) Challenge 2024: Challenge Evaluation Plan.
CoRR, 2024

VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis.
CoRR, 2024

Using Twitter Dataset for Social Listening in Singapore.
IEEE Access, 2024

Two-stage Semi-supervised Speaker Recognition with Gated Label Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

CPAUG: Refining Copy-Paste Augmentation for Speech Anti-Spoofing.
Proceedings of the IEEE International Conference on Acoustics, 2024

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Gradient Weighting for Speaker Verification in Extremely Low Signal-to-Noise Ratio.
Proceedings of the IEEE International Conference on Acoustics, 2024

Modeling Pseudo-Speaker Uncertainty in Voice Anonymization.
Proceedings of the IEEE International Conference on Acoustics, 2024

Adversarial Speech for Voice Privacy Protection from Personalized Speech Generation.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
A Dual Latent Variable Personalized Dialogue Agent.
SN Comput. Sci., March, 2023

Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition.
IEEE Trans. Inf. Forensics Secur., 2023

Meta-Generalization for Domain-Invariant Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

An Empirical Bayes Framework for Open-Domain Dialogue Generation.
CoRR, 2023

Partially Randomizing Transformer Weights for Dialogue Response Diversity.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

Disentangling Voice and Content with Self-Supervision for Speaker Recognition.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Single Integrated Spoofing-aware Speaker Verification Embeddings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Speaker-Aware Anti-spoofing.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Incorporating Uncertainty from Speaker Embedding Estimation to Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Speaker Recognition with Two-Step Multi-Modal Deep Cleansing.
Proceedings of the IEEE International Conference on Acoustics, 2023

Noise-Disentanglement Metric Learning for Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Probabilistic Back-ends for Online Speaker Recognition and Clustering.
Proceedings of the IEEE International Conference on Acoustics, 2023

Cross-Modal Audio-Visual Co-Learning for Text-Independent Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Leveraging Positional-Related Local-Global Dependency for Synthetic Speech Detection.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Audio-Visual Speaker Representation with Co-Meta Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask.
IEEE Signal Process. Lett., 2022

Discriminative speaker embedding with serialized multi-layer multi-head attention.
Speech Commun., 2022

I4U System Description for NIST SRE'20 CTS Challenge.
CoRR, 2022

Noise-Robust Semi-supervised Multi-modal Machine Translation.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Deep Spectro-temporal Artifacts for Detecting Synthesized Speech.
Proceedings of the DDAM@MM 2022: Proceedings of the 1st International Workshop on Deepfake Detection for Audio Multimedia, 2022

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Scoring of Large-Margin Embeddings for Speaker Verification: Cosine or PLDA?
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Learning Domain-Invariant Transformation for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2022

Self-Supervised Speaker Recognition with Loss-Gated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances.
Proceedings of the IEEE International Conference on Acoustics, 2022

Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents.
Proceedings of the IEEE International Conference on Acoustics, 2022

DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation.
Proceedings of the 14th International Conference on Agents and Artificial Intelligence, 2022

A Randomized Link Transformer for Diverse Open-Domain Dialogue Generation.
Proceedings of the 4th Workshop on NLP for Conversational AI, 2022

2021
ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech.
IEEE Trans. Biom. Behav. Identity Sci., 2021

Xi-Vector Embedding for Speaker Recognition.
IEEE Signal Process. Lett., 2021

ASVtorch toolkit: Speaker verification with deep neural networks.
SoftwareX, 2021

Replay attack detection using variable-frequency resolution phase and magnitude features.
Comput. Speech Lang., 2021

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection.
CoRR, 2021

ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan.
CoRR, 2021

Benchmarking and challenges in security and privacy for voice biometrics.
CoRR, 2021

Generating Personalized Dialogue via Multi-Task Meta-Learning.
CoRR, 2021

Exploring Deep Learning for Joint Audio-Visual Lip Biometrics.
CoRR, 2021

Serialized Multi-Layer Multi-Head Attention for Neural Speaker Embedding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Joint Feature Enhancement and Speaker Recognition with Multi-Objective Task-Oriented Network.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Meta-Learning for Cross-Channel Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Replay-Attack Detection Using Features With Adaptive Spectro-Temporal Resolution.
Proceedings of the IEEE International Conference on Acoustics, 2021

COOPNet: Multi-Modal Cooperative Gender Prediction in Social Media User Profiling.
Proceedings of the IEEE International Conference on Acoustics, 2021

Task-aware Warping Factors in Mask-based Speech Enhancement.
Proceedings of the 29th European Signal Processing Conference, 2021

PL-EESR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

DeepLip: A Benchmark for Deep Learning-Based Audio-Visual Lip Biometrics.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Maximal Figure-of-Merit Framework to Detect Multi-Label Phonetic Features for Spoken Language Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech.
Comput. Speech Lang., 2020

Voice biometrics security: Extrapolating false alarm rate via hierarchical Bayesian modeling of speaker verification scores.
Comput. Speech Lang., 2020

NEC-TT System for Mixed-Bandwidth and Multi-Domain Speaker Recognition.
Comput. Speech Lang., 2020

Two decades into Speaker Recognition Evaluation - are we there yet?
Comput. Speech Lang., 2020

Using Multi-Resolution Feature Maps with Convolutional Neural Networks for Anti-Spoofing in ASV.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Neural i-vectors.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020


On Early-stop Clustering for Speaker Diarization.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Dynamic Margin Softmax Loss for Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Adversarial Separation Network for Speaker Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

SdSV Challenge 2020: Large-Scale Evaluation of Short-Duration Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Extrapolating False Alarm Rates in Automatic Speaker Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

NEC-TT Speaker Verification System for SRE'19 CTS Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

POCO: A Voice Spoofing and Liveness Detection Corpus Based on Pop Noise.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Deep Discriminative Embedding with Ranked Weight for Speaker Verification.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

A Generalized Framework for Domain Adaptation of PLDA in Speaker Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Introduction to Voice Presentation Attack Detection and Recent Advances.
Proceedings of the Handbook of Biometric Anti-Spoofing, 2019

Short-duration Speaker Verification (SdSV) Challenge 2020: the Challenge Evaluation Plan.
CoRR, 2019

The ASVspoof 2019 database.
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

Introduction to Voice Presentation Attack Detection and Recent Advances.
CoRR, 2019

Speaker Augmentation and Bandwidth Extension for Deep Speaker Embedding.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Unleashing the Unused Potential of i-Vectors Enabled by GPU Acceleration.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The NEC-TT 2018 Speaker Verification System.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Generalizing I-Vector Estimation for Rapid Speaker Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Generalized Variability Model for Speaker Verification.
IEEE Signal Process. Lett., 2018

Attention Mechanism in Speaker Recognition: What Does it Learn in Deep Speaker Embedding?
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Co-whitening of I-vectors for Short and Long Duration Speaker Verification.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

On the Importance of Analytic Phase of Speech Signals in Spoken Language Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Maximal Figure-of-Merit Embedding for Multi-Label Audio Classification.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Many-to-Many Voice Conversion based on Bottleneck Features with Variational Autoencoder for Non-parallel Training Data.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Direct Optimization of the Detection Cost for I-Vector-Based Spoken Language Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Incorporating Local Acoustic Variability Information into Short Duration Speaker Verification.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Gain Compensation for Fast i-Vector Extraction Over Short Duration.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017


The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

RedDots replayed: A new replay spoofing attack corpus for text-dependent speaker verification research.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Adaptation of PLDA for multi-source text-independent speaker verification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

I2R-NUS submission to oriental language recognition AP16-OL7 challenge.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Exploration of Local Variability in Text-Independent Speaker Verification.
J. Signal Process. Syst., 2016

Total Variability Modeling Using Source-Specific Priors.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Fantastic 4 system for NIST 2015 Language Recognition Evaluation.
CoRR, 2016

Rapid Computation of I-vector.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Deep Language: a comprehensive deep learning approach to end-to-end language recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

I2R Submission to the 2015 NIST Language Recognition I-vector Challenge.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Discriminating Languages in a Probabilistic Latent Subspace.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Neural networks based channel compensation for i-vector speaker verification.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Joint Speaker and Lexical Modeling for Short-Term Characterization of Speaker.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Twin Model G-PLDA for Duration Mismatch Compensation in Text-Independent Speaker Verification.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

An extensible speaker identification sidekit in Python.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Content-aware local variability vector for speaker verification with short utterance.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Quasi-Factorial Prior for i-vector Extraction.
IEEE Signal Process. Lett., 2015

Relevance factor of maximum a posteriori adaptation for GMM-NAP-SVM in speaker and language recognition.
Comput. Speech Lang., 2015

Sparse coding of total variability matrix.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The reddots platform for mobile crowd-sourcing of speech data.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

The reddots data collection for speaker recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Phone-centric local variability vector for text-constrained speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

A new study of GMM-SVM system for text-dependent speaker recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Source-specific informative prior for i-vector extraction.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Normalization of total variability matrix for i-vector/PLDA speaker verification.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Channel adaptation of plda for text-independent speaker verification.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Text-dependent speaker verification: Classifiers, databases and RSR2015.
Speech Commun., 2014

PLDA in the I-Supervector Space for Text-Independent Speaker Verification.
EURASIP J. Audio Speech Music. Process., 2014

Unifying Probabilistic Linear Discriminant Analysis Variants in Biometric Authentication.
Proceedings of the Structural, Syntactic, and Statistical Pattern Recognition, 2014

A Comparison of Categorical Attribute Data Clustering Methods.
Proceedings of the Structural, Syntactic, and Statistical Pattern Recognition, 2014

Text-Dependent Speaker Verification System in VHF Communication Channel.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Local Variability Modeling for Text-Independent Speaker Verification.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Single-sided approach to discriminative PLDA training for text-independent speaker verification without using expanded i-vector.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Local variability vector for text-independent speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Extended RSR2015 for text-dependent speaker verification over VHF channel.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Imposture classification for text-dependent speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Modelling the alternative hypothesis for text-dependent speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Minimum divergence estimation of speaker prior in multi-session PLDA scoring.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Sparse Classifier Fusion for Speaker Verification.
IEEE Trans. Speech Audio Process., 2013

Spoken Language Recognition: From Fundamentals to Practice.
Proc. IEEE, 2013


Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Multi-session PLDA scoring of i-vector for partially open-set speaker detection.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

ALIZE 3.0 - open source toolkit for state-of-the-art speaker recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic regularization of cross-entropy cost for speaker recognition fusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A study on GMM-SVM with adaptive relevance factor and its comparison with i-vector and JFA for speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Anti-model KL-SVM-NAP system for NIST SRE 2012 evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Phonetically-constrained PLDA modeling for text-dependent speaker verification with multiple short utterances.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification.
IEEE Trans. Speech Audio Process., 2012

Bhattacharyya-based GMM-SVM system with adaptive relevance factor for pair language recognition.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Variational Bayes logistic regression as regularized fusion for NIST SRE 2010.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Effect of Relevance Factor of Maximum a posteriori Adaptation for GMM-SVM in Speaker and Language Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

PLDA Modeling in I-Vector and Supervector Space for Speaker Verification.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

I-vectors in the context of phonetically-constrained short utterances for speaker verification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Study on the Relevance Factor of Maximum a Posteriori with GMM for Language Recognition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Spoken Language Recognition in the Latent Topic Simplex.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Regularized Logistic Regression Fusion for Speaker Verification.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Speech enhancement with masking properties in eigen-domain for colored noise.
Proceedings of the IEEE International Conference on Acoustics, 2011

Factored covariance modeling for text-independent speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

Classifier subset selection and fusion for speaker verification.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition.
IEEE Trans. Speech Audio Process., 2010

Factor analysis based spatial correlation modeling for speaker verification.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

MAP estimation of subspace transform for speaker recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A hybrid modeling strategy for GMM-SVM speaker recognition with adaptive relevance factor.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

The estimation and kernel metric of spectral correlation for text-independent speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Incorporating MAP estimation and covariance transform for SVM based speaker recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Approaching human listener accuracy with modern speaker verification.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Adaptive score fusion using Weighted Logistic Linear Regression for spoken language recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

A GMM-supervector approach to language recognition with adaptive relevance factor.
Proceedings of the 18th European Signal Processing Conference, 2010

Discrete expected likelihood kernel for SVM-based speaker verification.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition.
IEEE Signal Process. Lett., 2009

Target-aware language models for spoken language recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009


2008
NIST 2007 Language Recognition Evaluation: From the Perspective of IIR.
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation, 2008

Dimension reduction of the modulation spectrogram for speaker verification.
Proceedings of the Odyssey 2008: The Speaker and Language Recognition Workshop, 2008

Self-Organized Clustering for Feature Mapping in Language Recognition.
Proceedings of the 6th International Symposium on Chinese Spoken Language Processing, 2008

Characterizing speech utterances for speaker verification with sequence kernel SVM.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Spoken Language recognition using support vector machines with generative front-end.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
A GMM-based probabilistic sequence kernel for speaker verification.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

On Delayless Architecture for the Normalized Subband Adaptive Filter.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

2006
Inherent Decorrelating and Least Perturbation Properties of the Normalized Subband Adaptive Filter.
IEEE Trans. Signal Process., 2006

On the Subband Orthogonality of Cosine-Modulated Filter Banks.
IEEE Trans. Circuits Syst. II Express Briefs, 2006

Fusion of Acoustic and Tokenization Features for Speaker Recognition.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

The IIR Submission to CSLP 2006 Speaker Recognition Evaluation.
Proceedings of the Chinese Spoken Language Processing, 5th International Symposium, 2006

2005
Adaptive filtering using constrained subband updates.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

2004
Improving convergence of the NLMS algorithm using constrained subband updates.
IEEE Signal Process. Lett., 2004

Subband adaptive filtering using a multiple-constraint optimization criterion.
Proceedings of the 2004 12th European Signal Processing Conference, 2004


  Loading...