Jia Liu

Affiliations:
  • Tsinghua University, Beijing National Research Center for Information Science and Technology, Beijing, China
  • Tsinghua University, Department of Electronic Engineering, TNList, Beijing, China (PhD 1990)


According to our database1, Jia Liu authored at least 162 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models.
CoRR, 2024

CoopASD: Cooperative Machine Anomalous Sound Detection with Privacy Concerns.
CoRR, 2024

AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection.
CoRR, 2024

Improving Acoustic Scene Classification via Self-Supervised and Semi-Supervised Learning with Efficient Audio Transformer.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Semi-Supervised Acoustic Scene Classification with Test-Time Adaptation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Exploring Large Scale Pre-Trained Models for Robust Machine Anomalous Sound Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Unsupervised Anomaly Detection and Localization of Machine Audio: A Gan-Based Approach.
Proceedings of the IEEE International Conference on Acoustics, 2023

Decoupling Detectors for Scalable Anomaly Detection in AIoT Systems with Multiple Machines.
Proceedings of the IEEE Global Communications Conference, 2023

2022
Ensemble of Multiple Anomalous Sound Detectors.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2020
Staged Training Strategy and Multi-Activation for Audio Tagging with Noisy and Sparse Multi-Label Data.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Distance-Dependent Metric Learning.
IEEE Signal Process. Lett., 2019

Latent class model with application to speaker diarization.
EURASIP J. Audio Speech Music. Process., 2019

End-to-End Topic Classification without ASR.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2019

Large Margin Softmax Loss for Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-objective Optimization Training of PLDA for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Geometric Discriminant Analysis for I-vector Based Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Dilated-Gated Convolutional Neural Network with A New Loss Function on Sound Event Detection.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Lattice Based Transcription Loss for End-to-End Speech Recognition.
J. Signal Process. Syst., 2018

Local Pairwise Linear Discriminant Analysis for Speaker Verification.
IEEE Signal Process. Lett., 2018

Advanced recurrent network-based hybrid acoustic models for low resource speech recognition.
EURASIP J. Audio Speech Music. Process., 2018

Multiobjective Optimization Training of PLDA for Speaker Verification.
CoRR, 2018

VB-HMM Speaker Diarization with Enhanced and Refined Segment Representation.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Latent Class Model for Single Channel Speaker Diarization.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Exploring a Unified Attention-Based Pooling Framework for Speaker Verification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Speaker Embedding Extraction with Phonetic Information.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Text Prompted Speaker Verification Based on Phoneme Clustering with Earth Mover's Distane and Cauchy-Schwarz Divergence.
Proceedings of the 2018 2nd International Conference on Algorithms, Computing and Systems, 2018

Improved Phonotactic Language Recognition Using Collaborated Language Model.
Proceedings of the 5th IEEE International Conference on Cloud Computing and Intelligence Systems, 2018

Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Investigation of Frame Alignments for GMM-based Text-prompted Speaker Verification.
CoRR, 2017

Ivec-PLDA-AHC priors for VB-HMM speaker diarization system.
Proceedings of the 2017 IEEE International Workshop on Signal Processing Systems, 2017

Deep neural networks based speaker modeling at different levels of phonetic granularity.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

An LSTM-CTC based verification system for proxy-word based OOV keyword search.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Comparison of multiple features and modeling methods for text-dependent speaker verification.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Gated convolutional networks based hybrid acoustic models for low resource speech recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Discriminative Boosting Algorithm for Diversified Front-End Phonotactic Language Recognition.
J. Signal Process. Syst., 2016

Maxout neurons for deep convolutional and LSTM neural networks in speech recognition.
Speech Commun., 2016

Investigation of Senone-based Long-Short Term Memory RNNs for Spoken Language Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Application of i-vector in speech and music classification.
Proceedings of the 2016 IEEE International Symposium on Signal Processing and Information Technology, 2016

A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction.
Proceedings of the 2016 IEEE International Symposium on Signal Processing and Information Technology, 2016

Gated recurrent units based hybrid acoustic models for robust speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Lattice based transcription loss for end-to-end speech recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

A study of variational method for text-independent speaker recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A Novel Discriminative Score Calibration Method for Keyword Search.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

THU-EE System Description for NIST LRE 2015.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Multi-resolution time frequency feature and complementary combination for short utterance speaker recognition.
Multim. Tools Appl., 2015

Regularized minimum class variance extreme learning machine for language recognition.
EURASIP J. Audio Speech Music. Process., 2015

THUEE language modeling method for the OpenKWS 2015 evaluation.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Convolutional maxout neural networks for speech separation.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Dialog state tracking using long short-term memory neural networks.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Using word confusion networks for slot filling in spoken language understanding.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Investigation of bottleneck features and multilingual deep neural networks for speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Simultaneous utilization of spectral magnitude and phase information to extract supervectors for speaker verification anti-spoofing.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Neuron sparseness versus connection sparseness in deep neural network for large vocabulary speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The THUEE system for the openKWS14 keyword search evaluation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Stacked bottleneck features for speaker verification.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

PRISM: A statistical modeling framework for text-independent speaker verification.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Improved system fusion for keyword search.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

High-performance Swahili keyword search with very limited language pack: The THUEE system for the OpenKWS15 evaluation.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Calibration of word posterior estimation in confusion networks for keyword search.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Efficient One-Pass Decoding with NNLM for Speech Recognition.
IEEE Signal Process. Lett., 2014

Spoken language recognition based on gap-weighted subsequence kernels.
Speech Commun., 2014

Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition.
EURASIP J. Audio Speech Music. Process., 2014

Homogenous ensemble phonotactic language recognition based on SVM supervector reconstruction.
EURASIP J. Audio Speech Music. Process., 2014

Text-Independent Speaker Verification via State Alignment.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Multi-scale kernels for short utterance speaker recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Deep belief network based CRF for spoken language understanding.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Word embeddings: A semi-supervised learning method for slot-filling in spoken dialog systems.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Speaker verification using Fisher vector.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Discriminative boosting regression backend for phonotactic language recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Improved multitaper PNCC feature for robust speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Phonotactic language recognition based on DNN-HMM acoustic model.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A new fast and memory effective i-vector extraction based on factor analysis of KLD derived GMM supervector.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Convolutional maxout neural networks for low-resource speech recognition.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Phonotactic language recognition based on time-gap-weighted lattice kernels.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Variance regularization of RNNLM for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Improved phonotactic language recognition based on RNN feature reconstruction.
Proceedings of the IEEE International Conference on Acoustics, 2014

Stochastic pooling maxout networks for low-resource speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Semi-supervised learning of dialogue acts using sentence similarity based on word embeddings.
Proceedings of the International Conference on Audio, 2014

Dereverberation for Speaker Identification in Meeting.
Proceedings of the HCI International 2014 - Posters' Extended Abstracts, 2014

Exploring the Large-Scale TDOA Feature Space for Speaker Diarization.
Proceedings of the HCI International 2014 - Posters' Extended Abstracts, 2014

2013
Exploiting articulatory features for pitch accent detection.
J. Zhejiang Univ. Sci. C, 2013

Fast Approximate Matching Algorithm for Phone-based Keyword Spotting.
J. Networks, 2013

Exploiting contextual information for prosodic event detection using auto-context.
EURASIP J. Audio Speech Music. Process., 2013

RNN language model with word clustering and class-based output layer.
EURASIP J. Audio Speech Music. Process., 2013

THU-EE system fusion for the NIST 2012 speaker recognition evaluation.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

MLP-HMM two-stage unsupervised training for low-resource languages on conversational telephone speech recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Parallel absolute-relative feature based phonotactic language recognition.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Audiovisual synthesis of exaggerated speech for corrective feedback in computer-assisted pronunciation training.
Proceedings of the IEEE International Conference on Acoustics, 2013

Simplified domain transfer multiple kernel learning for language recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Temporal kernel neural network language model.
Proceedings of the IEEE International Conference on Acoustics, 2013

I-matrix for text-independent speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Improve low-resource non-native mispronunciation detection with native speech by articulatory-based tandem feature.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

THUEE system for the Albayzin 2012 language recognition evaluation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Improving deep neural network acoustic models using unlabeled data.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

Combination of data borrowing strategies for low-resource LVCSR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Deep maxout neural networks for speech recognition.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012
Phone lattice reconstruction for embedded language recognition in LVCSR.
EURASIP J. Audio Speech Music. Process., 2012

Complementary combination in i-vector level for language recognition.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Automatic pitch accent detection using auto-context with acoustic features.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Improve mispronunciation detection with Tandem feature.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Articulatory Feature based Multilingual MLPs for Low-Resource Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Cross-Lingual and Ensemble MLPs Strategies for Low-Resource Speech Recognition.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012

2011
Automatic player labeling, tracking and field registration and trajectory mapping in broadcast soccer video.
ACM Trans. Intell. Syst. Technol., 2011

Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition.
IEEE Trans. Speech Audio Process., 2011

Robust speaker recognition in cross-channel condition based on Gaussian mixture model.
Multim. Tools Appl., 2011

Time-Frequency Cepstral Features and Combining Discriminative Training for Phonotactic Language Recognition.
J. Comput., 2011

Language Recognition Based on Acoustic Diversified Phone Recognizers and Phonotactic Feature Fusion.
IEICE Trans. Inf. Syst., 2011

Speaker segmentation and clustering based on the improved spectral clustering.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

Robust Audio Fingerprinting Based on Local Spectral Luminance Maxima Scheme.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Combining Lattice-Based Language Dependent and Independent Approaches for Out-of-Language Detection in LVCSR.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

State-Level Data Borrowing for Low-Resource Speech Recognition Based on Subspace GMMs.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Strategies for using MLP based features with limited target-language training data.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Multiple Background Models for Speaker Verification.
Proceedings of the Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28, 2010

Mandarin-English bilingual phone modeling and combining MPE based Discriminative training for cross-language speech recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Multi-feature combination for speaker recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Variant time-frequency cepstral features for speaker recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A fast query by humming system based on notes.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Combining Chinese spoken term detection systems via side-information conditioned linear logistic regression.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Integration of Complementary Phone Recognizers for Phonotactic Language Recognition.
Proceedings of the Information Computing and Applications - First International Conference, 2010

A CMLLR supervector kernel for SVM language recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Phone modeling and combining discriminative training for mandarinenglish bilingual speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

A modified Subband post-filtering approach for MVDR beamformer.
Proceedings of the 9th IEEE International Conference on Cognitive Informatics, 2010

2009
Efficient embedded speech recognition for very large vocabulary Mandarin car-navigation systems.
IEEE Trans. Consumer Electron., 2009

Automatic player detection, labeling and tracking in broadcast soccer video.
Pattern Recognit. Lett., 2009

Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

Two-layer network coordinate system for Internet distance prediction.
Proceedings of the International Conference on Ultra Modern Telecommunications, 2009

A Novel Embedded Speaker Verification on System on Chip.
Proceedings of the Sixth International Conference on Fuzzy Systems and Knowledge Discovery, 2009

A Combined De-correlation Method for Acoustic Feedback Cancellation in Hearing Aids.
Proceedings of the CSIE 2009, 2009 WRI World Congress on Computer Science and Information Engineering, March 31, 2009

2008
An Equalized Heteroscedastic Linear Discriminant Analysis Algorithm.
IEEE Signal Process. Lett., 2008

Speaker Clustering Aided by Visual Dialogue Analysis.
Proceedings of the Advances in Multimedia Information Processing, 2008

Addressing the out-of-vocabulary problem for large-scale Chinese spoken term detection.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Fusing multiple systems into a compact lattice index for chinese spoken term detection.
Proceedings of the IEEE International Conference on Acoustics, 2008

Fractional Fourier transform based auditory feature for language identification.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

Channel compensation technology in differential GSV-SVM speaker verification system.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

2007
English sentence stress detection system based on HMM framework.
Appl. Math. Comput., 2007

Subband Energy distance measure applied in multi-pass speech/non-speech discrimination.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Using confidence measures to evaluate the speaker turns in speaker segmentation.
Proceedings of the 9th International Symposium on Signal Processing and Its Applications, 2007

Two-Stage Method for Specific Audio Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2007

Confidence Measure Based Incremental Adaptation for Online Language Identification.
Proceedings of the Human-Computer Interaction. HCI Intelligent Multimodal Interaction Environments, 2007

Automatic Player Detection, Labeling and Tracking in Broadcast Soccer Video.
Proceedings of the British Machine Vision Conference 2007, 2007

A study of lattice-based spoken term detection for Chinese spontaneous speech.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Confidence Score Based Unsupervised Incremental Adaptation for OOV Words Detection.
Proceedings of the Structural, 2006

A Robust Acoustic Echo Canceller for Noisy Environment.
Proceedings of the 5th International Symposium on Chinese Spoken Language Processing, 2006

Perceptual Evaluation of Pronunciation Quality for Computer Assisted Language Learning.
Proceedings of the Technologies for E-Learning and Digital Entertainment, 2006

2004
Language identification using discriminative weighted language models.
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004

Comparison of Pronunciation Scores in Spoken Language Learning System.
Proceedings of the Advances in Web-Based Learning, 2004

Embedded speech recognition system on 8-bit MCU core.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Towards Robustness to Speech Rate in Mandarin All-Syllable Recognition.
J. Comput. Sci. Technol., 2003

Voice conversion with smoothed GMM and MAP adaptation.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A novel efficient decoding algorithm for CDHMM-based speech recognizer on chip.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
A Rejection Model Based on Multi-Layer Perceptrons for Mandarin Digit Recognition.
J. Comput. Sci. Technol., 2002

Acoustic model comparison for an embedded phoneme-based Mandarin name dialing system.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Real-time viterbi searching for practical telephone speech recognition systems.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Fast likelihood computation method using block-diagonal covariance matrices in hidden Markov model.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

Comparative study of linear feature transformation techniques for Mandarin digit string recognition.
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002

2001
Single-chip speech recognition system based on 8051 microcontroller core.
IEEE Trans. Consumer Electron., 2001

2000
Confidence measure based unsupervised speaker adaptation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Rejection based on a posteriori probability estimated by MLP with application for Mandarin voice dialer on ASIC.
Proceedings of the IEEE International Conference on Acoustics, 2000

1998
A novel robust speech recognition algorithm based on multi-models and integrated decision method.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998


  Loading...