Liang He

Orcid: 0000-0003-4076-7479

Affiliations:
  • Xinjiang University, School of Computer Science and Technology, Urumqi, China
  • Tsinghua University, Department of Electronic Engineering, TNLIST, Beijing, China


According to our database1, Liang He authored at least 90 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration.
EURASIP J. Audio Speech Music. Process., December, 2024

Improving Speaker Verification With Noise-Aware Label Ensembling and Sample Selection: Learning and Correcting Noisy Speaker Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

IIFC-Net: A Monaural Speech Enhancement Network With High-Order Information Interaction and Feature Calibration.
IEEE Signal Process. Lett., 2024

LMKG: A large-scale and multi-source medical knowledge graph for intelligent medicine applications.
Knowl. Based Syst., 2024

Prompt for extraction: Multiple templates choice model for event extraction.
Knowl. Based Syst., 2024

One Small and One Large for Document-level Event Argument Extraction.
CoRR, 2024

A Speaker Recognition Method Based on Stable Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024

Phase Continuity-Aware Self-Attentive Recurrent Network with Adaptive Feature Selection for Robust VAD.
Proceedings of the IEEE International Conference on Acoustics, 2024

Introducing Multilingual Phonetic Information to Speaker Embedding for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

SMMA-Net: An Audio Clue-Based Target Speaker Extraction Network with Spectrogram Matching and Mutual Attention.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Study on Graph Embedding for Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-View Speaker Embedding Learning for Enhanced Stability and Discriminability.
Proceedings of the IEEE International Conference on Acoustics, 2024

C-LLM: Learn to Check Chinese Spelling Errors Character by Character.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Audio-Visual Fusion Based on Interactive Attention for Person Verification.
Sensors, December, 2023

W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision.
EURASIP J. Audio Speech Music. Process., December, 2023

Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction.
CoRR, 2023

Graph Neural Network Backend for Speaker Recognition.
CoRR, 2023

MAKBQA: Multi-hop Knowledge Base Question Answering System Based on Sensors and Internet Agricultural Data.
Proceedings of the 20th Annual IEEE International Conference on Sensing, 2023

GhostVec: A New Threat to Speaker Privacy of End-to-End Speech Recognition System.
Proceedings of the ACM Multimedia Asia 2023, 2023

Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization.
Proceedings of the ACM Multimedia Asia 2023, 2023

A Study on Visualization of Voiceprint Feature.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Dynamic Fully-Connected Layer for Large-Scale Speaker Verification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MTANet: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Robust Training for Speaker Verification against Noisy Labels.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Speech Topic Classification Based on Pre-trained and Graph Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

A Joint Network Based on Interactive Attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

2022
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation.
IEEE Signal Process. Lett., 2022

A bimodal network based on Audio-Text-Interactional-Attention with ArcFace loss for speech emotion recognition.
Speech Commun., 2022

Multi-stage music separation network with dual-branch attention and hybrid convolution.
J. Intell. Inf. Syst., 2022

OR-Gate: A Noisy Label Filtering Method for Speaker Verification.
CoRR, 2022

I4U System Description for NIST SRE'20 CTS Challenge.
CoRR, 2022

THUEE system description for NIST 2020 SRE CTS challenge.
CoRR, 2022

How to Boost Anti-Spoofing with X-Vectors.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

End-to-end speech topic classification based on pre-trained model Wavlm.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

A Multi-grained based Attention Network for Semi-supervised Sound Event Detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Virtual Fully-Connected Layer for a Large-Scale Speaker Verification Dataset.
Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022

2021
End-to-End Cross-Lingual Spoken Language Understanding Model with Multilingual Pretraining.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improved Lightcnn with Attention Modules for Asv Spoofing Detection.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2020
Adaptive Multi-Scale Detection of Acoustic Events.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

MTF-CRNN: Multiscale Time-Frequency Convolutional Recurrent Neural Network for Sound Event Detection.
IEEE Access, 2020

Combined Vector Based on Factorized Time-delay Neural Network for Text-Independent Speaker Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

A Joint Detection-Classification Model for Weakly Supervised Sound Event Detection Using Multi-Scale Attention Method.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2020

THUEE System for NIST SRE19 CTS Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Distance-Dependent Metric Learning.
IEEE Signal Process. Lett., 2019

Latent class model with application to speaker diarization.
EURASIP J. Audio Speech Music. Process., 2019

THUEE system description for NIST 2019 SRE CTS Challenge.
CoRR, 2019

Large Margin Softmax Loss for Speaker Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-objective Optimization Training of PLDA for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2019

Geometric Discriminant Analysis for I-vector Based Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Local Pairwise Linear Discriminant Analysis for Speaker Verification.
IEEE Signal Process. Lett., 2018

Semi-supervised minimum redundancy maximum relevance feature selection for audio classification.
Multim. Tools Appl., 2018

Multiobjective Optimization Training of PLDA for Speaker Verification.
CoRR, 2018

Defect characterization of amorphous silicon thin film solar cell based on low frequency noise.
Sci. China Inf. Sci., 2018

VB-HMM Speaker Diarization with Enhanced and Refined Segment Representation.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Latent Class Model for Single Channel Speaker Diarization.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Exploring a Unified Attention-Based Pooling Framework for Speaker Verification.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Parallel Double Audio Fingerprinting.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Speaker Embedding Extraction with Phonetic Information.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Investigation of Frame Alignments for GMM-based Digit-prompted Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Fading channel modelling using single-hidden layer feedforward neural networks.
Multidimens. Syst. Signal Process., 2017

Investigation of Frame Alignments for GMM-based Text-prompted Speaker Verification.
CoRR, 2017

Ivec-PLDA-AHC priors for VB-HMM speaker diarization system.
Proceedings of the 2017 IEEE International Workshop on Signal Processing Systems, 2017

Deep neural networks based speaker modeling at different levels of phonetic granularity.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Comparison of multiple features and modeling methods for text-dependent speaker verification.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Semi-supervised feature selection for audio classification based on constraint compensated Laplacian score.
EURASIP J. Audio Speech Music. Process., 2016

Voice activity detection algorithm based on long-term pitch information.
EURASIP J. Audio Speech Music. Process., 2016

Investigation of Senone-based Long-Short Term Memory RNNs for Spoken Language Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

A study of variational method for text-independent speaker recognition.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016

Improving Deep Neural Networks Based Speaker Verification Using Unlabeled Data.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

THU-EE System Description for NIST LRE 2015.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Convolutional maxout neural networks for speech separation.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015

Investigation of bottleneck features and multilingual deep neural networks for speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Simultaneous utilization of spectral magnitude and phase information to extract supervectors for speaker verification anti-spoofing.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Stacked bottleneck features for speaker verification.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

PRISM: A statistical modeling framework for text-independent speaker verification.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

2014
Speaker verification using Fisher vector.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Improved multitaper PNCC feature for robust speaker verification.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

2013
I-matrix for text-independent speaker recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

THUEE system for the Albayzin 2012 language recognition evaluation.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012
Complementary combination in i-vector level for language recognition.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012

2011
Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition.
IEEE Trans. Speech Audio Process., 2011

2010
Multi-feature combination for speaker recognition.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Variant time-frequency cepstral features for speaker recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009

2008
Fractional Fourier transform based auditory feature for language identification.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008

Channel compensation technology in differential GSV-SVM speaker verification system.
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008


  Loading...