Liang He
Orcid: 0000-0003-4076-7479Affiliations:
- Xinjiang University, School of Computer Science and Technology, Urumqi, China
- Tsinghua University, Department of Electronic Engineering, TNLIST, Beijing, China
According to our database1,
Liang He
authored at least 90 papers
between 2008 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration.
EURASIP J. Audio Speech Music. Process., December, 2024
Improving Speaker Verification With Noise-Aware Label Ensembling and Sample Selection: Learning and Correcting Noisy Speaker Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IIFC-Net: A Monaural Speech Enhancement Network With High-Order Information Interaction and Feature Calibration.
IEEE Signal Process. Lett., 2024
LMKG: A large-scale and multi-source medical knowledge graph for intelligent medicine applications.
Knowl. Based Syst., 2024
Knowl. Based Syst., 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Phase Continuity-Aware Self-Attentive Recurrent Network with Adaptive Feature Selection for Robust VAD.
Proceedings of the IEEE International Conference on Acoustics, 2024
Introducing Multilingual Phonetic Information to Speaker Embedding for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024
SMMA-Net: An Audio Clue-Based Target Speaker Extraction Network with Spectrogram Matching and Mutual Attention.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
Sensors, December, 2023
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision.
EURASIP J. Audio Speech Music. Process., December, 2023
Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction.
CoRR, 2023
MAKBQA: Multi-hop Knowledge Base Question Answering System Based on Sensors and Internet Agricultural Data.
Proceedings of the 20th Annual IEEE International Conference on Sensing, 2023
Proceedings of the ACM Multimedia Asia 2023, 2023
Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization.
Proceedings of the ACM Multimedia Asia 2023, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
MTANet: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
2022
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation.
IEEE Signal Process. Lett., 2022
A bimodal network based on Audio-Text-Interactional-Attention with ArcFace loss for speech emotion recognition.
Speech Commun., 2022
Multi-stage music separation network with dual-branch attention and hybrid convolution.
J. Intell. Inf. Syst., 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022
2021
End-to-End Cross-Lingual Spoken Language Understanding Model with Multilingual Pretraining.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021
2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
MTF-CRNN: Multiscale Time-Frequency Convolutional Recurrent Neural Network for Sound Event Detection.
IEEE Access, 2020
Combined Vector Based on Factorized Time-delay Neural Network for Text-Independent Speaker Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
A Joint Detection-Classification Model for Weakly Supervised Sound Event Detection Using Multi-Scale Attention Method.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
EURASIP J. Audio Speech Music. Process., 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
IEEE Signal Process. Lett., 2018
Semi-supervised minimum redundancy maximum relevance feature selection for audio classification.
Multim. Tools Appl., 2018
Defect characterization of amorphous silicon thin film solar cell based on low frequency noise.
Sci. China Inf. Sci., 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Multidimens. Syst. Signal Process., 2017
CoRR, 2017
Proceedings of the 2017 IEEE International Workshop on Signal Processing Systems, 2017
Deep neural networks based speaker modeling at different levels of phonetic granularity.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Comparison of multiple features and modeling methods for text-dependent speaker verification.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
Semi-supervised feature selection for audio classification based on constraint compensated Laplacian score.
EURASIP J. Audio Speech Music. Process., 2016
EURASIP J. Audio Speech Music. Process., 2016
Investigation of Senone-based Long-Short Term Memory RNNs for Spoken Language Recognition.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Investigating Various Diarization Algorithms for Speaker in the Wild (SITW) Speaker Recognition Challenge.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015
Investigation of bottleneck features and multilingual deep neural networks for speaker verification.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Simultaneous utilization of spectral magnitude and phase information to extract supervectors for speaker verification anti-spoofing.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015
2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013
2012
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Orthogonal Subspace Combination Based on the Joint Factor Analysis for Text-Independent Speaker Recognition.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012
2011
Time-Frequency Cepstral Features and Heteroscedastic Linear Discriminant Analysis for Language Recognition.
IEEE Trans. Speech Audio Process., 2011
2010
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2009
2008
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008
Proceedings of the IEEE Asia Pacific Conference on Circuits and Systems, 2008