Ying Hu
Orcid: 0000-0001-7505-1767Affiliations:
- Xinjiang University, School of Information Science and Engineering, Key Laboratory of Signal Detection and Processing, Urumqi, China
- Xi'an Jiaotong University, School of Electronics and Information Engineering, Xi'an, China (PhD 2016)
According to our database1,
Ying Hu
authored at least 30 papers
between 2016 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on scopus.com
-
on orcid.org
On csauthors.net:
Bibliography
2024
GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration.
EURASIP J. Audio Speech Music. Process., December, 2024
Improving Speaker Verification With Noise-Aware Label Ensembling and Sample Selection: Learning and Correcting Noisy Speaker Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IIFC-Net: A Monaural Speech Enhancement Network With High-Order Information Interaction and Feature Calibration.
IEEE Signal Process. Lett., 2024
Introducing Multilingual Phonetic Information to Speaker Embedding for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024
SMMA-Net: An Audio Clue-Based Target Speaker Extraction Network with Spectrogram Matching and Mutual Attention.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision.
EURASIP J. Audio Speech Music. Process., December, 2023
MTANet: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023
Speakeraugment: Data Augmentation for Generalizable Source Separation via Speaker Parameter Manipulation.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation.
IEEE Signal Process. Lett., 2022
A bimodal network based on Audio-Text-Interactional-Attention with ArcFace loss for speech emotion recognition.
Speech Commun., 2022
Multi-stage music separation network with dual-branch attention and hybrid convolution.
J. Intell. Inf. Syst., 2022
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022
2021
Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion.
Digit. Signal Process., 2021
End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Encoder-Decoder Based Pitch Tracking and Joint Model Training for Mandarin Tone Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Monaural Singing Voice and Accompaniment Separation Based on Gated Nested U-Net Architecture.
Symmetry, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2018
Defect characterization of amorphous silicon thin film solar cell based on low frequency noise.
Sci. China Inf. Sci., 2018
2017
2016
Proceedings of the Intelligent Computing Methodologies - 12th International Conference, 2016
Monaural Singing Voice Separation by Non-negative Matrix Partial Co-Factorization with Temporal Continuity and Sparsity Criteria.
Proceedings of the Intelligent Computing Methodologies - 12th International Conference, 2016