Ying Hu

Orcid: 0000-0001-7505-1767

Affiliations:
  • Xinjiang University, School of Information Science and Engineering, Key Laboratory of Signal Detection and Processing, Urumqi, China
  • Xi'an Jiaotong University, School of Electronics and Information Engineering, Xi'an, China (PhD 2016)


According to our database1, Ying Hu authored at least 29 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
GLFER-Net: a polyphonic sound source localization and detection network based on global-local feature extraction and recalibration.
EURASIP J. Audio Speech Music. Process., December, 2024

Improving Speaker Verification With Noise-Aware Label Ensembling and Sample Selection: Learning and Correcting Noisy Speaker Labels.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

IIFC-Net: A Monaural Speech Enhancement Network With High-Order Information Interaction and Feature Calibration.
IEEE Signal Process. Lett., 2024

Introducing Multilingual Phonetic Information to Speaker Embedding for Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision.
EURASIP J. Audio Speech Music. Process., December, 2023

MTANet: Multi-band Time-frequency Attention Network for Singing Melody Extraction from Polyphonic Music.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

CRA-DIFFUSE: Improved Cross-Domain Speech Enhancement Based on Diffusion Model with T-F Domain Pre-Denoising.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Speech Topic Classification Based on Pre-trained and Graph Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

A Joint Network Based on Interactive Attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Speakeraugment: Data Augmentation for Generalizable Source Separation via Speaker Parameter Manipulation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation.
IEEE Signal Process. Lett., 2022

A bimodal network based on Audio-Text-Interactional-Attention with ArcFace loss for speech emotion recognition.
Speech Commun., 2022

Multi-stage music separation network with dual-branch attention and hybrid convolution.
J. Intell. Inf. Syst., 2022

Dual-Path Hybrid Attention Network for Monaural Speech Separation.
IEEE Access, 2022

How to Boost Anti-Spoofing with X-Vectors.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

A Multi-grained based Attention Network for Semi-supervised Sound Event Detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Mining Hard Samples Locally And Globally For Improved Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Virtual Fully-Connected Layer for a Large-Scale Speaker Verification Dataset.
Proceedings of the Biometric Recognition - 16th Chinese Conference, 2022

2021
Dual Attention Network for Pitch Estimation of Monophonic Music.
Symmetry, 2021

Connectionist temporal classification loss for vector quantized variational autoencoder in zero-shot voice conversion.
Digit. Signal Process., 2021

End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Encoder-Decoder Based Pitch Tracking and Joint Model Training for Mandarin Tone Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Monaural Singing Voice and Accompaniment Separation Based on Gated Nested U-Net Architecture.
Symmetry, 2020

A Lightweight Model Based on Separable Convolution for Speech Emotion Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018
Defect characterization of amorphous silicon thin film solar cell based on low frequency noise.
Sci. China Inf. Sci., 2018

2017
Mandarin tone modeling using recurrent neural networks.
CoRR, 2017

2016
Scene Text Detection Based on Text Probability and Pruning Algorithm.
Proceedings of the Intelligent Computing Methodologies - 12th International Conference, 2016

Monaural Singing Voice Separation by Non-negative Matrix Partial Co-Factorization with Temporal Continuity and Sparsity Criteria.
Proceedings of the Intelligent Computing Methodologies - 12th International Conference, 2016


  Loading...