Leying Zhang

Orcid: 0009-0008-1424-9604

According to our database1, Leying Zhang authored at least 17 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction.
CoRR, February, 2025

SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation.
CoRR, January, 2025

2024
Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling.
CoRR, 2024

DDTSE: Discriminative Diffusion Model for Target Speech Extraction.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Knowledge Distillation from Discriminative Model to Generative Model with Parallel Architecture for Speech Enhancement.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

PromptTTS 2: Describing and Generating Voices with Text Prompt.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Generation-Based Target Speech Extraction with Speech Discretization and Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction.
CoRR, 2023

PromptTTS 2: Describing and Generating Voices with Text Prompt.
CoRR, 2023

Adaptive Large Margin Fine-Tuning For Robust Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2023

Segment Anything Model (SAM) for Medical Image Segmentation: A Preliminary Review.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2023

2022
The SJTU X-LANCE Lab System for CNSRC 2022.
CoRR, 2022

Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Fuzzy Clustering Algorithm-Segmented MRI Images in Analysis of Effects of Mental Imagery on Neurorehabilitation of Stroke Patients.
Sci. Program., 2021

Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
A novel dual-domain clustering algorithm for inhomogeneous spatial point event.
Data Technol. Appl., 2020


  Loading...