Liumeng Xue

Orcid: 0000-0003-2815-8494

According to our database1, Liumeng Xue authored at least 26 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multi-Level Temporal-Channel Speaker Retrieval for Zero-Shot Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder.
CoRR, 2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM.
CoRR, 2024

SingVisio: Visual analytics of diffusion model for singing voice conversion.
Comput. Graph., 2024

Spontts: Modeling and Transferring Spontaneous Style for TTS.
Proceedings of the IEEE International Conference on Acoustics, 2024

An Initial Investigation of Neural Replay Simulator for Over-The-Air Adversarial Perturbations to Automatic Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2024

Transfer the Linguistic Representations from TTS to Accent Conversion with Non-Parallel Data.
Proceedings of the IEEE International Conference on Acoustics, 2024


2023
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit.
CoRR, 2023

SponTTS: modeling and transferring spontaneous style for TTS.
CoRR, 2023

Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion.
CoRR, 2023

Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion.
CoRR, 2023

Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features.
Proceedings of the IEEE International Conference on Acoustics, 2023

HIGNN-TTS: Hierarchical Prosody Modeling With Graph Neural Networks for Expressive Long-Form TTS.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Multi-granularity Semantic and Acoustic Stress Prediction for Expressive TTS.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
ParaTTS: Learning Linguistic and Prosodic Cross-Sentence Information in Paragraph-Based TTS.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Cycle consistent network for end-to-end style transfer TTS training.
Neural Networks, 2021

Controllable Emotion Transfer For End-to-End Speech Synthesis.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

2020
On the localness modeling for the self-attention based end-to-end speech synthesis.
Neural Networks, 2020

Building a controllable expressive speech synthesis system with multiple emotion strengths.
Cogn. Syst. Res., 2020

2019
Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-to-End Speech Synthesis.
IEEE Access, 2019

Building a Mixed-Lingual Neural TTS System with Only Monolingual Data.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Control Emotion Intensity for LSTM-Based Expressive Speech Synthesis.
Proceedings of the Data Science, 2019


  Loading...