Yayue Deng

Orcid: 0009-0003-7642-4942

According to our database1, Yayue Deng authored at least 14 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

2020
2021
2022
2023
2024
0
1
2
3
4
5
6
7
8
9
4
1
4
2
2
1

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Retrieval Augmented Generation in Prompt-based Text-to-Speech Synthesis with Context-Aware Contrastive Language-Audio Pretraining.
CoRR, 2024

Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model.
CoRR, 2024

FMPAF: How Do Fed Chairs Affect the Financial Market? A Fine-grained Monetary Policy Analysis Framework on Their Language.
CoRR, 2024

ExpressiveSinger: Synthesizing Expressive Singing Voice as an Instrument.
Proceedings of the 14th IEEE International Symposium on Chinese Spoken Language Processing, 2024

Frame-Level Emotional State Alignment Method for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Concss: Contrastive-based Context Comprehension for Dialogue-Appropriate Prosody in Conversational Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2024

CAG: A Consistency-Adaptive Text-Image Alignment Generation for Joint Multimodal Entity-Relation Extraction.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis.
CoRR, 2023

CMCU-CSS: Enhancing Naturalness via Commonsense-based Multi-modal Context Understanding in Conversational Speech Synthesis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

M<sup>2</sup>-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

2020
A Dynamic 3D Pronunciation Teaching Model Based on Pronunciation Attributes and Anatomy.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020


  Loading...