Benlai Tang

According to our database1, Benlai Tang authored at least 12 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling.
CoRR, 2024

2023
Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWP.
CoRR, 2023

CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

TranssionADD: A Multi-frame Reinforcement Based Sequence Tagging Model for Audio Deepfake Detection.
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), 2023

2022
Towards high-fidelity singing voice conversion with acoustic reference and contrastive predictive coding.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Towards Using Clothes Style Transfer for Scenario-Aware Person Video Generation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Towards Realistic Visual Dubbing with Heterogeneous Sources.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

PPG-Based Singing Voice Conversion with Adversarial Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech.
CoRR, 2020

2018
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder.
CoRR, 2018

2016
Application of pronunciation knowledge on phoneme recognition by LSTM neural network.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016


  Loading...