Hieu-Thi Luong

Orcid: 0000-0002-4772-5995

According to our database1, Hieu-Thi Luong authored at least 19 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
NTU-NPU System for Voice Privacy 2024 Challenge.
CoRR, 2024

LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation.
CoRR, 2024

Room Impulse Responses help attackers to evade Deep Fake Detection.
CoRR, 2024

Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection.
CoRR, 2024

2023
Controlling Multi-Class Human Vocalization Generation via a Simple Segment-based Labeling Scheme.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2021
LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example.
CoRR, 2021

Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance.
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

2020
Deep learning based voice cloning framework for a unified system of text-to-speech and voice conversion.
PhD thesis, 2020

NAUTILUS: A Versatile Voice Cloning System.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019
A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation.
CoRR, 2019

Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Bootstrapping Non-Parallel Voice Conversion from Speaker-Adaptive Text-to-Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder.
IEEE Access, 2018

Scaling and Bias Codes for Modeling Speaker-Adaptive DNN-Based Speech Synthesis Systems.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Adapting and controlling DNN-based speech synthesis using input codes.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
A non-expert Kaldi recipe for Vietnamese Speech Recognition System.
Proceedings of the Third International Workshop on Worldwide Language Service Infrastructure and Second Workshop on Open Infrastructures and Analysis Frameworks for Human Language Technologies WLSI/OIAF4HLT@COLING, 2016


  Loading...