We stand with Ukraine

We stand with Ukraine

Hieu-Thi Luong

Orcid: 0000-0002-4772-5995

According to our database¹, Hieu-Thi Luong authored at least 19 papers between 2016 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

NTU-NPU System for Voice Privacy 2024 Challenge.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Room Impulse Responses help attackers to evade Deep Fake Detection.

[BibT_eX]

[DOI]

,

Duc-Tuan Truong

,

,

CoRR, 2024

Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection.

[BibT_eX]

[DOI]

Duc-Tuan Truong

,

,

,

,

,

CoRR, 2024

2023

Controlling Multi-Class Human Vocalization Generation via a Simple Segment-based Labeling Scheme.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2021

LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

CoRR, 2021

Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021

2020

Deep learning based voice cloning framework for a unified system of text-to-speech and voice conversion.

[BibT_eX]

[DOI]

PhD thesis, 2020

NAUTILUS: A Versatile Voice Cloning System.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

IEEE ACM Trans. Audio Speech Lang. Process., 2020

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

2019

A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

CoRR, 2019

Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora.

[BibT_eX]

[DOI]

,

,

Junichi Yamagishi

,

Nobuyuki Nishizawa

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Bootstrapping Non-Parallel Voice Conversion from Speaker-Adaptive Text-to-Speech.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018

Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder.

[BibT_eX]

[DOI]

,

,

,

Junichi Yamagishi

,

,

Nobuaki Minematsu

IEEE Access, 2018

Scaling and Bias Codes for Modeling Speaker-Adaptive DNN-Based Speech Synthesis Systems.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation.

[BibT_eX]

[DOI]

,

Junichi Yamagishi

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects.

[BibT_eX]

[DOI]

,

,

Junichi Yamagishi

,

Nobuyuki Nishizawa

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Adapting and controlling DNN-based speech synthesis using input codes.

[BibT_eX]

[DOI]

,

,

Gustav Eje Henter

,

Junichi Yamagishi

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

A non-expert Kaldi recipe for Vietnamese Speech Recognition System.

[BibT_eX]

[DOI]

,

Proceedings of the Third International Workshop on Worldwide Language Service Infrastructure and Second Workshop on Open Infrastructures and Analysis Frameworks for Human Language Technologies WLSI/OIAF4HLT@COLING, 2016

Loading...