Trung Hieu Nguyen

  • Institute for Infocomm Research, A*STAR, Singapore

According to our database1, Trung Hieu Nguyen authored at least 33 papers between 2007 and 2024.

Collaborative distances:



In proceedings 
PhD thesis 


Online presence:



Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions.
CoRR, 2024

Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis.
CoRR, 2024

MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2024

SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance.
Proceedings of the IEEE International Conference on Acoustics, 2024

Are Soft Prompts Good Zero-Shot Learners for Speech Recognition?
Proceedings of the IEEE International Conference on Acoustics, 2024

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention.
CoRR, 2023

ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Adaptive Knowledge Distillation Between Text and Speech Pre-Trained Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Contrastive Speech Mixup for Low-Resource Keyword Spotting.
Proceedings of the IEEE International Conference on Acoustics, 2023

Auxiliary Pooling Layer For Spoken Language Understanding.
Proceedings of the IEEE International Conference on Acoustics, 2023

CPT: Cross-Modal Prefix-Tuning for Speech-To-Text Translation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram.
Proceedings of the IEEE International Conference on Acoustics, 2021

Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses.
Proceedings of the IEEE International Conference on Acoustics, 2021

Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019


I2R-NUS submission to oriental language recognition AP16-OL7 challenge.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Fantastic 4 system for NIST 2015 Language Recognition Evaluation.
CoRR, 2016

I2R Submission to the 2015 NIST Language Recognition I-vector Challenge.
Proceedings of the Odyssey 2016: The Speaker and Language Recognition Workshop, 2016

Joint Speaker and Lexical Modeling for Short-Term Characterization of Speaker.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

The 2015 NIST Language Recognition Evaluation: The Shared View of I2R, Fantastic4 and SingaMS.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Speaker diarization in meetings domain
PhD thesis, 2015

The reddots platform for mobile crowd-sourcing of speech data.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

On the use of Bhattacharyya based GMM distance and neural net features for identification of cognitive load levels.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Extended RSR2015 for text-dependent speaker verification over VHF channel.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Bhattacharyya distance based emotional dissimilarity measure in multi-dimensional space for emotion classification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Bhattacharyya distance based emotional dissimilarity measure for emotion classification.
Proceedings of the IEEE International Conference on Acoustics, 2013

The IIR NIST SRE 2008 and 2010 summed channel speaker recognition systems.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Cluster criterion functions in spectral subspace and their application in speaker clustering.
Proceedings of the IEEE International Conference on Acoustics, 2009

T-test distance and clustering criterion for speaker diarization.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Using direction of arrival estimate and acoustic feature information in speaker diarization.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Speaker Diarization Using Direction of Arrival Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007 Evaluation.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007
