Cong-Thanh Do

Orcid: 0000-0003-1748-2846

According to our database1, Cong-Thanh Do authored at least 33 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding.
CoRR, 2024

Improving Accented Speech Recognition Using Data Augmentation Based on Unsupervised Text-to-Speech Synthesis.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
Domain Adaptive Self-supervised Training of Automatic Speech Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and Rescoring.
Proceedings of the IEEE International Conference on Acoustics, 2023

Towards a Unified End-to-End Language Understanding System for Speech and Text Inputs.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Robust multi-sensor generalized labeled multi-Bernoulli filter.
Signal Process., 2022

Multi-object tracking with an adaptive generalized labeled multi-Bernoulli filter.
Signal Process., 2022

Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

An Adaptive Multi-Sensor Generalised Labelled Multi-Bernoulli Filter for Linear Gaussian Models.
Proceedings of the 11th International Conference on Control, 2022

2021
A Tractable Multi-target Detection Model for Line-of-Sight Measurements.
Proceedings of the 2021 International Conference on Control, 2021

Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Learning Noise Invariant Features Through Transfer Learning For Robust End-to-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Tracking Multiple Marine Ships via Multiple Sensors with Unknown Backgrounds.
Sensors, 2019

Tracking Multiple Targets from Multistatic Doppler Radar with Unknown Probability of Detection.
Sensors, 2019

End-to-End Speech Recognition with High-Frame-Rate Features Extraction.
CoRR, 2019

Multiple marine ships tracking from multistatic Doppler data with unknown clutter rate.
Proceedings of the 2019 International Conference on Control, 2019

Subband Temporal Envelope Features and Data Augmentation for End-to-end Recognition of Distant Conversational Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Weighting Time-Frequency Representation of Speech Using Auditory Saliency for Automatic Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Multistatic Doppler-Based Marine Ships Tracking.
Proceedings of the 2018 International Conference on Control, 2018

2017
Improved Automatic Speech Recognition Using Subband Temporal Envelope Features and Time-Delay Neural Network Denoising Autoencoder.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2014
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification.
IEEE Signal Process. Lett., 2014

Speech-to-text development for Slovak, a low-resourced language.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Objective evaluation of HMM-based speech synthesis system using kullback-leibler divergence.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech.
Speech Commun., 2012

Combining cepstral normalization and cochlear implant-like speech processing for microphone array-based speech recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Cochlear implant-like processing of speech signal for speaker verification.
Proceedings of the ISCA Workshop on Statistical And Perceptual Audition, 2012

2010
On Normalized MSE Analysis of Speech Fundamental Frequency in the Cochlear Implant-Like Spectrally Reduced Speech.
IEEE Trans. Biomed. Eng., 2010

On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR.
IEEE Trans. Speech Audio Process., 2010

Recognizing cochlear implant-like spectrally reduced speech with HMM-based ASR: experiments with MFCCs and PLP coefficients.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Area of mouth opening estimation from speech acoustics using blind deconvolution technique.
Proceedings of the Auditory-Visual Speech Processing, 2009


  Loading...