Van Hai Do

Orcid: 0000-0002-9554-5171

According to our database1, Van Hai Do authored at least 35 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Transfer learning methods for low-resource speech accent recognition: A case study on Vietnamese language.
Eng. Appl. Artif. Intell., 2024

Human Behavior Modeling in Speech Transcribing Process via Pretrained Speech Recognition Models.
Proceedings of the International Joint Conference on Neural Networks, 2024

2023
Probing Speech Quality Information in ASR Systems.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

LightVoc: An Upsampling-Free GAN Vocoder Based On Conformer And Inverse Short-time Fourier Transform.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

A Gaussian Distribution Labeling Method for Speech Quality Assessment.
Proceedings of the Computational Data and Social Networks - 12th International Conference, 2023

An Automatic Pipeline For Building Emotional Speech Dataset.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Improving Self-supervised Audio Representation based on Contrastive Learning with Conformer Encoder.
Proceedings of the 11th International Symposium on Information and Communication Technology, 2022

Improving Vietnamese Accent Recognition Using ASR Transfer Learning.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

2021
Development of Accent Recognition Systems for Vietnamese Speech.
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

2020
Agent/Client Speech Identification for Mixed-Channel Conversation in Customer Service Call Centers.
Proceedings of the International Conference on Asian Language Processing, 2020

2018
Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions.
Proceedings of the Ninth International Symposium on Information and Communication Technology, 2018

2017
Development of a Vietnamese speech recognition system for Viettel call center.
Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017

Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Mismatched crowdsourcing: Mining latent skills to acquire speech transcriptions.
Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016
A many-to-one phone mapping approach for cross-lingual speech recognition.
Proceedings of the 2016 IEEE RIVF International Conference on Computing & Communication Technologies, 2016

Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Approximate search of audio queries by using DTW with phone time boundary and data augmentation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exemplar-inspired strategies for low-resource spoken keyword search in Swahili.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Speech recognition of under-resourced languages using mismatched transcriptions.
Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Improving Efficiency of Sentence Boundary Detection by Feature Selection.
Proceedings of the Intelligent Information and Database Systems - 8th Asian Conference, 2016

2015
Acoustic modeling for speech recognition under limited training data conditions
PhD thesis, 2015

Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages.
Int. J. Asian Lang. Process., 2015

A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

On the study of very low-resource language keyword search.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Distance metric learning for kernel density-based acoustic model under limited training data conditions.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages.
IEICE Trans. Inf. Syst., 2014

TANDEM-bottleneck feature combination using hierarchical Deep Neural Networks.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Kernel density-based acoustic model with cross-lingual bottleneck features for resource limited LVCSR.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
A study on LVCSR and keyword search for tagalog.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Context-dependent phone mapping for LVCSR of under-resourced languages.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
Context dependant phone mapping for cross-lingual acoustic modeling.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

A Phone Mapping Technique for Acoustic Modeling of Under-Resourced Languages.
Proceedings of the 2012 International Conference on Asian Language Processing, 2012


  Loading...