Van Hai Do

Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

2020

Agent/Client Speech Identification for Mixed-Channel Conversation in Customer Service Call Centers.

[BibT_eX]

[DOI]

Van Tuan Mai

Proceedings of the International Conference on Asian Language Processing, 2020

2018

Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription.

[BibT_eX]

[DOI]

Mark A. Hasegawa-Johnson

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Symposium on Information and Communication Technology, 2018

2017

Development of a Vietnamese speech recognition system for Viettel call center.

[BibT_eX]

[DOI]

Proceedings of the 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, 2017

Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Mismatched crowdsourcing: Mining latent skills to acquire speech transcriptions.

[BibT_eX]

[DOI]

Preethi Jyothi

Wenda Chen

Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016

A many-to-one phone mapping approach for cross-lingual speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE RIVF International Conference on Computing & Communication Technologies, 2016

Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Approximate search of audio queries by using DTW with phone time boundary and data augmentation.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Exemplar-inspired strategies for low-resource spoken keyword search in Swahili.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Speech recognition of under-resourced languages using mismatched transcriptions.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Asian Language Processing, 2016

Improving Efficiency of Sentence Boundary Detection by Feature Selection.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Information and Database Systems - 8th Asian Conference, 2016

2015

Acoustic modeling for speech recognition under limited training data conditions

[BibT_eX]

[DOI]

PhD thesis, 2015

Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages.

[BibT_eX]

[DOI]

Int. J. Asian Lang. Process., 2015

A comparative study of BNF and DNN multilingual training on cross-lingual low-resource speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

On the study of very low-resource language keyword search.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Distance metric learning for kernel density-based acoustic model under limited training data conditions.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2014

TANDEM-bottleneck feature combination using hierarchical Deep Neural Networks.

[BibT_eX]

[DOI]

Mirco Ravanelli

Adam Janin

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Kernel density-based acoustic model with cross-lingual bottleneck features for resource limited LVCSR.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013

A study on LVCSR and keyword search for tagalog.

[BibT_eX]

[DOI]

Korbinian Riedhammer