Viet Anh Trinh

Orcid: 0000-0002-1660-6627

According to our database¹, Viet Anh Trinh authored at least 19 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2018

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?

[BibT_eX]

[DOI]

CoRR, 2024

Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing.

[BibT_eX]

[DOI]

CoRR, 2024

Automatic Speech Recognition Tuned for Child Speech in the Classroom.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

Tracking Classroom Movement Patterns with Person Re-ID.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Educational Data Mining, 2024

2023

Adaptive Endpointing with Deep Contextual Multi-Armed Bandits.

[BibT_eX]

[DOI]

Venkatesh Ravichandran

Viet Anh Trinh

Proceedings of the IEEE International Conference on Acoustics, 2023

Towards Accurate and Real-Time End-of-Speech Estimation.

[BibT_eX]

[DOI]

Venkatesh Ravichandran

Proceedings of the IEEE International Conference on Acoustics, 2023

Two-Pass Endpoint Detection for Speech Recognition.

[BibT_eX]

[DOI]

Venkatesh Ravichandran

Roland Maas

Ariya Rastrow

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Identifying, Evaluating and Applying Importance Maps for Speech.

[BibT_eX]

[DOI]

Viet Anh Trinh

PhD thesis, 2022

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Importantaug: A Data Augmentation Agent for Speech.

[BibT_eX]

[DOI]

Viet Anh Trinh

Hassan Salami Kavaki

Michael I. Mandel

Proceedings of the IEEE International Conference on Acoustics, 2022

Unsupervised Speech Enhancement with Speech Recognition Embedding and Disentanglement Losses.

[BibT_eX]

[DOI]

Viet Anh Trinh

Sebastian Braun

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Directly Comparing the Listening Strategies of Humans and Machines.

[BibT_eX]

[DOI]

Viet Anh Trinh

Michael I. Mandel

IEEE ACM Trans. Audio Speech Lang. Process., 2021

New Dataset and Strong Baselines for the Grammatical Error Correction of Russian.

[BibT_eX]

[DOI]

Viet Anh Trinh

Alla Rozovskaya

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2020

Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks.

[BibT_eX]

[DOI]

CoRR, 2020

Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2020

Large Scale Evaluation of Importance Maps in Automatic Speech Recognition.

[BibT_eX]

[DOI]

Viet Anh Trinh

Michael I. Mandel

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018

Bubble Cooperative Networks for Identifying Important Speech Cues.

[BibT_eX]

[DOI]

Viet Anh Trinh

Brian McFee

Michael I. Mandel

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Concatenative Resynthesis with Improved Training Signals for Speech Enhancement.

[BibT_eX]

[DOI]

Ali Raza Syed

Viet Anh Trinh

Michael I. Mandel

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Viet Anh Trinh

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...