Viet Anh Trinh

Orcid: 0000-0002-1660-6627

According to our database1, Viet Anh Trinh authored at least 19 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?
CoRR, 2024

Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing.
CoRR, 2024

Automatic Speech Recognition Tuned for Child Speech in the Classroom.
Proceedings of the IEEE International Conference on Acoustics, 2024

Tracking Classroom Movement Patterns with Person Re-ID.
Proceedings of the 17th International Conference on Educational Data Mining, 2024

2023
Adaptive Endpointing with Deep Contextual Multi-Armed Bandits.
Proceedings of the IEEE International Conference on Acoustics, 2023

Towards Accurate and Real-Time End-of-Speech Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Two-Pass Endpoint Detection for Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Identifying, Evaluating and Applying Importance Maps for Speech.
PhD thesis, 2022

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Importantaug: A Data Augmentation Agent for Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

Unsupervised Speech Enhancement with Speech Recognition Embedding and Disentanglement Losses.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Directly Comparing the Listening Strategies of Humans and Machines.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

New Dataset and Strong Baselines for the Grammatical Error Correction of Russian.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Combining Spatial Clustering with LSTM Speech Models for Multichannel Speech Enhancement.
CoRR, 2020

Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks.
CoRR, 2020

Enhancement of Spatial Clustering-Based Time-Frequency Masks using LSTM Neural Networks.
CoRR, 2020

Large Scale Evaluation of Importance Maps in Automatic Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018
Bubble Cooperative Networks for Identifying Important Speech Cues.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Concatenative Resynthesis with Improved Training Signals for Speech Enhancement.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018


  Loading...