Jeong Hun Yeo

Orcid: 0009-0002-8135-6625

According to our database1, Jeong Hun Yeo authored at least 12 papers between 2022 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model.
IEEE Trans. Multim., 2024

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language.
CoRR, 2024

Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units.
CoRR, 2024

Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Visual Speech Recognition for Languages with Limited Labeled Data Using Automatic Labels from Whisper.
Proceedings of the IEEE International Conference on Acoustics, 2024

Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-Training and Multi-Modal Tokens.
Proceedings of the IEEE International Conference on Acoustics, 2024

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model.
CoRR, 2023

Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-Temporal Lip-Audio Memory for Visual Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022


  Loading...