Mana Ihori
According to our database1,
Mana Ihori
authored at least 33 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Retrieval, Masking, and Generation: Feedback Comment Generation using Masked Comment Examples.
Proceedings of the 16th International Natural Language Generation Conference, 2023
Leveraging Language Embeddings for Cross-Lingual Self-Supervised Speech Representation Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023
Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning.
Proceedings of the 31st European Signal Processing Conference, 2023
2022
Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
2021
Large-Context Conversational Representation Learning: Self-Supervised Learning For Conversational Documents.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks Using Switching Tokens.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model.
Proceedings of the 13th International Conference on Natural Language Generation, 2020
Sequence-Level Consistency Training for Semi-Supervised End-to-End Automatic Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Unsupervised Domain Adversarial Training in Angular Space for Facial Expression Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
Generalized Large-Context Language Models Based on Forward-Backward Hierarchical Recurrent Encoder-Decoder Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Improving Speech-Based End-of-Turn Detection Via Cross-Modal Representation Learning with Punctuated Text Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019