Shota Orihashi

According to our database1, Shota Orihashi authored at least 29 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Born-Again Multi-task Self-training for Multi-task Facial Emotion Recognition.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

2023
Open-Set Recognition for Facial-Expression Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2023

Distilling Knowledge of Bidirectional Language Model for Scene Text Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2023

Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Audio Visual Scene-Aware Dialog Generation with Transformer-based Video Representations.
CoRR, 2022

Interactive Co-Learning with Cross-Modal Transformer for Audio-Visual Emotion Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Fully Shareable Scene Text Recognition Modeling for Horizontal and Vertical Writing.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
GAN-Based Image Compression Using Mutual Information for Optimizing Subjective Image Similarity.
IEICE Trans. Inf. Syst., 2021

Large-Context Conversational Representation Learning: Self-Supervised Learning For Conversational Documents.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Enrollment-Less Training for Personalized Voice Activity Detection.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks Using Switching Tokens.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Hierarchical Transformer-Based Large-Context End-To-End ASR with Large-Context Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Audio-Visual Speech Separation Using Cross-Modal Correspondence Loss.
Proceedings of the IEEE International Conference on Acoustics, 2021

MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training.
Proceedings of the IEEE International Conference on Acoustics, 2021

Hierarchical Knowledge Distillation for Dialogue Sequence Labeling.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model.
Proceedings of the 13th International Conference on Natural Language Generation, 2020

Unsupervised Domain Adversarial Training in Angular Space for Facial Expression Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Subjective Quality Driven Image Encoding Method Using Image Completion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
GAN-based Image Compression Using Mutual Information Maximizing Regularization.
Proceedings of the Picture Coding Symposium, 2019

2016
Improvement of H.265/HEVC encoding for 8K UHDTV by detecting motion complexity.
Proceedings of the IEEE International Conference on Consumer Electronics, 2016

2015
An Adaptive H.265/HEVC Encoding Control for 8K UHDTV Movies Based on Motion Complexity Estimation.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Improvement of 8K UHDTV picture quality for H.265/HEVC by global zoom estimation.
Proceedings of the IEEE International Conference on Consumer Electronics, 2015


  Loading...