Ryota Tanaka

Orcid: 0009-0005-1958-4442

According to our database1, Ryota Tanaka authored at least 13 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Empirical Analysis of Large Vision-Language Models against Goal Hijacking via Visual Prompt Injection.
CoRR, 2024

3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach.
Proceedings of the 7th ACM International Workshop on Multimedia Content Analysis in Sports, 2024

Pseudo-label based unsupervised fine-tuning of a monocular 3D pose estimation model for sports motions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

How Well Do Vision Models Encode Diagram Attributes?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from a Monocular Camera and IMUs.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from Inertial Sensors.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2021
VisualMRC: Machine Reading Comprehension on Document Images.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Context and knowledge aware conversational model and system combination for grounded response generation.
Comput. Speech Lang., 2020

Fact-based Dialogue Generation with Convergent and Divergent Decoding.
CoRR, 2020

Different Modal Stereo: Simultaneous Estimation of Stereo Image Disparity and Modality Translation.
Proceedings of the 15th International Joint Conference on Computer Vision, 2020

2019
An Ensemble Dialogue System for Facts-Based Sentence Generation.
CoRR, 2019


  Loading...