Taichi Nishimura

Orcid: 0000-0001-8725-7164

According to our database1, Taichi Nishimura authored at least 22 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts.
CoRR, 2024

Language-based Audio Moment Retrieval.
CoRR, 2024

DETECLAP: Enhancing Audio-Visual Representation Learning with Object Information.
CoRR, 2024

Lighthouse: A User-Friendly Library for Reproducible Video Moment Retrieval and Highlight Detection.
CoRR, 2024

BioVL-QR: Egocentric Biochemical Video-and-Language Dataset Using Micro QR Codes.
CoRR, 2024

Text-driven Affordance Learning from Egocentric Vision.
CoRR, 2024

On the Audio Hallucinations in Large Audio-Video Language Models.
CoRR, 2024

Automatic Construction of a Large-Scale Corpus for Geoparsing Using Wikipedia Hyperlinks.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
State-aware video procedural captioning.
Multim. Tools Appl., October, 2023

Large-scale Vision-Language Models Learn Super Images for Efficient and High-Performance Partially Relevant Video Retrieval.
CoRR, 2023

Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos.
CoRR, 2023

2022
Recipe Generation from Unsegmented Cooking Videos.
CoRR, 2022

Recipe Recommendation for Balancing Ingredient Preference and Daily Nutrients.
Proceedings of the CEA++@MM 2022: Proceedings of the 1st International Workshop on Multimedia for Cooking, 2022

Multimodal Dish Pairing: Predicting Side Dishes to Serve with a Main Dish.
Proceedings of the CEA++@MM 2022: Proceedings of the 1st International Workshop on Multimedia for Cooking, 2022

Image Description Dataset for Language Learners.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Cross-modal Representation Learning for Understanding Manufacturing Procedure.
Proceedings of the Cross-Cultural Design. Applications in Learning, Arts, Cultural Heritage, Creative Industries, and Virtual Reality, 2022

Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Structure-Aware Procedural Text Generation From an Image Sequence.
IEEE Access, 2021

Egocentric Biochemical Video-and-Language Dataset.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

2020
Visual Grounding Annotation of Recipe Flow Graph.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Procedural Text Generation from a Photo Sequence.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

Frame Selection for Producing Recipe with Pictures from an Execution Video of a Recipe.
Proceedings of the 11th Workshop on Multimedia for Cooking and Eating Activities, 2019


  Loading...