Yasutomo Kawanishi

Orcid: 0000-0002-3799-4550

According to our database1, Yasutomo Kawanishi authored at least 115 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Machine Learning-Based Interpretable Modeling for Subjective Emotional Dynamics Sensing Using Facial EMG.
Sensors, March, 2024

Correction to: Computational measurement of perceived pointiness from pronunciation.
Multim. Tools Appl., March, 2024

Computational measurement of perceived pointiness from pronunciation.
Multim. Tools Appl., March, 2024

Action Selection Learning for Multi-label Multi-view Action Recognition.
CoRR, 2024

Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association.
CoRR, 2024

J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution.
CoRR, 2024

Multi-View Video-Based Learning: Leveraging Weak Labels for Frame-Level Perception.
CoRR, 2024

Category-Level Object Pose Estimation in Heavily Cluttered Scenes by Generalized Two-Stage Shape Reconstructor.
IEEE Access, 2024

Relationship-Aware Unknown Object Detection for Open-Set Scene Graph Generation.
IEEE Access, 2024

Image-Collection Summarization Using Scene-Graph Generation With External Knowledge.
IEEE Access, 2024

Zero-Shot Pill-Prescription Matching With Graph Convolutional Network and Contrastive Learning.
IEEE Access, 2024

Subjective Baggage-Weight Estimation Based on Human Walking Behavior.
IEEE Access, 2024

Interpolating the Text-to-Image Correspondence Based on Phonetic and Phonological Similarities for Nonword-to-Image Generation.
IEEE Access, 2024

Pedestrian's Gaze Object Detection in Traffic Scene.
Proceedings of the 19th International Joint Conference on Computer Vision, 2024

Frame-Level Latent Embedding Using Weak Labels for Multi-View Action Recognition.
Proceedings of the 7th IEEE International Conference on Multimedia Information Processing and Retrieval, 2024

One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-Scale and Action Label Features.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Future Pose Prediction from 3D Human Skeleton Sequence with Surrounding Situation.
Sensors, January, 2023

Progressive Learning of a Multimodal Classifier Accounting for Different Modality Combinations.
Sensors, 2023

DeePoint: Pointing Recognition and Direction Estimation From A Fixed View.
CoRR, 2023

IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining.
CoRR, 2023

An Approach to Generate a Caption for an Image Collection Using Scene Graph Generation.
IEEE Access, 2023

End-to-End Gaze Grounding of a Person Pictured from Behind.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

Subjective Baggage-Weight Estimation from Gait: Can You Estimate How Heavy the Person Feels?
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

MVA2023 Small Object Detection Challenge for Spotting Birds: Dataset, Methods, and Results.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Image Impression Estimation by Clustering People with Similar Tastes.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Combining Knowledge Distillation and Transfer Learning for Sensor Fusion in Visible and Thermal Camera-based Person Classification.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Small Object Detection for Birds with Swin Transformer.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Human Pose Prediction by Progressive Generation in Multi-scale Frequency Domain.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Multimodal Cascaded Framework with Metric Learning Robust to Missing Modalities for Person Classification.
Proceedings of the 14th Conference on ACM Multimedia Systems, 2023

Towards Captioning an Image Collection from a Combined Scene Graph Representation Approach.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Audio-Visual Sensor Fusion Framework Using Person Attributes Robust to Missing Visual Modality for Person Recognition.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Operative Action Captioning for Estimating System Actions.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

DeePoint: Visual Pointing Recognition and Direction Estimation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
SDOF-Tracker: Fast and Accurate Multiple Human Tracking by Skipped-Detection and Optical-Flow.
IEICE Trans. Inf. Syst., November, 2022

What Should the System Do Next?: Operative Action Captioning for Estimating System Actions.
CoRR, 2022

Towards Open-Set Scene Graph Generation With Unknown Objects.
IEEE Access, 2022

A Multimodal Sensor Fusion Framework Robust to Missing Modalities for Person Recognition.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

Intuitive Gait Modeling using Mimetic-Words for Gait Description and Generation.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Detection of distant eye-contact using spatio-temporal pedestrian skeletons.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022

Label-based Multiple Object Ensemble Tracking with Randomized Frame Dropping.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Audio and Video-based Emotion Recognition using Multimodal Transformers.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Toward Surroundings-Aware Temporal Prediction of 3D Human Skeleton Sequence.
Proceedings of the Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges, 2022

Butsukusa: A Conversational Mobile Robot Describing Its Own Observations and Internal States.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2022

Detection of Birds in a 3D Environment Referring to Audio-Visual Information.
Proceedings of the 18th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2022

2021
Soft-Boundary Label Relaxation with class placement constraints for semantic segmentation of the railway environment.
Pattern Recognit. Lett., 2021

Aggregating Everyday Outfits by Incremental Clustering With Interactive User Adaptation.
IEEE Access, 2021

Imageability- and Length-Controllable Image Captioning.
IEEE Access, 2021

Tell as You Imagine: Sentence Imageability-Aware Image Captioning.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

Pointedness of an Image: Measuring How Pointy an Image is Perceived.
Proceedings of the HCI International 2021 - Posters - 23rd HCI International Conference, 2021

2020
Estimating the imageability of words by mining visual characteristics from crawled image data.
Multim. Tools Appl., 2020

Multiple Human Tracking Using an Omnidirectional Camera with Local Rectification and World Coordinates Representation.
IEICE Trans. Inf. Syst., 2020

Attribute-Aware Loss Function for Accurate Semantic Segmentation Considering the Pedestrian Orientations.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2020

SOANets: Encoder-decoder based Skeleton Orientation Alignment Network for White Cane User Recognition from 2D Human Skeleton Sequence.
Proceedings of the 15th International Joint Conference on Computer Vision, 2020

More-Natural Mimetic Words Generation for Fine-Grained Gait Description.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Browsing Visual Sentiment Datasets Using Psycholinguistic Groundings.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Imageability Estimation using Visual and Language Features.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Modeling Eye-Gaze Behavior of Electric Wheelchair Drivers via Inverse Reinforcement Learning.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

Median-Shape Representation Learning for Category-Level Object Pose Estimation in Cluttered Environments.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Ω-GAN: Object Manifold Embedding GAN for Image Generation by Disentangling Parameters into Pose and Shape Manifolds.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

LFIR2Pose: Pose Estimation from an Extremely Low-resolution FIR image Sequence.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Occlusion-Aware Skeleton Trajectory Representation for Abnormal Behavior Detection.
Proceedings of the Frontiers of Computer Vision - 26th International Workshop, 2020

2019
Estimating the visual variety of concepts by referring to Web popularity.
Multim. Tools Appl., 2019

Summarization of Multiple News Videos Considering the Consistency of Audio-Visual Contents.
Int. J. Semantic Comput., 2019

Estimation of the Attractiveness of Food Photography Based on Image Features.
IEICE Trans. Inf. Syst., 2019

Multiple Human Tracking using Multi-Cues including Primitive Action Features.
CoRR, 2019

Pedestrian Intensive Scanning for Active-scan LIDAR.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

Hard Negative Mining from in-Vehicle Camera Images based on Multiple Observations of Background Patterns.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

Next Viewpoint Recommendation by Pose Ambiguity Minimization for Accurate Object Pose Estimation.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

An Analysis of How Driver Experience Affects Eye-Gaze Behavior for Robotic Wheelchair Operation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Similar Seasonal-Geo-Region Mining Based on Visual Concepts in Social Media Photos.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Exemplar-Based Pseudo-Viewpoint Rotation for White-Cane User Recognition from a 2D Human Pose Sequence.
Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2019

Scene-Adaptive Driving Area Prediction Based on Automatic Label Acquisition from Driving Information.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

Semantic Segmentation of Railway Images Considering Temporal Continuity.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

2018
Pedestrian Detectability Estimation Considering Visual Adaptation to Drastic Illumination Change.
IEICE Trans. Inf. Syst., 2018

Voting-based Hand-Waving Gesture Spotting from a Low-Resolution Far-Infrared Image Sequence.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Attribute-aware Semantic Segmentation of Road Scenes for Understanding Pedestrian Orientations.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Estimation of Driver's Insight for Safe Passing based on Pedestrian Attributes.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Analyzing Headlight Flicker Patterns for Improving the Pedestrian Detectability from a Driver.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Estimating the Scene-wise Reliability of LiDAR Pedestrian Detectors.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Gaze-Inspired Learning for Estimating the Attractiveness of a Food Photo.
Proceedings of the 2018 IEEE International Symposium on Multimedia, 2018

Which Content in a Booklet is he/she Reading? Reading Content Estimation using an Indoor Surveillance Camera.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Localizing the Gaze Target of a Crowd of People.
Proceedings of the Computer Vision - ACCV 2018 Workshops, 2018

2017
Regression of feature scale tracklets for decimeter visual localization.
Image Vis. Comput., 2017

Human Wearable Attribute Recognition Using Probability-Map-Based Decomposition of Thermal Infrared Images.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Can We Detect Pedestrians using Low-resolution LIDAR? - Integration of Multi-frame Point-clouds.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 5: VISAPP, Porto, Portugal, February 27, 2017

Wheelchair-user Detection Combined with Parts-based Tracking.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 5: VISAPP, Porto, Portugal, February 27, 2017

Deep Manifold Embedding for 3D Object Pose Estimation.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 5: VISAPP, Porto, Portugal, February 27, 2017

Can a Driver Assistance System Determine if a Driver is Perceiving a Pedestrian? - Consideration of the Driver's Visual Adaptation to Illumination Change.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 4: VISAPP, Porto, Portugal, February 27, 2017

Detection of Similar Geo-Regions Based on Visual Concepts in Social Photos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Monocular localization within sparse voxel maps.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Driver's decision analysis in terms of pedestrian attributes - A case study in passing by a pedestrian.
Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Automatic Selection of Web Contents Towards Automatic Authoring of a Video Biography.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Estimation of the Attractiveness of Food Photography Focusing on Main Ingredients.
Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence, Melbourne, Australia, August 20, 2017

Toward Describing Human Gaits by Onomatopoeias.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Trajectory Ensemble: Multiple Persons Consensus Tracking Across Non-overlapping Multiple Cameras over Randomly Dropped Camera Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Action recognition from extremely low-resolution thermal image sequence.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

2016
Hand Waving Gesture Detection using a Far-infrared Sensor Array with Thermo-spatial Region of Interest.
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016

Image Transformation of Eye Areas for Synthesizing Eye-contacts in Video Conferencing.
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016

Parts Selective DPM for detection of pedestrians possessing an umbrella.
Proceedings of the 2016 IEEE Intelligent Vehicles Symposium, 2016

Misclassification tolerable learning for robust pedestrian orientation classification.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Moving camera background-subtraction for obstacle detection on railway tracks.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

A classification method of cooking operations based on eye movement patterns.
Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research & Applications, 2016

A Study on Estimating the Attractiveness of Food Photography.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Recognition of Texting-While-Walking by Joint Features Based on Arm and Head Poses.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Pedestrian orientation classification utilizing single-chip coaxial RGB-ToF camera.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

Distant Pedestrian Re-detection from an In-vehicle Camera Based on Detections by Other Vehicles.
Proceedings of the IEEE 18th International Conference on Intelligent Transportation Systems, 2015

Detector ensemble based on false positive mining for pedestrian detection.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Tracking Pedestrians Across Multiple Cameras via Partial Relaxation of Spatio-Temporal Constraint and Utilization of Route Cue.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Riemannian Set-level Common-Near-Neighbor Analysis for Multiple-shot Person Re-identification.
Proceedings of the 13. IAPR International Conference on Machine Vision Applications, 2013

2010
Privacy-Protected Camera for the Sensing Web.
Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems. Applications, 2010

2009
Background Estimation Based on Device Pixel Structures for Silhouette Extraction.
Proceedings of the Computer Vision, 2009


  Loading...