Ichiro Ide

Orcid: 0000-0003-3942-9296

  • Nagoya University, Japan
  • University of Tokyo, Japan (PhD 2000)

According to our database1, Ichiro Ide authored at least 217 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:

On csauthors.net:


Correction to: Computational measurement of perceived pointiness from pronunciation.
Multim. Tools Appl., March, 2024

Computational measurement of perceived pointiness from pronunciation.
Multim. Tools Appl., March, 2024

Cross-modal recipe retrieval based on unified text encoder with fine-grained contrastive learning.
Knowl. Based Syst., 2024

Action Selection Learning for Multi-label Multi-view Action Recognition.
CoRR, 2024

Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association.
CoRR, 2024

Category-Level Object Pose Estimation in Heavily Cluttered Scenes by Generalized Two-Stage Shape Reconstructor.
IEEE Access, 2024

Image-Collection Summarization Using Scene-Graph Generation With External Knowledge.
IEEE Access, 2024

Zero-Shot Pill-Prescription Matching With Graph Convolutional Network and Contrastive Learning.
IEEE Access, 2024

Interpolating the Text-to-Image Correspondence Based on Phonetic and Phonological Similarities for Nonword-to-Image Generation.
IEEE Access, 2024

Investigating Conceptual Blending of a Diffusion Model for Improving Nonword-to-Image Generation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-Scale and Action Label Features.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

R-DiP: Re-ranking Based Diffusion Pre-computation for Image Retrieval.
Proceedings of the Database and Expert Systems Applications, 2024

Lightweight Maize Disease Detection through Post-Training Quantization with Similarity Preservation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Introduction to special issue "SOICT 2022".
Informatica (Slovenia), 2023

IPA-CLIP: Integrating Phonetic Priors into Vision and Language Pretraining.
CoRR, 2023

An Approach to Generate a Caption for an Image Collection Using Scene Graph Generation.
IEEE Access, 2023

Discovering Phonesthemic Clusters in Readings of Kanji Characters toward Exploring Phonestheme in Japanese.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

Towards Achieving Lightweight Deep Neural Network for Precision Agriculture with Maize Disease Detection.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

MVA2023 Small Object Detection Challenge for Spotting Birds: Dataset, Methods, and Results.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Image Impression Estimation by Clustering People with Similar Tastes.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Small Object Detection for Birds with Swin Transformer.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Towards Captioning an Image Collection from a Combined Scene Graph Representation Approach.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

RecipeMeta: Metapath-enhanced Recipe Recommendation on Heterogeneous Recipe Network.
Proceedings of the ACM Multimedia Asia 2023, 2023

Nonword-to-Image Generation Considering Perceptual Association of Phonetically Similar Words.
Proceedings of the 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2023

Special issue on "The Eighth International Symposium on Information and Communication Technology - SoICT 2017".
J. Heuristics, 2022

A Novel Approach for Pill-Prescription Matching with GNN Assistance and Contrastive Learning.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Intuitive Gait Modeling using Mimetic-Words for Gait Description and Generation.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Action Semantic Alignment for Image Captioning.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Detection of Birds in a 3D Environment Referring to Audio-Visual Information.
Proceedings of the 18th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2022

Soft-Boundary Label Relaxation with class placement constraints for semantic segmentation of the railway environment.
Pattern Recognit. Lett., 2021

Imageability- and Length-Controllable Image Captioning.
IEEE Access, 2021

Tell as You Imagine: Sentence Imageability-Aware Image Captioning.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021

MMArt-ACM'21: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

A multi-modal dataset for analyzing the imageability of concepts across modalities.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Pointedness of an Image: Measuring How Pointy an Image is Perceived.
Proceedings of the HCI International 2021 - Posters - 23rd HCI International Conference, 2021

Estimating the imageability of words by mining visual characteristics from crawled image data.
Multim. Tools Appl., 2020

Introduction to Special Issue "SoICT 2019".
Informatica (Slovenia), 2020

Attribute-Aware Loss Function for Accurate Semantic Segmentation Considering the Pedestrian Orientations.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2020

SOANets: Encoder-decoder based Skeleton Orientation Alignment Network for White Cane User Recognition from 2D Human Skeleton Sequence.
Proceedings of the 15th International Joint Conference on Computer Vision, 2020

More-Natural Mimetic Words Generation for Fine-Grained Gait Description.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Browsing Visual Sentiment Datasets Using Psycholinguistic Groundings.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

Imageability Estimation using Visual and Language Features.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

CEA'20: The 12th Workshop on Multimedia for Cooking and Eating Activities.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

MMArt-ACM'20: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2020.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Modeling Eye-Gaze Behavior of Electric Wheelchair Drivers via Inverse Reinforcement Learning.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

Median-Shape Representation Learning for Category-Level Object Pose Estimation in Cluttered Environments.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Ω-GAN: Object Manifold Embedding GAN for Image Generation by Disentangling Parameters into Pose and Shape Manifolds.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

LFIR2Pose: Pose Estimation from an Extremely Low-resolution FIR image Sequence.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Occlusion-Aware Skeleton Trajectory Representation for Abnormal Behavior Detection.
Proceedings of the Frontiers of Computer Vision - 26th International Workshop, 2020

Estimating the visual variety of concepts by referring to Web popularity.
Multim. Tools Appl., 2019

ITS+DM Hackathon (ITSC 2017): Lane Departure Prediction With Naturalistic Driving Data.
IEEE Intell. Transp. Syst. Mag., 2019

Summarization of Multiple News Videos Considering the Consistency of Audio-Visual Contents.
Int. J. Semantic Comput., 2019

Estimation of the Attractiveness of Food Photography Based on Image Features.
IEICE Trans. Inf. Syst., 2019

IEICE Trans. Inf. Syst., 2019

Pedestrian Intensive Scanning for Active-scan LIDAR.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

Hard Negative Mining from in-Vehicle Camera Images based on Multiple Observations of Background Patterns.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

Next Viewpoint Recommendation by Pose Ambiguity Minimization for Accurate Object Pose Estimation.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

An Analysis of How Driver Experience Affects Eye-Gaze Behavior for Robotic Wheelchair Operation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Application of Data Augmentation for Accurate Attractiveness Estimation for Food Photography.
Proceedings of the 11th Workshop on Multimedia for Cooking and Eating Activities, 2019

Similar Seasonal-Geo-Region Mining Based on Visual Concepts in Social Media Photos.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Exemplar-Based Pseudo-Viewpoint Rotation for White-Cane User Recognition from a 2D Human Pose Sequence.
Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2019

Scene-Adaptive Driving Area Prediction Based on Automatic Label Acquisition from Driving Information.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

Semantic Segmentation of Railway Images Considering Temporal Continuity.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

Video Querying.
Proceedings of the Encyclopedia of Database Systems, Second Edition, 2018

Introduction to special issue SoICT 2017.
Informatica (Slovenia), 2018

Pedestrian Detectability Estimation Considering Visual Adaptation to Drastic Illumination Change.
IEICE Trans. Inf. Syst., 2018

Voting-based Hand-Waving Gesture Spotting from a Low-Resolution Far-Infrared Image Sequence.
Proceedings of the IEEE Visual Communications and Image Processing, 2018

Attribute-aware Semantic Segmentation of Road Scenes for Understanding Pedestrian Orientations.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Estimation of Driver's Insight for Safe Passing based on Pedestrian Attributes.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Analyzing Headlight Flicker Patterns for Improving the Pedestrian Detectability from a Driver.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Estimating the Scene-wise Reliability of LiDAR Pedestrian Detectors.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Gaze-Inspired Learning for Estimating the Attractiveness of a Food Photo.
Proceedings of the 2018 IEEE International Symposium on Multimedia, 2018

A study on the factors affecting the attractiveness of food photography.
Proceedings of the Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management, 2018

Localizing the Gaze Target of a Crowd of People.
Proceedings of the Computer Vision - ACCV 2018 Workshops, 2018

Regression of feature scale tracklets for decimeter visual localization.
Image Vis. Comput., 2017

Single Camera Vehicle Localization Using Feature Scale Tracklets.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Human Wearable Attribute Recognition Using Probability-Map-Based Decomposition of Thermal Infrared Images.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2017

Proposal of a spectral random dots marker using local feature for posture estimation.
Proceedings of the 2017 IEEE Virtual Reality, 2017

Can We Detect Pedestrians using Low-resolution LIDAR? - Integration of Multi-frame Point-clouds.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 5: VISAPP, Porto, Portugal, February 27, 2017

Wheelchair-user Detection Combined with Parts-based Tracking.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 5: VISAPP, Porto, Portugal, February 27, 2017

Deep Manifold Embedding for 3D Object Pose Estimation.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 5: VISAPP, Porto, Portugal, February 27, 2017

Can a Driver Assistance System Determine if a Driver is Perceiving a Pedestrian? - Consideration of the Driver's Visual Adaptation to Illumination Change.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 4: VISAPP, Porto, Portugal, February 27, 2017

Detection of Similar Geo-Regions Based on Visual Concepts in Social Photos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Monocular localization within sparse voxel maps.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Driver's decision analysis in terms of pedestrian attributes - A case study in passing by a pedestrian.
Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017

Summarization of News Videos Considering the Consistency of Auditory and Visual Contents.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Automatic Selection of Web Contents Towards Automatic Authoring of a Video Biography.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Estimation of the Attractiveness of Food Photography Focusing on Main Ingredients.
Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence, Melbourne, Australia, August 20, 2017

Toward Describing Human Gaits by Onomatopoeias.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Trajectory Ensemble: Multiple Persons Consensus Tracking Across Non-overlapping Multiple Cameras over Randomly Dropped Camera Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Action recognition from extremely low-resolution thermal image sequence.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

Reconstructing Face Image from the Thermal Infrared Spectrum to the Visible Spectrum.
Sensors, 2016

Using Super-Pixels and Human Probability Map for Automatic Human Subject Segmentation.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2016

Hand Waving Gesture Detection using a Far-infrared Sensor Array with Thermo-spatial Region of Interest.
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016

Image Transformation of Eye Areas for Synthesizing Eye-contacts in Video Conferencing.
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016

Parts Selective DPM for detection of pedestrians possessing an umbrella.
Proceedings of the 2016 IEEE Intelligent Vehicles Symposium, 2016

Misclassification tolerable learning for robust pedestrian orientation classification.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Moving camera background-subtraction for obstacle detection on railway tracks.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

A classification method of cooking operations based on eye movement patterns.
Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research & Applications, 2016

A Study on Estimating the Attractiveness of Food Photography.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Recognition of Texting-While-Walking by Joint Features Based on Arm and Head Poses.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

Multi-Modal Scene Duplicate Detection from News Videos Focusing on Human Faces.
Int. J. Semantic Comput., 2015

Estimation of Human Orientation using Coaxial RGB-Depth Images.
Proceedings of the VISAPP 2015, 2015

Adaptive Reference Image Selection for Temporal Object Removal from Frontal In-vehicle Camera Image Sequences.
Proceedings of the VISAPP 2015, 2015

Generation of a Video Summary on a News Topic Based on SNS Responses to News Stories.
Proceedings of the Fourth International Workshop on Crowdsourcing for Multimedia, 2015

Pedestrian orientation classification utilizing single-chip coaxial RGB-ToF camera.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

Distant Pedestrian Re-detection from an In-vehicle Camera Based on Detections by Other Vehicles.
Proceedings of the IEEE 18th International Conference on Intelligent Transportation Systems, 2015

Typicality analysis of the combination of ingredients in a cooking recipe for assisting the arrangement of ingredients.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

Tastes and Textures Estimation of Foods Based on the Analysis of Its Ingredients List and Image.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2015 Workshops, 2015

Position Interpolation Using Feature Point Scale for Decimeter Visual Localization.
Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop, 2015

Detector ensemble based on false positive mining for pedestrian detection.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

Environment Adaptive Pedestrian Detection using In-vehicle Camera and GPS.
Proceedings of the VISAPP 2014, 2014

Exemplar-based Human Body Super-resolution for Surveillance Camera Systems.
Proceedings of the VISAPP 2014, 2014

Nagoya University at TRECVID 2014: the Instance Search Task.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Event Detection based on Twitter Enthusiasm Degree for Generating a Sports Highlight Video.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Estimation of the Representative Story Transition in a Chronological Semantic Structure of News Topics.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Single camera vehicle localization using SURF scale and dynamic time warping.
Proceedings of the 2014 IEEE Intelligent Vehicles Symposium Proceedings, 2014

Estimation of traffic sign visibility considering local and global features in a driving environment.
Proceedings of the 2014 IEEE Intelligent Vehicles Symposium Proceedings, 2014

Prediction of driver's pedestrian detectability by image processing adaptive to visual fields of view.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014

Scene Duplicate Detection from News Videos Using Image-Audio Matching Focusing on Human Faces.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Spatial People Density Estimation from Multiple Viewpoints by Memory Based Regression.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Vision-Based Vehicle Localization Using a Visual Street Map with Embedded SURF Scale.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Human Tracking Using a Far-Infrared Sensor Array and a Thermo-Spatial Sensitive Histogram.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

Cross-Pose Face Recognition - A Virtual View Generation Approach Using Clustering Based LVTM.
IEICE Trans. Inf. Syst., 2013

Segmentation of Human Instances Using Grab-cut and Active Shape Model Feedback.
Proceedings of the 13. IAPR International Conference on Machine Vision Applications, 2013

Detection of Biased Broadcast Sports Video Highlights by Attribute-Based Tweets Analysis.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Recompilation of Broadcast Videos Based on Real-World Scenarios.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Automatic authoring of a domestic cooking video based on the description of cooking instructions.
Proceedings of the 5th international workshop on Multimedia for cooking & eating activities, 2013

Pedestrian detection by scene dependent classifiers with generative learning.
Proceedings of the 2013 IEEE Intelligent Vehicles Symposium (IV), 2013

Construction of a traffic sign detector based on voting type co-training.
Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems, 2013

Speech Shot Extraction from Broadcast News Videos.
Int. J. Semantic Comput., 2012

Efficient Tracking of News Topics Based on Chronological Semantic Structures in a Large-Scale News Video Archive.
IEICE Trans. Inf. Syst., 2012

Real-time marker-free patient registration for electromagnetic navigated bronchoscopy: a phantom study.
Int. J. Comput. Assist. Radiol. Surg., 2012

Integration of Generative Learning and Multiple Pose Classifiers for Pedestrian Detection.
Proceedings of the VISAPP 2012, 2012

Overview of the ACM multimedia 2012 workshop on multimedia for cooking and eating activities (CEA'12).
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Smart VideoCooKing: a multimedia cooking recipe browsing application on portable devices.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Visibility estimation of traffic signals under rainy weather conditions for smart driving support.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Construction of a Local Attraction Map According to Social Visual Attention.
Proceedings of the Intelligent Interactive Multimedia: Systems and Services, 2012

Estimation of the human performance for pedestrian detectability based on visual search and motion features.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Robust Face Super-Resolution Using Free-Form Deformations for Low-Quality Surveillance Video.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Detection and Classification of Repetitious Human Motions Combining Shift Variant and Invariant Features.
Proceedings of the 2012 Third International Conference on Emerging Security Technologies, 2012

Subtraction-Based Forward Obstacle Detection Using Illumination Insensitive Feature for Driving-Support.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Virtual View Generation Using Clustering Based Local View Transition Model.
Proceedings of the Computer Vision - ACCV 2012 Workshops, 2012

Power-Efficient Hardware Architecture of K-Means Clustering With Bayesian-Information-Criterion Processor for Multimedia Processing Applications.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2011

On-road Obstacle Detection by Comparing Present and Past In-vehicle Camera Images.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011

Video CooKing: Towards the Synthesis of Multimedia Cooking Recipes.
Proceedings of the Advances in Multimedia Modeling, 2011

Scene segmentation of wedding party videos by scenario-based matching with example videos.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Why did the prime minister resign?: generation of event explanations from large news repositories.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

3-D line segment reconstruction using an in-vehicle camera for free space detection.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2011

Road image update using in-vehicle camera images and aerial image.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2011

Estimation of traffic sign visibility considering temporal environmental changes for smart driver assistance.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2011

Intelligent traffic sign detector: Adaptive learning based on online gathering of training samples.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2011

Detection of Inconsistency Between Subject and Speaker Based on the Co-occurrence of Lip Motion and Voice Towards Speech Scene Extraction from News Videos.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

Low Resolution QR-Code Recognition by Applying Super-Resolution Using the Property of QR-Codes.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

A Hilbert warping method for handwriting gesture recognition.
Pattern Recognit., 2010

A Multimodal Constellation Model for Object Image Classification.
EURASIP J. Image Video Process., 2010

PageRank with Text Similarity and Video Near-Duplicate Constraints for News Story Re-ranking.
Proceedings of the Advances in Multimedia Modeling, 2010

Estimation of traffic sign visibility toward smart driver assistance.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2010

Multimedia Supplementation to a Cooking Recipe Text for Facilitating Its Understanding to Inexperienced Users.
Proceedings of the 12th IEEE International Symposium on Multimedia, 2010

Removal of Moving Objects from a Street-View Image by Fusing Multiple Image Sequences.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Region-Based Image Transform for Transition Between Object Appearances.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Classification of Near-Duplicate Video Segments Based on Their Appearance Patterns.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Vehicle Ego-Localization by Matching In-Vehicle Camera Images to an Aerial Image.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Frontal Face Generation from Multiple Low-Resolution Non-frontal Faces for Face Recognition.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Improvement of a Traffic Sign Detector by Retrospective Gathering of Training Samples from In-Vehicle Camera Image Sequences.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Video Querying.
Proceedings of the Encyclopedia of Database Systems, 2009

Incremental Unsupervised-Learning of Appearance Manifold with View-Dependent Covariance Matrix for Face Recognition from Video Sequences.
IEICE Trans. Inf. Syst., 2009

News Topic Tracking and Re-ranking with Query Expansion Based on Near-Duplicate Detection.
Proceedings of the Advances in Multimedia Information Processing, 2009

Recognition of Road Markings from In-Vehicle Camera Images by a Generative Learning Method.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2009), 2009

A Multimodal Constellation Model for Object Category Recognition.
Proceedings of the Advances in Multimedia Modeling, 2009

Large-scale news topic tracking and key-scene ranking with video near-duplicate constraints.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Finding replaceable materials in cooking recipe texts considering characteristic cooking actions.
Proceedings of the ACM multimedia 2009 workshop on Multimedia for cooking and eating activities, 2009

Overview of the ACM multimedia 2009 workshop on multimedia for cooking and eating activities (CEA'09).
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Labeling News Topic Threads with Wikipedia Entries.
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009

Adaptive division of feature space for rapid detection of near-duplicate video segments.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Low-Resolution Character Recognition by Video-Based Super-Resolution.
Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009

Recognition of camera-captured low-quality characters using motion blur information.
Pattern Recognit., 2008

Construction of Appearance Manifold with Embedded View-Dependent Covariance Matrix for 3D Object Recognition.
IEICE Trans. Inf. Syst., 2008

Cross-Lingual Retrieval of Identical News Events by Near-Duplicate Video Segment Detection.
Proceedings of the Advances in Multimedia Modeling, 2008

Eigenspace interpolation for appearance-based object recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

A Hilbert warping method for camera-based finger-writing recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

A Hilbert Warping Algorithm for Recognizing Characters from Moving Camera.
Proceedings of the Eighth IAPR International Workshop on Document Analysis Systems, 2008

Generation of Training Data by Degradation Models for Traffic Sign Symbol Recognition.
IEICE Trans. Inf. Syst., 2007

Appearance Manifold with Embedded Covariance Matrix for Robust 3D Object Recognition.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2007), 2007

mediaWalker: a video archive explorer based on time-series semantic structure.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Genre-Adaptive Near-Duplicate Video Segment Detection.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Interpolation Between Eigenspaces Using Rotation in Multiple Dimensions.
Proceedings of the Computer Vision, 2007

Assembling personal speech collections by monologue scene detection from a news video archive.
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006

Planning of Multiple Camera Arrangement for Object Recognition in Parametric Eigenspace.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Change detection in streetscapes from GPS coordinated omni-directional image sequences.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Identification of degraded traffic sign symbols by a generative learning method.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Exploiting Topic Thread Structures in a News Video Archive for the Semi-Automatic Generation of Video Summaries.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Visibility Estimation in Foggy Conditions by In-Vehicle Camera and Radar.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Raindrop Detection from In-Vehicle Video Camera Images for Rainfall Judgment.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Spatiotemporal Density Feature Analysis to Detect Liver Cancer from Abdominal CT Angiography.
Proceedings of the Computer Vision, 2006

Associating semantically structured cooking videos with their preparation steps.
Syst. Comput. Jpn., 2005

Cooking navi: assistant for daily cooking in kitchen.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Automated Nomenclature of Bronchial Branches Extracted from CT Images and Its Application to Biopsy Path Planning in Virtual Bronchoscopy.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2005

<i>trackThem</i>: Exploring a Large-Scale News Video Archive by Tracking Human Relations.
Proceedings of the Information Retrieval Technology, 2005

Recognition of Very Low-Resolution Characters from Motion Images Captured by a Portable Digital Camera.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Key Image Extraction from a News Video Archive for Visualizing Its Semantic Structure.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Mining Large-Scale Broadcast Video Archives Towards Inter-video Structuring.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Multimedia Integration for Cooking Video Indexing.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Topic Threading for Structuring a Large-Scale News Video Archive.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

Compilation of dictionaries for semantic attribute analysis of television news captions.
Syst. Comput. Jpn., 2003

Threading news video topics.
Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003

Topic-based inter-video structuring of a large-scale news video corpus.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Associating Cooking Video Segments with Preparation Steps.
Proceedings of the Image and Video Retrieval, Second International Conference, 2003

An object detection method for describing soccer games from video.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

An automatic video indexing method based on shot classification.
Syst. Comput. Jpn., 2001

An attribute based news video indexing.
Proceedings of the 2001 ACM workshops on Multimedia: multimedia information retrieval, Ottawa, ON, Canada, September 30, 2001

Scene identification in news video by character region segmentation.
Proceedings of the ACM Multimedia 2000 Workshops, Los Angeles, CA, USA, October 30, 2000

Associating cooking video with related textbook.
Proceedings of the ACM Multimedia 2000 Workshops, Los Angeles, CA, USA, October 30, 2000

Structural analysis of cooking preparation steps in Japanese.
Proceedings of the Fifth International Workshop on Information Retrieval with Asian Languages, 2000, Hong Kong, China, September 30, 2000

Associating video with related documents.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Relating Graphical Features with Concept Classes for Automatic News Video Indexing.
Proceedings of the IJCAI-99 Workshop on Intelligent Information Integration, 1999

Automatic Video Indexing Based on Shot Classification.
Proceedings of the Advanced Multimedia Content Processing, First International Conference, 1998
