Keiji Yanai

Orcid: 0000-0002-0431-183X

According to our database1, Keiji Yanai authored at least 211 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Multi-Style Shape Matching GAN for Text Images.
IEICE Trans. Inf. Syst., 2024

CLIPFontDraw: Stylizing Fonts With CLIP.
IEEE Access, 2024

Training-Free Region Prediction with Stable Diffusion.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

HOI as Embeddings: Advancements of Model Representation Capability in Human-Object Interaction Detection.
Proceedings of the 7th IEEE International Conference on Multimedia Information Processing and Retrieval, 2024

RecipeSD: Injecting Recipe into Food Image Synthesis with Stable Diffusion.
Proceedings of the 2nd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2024

Font Style Translation in Scene Text Images with CLIPstyler.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Act-ChatGPT: Introducing Action Features into Multi-modal Large Language Models for Video Understanding.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Improving Cross-Modal Recipe Embeddings with Cross Decoder.
Proceedings of the Fifth Workshop on Intelligent Cross-Data Analysis and Retrieval, 2024

2023
Focusing on what to decode and what to train: Efficient Training with HOI Split Decoders and Specific Target Guided DeNoising.
CoRR, 2023

QAHOI: Query-Based Anchors for Human-Object Interaction Detection.
Proceedings of the 18th International Conference on Machine Vision and Applications, 2023

Transformer-Based Cross-Modal Recipe Embeddings with Large Batch Training.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Virtual Try-On Considering Temporal Consistency for Videoconferencing.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Contextual Associated Triplet Queries for Panoptic Scene Graph Generation.
Proceedings of the ACM Multimedia Asia 2023, 2023

VQ-VDM: Video Diffusion Models with 3D VQGAN.
Proceedings of the ACM Multimedia Asia 2023, 2023

Mask-based Food Image Synthesis with Cross-Modal Recipe Embeddings.
Proceedings of the ACM Multimedia Asia 2023, 2023

MADiMa '23: 8th International Workshop on Multimedia Assisted Dietary Management.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CalorieCam360: Simultaneous Eating Action Recognition of Multiple People Using an Omnidirectional Camera.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

HowToEat: Exploring Human Object Interaction and Eating Action in Eating Scenarios.
Proceedings of the 8th International Workshop on Multimedia Assisted Dietary Management, 2023

2022
FASSD-Net: Fast and Accurate Real-Time Semantic Segmentation for Embedded Systems.
IEEE Trans. Intell. Transp. Syst., 2022

Material Translation Based on Neural Style Transfer with Ideal Style Image Retrieval.
Sensors, 2022

Zero-Shot Font Style Transfer with a Differentiable Renderer.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

Parallel Queries for Human-Object Interaction Detection.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

Text-based Image Editing for Food Images with CLIP.
Proceedings of the 7th International Workshop on Multimedia Assisted Dietary Management, 2022

Real Scale Hungry Networks: Real Scale 3D Reconstruction of a Dish and a Plate using Implicit Function and a Single RGB-D Image.
Proceedings of the 7th International Workshop on Multimedia Assisted Dietary Management, 2022

MADiMa'22: 7th International Workshop on Multimedia Assisted Dietary Management.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SetMealAsYouLike: Sketch-based Set Meal Image Synthesis with Plate Annotations.
Proceedings of the 7th International Workshop on Multimedia Assisted Dietary Management, 2022

DepthGrillCam: A Mobile Application for Real-time Eating Action Recording Using RGB-D Images.
Proceedings of the 7th International Workshop on Multimedia Assisted Dietary Management, 2022

Unseen Food Segmentation.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Continual Learning in Vision Transformer.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

StyleGAN-based CLIP-guided Image Shape Manipulation.
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022

2021
Cross-Modal Recipe Embeddings by Disentangling Recipe Contents and Dish Styles.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

3D Mesh Reconstruction of Foods from a Single Image.
Proceedings of the AI & Food'21: Proceedings of the 3rd Workshop on AIxFood, 2021

Ketchup GAN: A New Dataset for Realistic Synthesis of Letters on Food.
Proceedings of the MMArt-ACM '21: Proceedings of the 2021 International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021, 2021

Multi-Style Transfer Generative Adversarial Network for Text Images.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Pop'n Food: 3D Food Model Estimation System from a Single Image.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Region-Based Food Calorie Estimation for Multiple-Dish Meals.
Proceedings of the CEA '21: Proceedings of the 13th International Workshop on Multimedia for Cooking and Eating Activities, 2021

Few-Shot and Zero-Shot Semantic Segmentation for Food Images.
Proceedings of the CEA '21: Proceedings of the 13th International Workshop on Multimedia for Cooking and Eating Activities, 2021

Ketchup As You Like: Drawing Editor for Foods.
Proceedings of the IEEE International Conference on Artificial Intelligence and Virtual Reality, 2021

Pose Sequence Generation with a GCN and an Initial Pose Generator.
Proceedings of the Pattern Recognition - 6th Asian Conference, 2021

2020
Weakly supervised semantic segmentation using distinct class specific saliency maps.
Comput. Vis. Image Underst., 2020

CalorieCaptorGlass: Food Calorie Estimation based on Actual Size using HoloLens and Deep Learning.
Proceedings of the 2020 IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2020

UEC at TRECVID 2020: INS and ActEV.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Hungry networks: 3D mesh reconstruction of a dish and a plate from a single dish image for estimating food volume.
Proceedings of the MMAsia 2020: ACM Multimedia Asia, 2020

Iconify: Converting Photographs into Icons.
Proceedings of the 2020 Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia, 2020

Style Image Retrieval for Improving Material Translation Using Neural Style Transfer.
Proceedings of the 2020 Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia, 2020

Food Image Generation and Translation and Its Application to Augmented Reality.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Training of Multiple and Mixed Tasks with a Single Network Using Feature Modulation.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Fast and Accurate Real-Time Semantic Segmentation with Dilated Asymmetric Convolutions.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

UEC-FoodPix Complete: A Large-Scale Food Image Segmentation Dataset.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Rescue Dog Action Recognition by Integrating Ego-Centric Video, Sound and Sensor Information.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Mask-based Style-Controlled Image Synthesis Using a Mask Style Encoder.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

IPN Hand: A Video Dataset and Benchmark for Real-Time Continuous Hand Gesture Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Weakly-Supervised Plate And Food Region Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019
Editorial for the ICMR 2018 special issue.
Int. J. Multim. Inf. Retr., 2019

Webly-Supervised Food Detection with Foodness Proposal.
IEICE Trans. Inf. Syst., 2019

Simultaneous Estimation of Dish Locations and Calories with Multi-Task Learning.
IEICE Trans. Inf. Syst., 2019

Enchanting Your Noodles: A Gustatory Manipulation Interface by Using GAN-based Real-time Food-to-Food Translation.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2019

Enchanting Your Noodles: GAN-based Real-time Food-to-Food Translation and Its Impact on Vision-induced Gustatory Manipulation.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces, 2019

MADiMA'19: 5th International Workshop on Multimedia Assisted Dietary Management.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Unseen Food Creation by Mixing Existing Food Images with Conditional StyleGAN.
Proceedings of the 5th International Workshop on Multimedia Assisted Dietary Management, 2019

A New Large-scale Food Image Segmentation Dataset and Its Application to Food Calorie Estimation Based on Grains of Rice.
Proceedings of the 5th International Workshop on Multimedia Assisted Dietary Management, 2019

Ramen as You Like: Sketch-based Food Image Generation and Editing.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

DepthCalorieCam: A Mobile Application for Volume-Based FoodCalorie Estimation using Depth Cameras.
Proceedings of the 5th International Workshop on Multimedia Assisted Dietary Management, 2019

A Large-Scale Analysis of Regional Tendency of Twitter Photos Using Only Image Features.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Image-Based Estimation of Real Food Size for Accurate Food Calorie Estimation.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Partial Image Texture Translation Using Weakly-Supervised Semantic Segmentation.
Proceedings of the New Frontiers in Artificial Intelligence, 2019

DeepTaste: Augmented Reality Gustatory Manipulation with GAN-Based Real-Time Food-to-Food Translation.
Proceedings of the 2019 IEEE International Symposium on Mixed and Augmented Reality, 2019

Self-Supervised Difference Detection for Weakly-Supervised Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Mosquito Larvae Image Classification Based on DenseNet and Guided Grad-CAM.
Proceedings of the Pattern Recognition and Image Analysis - 9th Iberian Conference, 2019

Self-supervised Difference Detection for Refinement CRF and Seed Interpolation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Analyzing Regional Food Trends with Geo-tagged Twitter Food Photos.
Proceedings of the 2019 International Conference on Content-Based Multimedia Indexing, 2019

Large-Scale Twitter Food Photo Mining and Its Applications.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Pre-trained and Shared Encoder in Cycle-Consistent Adversarial Networks to Improve Image Quality.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

Attention Guided Unsupervised Image-to-Image Translation with Progressively Growing Strategy.
Proceedings of the Pattern Recognition, 2019

Continual Learning of Image Translation Networks Using Task-Dependent Weight Selection Masks.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

SSA-GAN: End-to-End Time-Lapse Video Generation with Spatial Self-Attention.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

2018
An integration of bottom-up and top-down salient cues on RGB-D data: saliency from objectness versus non-objectness.
Signal Image Video Process., 2018

Image-Based Food Calorie Estimation Using Recipe Information.
IEICE Trans. Inf. Syst., 2018

An Integration of Bottom-up and Top-Down Salient Cues on RGB-D Data: Saliency from Objectness vs. Non-Objectness.
CoRR, 2018

AR DeepCalorieCam V2: food calorie estimation with CNN and AR-based actual size estimation.
Proceedings of the 24th ACM Symposium on Virtual Reality Software and Technology, 2018

Ramen spoon eraser: CNN-based photo transformation for improving attractiveness of ramen photos.
Proceedings of the 24th ACM Symposium on Virtual Reality Software and Technology, 2018

AR DeepCalorieCam: An iOS App for Food Calorie Estimation with Augmented Reality.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Magical Rice Bowl: A Real-time Food Category Changer.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Food image generation using a large amount of food images with conditional GAN: ramenGAN and recipeGAN.
Proceedings of the Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management, 2018

Food category transfer with conditional cycleGAN and a large-scale food image dataset.
Proceedings of the Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management, 2018

Multi-task learning of dish detection and calorie estimation.
Proceedings of the Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management, 2018

FoodChangeLens: CNN-Based Food Transformation on HoloLens.
Proceedings of the 2018 IEEE International Conference on Artificial Intelligence and Virtual Reality, 2018

Word-Conditioned Image Style Transfer.
Proceedings of the Computer Vision - ACCV 2018 Workshops, 2018

Font Style Transfer Using Neural Style Transfer and Unsupervised Cross-domain Transfer.
Proceedings of the Computer Vision - ACCV 2018 Workshops, 2018

2017
Guest Editorial Nutrition Informatics: From Food Monitoring to Dietary Management.
IEEE J. Biomed. Health Informatics, 2017

Simultaneous estimation of food categories and calories with multi-task CNN.
Proceedings of the Fifteenth IAPR International Conference on Machine Vision Applications, 2017

DeepStyleCam: A Real-Time Style Transfer App on iOS.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Image-Based Food Calorie Estimation Using Knowledge on Food Categories, Ingredients and Cooking Directions.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Conditional Fast Style Transfer Network.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Partial style transfer using weakly supervised semantic segmentation.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Unseen Style Transfer Based on a Conditional Fast Style Transfer Network.
Proceedings of the 5th International Conference on Learning Representations, 2017

Comparison of Two Approaches for Direct Food Calorie Estimation.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2017, 2017

Scene Text Eraser.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

Neural Font Style Transfer.
Proceedings of the First Workshop of Machine Learning, 2017

Learning Food Image Similarity for Food Image Retrieval.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Twitter Photo Geo-Localization Using Both Textual and Visual Features.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Predicting Segmentation "Easiness" from the Consistency for Weakly-Supervised Segmentation.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

Estimating Food Calories for Multiple-Dish Food Photos.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

2016
Event photo mining from Twitter using keyword bursts and image clustering.
Neurocomputing, 2016

Automatic Retrieval of Action Video Shots from the Web Using Density-Based Cluster Analysis and Outlier Detection.
IEICE Trans. Inf. Syst., 2016

Visual Event Mining from the Twitter Stream.
Proceedings of the 25th International Conference on World Wide Web, 2016

UEC at TRECVID 2016 AVS task.
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Caffe2C: A Framework for Easy Implementation of CNN-based Mobile Applications.
Proceedings of the Adjunct Proceedings of the 13th International Conference on Mobile and Ubiquitous Systems: Computing Networking and Services, 2016

GrillCam: A Real-Time Eating Action Recognition System.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Efficient Mobile Implementation of A CNN-based Object Recognition System.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

DeepFoodCam: A DCNN-based Real-time Mobile Food Recognition System.
Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management, 2016

Foodness Proposal for Multiple Food Detection by Training of Single Food Images.
Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management, 2016

An Automatic Calorie Estimation System of Food Images on a Smartphone.
Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management, 2016

Overview of the ACM MultiMedia 2016 International Workshop on Multimedia Assisted Dietary Management.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

CNN-based Style Vector for Style Image Retrieval.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Weakly-supervised segmentation by combining CNN feature maps and object saliency maps.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Distinct Class-Specific Saliency Maps for Weakly Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2016, 2016

A System to Help Amateurs Take Pictures of Delicious Looking Food.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015
FoodCam: A real-time food recognition system on a smartphone.
Multim. Tools Appl., 2015

VisualTextualRank: An Extension of VisualRank to Large-Scale Video Shot Extraction Exploiting Tag Co-occurrence.
IEICE Trans. Inf. Syst., 2015

A system to support the amateurs to take a delicious-looking picture of foods.
Proceedings of the SIGGRAPH Asia 2015 Mobile Graphics and Interactive Applications, 2015

Automatic Construction of Action Datasets Using Web Videos with Density-Based Cluster Analysis and Outlier Detection.
Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Twitter Event Photo Detection Using both Geotagged Tweets and Non-geotagged Photo Tweets.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Food image recognition using deep convolutional network with pre-training and fine-tuning.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

A visual analysis on recognizability and discriminability of onomatopoeia words with DCNN features.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

CNN-Based Food Image Segmentation Without Pixel-Wise Annotation.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2015 Workshops, 2015

Low-bit representation of linear classifier weights for mobile large-scale image classification.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
ILSVRC on a Smartphone.
IPSJ Trans. Comput. Vis. Appl., 2014

A Cooking Recipe Recommendation System with Visual Recognition of Food Ingredients.
Int. J. Interact. Mob. Technol., 2014

Automatic extraction of relevant video shots of specific actions exploiting Web data.
Comput. Vis. Image Underst., 2014

UEC at TRECVID 2014 SIN task.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Twitter Food Photo Mining and Analysis for One Hundred Kinds of Foods.
Proceedings of the Advances in Multimedia Information Processing - PCM 2014, 2014

A Dense SURF and Triangulation Based Spatio-temporal Feature for Action Recognition.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

FoodCam: A Real-Time Mobile Food Recognition System Employing Fisher Vector.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

FoodCam-256: A Large-scale Real-time Mobile Food RecognitionSystem employing High-Dimensional Features and Compression of Classifier Weights.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Real-Time Photo Mining from the Twitter Stream: Event Photo Discovery and Food Photo Detection.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

Real-time eating action recognition system on a smartphone.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Food image recognition with deep convolutional features.
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

Automatic Expansion of a Food Image Dataset Leveraging Existing Categories with Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Offline 1000-Class Classification on a Smartphone.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

Hand Detection and Tracking in Videos for Fine-Grained Action Recognition.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014

2013
Summarization of Egocentric Moving Videos for Generating Walking Route Guidance.
Proceedings of the Image and Video Technology - 6th Pacific-Rim Symposium, 2013

Visual Analysis of Tag Co-occurrence on Nouns and Adjectives.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Large-scale web video shot ranking based on visual features and tag co-occurrence.
Proceedings of the ACM Multimedia Conference, 2013

UEC, Tokyo at MediaEval 2013 Retrieving Diverse Social Images Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

[Demo paper] mirurecipe: A mobile cooking recipe recommendation system with food ingredient recognition.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Visual event mining from geo-tweet photos.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

[Demo paper] twitter visual event mining system.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

A Spatio-temporal Feature Based on Triangulation of Dense SURF.
Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, 2013

Real-Time Mobile Food Recognition System.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

Rapid Mobile Object Recognition Using Fisher Vector.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

2012
UEC at TRECVID 2012 SIN and MED task.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Real-time mobile recipe recommendation system using food ingredient recognition.
Proceedings of the 2nd ACM international workshop on Interactive multimedia on mobile and portable devices, 2012

World seer: a realtime geo-tweet photo mapping system.
Proceedings of the International Conference on Multimedia Retrieval, 2012

Multiple-food recognition considering co-occurrence employing manifold ranking.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Visualization of Real-World Events with Geotagged Tweet Photos.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Recognition of Multiple-Food Images by Detecting Candidate Regions.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Automatic collection of Web video shots corresponding to specific actions using Web images.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011
GeoVisualRank: a ranking method of geotagged imagesconsidering visual similarity and geo-location proximity.
Proceedings of the 20th International Conference on World Wide Web, 2011

UEC at TRECVID 2011 SIN and MED task.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Automatic construction of an action video shot database using web videos.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
Mining Regional Representative Photos from Consumer-Generated Geotagged Photos.
Proceedings of the Handbook of Social Network Technologies and Applications, 2010

UEC at TRECVID 2010 Semantic Indexing Task.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Region-based automatic web image selection.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Automatic Construction of a Folksonomy-Based Visual Ontology.
Proceedings of the 12th IEEE International Symposium on Multimedia, 2010

Image Recognition of 85 Food Categories by Feature Fusion.
Proceedings of the 12th IEEE International Symposium on Multimedia, 2010

Geotagged Photo Recognition Using Corresponding Aerial Photos with Multiple Kernel Learning.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

A SURF-Based Spatio-Temporal Feature for Feature-Fusion-Based Action Recognition.
Proceedings of the Trends and Topics in Computer Vision, 2010

Geotagged Image Recognition by Combining Three Different Kinds of Geolocation Features.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
Mining cultural differences from a large number of geotagged photos.
Proceedings of the 18th International Conference on World Wide Web, 2009

UEC at TRECVID 2009 High Level Feature Task.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Can Geotags Help Image Recognition?.
Proceedings of the Advances in Image and Video Technology, Third Pacific Rim Symposium, 2009

Detecting "In-Play" Photos in Sports News Photo Database.
Proceedings of the Advances in Multimedia Information Processing, 2009

Detecting cultural differences using consumer-generated geotagged photos.
Proceedings of the Second International Workshop on Location and the Web, 2009

Web image gathering with region-based bag-of-features and multiple instance learning.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

An analysis of the relation between visual concepts and geo-locations using geotagged images on the web.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A food image recognition system with Multiple Kernel Learning.
Proceedings of the International Conference on Image Processing, 2009

A visual analysis of the relationship between word concepts and geographical locations.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

Extracting Spatio-temporal Local Features Considering Consecutiveness of Motions.
Proceedings of the Computer Vision, 2009

2008
Automatic web image selection with a probabilistic latent topic model.
Proceedings of the 17th International Conference on World Wide Web, 2008

UEC at TRECVID 2008 High Level Feature Task.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Objects over the World.
Proceedings of the Advances in Multimedia Information Processing, 2008

Web Image Gathering with a Part-Based Object Recognition Method.
Proceedings of the Advances in Multimedia Modeling, 2008

Rushes summarization based on color, motion and face.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Web image selection with PLSA.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Web video retrieval based on the Earth Mover's Distance by integrating color, motion and sound.
Proceedings of the International Conference on Image Processing, 2008

Associating Faces and Names in Japanese Photo News Articles on the Web.
Proceedings of the 22nd International Conference on Advanced Information Networking and Applications, 2008

2007
Image collector III: a web image-gathering system with bag-of-keypoints.
Proceedings of the 16th International Conference on World Wide Web, 2007

UEC at TRECVID 2007 High Level Feature Task.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

The Photo News Flusher: A Photo-News Clustering Browser.
Proceedings of the Advances in Multimedia Information Processing, 2007

2006
Finding visual concepts by web image mining.
Proceedings of the 15th international conference on World Wide Web, 2006

Automatic "Go" record generation from a TV program.
Proceedings of the 12th International Conference on Multi Media Modeling (MMM 2006), 2006

Cross Modal Disambiguation.
Proceedings of the Toward Category-Level Object Recognition, 2006

2005
Image Collector II: A System to Gather a Large Number of Images from the Web.
IEICE Trans. Inf. Syst., 2005

UEC at TRECVID 2005 High Level Feature Task - Web Images Meet TRECVID.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Image region entropy: a measure of "visualness" of web images associated with one concept.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Evaluation strategies for image understanding and retrieval.
Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

Probabilistic web image gathering.
Proceedings of the 7th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2005

2004
A fast image-gathering system from the World-Wide Web using a PC cluster.
Image Vis. Comput., 2004

2003
Web Image Mining toward Generic Image Recognition.
Proceedings of the Twelfth International World Wide Web Conference - Posters, 2003

Image Collector II : An Over-One-Thousand-Image-Gathering System.
Proceedings of the Twelfth International World Wide Web Conference - Posters, 2003

Generic image classification using visual knowledge on the web.
Proceedings of the Eleventh ACM International Conference on Multimedia, 2003

Image collector II: a system for gathering more than one thousand images from the Web for one keyword.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

2002
Recognition of indoor images employing supporting relation between objects.
Syst. Comput. Jpn., 2002

Image Classification by Web Images.
Proceedings of the PRICAI 2002: Trends in Artificial Intelligence, 2002

An Experiment on Generic Image Classification Using Web Images.
Proceedings of the Advances in Multimedia Information Processing, 2002

2001
A Fast Image-Gathering System on the World-Wide Web Using a PC Cluster.
Proceedings of the Web Intelligence: Research and Development, 2001

Image Collector: An Image-Gathering System From The World-Wide Web Employing Keyword-Based Search Engines.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

2000
A Multi-Resolution Image Understanding System Based on Multi-agent Architecture for High-Resolution Images.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2000), 2000

Recognition of Indoor Images Employing Qualitative Model Fitting and Supporting Relation between Objects.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

1998
An architecture of object recognition system for various images based on multi-agents.
Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998


  Loading...