Kiyoharu Aizawa

Orcid: 0000-0003-2146-6275

According to our database1, Kiyoharu Aizawa authored at least 459 papers between 1986 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Awards

IEEE Fellow

IEEE Fellow 2016, "For contributions to model-based coding and multimedia lifelogging".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Negative Learning to Prevent Undesirable Misclassification.
IEICE Trans. Inf. Syst., January, 2024

Self-Labeling Framework for Open-Set Domain Adaptation With Few Labeled Samples.
IEEE Trans. Multim., 2024

Content-Adaptive Optimization Framework for Universal Deep Image Compression.
IEICE Trans. Inf. Syst., 2024

FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe Generation.
CoRR, 2024

Training-Free Sketch-Guided Diffusion with Latent Optimization.
CoRR, 2024

Investigating the Perception of Facial Anonymization Techniques in 360° Videos.
CoRR, 2024

Generalized Out-of-Distribution Detection and Beyond in Vision Language Model Era: A Survey.
CoRR, 2024

MangaUB: A Manga Understanding Benchmark for Large Multimodal Models.
CoRR, 2024

Privacy Protection and Video Manipulation in Immersive Media.
CoRR, 2024

Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models.
CoRR, 2024

Measure and Improve Your Food: Ingredient Estimation Based Nutrition Calculator.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Manga109Dialog: A Large-Scale Dialogue Dataset for Comics Speaker Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Cross-Lingual Learning in Multilingual Scene Text Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Entity-NeRF: Detecting and Removing Moving Entities in Urban Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Guest Editorial Introduction to the Special Issue on Video Transformers.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Quality Enhancement of Conventional Compression with a Learned Side Bitstream.
IEICE Trans. Inf. Syst., August, 2023

Semantic-Driven Initial Image Construction for Guided Image Synthesis in Diffusion Model.
CoRR, 2023

Can Pre-trained Networks Detect Familiar Out-of-Distribution Data?
CoRR, 2023

Open-Set Domain Adaptation with Visual-Language Foundation Models.
CoRR, 2023

Zero-Shot In-Distribution Detection in Multi-Object Settings Using Vision-Language Foundation Models.
CoRR, 2023

Comprehensive Comparisons of Uniform Quantization in Deep Image Compression.
IEEE Access, 2023

Universal Deep Image Compression via Content-Adaptive Optimization with Adapters.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data Augmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Automatic Dataset Creation from User-generated Recipes for Ingredient-centric Food Image Analysis.
Proceedings of the ACM Multimedia Asia 2023, 2023

Open-Vocabulary Segmentation Approach for Transformer-Based Food Nutrient Estimation.
Proceedings of the ACM Multimedia Asia 2023, 2023

360RVW: Fusing Real 360° Videos and Interactive Virtual Worlds.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Guided Image Synthesis via Initial Image Editing in Diffusion Model.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Text-to-Image Fashion Retrieval with Fabric Textures.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Noise-Avoidance Sampling for Annotation Missing Object Detection.
Proceedings of the IEEE International Conference on Image Processing, 2023

Restorable Visible and Infrared Image Fusion.
Proceedings of the IEEE International Conference on Image Processing, 2023

A Structure-Guided Diffusion Model for Large-Hole Image Completion.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Evaluating the Stability of Deep Image Quality Assessment with Respect to Image Scaling.
IEICE Trans. Inf. Syst., October, 2022

Movie Map for Virtual Exploration in a City.
IEICE Trans. Inf. Syst., 2022

Non-uniform Sampling Strategies for NeRF on 360{\textdegree} images.
CoRR, 2022

A Structure-Guided Diffusion Model for Large-Hole Diverse Image Completion.
CoRR, 2022

Distortion-Aware Self-Supervised 360° Depth Estimation from A Single Equirectangular Projection Image.
CoRR, 2022

Field-of-View IoU for Object Detection in 360° Images.
CoRR, 2022

Saliency-Based Multiple Region of Interest Detection From a Single 360° Image.
IEEE Access, 2022

Fast Nonlinear Image Unblending.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Wearable Camera Based Food Logging System.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

FoodLog Athl: Multimedia Food Recording Platform for Dietary Guidance and Food Monitoring.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

SLGAN: Style- and Latent-Guided Generative Adversarial Network for Desirable Makeup Transfer and Removal.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022

Recipe-oriented Food Logging for Nutritional Management.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Prediction of Mental State from Food Images.
Proceedings of the CEA++@MM 2022: Proceedings of the 1st International Workshop on Multimedia for Cooking, 2022

Recipe Recording by Duplicating and Editing Standard Recipe.
Proceedings of the CEA++@MM 2022: Proceedings of the 1st International Workshop on Multimedia for Cooking, 2022

Towards Content-Aware Pixel-Wise Comic Panel Segmentation.
Proceedings of the Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges, 2022

Dual-Erp Representation for Object Detection in 360° Images.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Translation of Illustration Artist Style Using Sailormoonredraw Data.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

SVG Vector Font Generation for Chinese Characters with Transformer.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts.
Proceedings of the Computer Vision - ECCV 2022, 2022

Non-uniform Sampling Strategies for NeRF on 360° images.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Self-Labeling Framework for Novel Category Discovery over Domains.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Computational attention model for children, adults and the elderly.
Multim. Tools Appl., 2021

Estimation of Semantic Impressions from Portraits.
IEICE Trans. Inf. Syst., 2021

Noisy Localization Annotation Refinement for Object Detection.
IEICE Trans. Inf. Syst., 2021

NTIRE 2021 Challenge on Perceptual Image Quality Assessment.
CoRR, 2021

A Novel Perspective for Positive-Unlabeled Learning via Noisy Labels.
CoRR, 2021

Comic Image Inpainting via Distance Transform.
Proceedings of the SA '21: SIGGRAPH Asia 2021 Technical Communications, Tokyo, Japan, December 14, 2021

UrbanMM'21: 1st International Workshop on Multimedia Computing for Urban Data.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

RecipeLog: Recipe Authoring App for Accurate Food Recording.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Comprehensive Comparisons Of Uniform Quantizers For Deep Image Compression.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

360° Single Image Super Resolution via Distortion-Aware Network and Distorted Perspective Images.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Improving The Quality Of Illustrations: Transforming Amateur Illustrations To A Professional Standard.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Learned Image Compression With Super-Resolution Residual Modules and DISTS Optimization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

What if We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

World Food Atlas Project.
Proceedings of the CEA '21: Proceedings of the 13th International Workshop on Multimedia for Cooking and Eating Activities, 2021

Boosting Personalized Food Image Classifier by Sharing Food Records.
Proceedings of the CEA '21: Proceedings of the 13th International Workshop on Multimedia for Cooking and Eating Activities, 2021

Intersection Prediction from Single 360° Image via Deep Detection of Possible Direction of Travel.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Noisy Annotation Refinement for Object Detection.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Context-Patch Face Hallucination Based on Thresholding Locality-Constrained Representation and Reproducing Learning.
IEEE Trans. Cybern., 2020

Significance of Softmax-Based Features in Comparison to Distance Metric Learning-Based Features.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Distance Surface for Event-Based Optical Flow.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Users' Preference Prediction of Real Estate Properties Based on Floor Plan Analysis.
IEICE Trans. Inf. Syst., 2020

Building a Manga Dataset "Manga109" With Annotations for Multimedia Applications.
IEEE Multim., 2020

Building Movie Map - A Tool for Exploring Areas in a City - and its Evaluation.
CoRR, 2020

Unsupervised Embedding Learning by Noisy Similarity Label Optimization.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Building Movie Map - A Tool for Exploring Areas in a City - and its Evaluations.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Urban Movie Map for Walkers: Route View Synthesis using 360° Videos.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Font Search Across Various Languages Based on Multimodal Learning.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Channel-Level Variable Quantization Network for Deep Image Compression.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Translating Adult's Focus of Attention to Elderly's.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Few-Shot Font Generation with Deep Metric Learning.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Unknown Class Label Cleaning For Learning With Open-Set Noisy Labels.
Proceedings of the IEEE International Conference on Image Processing, 2020

Estimation Of Impression Associated With Portraits Using Facial Landmarks And Visual Features.
Proceedings of the IEEE International Conference on Image Processing, 2020

Multi-task Curriculum Framework for Open-Set Semi-supervised Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Ultra Low Bitrate Learned Image Compression by Selective Detail Decoding.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Image Recognition-Based Tool for Food Recording and Analysis: FoodLog.
Proceedings of the Connected Health in Smart Cities, 2020

2019
Category-Based Deep CCA for Fine-Grained Venue Discovery From Multimodal Data.
IEEE Trans. Neural Networks Learn. Syst., 2019

Emotype: Expressing emotions by changing typeface in mobile messenger texting.
Multim. Tools Appl., 2019

Face hallucination through differential evolution parameter map learning with facial structure prior.
Inf. Sci., 2019

Personalized Food Image Classifier Considering Time-Dependent and Item-Dependent Food Distribution.
IEICE Trans. Inf. Syst., 2019

Recognition of Multiple Food Items in A Single Photo for Use in A Buffet-Style Restaurant.
IEICE Trans. Inf. Syst., 2019

MeshDepth: Disconnected Mesh-based Deep Depth Prediction.
CoRR, 2019

Computational Attention System for Children, Adults and Elderly.
CoRR, 2019

Social Font Search by Multimodal Feature Embedding.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Walker's Movie Map: Route Vies Synthesis Using Omni-directional Videos.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

FoodLog: Multimedia Food Recording Platform and its Application.
Proceedings of the 5th International Workshop on Multimedia Assisted Dietary Management, 2019

Assist Users' Interactions in Font Search with Unexpected but Useful Concepts Generated by Multimodal Learning.
Proceedings of the 2019 on International Conference on Multimedia Retrieval, 2019

Synthesis of Screentone Patterns of Manga Characters.
Proceedings of the IEEE International Symposium on Multimedia, 2019

Identification Of Buildings In Street Images Using Map Information.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Optical Flow Based Line Drawing Frame Interpolation Using Distance Transform to Support Inbetweenings.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Reinforcing the Robustness of a Deep Neural Network to Adversarial Examples by Using Color Quantization of Training Image Data.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Impression Estimation for Deformed Portraits With a Landmark-Based Ranking Network.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Zero-Shot Semantic Segmentation via Variational Mapping.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

TriDepth: Triangular Patch-Based Deep Depth Prediction.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Unsupervised Out-of-Distribution Detection by Maximum Classifier Discrepancy.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Object-Aware Instance Labeling for Weakly Supervised Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Aspect-Ratio-Preserving Multi-Patch Image Aesthetics Score Prediction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Multi-Task Learning based on Separable Formulation of Depth Estimation and its Uncertainty.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Deep Neural Network-Based Click-Through Rate Prediction using Multimodal Features of Online Banners.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

2018
PQTable: Nonexhaustive Fast Search for Product-Quantized Codes Using Hash Tables.
IEEE Trans. Multim., 2018

Personalized Classifier for Food Image Recognition.
IEEE Trans. Multim., 2018

Photo aesthetic quality estimation using visual complexity features.
Multim. Tools Appl., 2018

Efficiency-enhanced cost-volume filtering featuring coarse-to-fine strategy.
Multim. Tools Appl., 2018

An unsupervised service annotation by review analysis.
Int. J. Big Data Intell., 2018

Parallel Grid Pooling for Data Augmentation.
CoRR, 2018

Object Detection for Comics using Manga109 Annotations.
CoRR, 2018

Adaptation of manga face representation for accurate clustering.
Proceedings of the SIGGRAPH Asia 2018 Posters, Tokyo, Japan, December 04-07, 2018, 2018

Session details: Brand New Ideas.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Users' Preference Prediction of Real Estates Featuring Floor Plan Analysis using FloorNet.
Proceedings of the 2018 ACM Workshop on Multimedia for Real Estate Tech, 2018

Computer Vision Based and FPRank Based Tag Recommendation for Social Popularity Enhancement.
Proceedings of the 23rd International Conference on Intelligent User Interfaces Companion, 2018

FontMatcher: Font Image Paring for Harmonious Digital Graphic Design.
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018

Resource intensity for menu items: how much land is required to provide for each dish?
Proceedings of the Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management, 2018

Bag-of-foods: analysis of personal foodlogging data.
Proceedings of the Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management, 2018

Food Image Recognition by Personalized Classifier.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Billboard Saliency Detection in Street Videos for Adults and Elderly.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Similar floor plan retrieval featuring multi-task learning of layout type classification and room presence prediction.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Measurement and evaluation of comfort levels of apartments using IoT sensors.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Prediction of the time to pregnancy by kernel density estimation on lifestyle questionnaire.
Proceedings of the IEEE International Conference on Consumer Electronics, 2018

Signboard Saliency Detection in Street Videos.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Scale Drift Correction of Camera Geo-Localization Using Geo-Tagged Images.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Joint Optimization Framework for Learning With Noisy Labels.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Mask-SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Cross-Domain Weakly-Supervised Object Detection Through Progressive Domain Adaptation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Fast and Robust Estimation for Unit-Norm Constrained Linear Fitting Problems.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Local and Global Optimization Techniques in Graph-Based Clustering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
DrawFromDrawings: 2D Drawing Assistance via Stroke Interpolation with a Sketch Database.
IEEE Trans. Vis. Comput. Graph., 2017

Depth Estimation Using an Infrared Dot Projector and an Infrared Color Stereo Camera.
IEEE Trans. Circuits Syst. Video Technol., 2017

Sketch-based manga retrieval using manga109 dataset.
Multim. Tools Appl., 2017

Efficient human action recognition using histograms of motion gradients and VLAD with descriptor shape information.
Multim. Tools Appl., 2017

PQTable: Non-exhaustive Fast Search for Product-quantized Codes using Hash Tables.
CoRR, 2017

Gaze Distribution Analysis and Saliency Prediction Across Age Groups.
CoRR, 2017

Spatio-Temporal VLAD Encoding for Human Action Recognition in Videos.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Panel: Cross-media Intelligence.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

A Tag Recommendation System for Popularity Boosting.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

PQk-means: Billion-scale Clustering for Product-quantized Codes.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

MatPlanner: Plan Your Days in Conferences by Resolving Conflicting Events.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Simple, Efficient and Effective Encodings of Local Deep Features for Video Action Recognition.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

VenueNet: Fine-Grained Venue Discovery by Deep Correlation Learning.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Become Popular in SNS: Tag Recommendation using FolkPopularityRank to Enhance Social Popularity.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

FolkPopularityRank: Tag Recommendation for Enhancing Social Popularity using Text Tags in Content Sharing Services.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Prediction of Individual Eating Habits Using Short-Term Food Recording.
Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities in conjunction with The 2017 International Joint Conference on Artificial Intelligence, Melbourne, Australia, August 20, 2017

Prediction of users' facial attractiveness on an online dating website.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

How competitive are you: Analysis of people's attractiveness in an online dating system.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Hyperlapse generation of omnidirectional videos by adaptive sampling based on 3D camera positions.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Sketch-Based Manga Retrieval Using Deep Features.
Proceedings of the 2nd International Workshop on coMics Analysis, 2017

cGAN-Based Manga Colorization Using a Single Training Image.
Proceedings of the 2nd International Workshop on coMics Analysis, 2017

Object detection refinement using Markov random field based pruning and learning based rescoring.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Developmental Changes in Ambient and Focal Visual Processing Strategies.
Proceedings of the Human Vision and Electronic Imaging 2017, Burlingame, CA, USA, 29 January 2017, 2017

Residual Expansion Algorithm: Fast and Effective Optimization for Nonconvex Least Squares Problems.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

You Will Succeed or Not? Matching Prediction in a Marriage Consulting Service.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Efficient Optimization of Convolutional Neural Networks Using Particle Swarm Optimization.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Distributed Representation of Dish Names in Food Related Web Services for Associative Search and Nutrition Estimation.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Data-Driven Geometric Face Image Smilization Featuring Moving Least Square Based Deformation.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Fooling Neural Networks in Face Attractiveness Evaluation: Adversarial Examples with High Attractiveness Score But Low Subjective Score.
Proceedings of the Third IEEE International Conference on Multimedia Big Data, 2017

Age-adapted saliency model with depth bias.
Proceedings of the ACM Symposium on Applied Perception, 2017

2016
Very fast generation of content-preserved photo collage under canvas size constraint.
Multim. Tools Appl., 2016

City-view image location identification by multiple geo-social media and graph-based image cluster refinement.
J. Vis. Commun. Image Represent., 2016

Review-Based Service Profiling and Recommendation.
Proceedings of the 2016 Joint 8th International Conference on Soft Computing and Intelligent Systems (SCIS) and 17th International Symposium on Advanced Intelligent Systems (ISIS), 2016

Typeface Emotion Analysis for Communication on Mobile Messengers.
Proceedings of the 1st International Workshop on Multimedia Alternate Realities, 2016

Multimedia for personal health and health care.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Food Search Based on User Feedback to Assist Image-based Food Recording Systems.
Proceedings of the 2nd International Workshop on Multimedia Assisted Dietary Management, 2016

Sketch simplification by classifying strokes.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Interactive region segmentation for manga.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Manga109 dataset and creation of metadata.
Proceedings of the 1st International Workshop on coMics ANalysis, 2016

Boosting VLAD with double assignment using deep features for action recognition in videos.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Generation of representative meal names for food recording data by using web search results.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

The log-normal distribution of the size of objects in daily meal images and its application to the efficient reduction of object proposals.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Text detection in manga by combining connected-component-based and region-based classifications.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Multimodal learning for image popularity prediction on social media.
Proceedings of the IEEE International Conference on Consumer Electronics-Taiwan, 2016

Uncalibrated Photometric Stereo by Stepwise Optimization Using Principal Components of Isotropic BRDFs.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Histograms of Motion Gradients for real-time video classification.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

Towards Online Impression Prediction of Oral Presentations Using Soft Coding.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Service Annotation and Profiling by Review Analysis.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

Audience Ratings Prediction of TV Dramas Based on the Cast and Their Popularity.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

FoodLog: Multimedia Food Recording Tools for Diverse Applications.
Proceedings of the Human-Harmonized Information Technology, Volume 1 - Vertical Impact, 2016

2015
FoodLog: Multimedia Tool for Healthcare Applications.
IEEE Multim., 2015

Sketch-based Manga Retrieval using Manga109 Dataset.
CoRR, 2015

A prediction model on 3D model compression and its printed quality based on subjective study.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2015

Power of Tags: Predicting Popularity of Social Media in Geo-Spatial and Temporal Contexts.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015

Prediction of User Ratings of Oral Presentations using Label Relations.
Proceedings of the 1st International Workshop on Affect & Sentiment in Multimedia, 2015

Selective K-means Tree Search.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

An Interactive System based on Yes-No Questions for Affective Image Retrieval.
Proceedings of the 1st International Workshop on Affect & Sentiment in Multimedia, 2015

Fast Face Model Reconstruction and Synthesis Using an RGB-D Camera and Its Subjective Evaluation.
Proceedings of the 2015 IEEE International Symposium on Multimedia, 2015

Repositioning the salient region of videos by using active illumination.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Searching for nearest neighbors with a dense space partitioning.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

A layered method for determining manga text bubble reading order.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Highly Accurate Food/Non-Food Image Classification Based on a Deep Convolutional Neural Network.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2015 Workshops, 2015

PQTable: Fast Exact Asymmetric Distance Neighbor Search for Product Quantization Using Hash Tables.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Separation of Manga Line Drawings and Screentones.
Proceedings of the 36th Annual Conference of the European Association for Computer Graphics, 2015

A Discourse Search Engine Based on Rhetorical Structure Theory.
Proceedings of the Advances in Information Retrieval, 2015

Food Category Representatives: Extracting Categories from Meal Names in Food Recordings and Recipe Data.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Interactive segmentation for manga using lossless thinning and coarse labeling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Personal rating prediction for on-line video lectures using gaze information.
Proceedings of the 10th International Conference on Information, 2015

Depth Estimation Based on an Infrared Projector and an Infrared Color Stereo Camera by Using Cross-Based Dynamic Programming with Cost Volume Filter.
Proceedings of the 2015 International Conference on 3D Vision, 2015

Personalized Travel Navigation and Photo-Shooting Navigation Using Large-Scale Geotags.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

2014
Photometric Stereo Using Sparse Bayesian Regression for General Diffuse Surfaces.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Building Friend Wall for Local Photo Repository by Using Social Attribute Annotation.
J. Multim., 2014

Self-similarity-based partial near-duplicate video retrieval and alignment.
Int. J. Multim. Inf. Retr., 2014

Large-Scale Geosocial Multimedia [Guest editorial].
IEEE Multim., 2014

Dynamic stochastic resonance-based improved logo extraction in discrete cosine transform domain.
Comput. Electr. Eng., 2014

Reference-based manga colorization by graph correspondence using quadratic programming.
Proceedings of the SIGGRAPH Asia 2014 Technical Briefs, 2014

IllustStyleMap: visualization of illustrations based on similarity of drawing style of authors.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2014

Diffusion: change the ambience of a space with a small amount of ink.
Proceedings of the SIGGRAPH Asia 2014 Posters, Shenzhen, China, December 3-6, 2014, 2014

Interactive segmentation for manga.
Proceedings of the Special Interest Group on Computer Graphics and Interactive Techniques Conference, 2014

SVM is not always confident: Telling whether the output from multiclass SVM is true or false by analysing its confidence values.
Proceedings of the IEEE 16th International Workshop on Multimedia Signal Processing, 2014

Empirical Observation of User Activities: Check-ins, Venue Photos and Tips in Foursquare.
Proceedings of the First International Workshop on Internet-Scale Multimedia Management, 2014

Emerging Topics on Personalized and Localized Multimedia Information Systems.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Social Popularity Score: Predicting Numbers of Views, Comments, and Favorites of Social Photos Using Only Annotations.
Proceedings of the First International Workshop on Internet-Scale Multimedia Management, 2014

Food Detection and Recognition Using Convolutional Neural Network.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Degree of loop assessment in microvideo.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Sketch2Manga: Sketch-based manga retrieval.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Multi-stage object classification featuring confidence analysis of classifier and inclined local Naive Bayes nearest neighbor.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Coarse-to-fine strategy for efficient cost-volume filtering.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

MangaWall: Generating manga pages for real-time applications.
Proceedings of the IEEE International Conference on Acoustics, 2014

Simultaneous acquisition of multiple images with higher dynamic range.
Proceedings of the IEEE International Conference on Acoustics, 2014

Frequency statistics of words used in Japanese food records of FoodLog.
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

Summary for the workshop on smart technology for cooking and eating activities (CEA'14).
Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2014

Relationship Between Visual Complexity and Aesthetics: Application to Beauty Prediction of Photos.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Photometric Stereo Using Constrained Bivariate Regression for General Isotropic Surfaces.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Food Balance Estimation by Using Personal Dietary Tendencies in a Multimedia Food Log.
IEEE Trans. Multim., 2013

SIFT-Based Non-blind Watermarking Robust to Non-linear Geometrical Distortions.
IEICE Trans. Inf. Syst., 2013

Cooperative estimation of human motion and surfaces using multiview videos.
Comput. Vis. Image Underst., 2013

A novel approach for combined rotational and translational motion estimation using Frame Projection Warping.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

SNAPPER: Fashion Coordinate Image Retrieval System.
Proceedings of the Ninth International Conference on Signal-Image Technology & Internet-Based Systems, 2013

Saliency Detection-Based Mixture of Reality and Non-Photorealistic Rendering Effects for Artistic Visualization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Navilog: A Museum Guide and Location Logging System Based on Image Recognition.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Action recognition using invariant features under unexampled viewing conditions.
Proceedings of the ACM Multimedia Conference, 2013

Kanji snap: an OCR-based smartphone application for learning Japanese kanji characters.
Proceedings of the ACM Multimedia Conference, 2013

Workshop summary for the 5th international workshop on multimedia for cooking and eating activities (CEA'13).
Proceedings of the ACM Multimedia Conference, 2013

Depth map up-sampling using cost-volume filtering.
Proceedings of the 11th IVMSP Workshop: 3D Image/Video Technologies and Applications, 2013

Depth map inpainting and super-resolution based on internal statistics of geometry and appearance.
Proceedings of the IEEE International Conference on Image Processing, 2013

FoodLog: Smartphone based multimedia food recording tool.
Proceedings of the 23rd International Conference on Artificial Reality and Telexistence, 2013

Depth estimation by cost volume with spatial-temporal cross-based local multipoint filter using projecting infrared patterns.
Proceedings of the Conference on Visual Media Production 2013, 2013

PicWall: Photo collage on-the-fly.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

When disparity meets distance: HEVC compression of double-faced depth map.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Determination of emotional content of video clips by low-level audiovisual features - A dimensional and categorial experimental approach.
Multim. Tools Appl., 2012

Real-time tracking of humans and visualization of their future footsteps in public indoor environments - An intelligent interactive system for public entertainment.
Multim. Tools Appl., 2012

Enhancement of Depth Maps With Alpha Channel Estimation for 3-D Video.
IEEE J. Sel. Top. Signal Process., 2012

Food region segmentation in meal images using touch points.
Proceedings of the ACM multimedia 2012 workshop on Multimedia for cooking and eating activities, 2012

Social Attribute Annotation for Personal Photo Collection.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Noise attenuation performance of mura apertures in photographic cameras.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Internal noise-induced contrast enhancement of dark images.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Intra texture prediction based on repetitive pixel replenishment.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Automated awareness and visualization of online presence.
Proceedings of the Joint International Conference on Human-Centered Computer Environments, 2012

Dynamic stochastic resonance-based watermark extraction from audio signals in SVD domain.
Proceedings of the 20th European Signal Processing Conference, 2012

Robust photometric stereo using sparse regression.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Confidence-based refinement of corrupted depth maps.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
Interactive Manga retargeting.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2011

Image collection summarization for search result overviewing on mobile devices.
Proceedings of the 2011 international ACM workshop on Interactive multimedia on mobile and portable devices, 2011

High efficient distributed video coding with parallelized design for cloud computing.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Image-based Calorie Content Estimation for Dietary Assessment.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

Clustering meal images in a web-based dietary management system.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Marker-less human pose estimation and surface reconstruction using a segmented model.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Robust watermark extraction using SVD-based dynamic stochastic resonance.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Interactive social, spatial and temporal querying for multimedia retrieval.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011

2010
Affective Audio-Visual Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification.
IEEE Trans. Multim., 2010

Large-scale image and video search: Challenges, technologies, and trends.
J. Vis. Commun. Image Represent., 2010

Location identification for visitor behavior log in museum.
Proceedings of the 9th International Conference on Virtual Reality Continuum and its Applications in Industry, 2010

Approaches to 3D video compression.
Proceedings of the Visual Communications and Image Processing 2010, 2010

Bit allocation of vertices and colors for patch-based coding in time-varying meshes.
Proceedings of the Picture Coding Symposium, 2010

3D pose estimation in high dimensional search spaces with local memorization.
Proceedings of the Picture Coding Symposium, 2010

Image-based dietary information mining for community creation in a social network.
Proceedings of second ACM SIGMM workshop on Social media, 2010

Image-based indoor positioning system: fast image matching using omnidirectional panoramic images.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

Automatic trailer generation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Detecting Resized JPEG Images by Analyzing High Frequency Elements in DCT Coefficients.
Proceedings of the Sixth International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2010), 2010

Detecting Dominant Motion Flows in Unstructured/Structured Crowd Scenes.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Image processing based approach to food balance analysis for personal food logging.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010

Patch-based compression for Time-Varying Meshes.
Proceedings of the International Conference on Image Processing, 2010

Automatic preview video generation for mesh sequences.
Proceedings of the International Conference on Image Processing, 2010

Interacting with location-based multimedia using sketches.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Estimating Human Body and Head Orientation Change to Detect Visual Attention Direction.
Proceedings of the Computer Vision - ACCV 2010 Workshops, 2010

Wearable Video Retrieval and Navigation System using GPS Data.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
Sketch-Based Spatial Queries for Retrieving Human Locomotion Patterns From Continuously Archived GPS Data.
IEEE Trans. Multim., 2009

Temporal Segmentation of 3-D Video by Histogram-Based Feature Vectors.
IEEE Trans. Circuits Syst. Video Technol., 2009

Motion Segmentation for Time-Varying Mesh Sequences Based on Spherical Registration.
EURASIP J. Adv. Signal Process., 2009

Motion Editing for Time-Varying Mesh.
EURASIP J. Adv. Signal Process., 2009

Sketch-Based Spatial Queries for the Retrieval of Human Locomotion Patterns in Smart Environments.
Adv. Multim., 2009

Sketch-on-Map: Spatial Queries for Retrieving Human Locomotion Patterns from Continuously Archived GPS Data.
Proceedings of the Advances in Multimedia Modeling, 2009

Retrieving multimedia travel stories using location data and spatial queries.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

FoodLog: capture, analysis and retrieval of personal food images via web.
Proceedings of the ACM multimedia 2009 workshop on Multimedia for cooking and eating activities, 2009

Latent topic driving model for movie affective scene classification.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

A Euclidean-geodesic shape distribution for retrieval of time-varying mesh sequences.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Retrieval of Time-Varying Mesh and motion capture data using 2D video queries based on silhouette shape descriptors.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

A degree-of-edit ranking for consumer generated video retrieval.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Affective video segment retrieval for consumer generated videos based on correlation between emotions and emotional audio events.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

An object-based non-blind watermarking that is robust to non-linear geometrical distortion attacks.
Proceedings of the International Conference on Image Processing, 2009

Tracking of humans and estimation of body/head orientation from top-view single camera for visual focus of attention analysis.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

2008
Large-scale image sensing by a group of smart image sensors.
Parallel Comput., 2008

Ubiquitous Home: Retrieval of Experiences in a Home Environment.
IEICE Trans. Inf. Syst., 2008

Robust Object-Based Watermarking Using Feature Matching.
IEICE Trans. Inf. Syst., 2008

Motion tracking of time-varying mesh through surface gradient matching with multi-temporal registration.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2008

Embedded Tags and Visual Querying for Face Photo Retrieval.
Proceedings of the Advances in Multimedia Information Processing, 2008

Audio Analysis for Multimedia Retrieval from a Ubiquitous Home.
Proceedings of the Advances in Multimedia Modeling, 2008

Interactive retrieval for multi-camera surveillance systems featuring spatio-temporal summarization.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Food log by analyzing food images.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

High level activity annotation of daily experiences by a combination of a wearable device and Wi-Fi based positioning system.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Hierarchical mesh decomposition and motion tracking for Time-Varying-Meshes.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Geometry compression for time-varying meshes using coarse and fine levels of quantization and run-length encoding.
Proceedings of the International Conference on Image Processing, 2008

Error analysis of 3Dc-based normal map compression and its application to optimized quantization.
Proceedings of the IEEE International Conference on Acoustics, 2008

Model-Based Analysis and Synthesis of Time-Varying Mesh.
Proceedings of the Articulated Motion and Deformable Objects, 5th International Conference, 2008

2007
Reconstructing Dense Light Field From Array of Multifocus Images for Novel View Synthesis.
IEEE Trans. Image Process., 2007

Time-Varying Mesh Compression Using an Extended Block Matching Algorithm.
IEEE Trans. Circuits Syst. Video Technol., 2007

Retrieval of Images Captured by Car Cameras Using Its Front and Side Views and GPS Data.
IEICE Trans. Inf. Syst., 2007

Summarization of 3D Video by Rate-Distortion Trade-off.
IEICE Trans. Inf. Syst., 2007

Special Section on Advanced Image Technology.
IEICE Trans. Inf. Syst., 2007

Motion Segmentation and Retrieval for 3D Video Based on Modified Shape Distribution.
EURASIP J. Adv. Signal Process., 2007

An Interactive Multimedia Diary for the Home.
Computer, 2007

Motion Structure Parsing and Motion Editing in 3D Video.
Proceedings of the Advances in Multimedia Modeling, 2007

Spatial querying for retrieval of locomotion patterns in smart environments.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Emerging Issues for Multimedia Analysis and Applications.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

Content-Based Cross Search for Human Motion Data using Time-Varying Mesh and Motion Capture Data.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Deformation of Time-Varying-Mesh Based on Semantic Human Model.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Fast and Robust Motion Tracking for Time-Varying Mesh Featuring Reeb-Graph-Based Skeleton Fitting and its Application to Motion Retrieval.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Visual Tracking of Pedestrians Jointly using Wi-Fi Location System on Distributed Camera Network.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

View-Based Web Page Retrieval using Interactive Sketch Query.
Proceedings of the International Conference on Image Processing, 2007

Geometrically Invariant Object-Based Watermarking using SIFT Feature.
Proceedings of the International Conference on Image Processing, 2007

Tracking Persons using Particle Filter Fusing Visual and Wi-Fi Localizations for Widely Distributed Camera.
Proceedings of the International Conference on Image Processing, 2007

Multi-Sensor Fusion Tracking Using Visual Information and WI-Fl Location Estimation.
Proceedings of the 2007 First ACM/IEEE International Conference on Distributed Smart Cameras, 2007

Highly Efficient VQ-Based Normal Map Compression using Quality Estimation Model.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Sensor Network for Event Retrieval in a Home Like Ubiquitous Environment.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Effects of physical display size and amplitude of oscillation on visually induced motion sickness.
Proceedings of the ACM Symposium on Virtual Reality Software and Technology, 2006

Motion Composition of 3D Video.
Proceedings of the Advances in Multimedia Information Processing, 2006

Motion Segmentation of 3D Video using Modified Shape Distribution.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Key Frame Extraction in 3D Video by Rate-Distortion Optimization.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

3D Video Compression Based on Extended Block Matching Algorithm.
Proceedings of the International Conference on Image Processing, 2006

Creation of an Electronic Chronicle for a Ubiquitous Home: Sensing, Analysis and Evaluation.
Proceedings of the 22nd International Conference on Data Engineering Workshops, 2006

Fast and Efficient Normal MAP Compression Based on Vector Quantization.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Similar Motion Retrieval for Dynamic 3D Mesh Based on Modified Shape Distributions.
Proceedings of the 27th Annual Conference of the European Association for Computer Graphics, 2006

Motion Editing in 3D Video Database.
Proceedings of the 3rd International Symposium on 3D Data Processing, 2006

2005
Reconstructing arbitrarily focused images from two differently focused images using linear filters.
IEEE Trans. Image Process., 2005

Proposal of meta-data in a database for identification using face images.
Syst. Comput. Jpn., 2005

Accuracy enhancement of function-oriented web image classification.
Proceedings of the 14th international conference on World Wide Web, 2005

Mathematical PSNR Prediction Model Between Compressed Normal Maps and Rendered 3D Images.
Proceedings of the Advances in Multimedia Information Processing, 2005

Multimedia Retrieval from a Large Number of Sources in a Ubiquitous Environment.
Proceedings of the Advances in Multimedia Information Processing, 2005

Digitizing Personal Experiences: Capture and Retrieval of Life Log.
Proceedings of the 11th International Conference on Multi Media Modeling (MMM 2005), 2005

Evaluation of video summarization for a large number of cameras in <i>ubiquitous</i> home.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

Video handover for retrieval in a ubiquitous environment using floor sensor data.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Depth estimation for synthesizing arbitrary view images by random access IBR sensor array.
Proceedings of the 2005 International Conference on Image Processing, 2005

Mathematical error analysis of normal map compression based on unity condition.
Proceedings of the 2005 International Conference on Image Processing, 2005

3D video segmentation using point distance histograms.
Proceedings of the 2005 International Conference on Image Processing, 2005

An adaptive video stabilization method for reducing visually induced motion sickness.
Proceedings of the 2005 International Conference on Image Processing, 2005

Direct filtering method for image based rendering.
Proceedings of the 2005 International Conference on Image Processing, 2005

Person Tracking and Multicamera Video Retrieval Using Floor Sensors in a Ubiquitous Environment.
Proceedings of the Image and Video Retrieval, 4th International Conference, 2005

Context-Based Video Retrieval for Life-Log Applications.
Proceedings of the Multimedia Content and the Semantic Web, 2005

2004
Editorial.
EURASIP J. Adv. Signal Process., 2004

Layered indexing of home video based on audio signals.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

Home video summarization by shot characteristics and user's feedback.
Proceedings of the Storage and Retrieval Methods and Applications for Multimedia 2004, 2004

All-focused light field rendering.
Proceedings of the 15th Eurographics Workshop on Rendering Techniques, 2004

Novel Concept for Video Retrieval in Life Log Application.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Automatic Categorization for WWW Images with Applications for Retrieval Navigation.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Reconstructing dense light field from a multi-focus images array.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Capturing life-log and retrieval based on contexts.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Restoration and demosaicing for pixel mixture images in dsc video clips.
Proceedings of the 2004 International Conference on Image Processing, 2004

Virtual view synthesis through linear processing without geometry.
Proceedings of the 2004 International Conference on Image Processing, 2004

2003
A Solid-State, Simultaneous Wide Angle - Detailed View Video Surveillance Camera.
Proceedings of the 8th International Fall Workshop on Vision, Modeling, and Visualization, 2003

Panel Position Notes - Video Coding: Present and Future.
Proceedings of the Visual Content Processing and Representation, 2003

Three-dimensional image representation of buildings utilizing heterogeneous information for multimedia ambiance communication.
Proceedings of the Visual Communications and Image Processing 2003, 2003

Object-based approach to image-based rendering with linear filters using defocus information.
Proceedings of the Visual Communications and Image Processing 2003, 2003

Context-based video retrieval system for the life-log applications.
Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003

Wearable imaging system for summarizing personal experiences.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

Wide dynamic range imaging by sensitivity adjustable CMOS image sensor.
Proceedings of the 2003 International Conference on Image Processing, 2003

A novel image-based rendering method by linear filtering of multiple focused images acquired by a camera array.
Proceedings of the 2003 International Conference on Image Processing, 2003

Indexing of Personal Video Captured by a Wearable Imaging.
Proceedings of the Image and Video Retrieval, Second International Conference, 2003

2002
Virtual view generation by linear processing of two differently focused images.
Proceedings of the 29th International Conference on Computer Graphics and Interactive Techniques, 2002

Ubiquitous Displays for Cellular Phone Based Personal Information Environments.
Proceedings of the Advances in Multimedia Information Processing, 2002

Summarization of wearable videos using support vector machine.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

Real-time objects tracking by using smart image sensor and FPGA.
Proceedings of the 2002 International Conference on Image Processing, 2002

Arbitrary view and focus image generation: rendering object-based shifting and focussing effect by linear filtering.
Proceedings of the 2002 International Conference on Image Processing, 2002

Three dimensional modeling of large-scale real environment by fusing range data, texture images, and airborne altimetry data.
Proceedings of the 2002 International Conference on Image Processing, 2002

Scope-Based Interaction - A Technique for Interaction in an Image-Based Virtual Environment.
Proceedings of the Eighth Eurographics Workshop on Virtual Environments, 2002

Construction of Large-Scale Virtual Environment by Fusing Range Data, Texture Images, and Airborne Altimetry Data.
Proceedings of the 1st International Symposium on 3D Data Processing Visualization and Transmission (3DPVT 2002), 2002

2001
Multimedia ambiance communication.
IEEE Signal Process. Mag., 2001

A computational image sensor with adaptive pixel-based integration time.
IEEE J. Solid State Circuits, 2001

Structure analysis of natural scenes using census transform and region competition.
Proceedings of the Visual Communications and Image Processing 2001, 2001

Universal watermark estimation scheme based on error probabilities.
Proceedings of the Security and Watermarking of Multimedia Contents III, 2001

New similarity measure for color image indexing.
Proceedings of the Storage and Retrieval for Media Databases 2001, 2001

Motion Segmentation with Census Transform.
Proceedings of the Advances in Multimedia Information Processing, 2001

Automatic Summarization of Wearable Video - Indexing Subjective Interest.
Proceedings of the Advances in Multimedia Information Processing, 2001

Programmable spatially variant multiresolution readout capability on a sensor focal plane.
Proceedings of the 2001 International Symposium on Circuits and Systems, 2001

New method of on-sensor A/D conversion.
Proceedings of the 2001 International Symposium on Circuits and Systems, 2001

Foreground Extraction Based On Logical Operation Of 3-D Array Frame Difference.
Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

All-focused image generation and 3D modeling of microscopic images of insects.
Proceedings of the 2001 International Conference on Image Processing, 2001

Pixel independent random access image sensor for real time image-based rendering system.
Proceedings of the 2001 International Conference on Image Processing, 2001

A new approach to depth range detection by producing depth-dependent blurring effect.
Proceedings of the 2001 International Conference on Image Processing, 2001

Summarizing wearable video.
Proceedings of the 2001 International Conference on Image Processing, 2001

2000
Producing object-based special effects by fusing multiple differently focused images.
IEEE Trans. Circuits Syst. Video Technol., 2000

High-quality stereo panorama generation using a three-camera system.
Proceedings of the Visual Communications and Image Processing 2000, 2000

Inverse filters for generation of arbitrarily focused images.
Proceedings of the Visual Communications and Image Processing 2000, 2000

Implicit 3D Approach to Image Generation: Object-Based Visual Effects by Linear Processing of Multiple Differently Focused Images.
Proceedings of the Multi-Image Analysis, 2000

A New Computational Image Sensor with Programmable Spatially Variant Multiresolution Readout Capability.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2000), 2000

New A/D Conversion Technique for Smart Image Sensor.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2000), 2000

Digital Watermarking using Inter-Block Correlation: Extension to JPEG Coded Domain.
Proceedings of the 2000 International Symposium on Information Technology (ITCC 2000), 2000

Object-Based Visual Effects by using Multi-focus Images and Its Real-Time Implementation.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Generation of a Disparity Panorama Using a 3-Camera Capturing System.
Proceedings of the 2000 International Conference on Image Processing, 2000

Inverse Filters for Reconstruction of Arbitrarily Focused Images from two Differently Focused Images.
Proceedings of the 2000 International Conference on Image Processing, 2000

A Computational Image Sensor with Pixel-Based Integration Time Control.
Proceedings of the 2000 International Conference on Image Processing, 2000

New design and implementation of adaptive-integration-time image sensor.
Proceedings of the IEEE International Conference on Acoustics, 2000

Acquisition of 3D Image Representation in Multimedia Ambiance Communication using 3D Laser Scanner and Digital Camera.
Proceedings of the Conference on Three-Dimensional Image Capture and Applications III, 2000

1999
Implementation of a 2D motion vector detection on image sensor focal plane.
Proceedings of the 1999 International Symposium on Circuits and Systems, ISCAS 1999, Orlando, Florida, USA, May 30, 1999

Very fast tracking and depth estimation by using focal plane compression sensors.
Proceedings of the 1999 International Symposium on Circuits and Systems, ISCAS 1999, Orlando, Florida, USA, May 30, 1999

Software Based Object Tracking with Visual Feature Integration.
Proceedings of the 1999 International Conference on Image Processing, 1999

Registration and Blur Estimation Methods for Multiple Differently Focused Images.
Proceedings of the 1999 International Conference on Image Processing, 1999

Multi-Media Ambiance Communication Based on Actual Moving Pictures.
Proceedings of the 1999 International Conference on Image Processing, 1999

Real-Time Image Processing by Using Image Compression Sensor.
Proceedings of the 1999 International Conference on Image Processing, 1999

Digital Watermarking Using Inter-Block Correlation.
Proceedings of the 1999 International Conference on Image Processing, 1999

Producing Object-Based Special Visual Effects by Integrating Multiple Differently Focused Images: Implicit 3D Approach to Image Content Manipulation.
Proceedings of the 1999 International Conference on Image Processing, 1999

1998
Acquisition of an all-focused image by the use of multiple differently focused images.
Syst. Comput. Jpn., 1998

Generation of arbitrarily focused images by using multiple differently focused images.
J. Electronic Imaging, 1998

Selection/Substitution of Visual Features for Object Tracking.
Proceedings of IAPR Workshop on Machine Vision Applications, 1998

A New Image Sensor with Space Variant Sampling Control on a Focal Plane.
Proceedings of IAPR Workshop on Machine Vision Applications, 1998

New Design and Implementation of On-Sensor-Compression.
Proceedings of IAPR Workshop on Machine Vision Applications, 1998

Spatially Variant Flexible Sampling Control Integrated on an Image Sensor.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

128 x 128 Pixels Image Sensor for On-Sensor-Compression.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Motion Adaptive Image Sensor.
Proceedings of the ASP-DAC '98, 1998

1997
On sensor image compression.
IEEE Trans. Circuits Syst. Video Technol., 1997

Implementations of on Sensor Image Compression and Comparisons Between Pixel and Column Parallel Architectures.
Proceedings of the Proceedings 1997 International Conference on Image Processing, 1997

1996
Video Enhancement Sensor Using Motion Adaptive Storage Time.
Proceedings of IAPR Workshop on Machine Vision Applications, 1996

Focal Plane Compression Sensors Based on Pixel Parallel and Column Parallel Architectures.
Proceedings of IAPR Workshop on Machine Vision Applications, 1996

Iterative reconstruction of an all-focused image by using multiple differently focused images.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

Structural motion segmentation based on probabilistic clustering.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

On sensor image compression for high pixel rate imaging: pixel parallel and column parallel architectures.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

1995
A teleconferencing system capable of multiple person eye contact (MPEC) using half mirrors and cameras placed at common points of extended lines of gaze.
IEEE Trans. Circuits Syst. Video Technol., 1995

Model-based image coding advanced video coding techniques for very low bit-rate applications.
Proc. IEEE, 1995

Detection and Tracking of Facial Features by Using Edge Pixel Counting and Deformable Circular Template Matching.
IEICE Trans. Inf. Syst., 1995

A multiple person eye contact (MPEC) teleconferencing system.
Proceedings of the Proceedings 1995 International Conference on Image Processing, 1995

1994
Estimation of camera parameters from image sequence for model-based video coding.
IEEE Trans. Circuits Syst. Video Technol., 1994

Analysis and synthesis of facial image sequences in model-based image coding.
IEEE Trans. Circuits Syst. Video Technol., 1994

Two Approaches for Image-processing Based High Relolution Image Acquisition.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994

A Novel Image Sensor for Video Compression.
Proceedings of the Proceedings 1994 International Conference on Image Processing, 1994

Use of steerable viewing window (SVW) to improve the visual sensation in face to face teleconferencing.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Fractionally spaced equalizers with adaptive sampling.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

An image processing algorithm for a super high definition imaging scheme with multiple different-aperture cameras.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Motion estimation using multiple image sensors.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Very high resolution imaging scheme with multiple different-aperture cameras.
Signal Process. Image Commun., 1993

Motion estimation with wavelet transform and the application to motion compensated interpolation.
Proceedings of the IEEE International Conference on Acoustics, 1993

Subpixel registration for a high resolution imaging scheme using multiple imagers.
Proceedings of the IEEE International Conference on Acoustics, 1993

1992
A scheme for acquiring very high resolution images using multiple cameras.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991
VIWOB: an interactive programming environment for distributed simulations.
Proceedings of the 1991 International Conference on Acoustics, 1991

1990
Real-time facial action image synthesis system driven by speech and text.
Proceedings of the Visual Communications and Image Processing '90: Fifth in a Series, 1990

1989
Model-based analysis synthesis image coding (MBASIC) system for a person's face.
Signal Process. Image Commun., 1989

An intelligent facial image coding driven by speech and phoneme.
Proceedings of the IEEE International Conference on Acoustics, 1989

1986
Adaptive discrete cosine transform image coding using gain/Shape vector quantizers.
Proceedings of the IEEE International Conference on Acoustics, 1986

Adaptive discrete cosine transform coding with vector quantization for color images.
Proceedings of the IEEE International Conference on Acoustics, 1986


  Loading...