Shin'ichi Satoh

Orcid: 0000-0001-6995-6447

Affiliations:
  • National Institute of Informatics, Multimedia Information Research Division, Tokyo, Japan
  • National Center for Science Information Systems (NACSIS), Tokyo, Japan
  • Carnegie Mellon University, School of Computer Science, Pittsburgh, PA, USA
  • Univesity of Tokyo, Japan (PhD 1992)


According to our database1, Shin'ichi Satoh authored at least 402 papers between 1990 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Physical Adversarial Attack Meets Computer Vision: A Decade Survey.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

DisCO: Portrait Distortion Correction with Perspective-Aware 3D GANs.
Int. J. Comput. Vis., November, 2024

Improving deep metric learning via self-distillation and online batch diffusion process.
Vis. Intell., 2024

Deep Counterfactual Representation Learning for Visual Recognition Against Weather Corruptions.
IEEE Trans. Multim., 2024

The SkatingVerse Workshop & Challenge: Methods and Results.
CoRR, 2024

The Effects of Short Video-Sharing Services on Video Copy Detection.
CoRR, 2024

Vers une pédagogie inclusive : une classification multimodale des illustrations de manuels scolaires pour des environnements d'apprentissage adaptés.
Proceedings of the Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, 2024

Utiliser l'explicabilité des modèles pour mettre en évidence les expressions genrées dans la parole.
Proceedings of the Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, 2024

Matting by Generation.
Proceedings of the ACM SIGGRAPH 2024 Conference Papers, 2024

Hierarchical Debiasing and Noisy Correction for Cross-domain Video Tube Retrieval.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

ExCEDA: Unlocking Attention Paradigms in Extended Duration E-Classrooms by Leveraging Attention-Mechanism Models.
Proceedings of the 7th IEEE International Conference on Multimedia Information Processing and Retrieval, 2024

Mitigating robust overfitting via self-residual-calibration regularization (Abstract Reprint).
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers.
Proceedings of the Computer Vision - ECCV 2024, 2024

Unveiling Learner Dynamics: The ECLIPSE Dataset and NeuralGaze Framework for Prolonged Engagement Assessment in Online Learning.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Contributing Dimension Structure of Deep Feature for Coreset Selection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Towards Robust Person Re-Identification by Defending Against Universal Attackers.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Mitigating robust overfitting via self-residual-calibration regularization.
Artif. Intell., April, 2023

Geo-Localization via Ground-to-Satellite Cross-View Image Retrieval.
IEEE Trans. Multim., 2023

Progressive Motion Boosting for Video Frame Interpolation.
IEEE Trans. Multim., 2023

Unsupervised Domain Adaptation for Person Re-Identification Via Individual-Preserving and Environmental-Switching Cyclic Generation.
IEEE Trans. Multim., 2023

Win-Win by Competition: Auxiliary-Free Cloth-Changing Person Re-Identification.
IEEE Trans. Image Process., 2023

TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting.
CoRR, 2023

Rethinking Adversarial Training with A Simple Baseline.
CoRR, 2023

The 3rd Anti-UAV Workshop & Challenge: Methods and Results.
CoRR, 2023

Certified Zeroth-order Black-Box Defense with Robust UNet Denoiser.
CoRR, 2023

Improving Adversarial Robustness via Information Bottleneck Distillation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

How People Watch Videos? Viewer Behavior Analysis for Video Archive Summarization.
Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos, 2023

Event-based High-speed Ball Detection in Sports Video.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content.
Proceedings of the 1st International Workshop on Deep Multimodal Learning for Information Retrieval, 2023

RanLayNet: A Dataset for Document Layout Detection used for Domain Adaptation and Generalization.
Proceedings of the ACM Multimedia Asia 2023, 2023

Beyond Domain Gap: Exploiting Subjectivity in Sketch-Based Person Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

From Scarcity to Understanding: Transfer Learning for the Extremely Low Resource Irish Sign Language.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Single Image Deblurring with Row-dependent Blur Magnitude.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Referring Image Segmentation via Joint Mask Contextual Embedding Learning and Progressive Alignment Network.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GEC-DCL: Grammatical Error Correction Model with Dynamic Context Learning for Paragraphs and Scholarly Papers.
Proceedings of the Big Data and Artificial Intelligence - 11th International Conference, 2023

HOTCOLD Block: Fooling Thermal Infrared Detectors with a Novel Wearable Design.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Only a Few Classes Confusing: Pixel-Wise Candidate Labels Disambiguation for Foggy Scene Understanding.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Guest Editorial: Learning From Noisy Multimedia Data.
IEEE Trans. Multim., 2022

Weakly-Supervised Learning With Complementary Heatmap for Retinal Disease Detection.
IEEE Trans. Medical Imaging, 2022

Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion.
IEEE Trans. Image Process., 2022

Vehicle Counting in Very Low-Resolution Aerial Images via Cross-Resolution Spatial Consistency and Intraresolution Time Continuity.
IEEE Trans. Geosci. Remote. Sens., 2022

Capturing Small, Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement.
IEEE Trans. Circuits Syst. Video Technol., 2022

Discovering regression-detection bi-knowledge transfer for unsupervised cross-domain crowd counting.
Neurocomputing, 2022

Self-distillation with Online Diffusion on Batch Manifolds Improves Deep Metric Learning.
CoRR, 2022

Physical Adversarial Attack meets Computer Vision: A Decade Survey.
CoRR, 2022

Improving Generalization of Metric Learning via Listwise Self-distillation.
CoRR, 2022

On Assisting Diagnoses of Pareidolia by Emulating Patient Behavior.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Personalized Fashion Recommendation Using Pairwise Attention.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Towards Causality Inference for Very Important Person Localization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Reference-Guided Texture and Structure Inference for Image Inpainting.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Multiple Object Tracking from appearance by hierarchically clustering tracklets.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
DotSCN: Group Re-Identification via Domain-Transferred Single and Couple Representation Learning.
IEEE Trans. Circuits Syst. Video Technol., 2021

Community Detection Using Restrained Random-Walk Similarity.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

A multimedia document browser based on multilayer networks.
Multim. Tools Appl., 2021

Uncertainty-Aware Semantic Guidance and Estimation for Image Inpainting.
IEEE J. Sel. Top. Signal Process., 2021

Classification of large-scale image database of various skin diseases using deep learning.
Int. J. Comput. Assist. Radiol. Surg., 2021

MADGAN: unsupervised medical anomaly detection GAN using multiple adjacent brain MRI slice reconstruction.
BMC Bioinform., 2021

Imageability- and Length-Controllable Image Captioning.
IEEE Access, 2021

Pose-aware Outfit Transfer between Unpaired in-the-wild Fashion Images.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Improving Camouflaged Object Detection with the Uncertainty of Pseudo-edge Labels.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Efficient Nearest Neighbor Search by Removing Anti-hub.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Video Action Retrieval Using Action Recognition Model.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

A multi-modal dataset for analyzing the imageability of concepts across modalities.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Continuous and Gradual Style Changes of Graphic Designs with Generative Model.
Proceedings of the IUI '21: 26th International Conference on Intelligent User Interfaces, 2021

Hierarchical Attention Image-Text Alignment Network For Person Re-Identification.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021

Unsupervised Common Particular Object Discovery and Localization by Analyzing a Match Graph.
Proceedings of the IEEE International Conference on Acoustics, 2021

Image Inpainting Guided by Coherence Priors of Semantics and Textures.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learning to Attack Real-World Models for Person Re-identification via Virtual-Guided Meta-Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Illumination-Adaptive Person Re-Identification.
IEEE Trans. Multim., 2020

How to Extract More Information With Less Burden: Fundus Image Classification and Retinal Disease Localization With Ophthalmologist Intervention.
IEEE J. Biomed. Health Informatics, 2020

Learning Sparse and Identity-Preserved Hidden Attributes for Person Re-Identification.
IEEE Trans. Image Process., 2020

Long-Term Background Redundancy Reduction for Earth Observatory Video Coding.
IEEE Trans. Circuits Syst. Video Technol., 2020

SDL: Spectrum-Disentangled Representation Learning for Visible-Infrared Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2020

Rephrasing Visual Questions by Specifying the Entropy of the Answer Distribution.
IEICE Trans. Inf. Syst., 2020

An Entropy Clustering Approach for Assessing Visual Question Difficulty.
IEEE Access, 2020

NII_UIT AT TRECVID 2020.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Is AutoML a practical way of tackling DSDI Task?
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Progressive Domain Adaptation for Robot Vision Person Re-identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

An Interactive Design for Visualizable Person Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Effective and Efficient: Toward Open-world Instance Re-identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Searching For Desired Person Doing Desired Action based on Visual and Audio Feature in Large Scale Video Database.
Proceedings of the International Conference on Multimedia Analysis and Pattern Recognition, 2020

Beyond Intra-modality: A Survey of Heterogeneous Person Re-identification.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

When Pedestrian Detection Meets Nighttime Surveillance: A New Benchmark.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

FashionGraph: Understanding fashion data using scene graph generation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Siamese-structure Deep Neural Network Recognizing Changes in Facial Expression According to the Degree of Smiling.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Motion Feedback Design for Video Frame Interpolation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes.
Proceedings of the Computer Vision - ECCV 2020, 2020

Analysis of Typefaces Designed for Readers with Developmental Dyslexia - Insights from Neural Networks.
Proceedings of the Document Analysis Systems - 14th IAPR International Workshop, 2020

ADINet: Attribute Driven Incremental Network for Retinal Image Classification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Incremental Re-Identification by Cross-Direction and Cross-Ranking Adaption.
IEEE Trans. Multim., 2019

Video instance search via spatial fusion of visual words and object proposals.
Int. J. Multim. Inf. Retr., 2019

GAN-based Multiple Adjacent Brain MRI Slice Reconstruction for Unsupervised Alzheimer's Disease Diagnosis.
CoRR, 2019

Beyond Intra-modality Discrepancy: A Comprehensive Survey of Heterogeneous Person Re-identification.
CoRR, 2019

Group Re-identification via Transferred Single and Couple Representation Learning.
CoRR, 2019

Illumination-Adaptive Person Re-identification.
CoRR, 2019

Learning More with Less: GAN-based Medical Image Augmentation.
CoRR, 2019

Evaluating Face Tracking for Political Analysis in Japanese News Over a Long Period of Time.
Proceedings of the 2019 IEEE/WIC/ACM International Conference on Web Intelligence, 2019


Poses Guide Spatiotemporal Model for Vehicle Re-identification.
Proceedings of the MultiMedia Modeling - 25th International Conference, 2019

Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV Series.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

DoT-GNN: Domain-Transferred Graph Neural Network for Group Re-identification.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Gastric Cancer Detection from Endoscopic Images Using Synthesis by GAN.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Burst-survive Temporal Matching Kernel with Fibonacci Periods.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning to Reduce Dual-Level Discrepancy for Infrared-Visible Person Re-Identification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning More with Less: Conditional PGGAN-based Data Augmentation for Brain Metastases Detection Using Highly-Rough Annotation on MR Images.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

GAN-Based Multiple Adjacent Brain MRI Slice Reconstruction for Unsupervised Alzheimer's Disease Diagnosis.
Proceedings of the Computational Intelligence Methods for Bioinformatics and Biostatistics, 2019

SpotFake: A Multi-modal Framework for Fake News Detection.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Group Re-Identification via Transferred Representation and Adaptive Fusion.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Maybe Look Closer? Detecting Trolling Prone Images on Instagram.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Fundus Image Classification and Retinal Disease Localization with Limited Supervision.
Proceedings of the Pattern Recognition - 5th Asian Conference, 2019

Efficient Image Retrieval via Decoupling Diffusion into Online and Offline Processing.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Supervised Distributed Hashing for Large-Scale Multimedia Retrieval.
IEEE Trans. Multim., 2018

Learning From Cross-Domain Media Streams for Event-of-Interest Discovery.
IEEE Trans. Multim., 2018

Person Reidentification via Discrepancy Matrix and Matrix Metric.
IEEE Trans. Cybern., 2018

An adjustable-purpose image watermarking technique by particle swarm optimization.
Multim. Tools Appl., 2018

Introduction to Special Issue of the 23rd International Conference on Multimedia Modeling (MMM 2017).
Multim. Tools Appl., 2018

Towards robots reasoning about group behavior of museum visitors: Leader detection and group tracking.
J. Ambient Intell. Smart Environ., 2018

Digital watermarking for deep neural networks.
Int. J. Multim. Inf. Retr., 2018

Multilevel Thresholding Color Image Segmentation Using a Modified Artificial Bee Colony Algorithm.
IEICE Trans. Inf. Syst., 2018

Speech Reconstitution using Multi-view Silent Videos.
CoRR, 2018

Multimodal Co-Training for Selecting Good Examples from Webly Labeled Video.
CoRR, 2018

NII_HITACHI_UIT at TRECVID 2018.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

Incremental Deep Hidden Attribute Learning.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Exploring Temporal Communities in Mass Media Archives.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Reconfigurable Inverted Index.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Cascaded SR-GAN for Scale-Adaptive Low Resolution Person Re-identification.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Revisiting Column-Wise Vector Quantization for Memory-Efficient Matrix Multiplication.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Tracked Instance Search.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Discriminative Learning of Open-Vocabulary Object Retrieval and Localization by Negative Phrase Augmentation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Generating "Visual Clouds" from Multiplex Networks for TV News Archive Query Visualization.
Proceedings of the 2018 International Conference on Content-Based Multimedia Indexing, 2018

2017
Scalable Face Track Retrieval in Video Archives Using Bag-of-Faces Sparse Representation.
IEEE Trans. Circuits Syst. Video Technol., 2017

Evaluation of multiple features for violent scenes detection.
Multim. Tools Appl., 2017

Instance search retrospective with focus on TRECVID.
Int. J. Multim. Inf. Retr., 2017

Audience Behavior Mining: Integrating TV Ratings with Multimedia Content.
IEEE Multim., 2017

Efficient large-scale multi-class image classification by learning balanced trees.
Comput. Vis. Image Underst., 2017

Consensus-based Sequence Training for Video Captioning.
CoRR, 2017

Query-Adaptive R-CNN for Open-Vocabulary Object Detection and Retrieval.
CoRR, 2017

Active Learning for Structured Prediction from Partially Labeled Data.
CoRR, 2017


NTT Communication Science Laboratories and National Institute of Informatics at TRECVID 2017 Instance Search.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

A modification of retake detection using simple signature and LCS algorithm.
Proceedings of the 18th IEEE/ACIS International Conference on Software Engineering, 2017

Information Retrieval Model using Generalized Pareto Distribution and Its Application to Instance Search.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Semantic Extraction and Object Proposal for Video Search.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

Statistical Inference of Gaussian-Laplace Distribution for Person Verification.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

FaceCloud: Heterogeneous Cloud Visualization of Multiplex Networks for Multimedia Archive Exploration.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

MANet: A Modal Attention Network for Describing Videos.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

P-S Instance Retrieval via Early Elimination and Late Expansion.
Proceedings of the Workshop on Visual Analysis in Smart and Connected Communities, 2017

Region-Based Image Retrieval Revisited.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Embedding Watermarks into Deep Neural Networks.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Video Indexing, Search, Detection, and Description with Focus on TRECVID.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

Temporal Matching Kernel with Embedded Stability-Sensitive Filter.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Estimating political leanings from mass media via graph-signal restoration with negative edges.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Energy based fast event retrieval in video with temporal match kernel.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Community detection using random-walk similarity and application to image clustering.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Face2Graph: Base de données graphe et visualisation pour l'annotation d'archives vidéos.
Proceedings of the 17ème Journées Francophones Extraction et Gestion des Connaissances, 2017

Deep Multi-label Hashing for Large-Scale Visual Search Based on Semantic Graph.
Proceedings of the Web and Big Data - First International Joint Conference, 2017

2016
Visual Analytics of Political Networks From Face-Tracking of News Video.
IEEE Trans. Multim., 2016

Web Image Search Re-Ranking With Click-Based Similarity and Typicality.
IEEE Trans. Image Process., 2016

Introduction of New Associate Editors.
IEEE Trans. Circuits Syst. Video Technol., 2016

Bidirectional extraction and recognition of scene text with layout consistency.
Int. J. Document Anal. Recognit., 2016

Bayesian Exponential Inverse Document Frequency and Region-of-Interest Effect for Enhancing Instance Search Accuracy.
IEICE Trans. Inf. Syst., 2016

Human Action Recognition from Depth Videos Using Pool of Multiple Projections with Greedy Selection.
IEICE Trans. Inf. Syst., 2016

Query Bootstrapping: A Visual Mining Based Query Expansion.
IEICE Trans. Inf. Syst., 2016

Adaptive Substring Extraction and Modified Local NBNN Scoring for Binary Feature-based Local Mobile Visual Search without False Positives.
CoRR, 2016

Image Retrieval with Fisher Vectors of Binary Features.
CoRR, 2016

When face-tracking meets social networks: a story of politics in news videos.
Appl. Netw. Sci., 2016


Concept-Level Multimodal Ranking of Flickr Photo Tags via Recall Based Weighting.
Proceedings of the 2016 ACM Workshop on Multimedia COMMONS, 2016

News Archive Exploration Combining Face Detection and Tracking with Network Visual Analytics.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

NII-UIT at MediaEval 2016 Predicting Media Interestingness Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016

Unsupervised Estimation of Video Continuity Model from Large-Scale Video Archives and Its Application to Shot Boundary Detection.
Proceedings of the IEEE International Symposium on Multimedia, 2016

Audience Behavior Mining by Integrating TV Ratings with Multimedia Contents.
Proceedings of the IEEE International Symposium on Multimedia, 2016

Geometric verification using semi-2D constraints for 3D object retrieval.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Unsupervised learning of supervoxel embeddings for video Segmentation.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Using node relationships for hierarchical classification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Image sentiment analysis using latent correlations among visual, textual, and sentiment views.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Efficient Large Scale Image Classification via Prediction Score Decomposition.
Proceedings of the Computer Vision - ECCV 2016, 2016

Large-Scale R-CNN with Classifier Adaptive Quantization.
Proceedings of the Computer Vision - ECCV 2016, 2016

Faster R-CNN Features for Instance Search.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

Video Event Detection by Exploiting Word Dependencies from Image Captions.
Proceedings of the COLING 2016, 2016

2015
Guest editorial: selected papers from ICIMCS 2012.
Multim. Syst., 2015

Automated real-time video surveillance summarization framework.
J. Real Time Image Process., 2015

Enhanced Visualization of News Shot Cloud with Employing Circular Layout.
Proceedings of the 8th International Symposium on Visual Information Communication and Interaction, 2015


A Social Network Analysis of Face Tracking in News Video.
Proceedings of the 11th International Conference on Signal-Image Technology & Internet-Based Systems, 2015

Cross-View Action Recognition by Projection-Based Augmentation.
Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Query-adaptive late fusion with neural network for instance search.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Large scale multi-class classification using latent classifiers.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

Human Action recognition from depth videos using multi-projection based representation.
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015

NII-UIT Browser: A Multimodal Video Search System.
Proceedings of the MultiMedia Modeling - 21st International Conference, 2015

Temporal Matching Kernel with Explicit Feature Maps.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Multimedia Event Detection Using Event-Driven Multiple Instance Learning.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

NII-UIT at MediaEval 2015 Affective Impact of Movies Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

PVSS: portable visual search service for researchers.
Proceedings of the 7th International Conference on Internet Multimedia Computing and Service, 2015

Learning Balanced Trees for Large Scale Image Classification.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

Local feature reliability measure using multiview synthetic images for mobile visual search.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Multimedia Event Detection Using Segment-Based Approach for Motion Feature.
J. Signal Process. Syst., 2014

BM25 With Exponential IDF for Instance Search.
IEEE Trans. Multim., 2014

An interaction model between human and system for intuitive graphical search interface.
Knowl. Inf. Syst., 2014

BIG-OH: BInarization of gradient orientation histograms.
Image Vis. Comput., 2014

Dense Segmentation of Textured Fruits in Video Sequences.
Proceedings of the VISAPP 2014, 2014

VabCut: A Video Extension of GrabCut for Unsupervised Video Foreground Object Segmentation.
Proceedings of the VISAPP 2014, 2014

Nagoya University at TRECVID 2014: the Instance Search Task.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

National Institute of Informatics, Japan at TRECVID 2014.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Image Annotation Fusing Content-Based and Tag-Based Technique Using Support Vector Machine and Vector Space Model.
Proceedings of the Tenth International Conference on Signal-Image Technology and Internet-Based Systems, 2014

<i>Recommend-Me</i>: recommending query regions for image search.
Proceedings of the Symposium on Applied Computing, 2014

Tell Me about TV Commercials of This Product.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

NII-UIT: A Tool for Known Item Search by Sequential Pattern Filtering.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014

Efficient Cross-Domain Image Retrieval by Multi-Level Matching and Spatial Verification for Structural Similarity.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

User Emotion Sensing in Search Process based on Chromatic Sensation.
Proceedings of the 1st ACM International Workshop on Human Centered Event Understanding from Multimedia, 2014

An Application Search Interface Including Sense-related Search Facets.
Proceedings of the International Conference on Multimedia Retrieval, 2014

NII-UIT at MediaEval 2014 Violent Scenes Detection Affect Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2014 Workshop, 2014

A practical spatial re-ranking method for instance search from videos.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Sum-max video pooling for complex event recognition.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Multi-image aggregation for better visual object retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2014

Binary feature-based image retrieval with effective indexing and scoring.
Proceedings of the IEEE 3rd Global Conference on Consumer Electronics, 2014

Image Flows Visualization for Inter-media Comparison.
Proceedings of the IEEE Pacific Visualization Symposium, 2014

2013
Annotation propagation in image databases using similarity graphs.
ACM Trans. Multim. Comput. Commun. Appl., 2013

Ultrahigh-Speed TV Commercial Detection, Extraction, and Matching.
IEEE Trans. Circuits Syst. Video Technol., 2013

Human gesture recognition system for TV viewing using time-of-flight camera.
Multim. Tools Appl., 2013

A Framework for Video Segmentation using Global and Local Features.
Int. J. Pattern Recognit. Artif. Intell., 2013

Face Retrieval in Large-Scale News Video Datasets.
IEICE Trans. Inf. Syst., 2013

The Future of Multimedia Analysis and Mining: Visions from the Shonan Meeting.
IEEE Multim., 2013

Improving the performance of SIFT and CSLBP for image copy detection.
Proceedings of the 36th International Conference on Telecommunications and Signal Processing, 2013

Violent scene detection using mid-level feature.
Proceedings of the 4th International Symposium on Information and Communication Technology, 2013

Evaluation of low-level features for detecting violent scenes in videos.
Proceedings of the 2013 International Conference on Soft Computing and Pattern Recognition, 2013

A Negative Sample Image Selection Method Referring to Semantic Hierarchical Structure for Image Annotation.
Proceedings of the Ninth International Conference on Signal-Image Technology & Internet-Based Systems, 2013

A video navigation interface using multi-faceted search hierarchies.
Proceedings of the Multimedia Systems Conference 2013, 2013

NII-UIT-VBS: A Video Browsing Tool for Known Item Search.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

Connect commercial films with realities.
Proceedings of the International Conference on Multimedia Retrieval, 2013

NII-UIT at MediaEval 2013 Violent Scenes Detection Affect Task.
Proceedings of the MediaEval 2013 Multimedia Benchmark Workshop, 2013

Self-localization and Navigation in Dynamic Search Hierarchy for Video Retrieval Interface.
Proceedings of the Knowledge, Information and Creativity Support Systems: Recent Trends, Advances and Solutions - Selected Papers from KICSS'2013, 2013

A Classification-Based Approach for Retake and Scene Detection in Rushes Video.
Proceedings of the Neural Information Processing - 20th International Conference, 2013

Person Re-identification Using Deformable Part Models.
Proceedings of the Neural Information Processing - 20th International Conference, 2013

Attribute-based learning for large scale object classification.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Evaluation of visual object retrieval datasets.
Proceedings of the IEEE International Conference on Image Processing, 2013

Efficient instance search from large video database via sparse filters in subspaces.
Proceedings of the IEEE International Conference on Image Processing, 2013

Bag of visual words model for videos segmentation into scenes.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013


Query-Adaptive Asymmetrical Dissimilarities for Visual Object Retrieval.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Bag-of-Words Against Nearest-Neighbor Search for Visual Object Retrieval.
Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, 2013

Estimation of Attentiveness of People Watching TV Based on Their Emotional Behaviors.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
The Future of Multimedia Analysis and Mining (NII Shonan Meeting 2012-9).
NII Shonan Meet. Rep., 2012

Structured learning of local features for human action classification and localization.
Image Vis. Comput., 2012

Efficient Tracking of News Topics Based on Chronological Semantic Structures in a Large-Scale News Video Archive.
IEICE Trans. Inf. Syst., 2012

Integrating local action elements for action analysis.
Comput. Vis. Image Underst., 2012

NTT Communication Science Laboratories and National Institute of Informatics at TRECVID 2012 Instance Search and Multimedia Event Detection Tasks.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

National Institute of Informatics, Japan at TRECVID 2012.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Q-CSLBP: Compression of CSLBP Descriptor.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Large vocabulary quantization for searching instances from videos.
Proceedings of the International Conference on Multimedia Retrieval, 2012

NII, Japan at MediaEval 2012 Violent Scenes Detection Affect Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Robust eye localization in video by combining eye detector and eye tracker.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

A Codeword Visualization Tool for Dense Trajectory Feature.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012

Auto face re-ranking by mining the web and video archives.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

An interactive exploration system that visually supports learning of country features.
Proceedings of the 11th International Conference on Interaction Design and Children, 2012

2011
Simple low-dimensional features approximating NCC-based image matching.
Pattern Recognit. Lett., 2011

NHK STRL at TRECVID 2011: Surveillance Event Detection and Semantic Indexing.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

National Institute of Informatics, Japan at TRECVID 2011.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

NTT Communication Science Laboratories and NII at TRECVID 2011 Instance Search Task.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

A Comprehensive Study of Feature Representations for Semantic Concept Detection.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

Summarizing Large News Video Archives by Event Ranking.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

NII-KAORI-PERSON-SEARCH: A General Framework for Indexing and Retrieving People's Appearance in Large Video Archives.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

Generalized Lasso based Approximation of Sparse Coding for Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

News Shot Cloud: Ranking TV News Shots by Cross TV-Channel Filtering for Efficient Browsing of Large-Scale News Video Archives.
Proceedings of the Advances in Multimedia Modeling, 2011

Knowledge propagation in large image databases using neighborhood information.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

NII, Japan at MediaEval 2011 Violent Scenes Detection Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Efficient quantization of color sift for image classification.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Commercial mining basedon temporal recurrence hashing algorithm and bag-of-fingerprints model.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Fast face sequence matching in large-scale video databases.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Boosting global scene classification accuracy by discriminative region localization.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Improving Image Categorization by Using Multiple Instance Learning with Spatial Relation.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

Improving Retake Detection by Adding Motion Feature.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

Indexing Faces in Broadcast News Video Archives.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Compact correlation coding for visual object categorization.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Temporal recurrence hashing algorithm for mining commercials from multimedia streams.
Proceedings of the IEEE International Conference on Acoustics, 2011

Human action recognition in crowded surveillance video sequences by using features taken from key-point trajectories.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

NII, Japan at ImageCLEF 2011 Photo Annotation Task.
Proceedings of the CLEF 2011 Labs and Workshop, 2011

2010
Automatic Pitch Type Recognition System from Single-View Video Sequences of Baseball Broadcast Videos.
Int. J. Multim. Data Eng. Manag., 2010

Robust Recognition of Specific Human Behaviors in Crowded Surveillance Video Sequences.
EURASIP J. Adv. Signal Process., 2010

National Institute of Informatics, Japan at TRECVID 2010.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

NTT Communication Science Laboratories and NII in TRECVID 2010 Instance Search Task.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

NHK STRL at TRECVID 2010: Semantic Indexing and Surveillance Event Detection.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Real Time Tunnel Based Video Summarization Using Direct Shift Collision Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

PageRank with Text Similarity and Video Near-Duplicate Constraints for News Story Re-ranking.
Proceedings of the Advances in Multimedia Modeling, 2010

Human gesture recognition using 3.5-dimensional trajectory features for hands-free user interface.
Proceedings of the first ACM international workshop on Analysis and retrieval of tracked events and motion in imagery streams, 2010

The python computer vision framework.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Construction of image retrieval systems focused on user knowledge interaction.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Weakly Supervised Action Recognition Using Implicit Shape Models.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Unified Approach to Detection and Identification of Commercial Films by Temporal Occurrence Pattern.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Detecting Screen Shot Images within Large-Scale Video Archive.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Video Retrieval Based on Tracked Features Quantization.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Building Compact Local Pairwise Codebook with Joint Feature Space Clustering.
Proceedings of the Computer Vision, 2010

An efficient method for face retrieval from large video datasets.
Proceedings of the 9th ACM International Conference on Image and Video Retrieval, 2010

Learning Directional Local Pairwise Bases with Sparse Coding.
Proceedings of the British Machine Vision Conference, 2010

Human Action Recognition and Localization in Video Using Structured Learning of Local Space-Time Features.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

2009
Editorial.
Vis. Comput., 2009

NHK STRL at TRECVID 2009: Surveillance Event Detection and High-Level Feature Extraction.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

ISM TRECVID2009 High-level Feature Extraction.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

National Institute of Informatics, Japan at TRECVID 2009.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Visually Guided Summary Based on Roles of Shots for Understanding News Topics.
Proceedings of the Fifth International Conference on Signal-Image Technology & Internet-Based Systems, 2009

Efficient concept detection by fusing simple visual features.
Proceedings of the 2009 ACM Symposium on Applied Computing (SAC), 2009

News Topic Tracking and Re-ranking with Query Expansion Based on Near-Duplicate Detection.
Proceedings of the Advances in Multimedia Information Processing, 2009

A Novel Retake Detection Using LCS and SIFT Algorithm.
Proceedings of the Advances in Multimedia Information Processing, 2009

Personalized News Video Recommendation.
Proceedings of the Advances in Multimedia Modeling, 2009

Large-scale news topic tracking and key-scene ranking with video near-duplicate constraints.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Indexing local configurations of features for scalable content-based video copy detection.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

High-level feature extraction using SVM with walk-based graph kernel.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Video Near-duplicate Detection.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008

Analyzing Person Information in News Video.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008

Face Detection, Tracking, and Recognition for Broadcast Video.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008

Integrating multi-modal content analysis and hyperbolic visualization for large-scale news video retrieval and exploration.
Signal Process. Image Commun., 2008

Media objects for user-centered similarity matching.
Multim. Tools Appl., 2008

ISM TRECVID2008 High-level Feature Extraction.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

National Institute of Informatics, Japan at TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Robust Face Track Finding in Video Using Tracked Points.
Proceedings of the 4th IEEE International Conference on Signal Image Technology and Internet Based Systems, 2008

A text segmentation based approach to video shot boundary detection.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

A Novel Approach for Filtering Junk Images from Google Search Results.
Proceedings of the Advances in Multimedia Modeling, 2008

New Approach for Hierarchical Classifier Training and Multi-level Image Annotation.
Proceedings of the Advances in Multimedia Modeling, 2008

Scene duplicate detection based on the pattern of discontinuities in feature point trajectories.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Image-based quiz generation from news video archives based on principal object.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Video rushes summarization utilizing retake characteristics.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Rushes summarization using different redundancy elimination approaches.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Unsupervised Face Annotation by Mining the Web.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008

2007
Ent-Boost: Boosting using entropy measures for robust object detection.
Pattern Recognit. Lett., 2007

Mining Large-Scale News Video Database Via Knowledge Visualization.
Proceedings of the Advances in Visual Information Systems, 9th International Conference, 2007

NII-ISM, Japan at TRECVID 2007: High Level Feature Extraction.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Large scale news video database browsing and retrieval via information visualization.
Proceedings of the 2007 ACM Symposium on Applied Computing (SAC), 2007

National institute of informatics, japan at TRECVID 2007: BBC rushes summarization.
Proceedings of the 1st ACM Workshop on Video Summarization, 2007

mediaWalker: a video archive explorer based on time-series semantic structure.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Scene duplicate detection from videos based on trajectories of feature points.
Proceedings of the 9th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2007

Analyzing Large-Scale News Video Databases to Support Knowledge Visualization and Intuitive Retrieval.
Proceedings of the 2nd IEEE Symposium on Visual Analytics Science and Technology, 2007

Image-Based Quizzes from News Video Archives.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Boosting Face Retrieval by using Relevant Set Correlation Clustering.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Finding Important People in Large News Video Databases Using Multimodal and Clustering Analysis.
Proceedings of the 23rd International Conference on Data Engineering Workshops, 2007

Using Visual-Textual Mutual Information and Entropy for Inter-modal Document Indexing.
Proceedings of the Advances in Information Retrieval, 2007

Video search by multi-modal and clustering analysis.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

A Study of Intra-Modal Association Rules for Visual Modality Representation.
Proceedings of the International Workshop on Content-Based Multimedia Indexing, 2007

2006
Analyzing Person Information in News Video.
Proceedings of the Encyclopedia of Multimedia, 2006

A Multi-Stage Approach to Fast Face Detection.
IEICE Trans. Inf. Syst., 2006

Shot Boundary Detection and High-Level Feature Extraction Experiments for TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Concept Detection Using Local Binary Patterns and SVM.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Large-scale news video retrieval via visualization.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Exploring Large-Scale Video News via Interactive Visualization.
Proceedings of the 1st IEEE Symposium On Visual Analytics Science And Technology, 2006

Ent-Boost: Boosting Using Entropy Measure for Robust Object Detection.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

A hybrid classifier for precise and robust eye detection.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Identification and Detection of the Same Scene Based on Flash Light Patterns.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Exploiting Topic Thread Structures in a News Video Archive for the Semi-Automatic Generation of Video Summaries.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Robust Object Detection using Fast Feature Selection from Huge Feature Sets.
Proceedings of the International Conference on Image Processing, 2006

Context-Based Conceptual Image Indexing.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Face Retrieval in Broadcasting News Video by Fusing Temporal and Intensity Information.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

Using Topic Concepts for Semantic Video Shots Classification.
Proceedings of the Image and Video Retrieval, 5th International Conference, 2006

2005
CLIPS-LSR-NII Experiments at TRECVID 2005.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Examination and enhancement of a ring-structured graphical search interface based on usability testing.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Cooking navi: assistant for daily cooking in kitchen.
Proceedings of the 13th ACM International Conference on Multimedia, 2005

An Efficient Feature Selection Method for Object Detection.
Proceedings of the Pattern Recognition and Data Mining, 2005

<i>trackThem</i>: Exploring a Large-Scale News Video Archive by Tracking Human Relations.
Proceedings of the Information Retrieval Technology, 2005

2004
Subject region segmentation in disparity maps for image retrieval.
Syst. Comput. Jpn., 2004

Person X Detector.
Proceedings of the 2004 TREC Video Retrieval Evaluation, 2004

Semantic Retrieval in a Large-Scale Video Database by Using Both Image and Text Feature.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A News Video Browser Using Identical Video Segment Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Key Image Extraction from a News Video Archive for Visualizing Its Semantic Structure.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Mining Large-Scale Broadcast Video Archives Towards Inter-video Structuring.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Multimedia Integration for Cooking Video Indexing.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

A Novel Adaptive Image Enhancement Algorithm for Face Detection.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Generalized Histogram: Empirical Optimization of Low Dimensional Features for Image Matching.
Proceedings of the Computer Vision, 2004

Topic Threading for Structuring a Large-Scale News Video Archive.
Proceedings of the Image and Video Retrieval: Third International Conference, 2004

2003
Topic-based inter-video structuring of a large-scale news video corpus.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

2001
Distinctiveness-Sensitive Nearest Neighbor Search for Efficient Similarity Retrieval of Multimedia Information.
Proceedings of the 17th International Conference on Data Engineering, 2001

2000
Application of multidimensional indexing methods to massive processing of multimedia information.
Syst. Comput. Jpn., 2000

Toward the MEdiaSys VIdeo Search Engine (MEVISE).
Proceedings of the Advances in Visual Information Management, 2000

Face Sequence Matching with Certainty Factor Evaluation.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2000), 2000

Comparative Evaluation of Face Sequence Matching for Content-Based Video Access.
Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2000), 2000

1999
Video OCR: Indexing Digital News Libraries by Recognition of Superimposed Captions.
Multim. Syst., 1999

Name-It: Naming and Detecting Faces in News Videos.
IEEE Multim., 1999

an actor/actress annotation system for drama videos.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Towards actor/actress identification in drama videos.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

An Efficient Implementation and Evaluation of Robust Face Sequence Matching.
Proceedings of the 1oth International Conference on Image Analysis and Processing (ICIAP 1999), 1999

Experimental evaluation of disk-based data structures for nearest neighbor searching.
Proceedings of the Data Structures, 1999

Toward Multimedia Document Support inside the AHYDS Platform.
Proceedings of the 1999 International Symposium on Database Applications in Non-Traditional Environments (DANTE '99), 1999

1998
SR-tree: An index structure for nearest-neighbor searching of high-dimensional point data.
Syst. Comput. Jpn., 1998

1997
The SR-tree: An Index Structure for High-Dimensional Nearest Neighbor Queries.
Proceedings of the SIGMOD 1997, 1997

Name-It: Naming and Detecting Faces in Video by the Integration of Image and Natural Language Processing.
Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, 1997

Name-It: Association of Face and Name in Video.
Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97), 1997

1996
A New Type of Video Scene Classification System Based on Typical Model Database.
Proceedings of IAPR Workshop on Machine Vision Applications, 1996

1995
A rule learning method for academic document image processing.
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

An automated generation of an electronic library based on document image understanding.
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

1994
Robust Line Drawing Understanding Incorporating Efficient Closed Symbols Extraction.
Proceedings of IAPR Workshop on Machine Vision Applications, 1994

A document understanding method for database construction of an electronic library.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

1993
A collaborative supporting method between document processing and hypertext construction.
Proceedings of the 2nd International Conference Document Analysis and Recognition, 1993

Drawing image understanding system with capability of rule learning.
Proceedings of the 2nd International Conference Document Analysis and Recognition, 1993

1992
Data model generation in image database systems.
Syst. Comput. Jpn., 1992

A Syntactical Approach to the Database Construction Method from Document Images.
Proceedings of IAPR Workshop on Machine Vision Applications, 1992

Understanding Rule Generation Supporting System for Drawing Understanding using Interaction with User.
Proceedings of IAPR Workshop on Machine Vision Applications, 1992

One method of structural description rule extraction based on graphical and spatial relations.
Proceedings of the 11th IAPR International Conference on Pattern Recognition, 1992

1990
Descriptive Ability of Drawing Image Understanding Framework Using State Transition Models.
Proceedings of IAPR Workshop on Machine Vision Applications, 1990

Drawing image understanding framework using state transition models.
Proceedings of the 10th IAPR International Conference on Pattern Recognition, 1990


  Loading...