2025
Controllable Hand Grasp Generation for HOI and Efficient Evaluation Methods.
CoRR, January, 2025
NeuroViG - Integrating Event Cameras for Resource-Efficient Video Grounding.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
Decreasing Word Error Rates in Paragraph Handwritten Text Recognition with Synthetic Data.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Visual Prompting for One-shot Controllable Video Editing without Inversion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
SPASCA: Social Presence and Support with Conversational Agent for Persons Living with Dementia.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
CAS: Fusing DNN Optimization & Adaptive Sensing for Energy-Efficient Multi-Modal Inference.
IEEE Robotics Autom. Lett., November, 2024
Enhancing Representation Learning With Spatial Transformation and Early Convolution for Reinforcement Learning-Based Small Object Detection.
IEEE Trans. Circuits Syst. Video Technol., January, 2024
Controllable Video Generation With Text-Based Instructions.
IEEE Trans. Multim., 2024
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering.
IEEE Trans. Multim., 2024
Bridging the Intent Gap: Knowledge-Enhanced Visual Generation.
CoRR, 2024
MVGamba: Unify 3D Content Generation as State Space Sequence Modeling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Poster: Towards Efficient Spatio-Temporal Video Grounding in Pervasive Mobile Devices.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024
Diffusion Time-step Curriculum for One Image to 3D Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Learning by Imagination: A Joint Framework for Text-Based Image Manipulation and Change Captioning.
IEEE Trans. Multim., 2023
Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos.
CoRR, 2023
An Overview of Challenges in Egocentric Text-Video Retrieval.
CoRR, 2023
Data Augmentation Using Corner CutMix and an Auxiliary Self-Supervised Loss.
Proceedings of the IEEE International Conference on Image Processing, 2023
Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention.
Proceedings of the 34th British Machine Vision Conference 2023, 2023
Counterfactual Dynamics Forecasting - a New Setting of Quantitative Reasoning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Align R-CNN: A Pairwise Head Network for Visual Relationship Detection.
IEEE Trans. Multim., 2022
Image Understanding With Reinforcement Learning: Auto-Tuning Image Attributes and Model Parameters for Object Detection and Segmentation.
IEEE Trans. Circuits Syst. Video Technol., 2022
EEG-Video Emotion-Based Summarization: Learning With EEG Auxiliary Signals.
IEEE Trans. Affect. Comput., 2022
Self-Supervised Reinforcement Learning for Active Object Detection.
IEEE Robotics Autom. Lett., 2022
Entropy guided attention network for weakly-supervised action localization.
Pattern Recognit., 2022
Is Bio-Inspired Learning Better than Backprop? Benchmarking Bio Learning vs. Backprop.
CoRR, 2022
Reason from Context with Self-supervised Learning.
CoRR, 2022
On the Robustness, Generalization, and Forgetting of Shape-Texture Debiased Continual Learning.
CoRR, 2022
Combined CNN Transformer Encoder for Enhanced Fine-grained Human Action Recognition.
CoRR, 2022
Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.
CoRR, 2022
RoME: Role-aware Mixture-of-Expert Transformer for Text-to-Video Retrieval.
CoRR, 2022
Portmanteauing Features for Scene Text Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Hierarchical Defect Detection Based On Reinforcement Learning.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
Improving Generalization of Reinforcement Learning Using a Bilinear Policy Network.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
Identifying Hard Noise in Long-Tailed Sample Distribution.
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
Lifelog Image Retrieval Based on Semantic Relevance Mapping.
ACM Trans. Multim. Comput. Commun. Appl., 2021
Single-Image Dehazing via Compositional Adversarial Network.
IEEE Trans. Cybern., 2021
A comprehensive survey of procedural video datasets.
Comput. Vis. Image Underst., 2021
FashionSearchNet-v2: Learning Attribute Representations with Localization for Image Retrieval with Attribute Manipulation.
CoRR, 2021
Towards a Programming-Free Robotic System for Assembly Tasks Using Intuitive Interactions.
Proceedings of the Social Robotics - 13th International Conference, 2021
Predicting Event Memorability from Contextual Visual Semantics.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Towards Efficient Multiview Object Detection with Adaptive Action Prediction.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021
Semantic Role Aware Correlation Transformer For Text To Video Retrieval.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Joint Learning on the Hierarchy Representation for Fine-Grained Human Action Recognition.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
A Diagnostic Study Of Visual Question Answering With Analogical Reasoning.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Enhancing Multi-Step Action Prediction for Active Object Detection.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Action Relational Graph for Weakly-Supervised Temporal Action Localization.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
Robust Multi-Frame Future Prediction By Leveraging View Synthesis.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021
TAILOR: Teaching with Active and Incremental Learning for Object Registration.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
Combining Faster R-CNN and Model-Driven Clustering for Elongated Object Detection.
IEEE Trans. Image Process., 2020
Semantically consistent text to fashion image synthesis with an enhanced attentional generative adversarial network.
Pattern Recognit. Lett., 2020
A novel hybrid approach for crack detection.
Pattern Recognit., 2020
Detecting Objects with High Object Region Percentage.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
6D Pose Estimation with Correlation Fusion.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Gesture Enhanced Comprehension of Ambiguous Human-to-Robot Instructions.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020
Active Image Sampling on Canonical Views for Novel Object Detection.
Proceedings of the IEEE International Conference on Image Processing, 2020
Task-Oriented Multi-Modal Question Answering For Collaborative Applications.
Proceedings of the IEEE International Conference on Image Processing, 2020
Learning Cross-Modal Representations for Language-Based Image Manipulation.
Proceedings of the IEEE International Conference on Image Processing, 2020
EEG-based Emotion Recognition Using Spatial-Temporal Representation via Bi-GRU.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
2019
Anticipating Where People will Look Using Adversarial Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2019
6D Pose Estimation with Correlation Fusion.
CoRR, 2019
Prototype Reminding for Continual Learning.
CoRR, 2019
Lift-the-Flap: Context Reasoning Using Object-Centered Graphs.
CoRR, 2019
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019
Self-Teaching Strategy for Learning to Recognize Novel Objects in Collaborative Robots.
Proceedings of the ICRAI 2019: 5th International Conference on Robotics and Artificial Intelligence, 2019
Enhance the Efficacy of Deep CNN with Auxiliary Labels.
Proceedings of the ICRAI 2019: 5th International Conference on Robotics and Artificial Intelligence, 2019
Towards Robust Retrieval for Imperfectly Scanned Point Cloud Objects.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
An Adaptive Fitting Approach for the Visual Detection and Counting of Small Circular Objects in Manufacturing Applications.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Towards Real-Time Crack Detection Using a Deep Neural Network With a Bayesian Fusion Algorithm.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Semantically Consistent Hierarchical Text to Fashion Image Synthesis with an Enhanced-Attentional Generative Adversarial Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019
Attribute Manipulation Generative Adversarial Networks for Fashion Images.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Singe Image Rain Removal with Unpaired Information: A Differentiable Programming Perspective.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
A Probabilistic Model of Social Working Memory for Information Retrieval in Social Interactions.
IEEE Trans. Cybern., 2018
Which shirt for my first date? Towards a flexible attribute-based fashion query system.
Pattern Recognit. Lett., 2018
Finding any Waldo: zero-shot invariant and efficient visual search.
CoRR, 2018
Efficient Multi-attribute Similarity Learning Towards Attribute-Based Fashion Search.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018
Personalized Serious Games for Cognitive Intervention with Lifelog Visual Analytics.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Egocentric Spatial Memory.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018
DehazeGAN: When Image Dehazing Meets Differential Programming.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
FashionSearchNet: Fashion Search with Attribute Manipulation.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Learning Attribute Representations With Localization for Flexible Fashion Search.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
2017
Summarization of Egocentric Videos: A Comprehensive Survey.
IEEE Trans. Hum. Mach. Syst., 2017
A Wearable Virtual Usher for Vision-Based Cognitive Indoor Navigation.
IEEE Trans. Cybern., 2017
The effect of different types of navigation assistance on indoor scene memorability.
Behav. Inf. Technol., 2017
Visualizing Personal Lifelog Data for Deeper Insights at the NTCIR-13 Lifelog-2 Task.
Proceedings of the 13th NTCIR Conference, 2017
VCI2R at the NTCIR-13 Lifelog-2 Lifelog Semantic Access Task.
Proceedings of the 13th NTCIR Conference, 2017
Foveated neural network: Gaze prediction on egocentric videos.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Multi-layer linear model for top-down modulation of visual attention in natural egocentric vision.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Principal curvature of point cloud for 3D shape recognition.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
VC-I2R@ImageCLEF2017: Ensemble of Deep Learned Features for Lifelog Video Summarization.
Proceedings of the Working Notes of CLEF 2017, 2017
Analysis of Human Attentions for Face Recognition on Natural Videos and Comparison with CV Algorithm on Performance.
Proceedings of the 2017 AAAI Spring Symposia, 2017
Active Video Summarization: Customized Summaries via On-line Interaction with the User.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Describing Lifelogs with Convolutional Neural Networks: A Comparative Study.
Proceedings of the first Workshop on Lifelogging Tools and Applications, 2016
Design and Evaluation of Vibrating Footwear for Navigation Assistance to Visually Impaired People.
Proceedings of the 2016 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, 2016
Multimodal Multi-Stream Deep Learning for Egocentric Activity Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016
Exploring the Use of Visual Annotations in a Remote Assistance Platform.
Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 2016
2015
Scene text extraction based on edges and support vector regression.
Int. J. Document Anal. Recognit., 2015
Enhancing Social Interaction with Seamless Face Recognition on Google Glass: Leveraging opportunistic multi-tasking on smart phones.
Proceedings of the 17th International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct, 2015
Organizing and retrieving episodic memories from first person view.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015
Whole space subclass discriminant analysis for face recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Exploring users' attitudes towards social interaction assistance on Google Glass.
Proceedings of the 6th Augmented Human International Conference, 2015
2014
Learning Deep Hierarchical Visual Feature Coding.
IEEE Trans. Neural Networks Learn. Syst., 2014
Extended Spectral Regression for efficient scene recognition.
Pattern Recognit., 2014
Robust and Efficient Saliency Modeling from Image Co-Occurrence Histograms.
IEEE Trans. Pattern Anal. Mach. Intell., 2014
Recovering Social Interaction Spatial Structure from Multiple First-Person Views.
Proceedings of the 3rd International Workshop on Socially-Aware Multimedia, 2014
Neurosphere segmentation in brightfield images.
Proceedings of the Medical Imaging 2014: Image Processing, 2014
A wearable virtual guide for context-aware cognitive indoor navigation.
Proceedings of the 16th international conference on Human-computer interaction with mobile devices & services, 2014
Character Recognition in Natural Scenes Using Convolutional Co-occurrence HOG.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014
Incremental Graph Clustering for Efficient Retrieval from Streaming Egocentric Video Data.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014
Understanding the Nature of First-Person Videos: Characterization and Classification Using Low-Level Features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014
Efficient Retrieval from Large-Scale Egocentric Visual Data Using a Sparse Graph Representation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014
Activity Recognition in Egocentric Life-Logging Videos.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014
A Wearable Face Recognition System on Google Glass for Assisting Social Interactions.
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014
Proceedings of the Computer Vision - ACCV 2014 Workshops, 2014
A Three-Color Coupled Level-Set Algorithm for Simultaneous Multiple Cell Segmentation and Tracking.
Proceedings of the Computer Vision - ACCV 2014, 2014
2013
Top-Down Regularization of Deep Belief Networks.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Classification of marine organisms in underwater images using CQ-HMAX biologically inspired color approach.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013
The use of optical and sonar images in the human and dolphin brain for image classification.
Proceedings of the 2013 International Joint Conference on Neural Networks, 2013
A Wearable Cognitive Vision System for Navigation Assistance in Indoor Environment.
Proceedings of the Neural Information Processing - 20th International Conference, 2013
An analysis-synthesis approach for neurosphere modelisation under phase-contrast microscopy.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013
Encoding Co-occurrence of Features in the HMAX Model.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013
Visual Recognition using a Combination of Shape and Color Features.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013
Adaptive picture-in-picture technology based on visual saliency.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013
2012
A non-parametric visual-sense model of images - extending the cluster hypothesis beyond text.
Multim. Tools Appl., 2012
Visual graph modeling for scene recognition and mobile robot localization.
Multim. Tools Appl., 2012
Topic Based Query Suggestions for Video Search.
Proceedings of the Advances in Multimedia Modeling - 18th International Conference, 2012
Autonomous Viewpoint Control from Saliency.
Proceedings of the Biomimetic and Biohybrid Systems - First International Conference, 2012
Neurosphere fate prediction: An analysis-synthesis approach for feature extraction.
Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012
Clustering and use of spatial and frequency information in a biologically inspired approach to image classification.
Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012
Segmentation of neural stem cells/neurospheres in unevenly illuminated brightfield images with shading reduction.
Proceedings of the 21st International Conference on Pattern Recognition, 2012
Segmentation of neural stem cells/neurospheres in high content brightfield microscopy images using localized level sets.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Saliency Modeling from Image Histograms.
Proceedings of the Computer Vision - ECCV 2012, 2012
Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines.
Proceedings of the Computer Vision - ECCV 2012, 2012
Visual Attention is Attracted by Text Features Even in Scenes without Text.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012
A Review of Cognitive Architectures for Visual Memory.
Proceedings of the Biologically Inspired Cognitive Architectures 2012 - Proceedings of the Third Annual Meeting of the BICA Society, Palermo, Sicily, Italy, October 31, 2012
2011
A Computer Assisted Method for Nuclear Cataract Grading From Slit-Lamp Images Using Ranking.
IEEE Trans. Medical Imaging, 2011
Automatic Optic Disc Detection From Retinal Images by a Line Operator.
IEEE Trans. Biomed. Eng., 2011
Advertisement Image Recognition for a Location-Based Reminder System.
Proceedings of the Advances in Multimedia Modeling, 2011
Automated nuclei clump decomposition for image analysis in neuronal cell fluorescent microscopy.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Learning invariant color features with sparse topographic restricted Boltzmann machines.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Computer-aided cataract detection using enhanced texture features on retro-illumination lens images.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Automatic detection of cortical and PSC cataracts using texture and intensity analysis on retro-illumination lens images.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011
Detection of unstained living neurospheres from phase contrast images with very large illumination variations.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011
Extended Visual Memory for Computer-Aided Vision.
Proceedings of the 33th Annual Meeting of the Cognitive Science Society, 2011
2010
Automatic Area Classification in Peripheral Blood Smears.
IEEE Trans. Biomed. Eng., 2010
Automated Layer Segmentation of Optical Coherence Tomography Images.
IEEE Trans. Biomed. Eng., 2010
A Computer-Aided Diagnosis System of Nuclear Cataract.
IEEE Trans. Biomed. Eng., 2010
Automatic optic disc segmentation based on image brightness and contrast.
Proceedings of the Medical Imaging 2010: Image Processing, 2010
Content-Based Medical Image Retrieval with Metric Learning via Rank Correlation.
Proceedings of the Machine Learning in Medical Imaging, First International Workshop, 2010
Epitomized Summarization of Wireless Capsule Endoscopic Videos for Efficient Visualization.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2010
Enhancement of optic cup detection through an improved vessel kink detection framework.
Proceedings of the Medical Imaging 2010: Computer-Aided Diagnosis, San Diego, 2010
Classification of left and right eye retinal images.
Proceedings of the Medical Imaging 2010: Computer-Aided Diagnosis, San Diego, 2010
Automatic classification of pathological myopia in retinal fundus images using PAMELA.
Proceedings of the Medical Imaging 2010: Computer-Aided Diagnosis, San Diego, 2010
A Recursive and Model-Constrained Region Splitting Algorithm for Cell Clump Decomposition.
Proceedings of the 20th International Conference on Pattern Recognition, 2010
Dictionary of Features in a Biologically Inspired Approach to Image Classification.
Proceedings of the Neural Information Processing. Models and Applications, 2010
Faceted topic retrieval of news video using joint topic modeling of visual features and speech transcripts.
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Learning cell geometry models for cell image simulation: An unbiased approach.
Proceedings of the International Conference on Image Processing, 2010
Automatic cell classification and population estimation in blastocystis autophagy images.
Proceedings of the International Conference on Image Processing, 2010
Automatic macula detection from retinal images by a line operator.
Proceedings of the International Conference on Image Processing, 2010
Automatic optic disc detection through background estimation.
Proceedings of the International Conference on Image Processing, 2010
Intermedia Conceptual Indexing.
Proceedings of the ImageCLEF, Experimental Evaluation in Visual Information Retrieval, 2010
2009
Mobile phone-based mixed reality: the Snap2Play game.
Vis. Comput., 2009
Fuzzy Associative Conjuncted Maps Network.
IEEE Trans. Neural Networks, 2009
A Computer-Aided Diagnosis System of Nuclear Cataract via Ranking.
Proceedings of the Medical Image Computing and Computer-Assisted Intervention, 2009
A Bayesian approach integrating regional and global features for image semantic learning.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Showroom introduction using mobile phone based on scene image recognition.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009
Photometric correction of retinal images by polynomial interpolation.
Proceedings of the International Conference on Image Processing, 2009
Selecting representative and distinctive descriptors for efficient landmark recognition.
Proceedings of the International Conference on Image Processing, 2009
I2R at ImageCLEF Photo Retrieval 2009.
Proceedings of the Working Notes for CLEF 2009 Workshop co-located with the 13th European Conference on Digital Libraries (ECDL 2009) , Corfù, Greece, September 30, 2009
A Latent Model for Visual Disambiguation of Keyword-based Image Search.
Proceedings of the British Machine Vision Conference, 2009
Neuro-Retinal Optic Cup Detection in Glaucoma Diagnosis.
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009
Cell Clumping Quantification and Automatic Area Classification in Peripheral Blood Smear Images.
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009
Computerized Systems for Cataract Grading.
Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics, 2009
2008
Semantic Image Representation and Indexing.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008
Unsupervised learning for image classification based on distribution of hierarchical feature tree.
Proceedings of the 2008 IEEE International Conference on Research, 2008
Rich representation and ranking for photographic image retrieval in ImageCLEF 2007.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008
Bi-modal Conceptual Indexing for Medical Image Retrieval.
Proceedings of the Advances in Multimedia Modeling, 2008
Snap2Play: A Mixed-Reality Game Based on Scene Identification.
Proceedings of the Advances in Multimedia Modeling, 2008
CALM: cascading system with leaking detection mechanism for medical image segmentation.
Proceedings of the Medical Imaging 2008: Image Processing, 2008
Image based grading of nuclear cataract by SVM regression.
Proceedings of the Medical Imaging 2008: Computer-Aided Diagnosis, San Diego, 2008
Deploying and evaluating a mixed reality mobile treasure hunt: Snap2Play.
Proceedings of the 10th Conference on Human-Computer Interaction with Mobile Devices and Services, 2008
Learning associations of conjuncted fuzzy sets for data prediction.
Proceedings of the International Joint Conference on Neural Networks, 2008
Automatic working area classification in peripheral blood smears using spatial distribution features across scales.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008
Cascaded classification with optimal candidate selection for effective place recognition.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Automatic opacity detection in retro-illumination images for cortical cataract diagnosis.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008
Similarity Learning for Nearest Neighbor Classification.
Proceedings of the 8th IEEE International Conference on Data Mining (ICDM 2008), 2008
Using densely recorded scenes for place recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008
Boosting descriptors condensed from video sequences for place recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2008
Fusion de multi-modalités et réduction par sémantique latente.
Proceedings of the COnférence en Recherche d'Infomations et Applications, 2008
IPAL at CLEF 2008: Mixed-Modality based Image Search, Novelty based Re-ranking and Extended Matching.
Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008
2007
Medical-Image Retrieval Based on Knowledge-Assisted Text and Image Indexing.
IEEE Trans. Circuits Syst. Video Technol., 2007
Object identification and retrieval from efficient image matching. Snap2Tell with the STOIC dataset.
Inf. Process. Manag., 2007
Using Camera Settings Templates ("Scene Modes") for Image Scene Classification of Photographs Taken on Manual/Expert Settings.
Proceedings of the Advances in Multimedia Information Processing, 2007
Metadata Management, Reuse, Inference and Propagation in a Collection-Oriented Metadata Framework for Digital Images.
Proceedings of the Advances in Multimedia Modeling, 2007
Outdoor place recognition using compact local descriptors and multiple queries with user verification.
Proceedings of the 15th International Conference on Multimedia 2007, 2007
An image-based outdoor place recognition and information retrieval system.
Proceedings of the 15th International Conference on Multimedia 2007, 2007
An integrated statistical model for multimedia evidence combination.
Proceedings of the 15th International Conference on Multimedia 2007, 2007
Scene Recognition with Camera Phones for Tourist Information Access.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Knowledge-Assisted Medical Image Retrieval.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Hidden Maximum Entropy Approach for Visual Concept Modeling.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007
Propagating Image-Level Part Statistics to Enhance Object Detection.
Proceedings of the International Conference on Image Processing, 2007
Outlier Detection from Pooled Data for Image Retrieval System Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2007
Using Bayesian Network for Conceptual Indexing: Application to Medical Document Indexing with UMLS Metathesaurus.
Proceedings of the Advances in Multilingual and Multimodal Information Retrieval, 2007
IPAL at ImageClef 2007 Mixing Features, Models and Knowledge.
Proceedings of the Working Notes for CLEF 2007 Workshop co-located with the 11th European Conference on Digital Libraries (ECDL 2007), 2007
Latent semantic fusion model for image retrieval and annotation.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007
Domain knowledge conceptual inter-media indexing: application to multilingual multimedia medical reports.
Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, 2007
2006
Semantic Image Representation and Indexing.
Proceedings of the Encyclopedia of Multimedia, 2006
Discovering Recurrent Image Semantics from Class Discrimination.
EURASIP J. Adv. Signal Process., 2006
Scene Identification using Discriminative Patterns.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
An ensemble classifier learning approach to ROC optimization.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Combining Textual and Visual Ontologies to Solve Medical Multimodal Queries.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
A Collection-Oriented Metadata Framework for Digital Images.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Towards Automatic Mobile Blogging.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006
Une Indexation conceptuelle pour un filtrage par dimensions.
Proceedings of the COnférence en Recherche d'Infomations et Applications, 2006
IPAL Inter-Media Pseudo-Relevance Feedback Approach to ImageCLEF 2006 Photo Retrieval.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006
Inter-media Pseudo-relevance Feedback Application to ImageCLEF 2006 Photo Retrieval.
Proceedings of the Evaluation of Multilingual and Multi-modal Information Retrieval, 2006
IPAL Knowledge-based Medical Image Retrieval in ImageCLEFmed 2006.
Proceedings of the Working Notes for CLEF 2006 Workshop co-located with the 10th European Conference on Digital Libraries (ECDL 2006), 2006
Inter-media Concept-Based Medical Image Indexing and Retrieval with UMLS at IPAL.
Proceedings of the Evaluation of Multilingual and Multi-modal Information Retrieval, 2006
Concept Propagation Based on Visual Similarity.
Proceedings of the Information Retrieval Technology, 2006
2005
Combining intra-image and inter-class semantics for consumer image retrieval.
Pattern Recognit., 2005
A structured learning framework for content-based image indexing and visual query.
Multim. Syst., 2005
SnapToTell: A Singapore Image Test Bed for Ubiquitous Information Access from Camera.
Proceedings of the Advances in Information Retrieval, 2005
SnapToTell Accès ubiquitaire à de l'information multimédia à partir d'un téléphone portable.
Proceedings of the COnférence en Recherche d'Infomations et Applications, 2005
A Structured Learning Approach for Medical Image Indexing and Retrieval.
Proceedings of the Working Notes for CLEF 2005 Workshop co-located with the 9th European Conference on Digital Libraries (ECDL 2005), 2005
Using Ontology Dimensions and Negative Expansion to solve Precise Queries in the ImageCLEF Medical Task.
Proceedings of the Working Notes for CLEF 2005 Workshop co-located with the 9th European Conference on Digital Libraries (ECDL 2005), 2005
A Structured Visual Learning Approach Mixed with Ontology Dimensions for Medical Queries.
Proceedings of the Accessing Multilingual Information Repositories, 2005
A Structured Learning Approach to Semantic Photo Indexing and Query.
Proceedings of the Information Retrieval Technology, 2005
VisMed: A Visual Vocabulary Approach for Medical Image Indexing and Retrieval.
Proceedings of the Information Retrieval Technology, 2005
2004
Bridging semantic gap: learning and integrating semantics for content-based retrieval.
PhD thesis, 2004
Learning and Integrating Semantics for Image Indexing.
Proceedings of the PRICAI 2004: Trends in Artificial Intelligence, 2004
Cascading Classifiers for Consumer Image Indexing.
Proceedings of the 17th International Conference on Pattern Recognition, 2004
Visual Keywords Labeling in Soccer Video.
Proceedings of the 17th International Conference on Pattern Recognition, 2004
Image retrieval using spatial icons.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004
A generic mid-level representation for semantic video analysis.
Proceedings of the 2004 International Conference on Image Processing, 2004
Unifying local and global content-based similarities for home photo retrieval.
Proceedings of the 2004 International Conference on Image Processing, 2004
Combining local class patterns and discovered semantics for image retrieval.
Proceedings of the 2004 International Conference on Image Processing, 2004
Goal detection in soccer video using audio/visual keywords.
Proceedings of the 2004 International Conference on Image Processing, 2004
Semantics Discovery for Image Indexing.
Proceedings of the Computer Vision, 2004
2003
Home Photo Content Modeling for Personalized Event-Based Retrieval.
IEEE Multim., 2003
Using Dual Cascading Learning Frameworks for Image Indexing.
Proceedings of the Visual Information Processing 2003, 2003
Content-Based Summarization for Personal Image Library.
Proceedings of the ACM/IEEE 2003 Joint Conference on Digital Libraries (JCDL 2003), 2003
Learning Consumer Photo Categories for Semantic Retrieval.
Proceedings of the IJCAI-03, 2003
Event-based home photo retrieval.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
Support regions and images for photo event retrieval.
Proceedings of the 2003 International Conference on Image Processing, 2003
Real-time camera field-view tracking in soccer video.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
Home Photo Retrieval: Time Matters.
Proceedings of the Image and Video Retrieval, Second International Conference, 2003
2002
Image Mosaics Based on Homogenous Coordinates.
Proceedings of the Visualisation 2002, 2002
Home Photo Indexing using Learned Visual Keywords.
Proceedings of the Visualisation 2002, 2002
Image indexing and retrieval using visual keyword histograms.
Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002
Semantic indexing and retrieval of home photos.
Proceedings of the Seventh International Conference on Control, 2002
Symbolic photograph content-based retrieval.
Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, 2002
2001
Learning Similarity Matching in Multimedia Content-Based Retrieval.
IEEE Trans. Knowl. Data Eng., 2001
Building Visual Vocabulary for Image Indexation and Query Formulation.
Pattern Anal. Appl., 2001
Fuzzy Object Patterns for Visual Indexing and Segmentation.
Proceedings of the 10th IEEE International Conference on Fuzzy Systems, 2001
2000
Perspectives on Content-Based Multimedia Systems
The Kluwer International Series on Information Retrieval 9, Kluwer, ISBN: 978-0-306-47033-2, 2000
Photograph Retrieval and Classification by Visual Keywords and Thesaurus.
New Gener. Comput., 2000
Explicit query formulation with visual keywords.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000
1999
Categorizing Visual Contents by Matching Visual "Keywords".
Proceedings of the Visual Information and Information Systems, 1999
Learning Visual Keywords for Content-Based Retrieval.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999
Learnable Visual Keywords for Image Classification.
Proceedings of the Fourth ACM conference on Digital Libraries, 1999
1996
Stochastic topology with elastic matching for off-line handwritten character recognition.
Pattern Recognit. Lett., 1996
An Application of Hierarchical Knowledge Integration in Hand-Written Form Processing.
Proceedings of the PRICAI'96: Topics in Artificial Intelligence, 1996
1995
Fitting elastic maps to recognize handwritten digits.
Proceedings of International Conference on Neural Networks (ICNN'95), Perth, WA, Australia, November 27, 1995
1992
A Deductive Neural-Logic System.
Proceedings of the 22nd IEEE International Symposium on Multiple-Valued Logic, 1992
A Framework for Integrating Fault Diagnosis and Incremental Knowledge Acquisition in Connectionist Expert Systems.
Proceedings of the 10th National Conference on Artificial Intelligence, 1992