Matthieu Cord

Orcid: 0000-0002-0627-5844

According to our database1, Matthieu Cord authored at least 263 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Semantic augmentation by mixing contents for semi-supervised learning.
Pattern Recognit., January, 2024

MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments.
Trans. Mach. Learn. Res., 2024

Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval.
Comput. Vis. Image Underst., 2024

GradPaint: Gradient-guided inpainting with diffusion models.
Comput. Vis. Image Underst., 2024

Skipping Computations in Multimodal LLMs.
CoRR, 2024

LLM-wrapper: Black-Box Semantic-Aware Adaptation of Vision-Language Foundation Models.
CoRR, 2024

Annealed Winner-Takes-All for Motion Forecasting.
CoRR, 2024

ReGentS: Real-World Safety-Critical Driving Scenario Generation Made Stable.
CoRR, 2024

Valeo4Cast: A Modular Approach to End-to-End Forecasting.
CoRR, 2024

A Concept-Based Explainability Framework for Large Multimodal Models.
CoRR, 2024

Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features.
CoRR, 2024

Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs.
CoRR, 2024

What matters when building vision-language models?
CoRR, 2024

Mind-to-Image: Projecting Visual Mental Imagination of the Brain from fMRI.
CoRR, 2024

FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models.
CoRR, 2024

UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction.
CoRR, 2024

Improved Baselines for Data-efficient Perceptual Augmentation of LLMs.
CoRR, 2024

Towards Motion Forecasting with Real-World Perception Inputs: Are End-to-End Approaches Competitive?
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Reliability in Semantic Segmentation: Can We Use Synthetic Data?
Proceedings of the Computer Vision - ECCV 2024, 2024

PointBeV: A Sparse Approach to BeV Predictions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

What Makes Multimodal In-Context Learning Work?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

RED++ : Data-Free Pruning of Deep Neural Networks via Input Splitting and Output Merging.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

LiDARTouch: Monocular metric depth estimation with a few-beam LiDAR.
Comput. Vis. Image Underst., January, 2023

UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.
Trans. Mach. Learn. Res., 2023

Manipulating Trajectory Prediction with Backdoors.
CoRR, 2023

ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation.
CoRR, 2023

ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model.
CoRR, 2023

Unified Model for Image, Video, Audio and Language Tasks.
CoRR, 2023

OBELISC: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
CoRR, 2023

Challenges of Using Real-World Sensory Inputs for Motion Forecasting in Autonomous Driving.
CoRR, 2023

SPIQ: Data-Free Per-Channel Static Input Quantization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

REx: Data-Free Residual Quantization Error Expansion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization.
Proceedings of the International Conference on Machine Learning, 2023

PowerQuant: Automorphism Search for Non-Uniform Quantization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DiffEdit: Diffusion-based semantic image editing with mask guidance.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

eP-ALM: Efficient Perceptual Augmentation of Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Zero-shot spatial layout conditioning for text-to-image diffusion models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

OCTET: Object-aware Counterfactual Explanations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Co-training 2L Submodels for Visual Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improving Selective Visual Question Answering by Learning from Your Peers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CoMFormer: Continual Learning in Semantic and Panoptic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Detecting 32 Pedestrian Attributes for Autonomous Vehicles.
IEEE Trans. Intell. Transp. Syst., 2022

Driving behavior explanation with multi-level fusion.
Pattern Recognit., 2022

Confidence Estimation via Auxiliary Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Explainability of Deep Vision-Based Autonomous Driving Systems: Review and Challenges.
Int. J. Comput. Vis., 2022

Recycling diverse models for out-of-distribution generalization.
CoRR, 2022

Co-training 2<sup>L</sup> Submodels for Visual Recognition.
CoRR, 2022

Structured Vision-Language Pretraining for Computational Cooking.
CoRR, 2022

Dynamic Query Selection for Fast Visual Perceiver.
CoRR, 2022

SInGE: Sparsity via Integrated Gradients Estimation of Neuron Relevance.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Diverse Weight Averaging for Out-of-Distribution Generalization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Swapping Semantic Contents for Mixing Images.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Fishr: Invariant Gradient Variances for Out-of-Distribution Generalization.
Proceedings of the International Conference on Machine Learning, 2022

DeiT III: Revenge of the ViT.
Proceedings of the Computer Vision, 2022

Three Things Everyone Should Know About Vision Transformers.
Proceedings of the Computer Vision, 2022

STEEX: Steering Counterfactual Explanations with Semantics.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards efficient feature sharing in MIMO architectures.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Multi-Head Distillation for Continual Unsupervised Domain Adaptation in Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

CSG0: Continual Urban Scene Generation with Zero Forgetting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FlexIT: Towards Flexible Semantic Image Translation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Embedding Arithmetic of Multimodal Queries for Image Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Raising context awareness in motion forecasting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation.
Proceedings of the Conference on Robot Learning, 2022

Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Handling new target classes in semantic segmentation with domain adaptation.
Comput. Vis. Image Underst., 2021

Augmenting Convolutional networks with attention-based aggregation.
CoRR, 2021

Embedding Arithmetic for Text-driven Image Transformation.
CoRR, 2021

Tackling Catastrophic Forgetting and Background Shift in Continual Semantic Segmentation.
CoRR, 2021

RED : Looking for Redundancies for Data-Free Structured Compression of Deep Neural Networks.
CoRR, 2021

ResMLP: Feedforward networks for image classification with data-efficient training.
CoRR, 2021

Explainability of vision-based autonomous driving systems: Review and challenges.
CoRR, 2021

RED : Looking for Redundancies for Data-FreeStructured Compression of Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Look at the Variance! Efficient Black-box Explanations with Sobol-based Sensitivity Analysis.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Training data-efficient image transformers & distillation through attention.
Proceedings of the 38th International Conference on Machine Learning, 2021

DICE: Diversity in Deep Ensembles via Conditional Redundancy Adversarial Estimation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Grafit: Learning fine-grained image representations with coarse labels.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Going deeper with Image Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Semantic Palette: Guiding Scene Generation With Class Proportions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Insights From the Future for Continual Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

PLOP: Learning Without Forgetting for Continual Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

MAGECally invert images for realistic editing.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Deep Learning and Information Geometry for Time-Series Classification. (Apprentissage Profond et G om trie de L'Information pour la Classification de Signaux Temporels).
PhD thesis, 2020

SEMEDA: Enhancing segmentation precision with semantic edge aware loss.
Pattern Recognit., 2020

Online Bag-of-Visual-Words Generation for Unsupervised Representation Learning.
CoRR, 2020

Powers of layers for image-to-image translation.
CoRR, 2020

Overcoming Statistical Shortcuts for Open-ended Visual Counting.
CoRR, 2020

ESL: Entropy-guided Self-supervised Learning for Domain Adaptation in Semantic Segmentation.
CoRR, 2020

Small-Task Incremental Learning.
CoRR, 2020

BUDA: Boundless Unsupervised Domain Adaptation in Semantic Segmentation.
CoRR, 2020

This Dataset Does Not Exist: Training Models from Generated Images.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Entwined Learning Head Pose and Face Alignment Inside an Attentional Cascade with Doubly-Conditional fusion.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

QuEST: Quantized Embedding Space for Transferring Knowledge.
Proceedings of the Computer Vision - ECCV 2020, 2020

PODNet: Pooled Outputs Distillation for Small-Tasks Incremental Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning Representations by Predicting Bags of Visual Words.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

The Missing Data Encoder: Cross-Channel Image Completion with Hide-and-Seek Adversarial Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Exploiting Negative Evidence for Deep Latent Structured Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Distributed optimization for deep learning with gossip exchange.
Neurocomputing, 2019

End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection.
Int. J. Comput. Vis., 2019

DualDis: Dual-Branch Disentangling with Adversarial Learning.
CoRR, 2019

Addressing Failure Prediction by Learning Model Confidence.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

RUBi: Reducing Unimodal Biases for Visual Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Zero-Shot Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Riemannian batch normalization for SPD neural networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Reve: Regularizing Deep Learning with Variational Entropy Bound.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Delving Deep into Interpreting Neural Nets with Piece-Wise Affine Representation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

DADA: Depth-Aware Domain Adaptation in Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DiscoNet: Shapes Learning on Disconnected Manifolds for 3D Editing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Boosting Few-Shot Visual Learning With Self-Supervision.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DeCaFA: Deep Convolutional Cascade for Face Alignment in the Wild.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Exploring Complex Time-series Representations for Riemannian Machine Learning of Radar Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

Second-Order Networks in PyTorch.
Proceedings of the Geometric Science of Information - 4th International Conference, 2019

ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

MUREL: Multimodal Relational Reasoning for Visual Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
SyMIL: MinMax Latent SVM for Weakly Labeled Data.
IEEE Trans. Neural Networks Learn. Syst., 2018

Classifying low-resolution images by integrating privileged information in deep CNNs.
Pattern Recognit. Lett., 2018

Images & Recipes: Retrieval in the cooking context.
CoRR, 2018

Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018

Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Shade: Information-Based Regularization for Deep Learning.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Images and Recipes: Retrieval in the Cooking Context.
Proceedings of the 34th IEEE International Conference on Data Engineering Workshops, 2018

HybridNet: Classification and Reconstruction Cooperation for Semi-supervised Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

Exploring deep vision models for acoustic scene classification.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

Manifold Learning in Quotient Spaces.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Finding Beans in Burgers: Deep Semantic-Visual Embedding With Localization.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Gaze latent support vector machine for image classification improved by weakly supervised region selection.
Pattern Recognit., 2017

Learning a Distance Metric from Relative Comparisons between Quadruplets of Images.
Int. J. Comput. Vis., 2017

MUTAN: Multimodal Tucker Fusion for Visual Question Answering.
Proceedings of the IEEE International Conference on Computer Vision, 2017

WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deformable Part-based Fully Convolutional Network for Object Detection.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Master's Thesis : Deep Learning for Visual Recognition.
CoRR, 2016

M2CAI Workflow Challenge: Convolutional Neural Networks with Time Smoothing and Hidden Markov Model for Video Frames Classification.
CoRR, 2016

Gossip training for deep learning.
CoRR, 2016

Maxmin convolutional neural networks for image classification.
CoRR, 2016

Gaze latent support vector machine for image classification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Deep Neural Networks Under Stress.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Max-min convolutional neural networks for image classification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Closed-Form Training of Mahalanobis Distance for Supervised Clustering.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Streaming Graph-Based Hierarchical Video Segmentation by a Simple Label Propagation.
Proceedings of the 28th SIBGRAPI Conference on Graphics, Patterns and Images, 2015

Recipe recognition with large multimodal food dataset.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015

LR-CNN for fine-grained classification with varying resolution.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Exemplar based metric learning for robust visual localization.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Absolute geo-localization thanks to Hidden Markov Model and exemplar-based metric learning.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Apprentissage de métrique appliqué à la détection de changement de page Web et aux attributs relatifs.
Proceedings of the CORIA 2015 - Conférence en Recherche d'Infomations et Applications, 2015

2014
Bag-of-Words Image Representation: Key Ideas and Further Insight.
Proceedings of the Fusion in Computer Vision - Understanding Complex Visual Content, 2014

Learning Deep Hierarchical Visual Feature Coding.
IEEE Trans. Neural Networks Learn. Syst., 2014

Model-Based Analysis-Synthesis for Realistic Tree Reconstruction and Growth Simulation.
IEEE Trans. Geosci. Remote. Sens., 2014

Perceptual Principles for Video Classification With Slow Feature Analysis.
IEEE J. Sel. Top. Signal Process., 2014

SnooperText: A text detection system for automatic indexing of urban scenes.
Comput. Vis. Image Underst., 2014

Sequentially Generated Instance-Dependent Image Representations for Classification.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Incremental learning of latent structural SVM for weakly supervised image classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Semantic pooling for image categorization using multiple kernel learning.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Fantope Regularization in Metric Learning.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Extended Coding and Pooling in the HMAX Model.
IEEE Trans. Image Process., 2013

T-HOG: An effective gradient-based descriptor for single line text regions.
Pattern Recognit., 2013

Text detection in street level images.
Pattern Anal. Appl., 2013

JKernelMachines: a simple framework for kernel machine.
J. Mach. Learn. Res., 2013

Pooling in image representation: The visual codeword point of view.
Comput. Vis. Image Underst., 2013

Top-Down Regularization of Deep Belief Networks.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013

Image classification using object detectors.
Proceedings of the IEEE International Conference on Image Processing, 2013

Quadruplet-Wise Image Similarity Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Dynamic Scene Classification: Learning Motion Descriptors with Slow Features Analysis.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Locality-Sensitive Hashing for Chi2 Distance.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

An application of swarm intelligence to distributed image retrieval.
Inf. Sci., 2012


Classification of Urban Scenes from Geo-referenced Images in Urban Street-View Context.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Contextual detection of drawn symbols in old maps.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Learning geometric combinations of Gaussian kernels with alternating Quasi-Newton algorithm.
Proceedings of the 20th European Symposium on Artificial Neural Networks, 2012

Hybrid Pooling Fusion in the BoW Pipeline.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines.
Proceedings of the Computer Vision - ECCV 2012, 2012

Structural and visual comparisons for web page archiving.
Proceedings of the ACM Symposium on Document Engineering, 2012

BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012

2011
SALSAS: Sub-linear active learning strategy with approximate k-NN search.
Pattern Recognit., 2011

Spatio-Temporal Tube data representation and Kernel design for SVM-based video object retrieval system.
Multim. Tools Appl., 2011


HMAX-S: Deep scale representation for biologically inspired image categorization.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Efficient Bag-of-Feature kernel representation for image similarity search.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Snoopertrack: Text detection and tracking for outdoor videos.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Learning invariant color features with sparse topographic restricted Boltzmann machines.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

BOSSA: Extended bow formalism for image classification.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Text detection and recognition in urban scenes.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Using LIDO to handle 3D cultural heritage documentation data provenance.
Proceedings of the 9th International Workshop on Content-Based Multimedia Indexing, 2011

2010
Indexing personal image collections: a flexible, scalable solution.
IEEE Trans. Consumer Electron., 2010

STTK-based video object recognition.
Proceedings of the International Conference on Image Processing, 2010

An efficient system for combining complementary kernels in complex visual categorization tasks.
Proceedings of the International Conference on Image Processing, 2010

Snoopertext: A multiresolution system for text detection in complex visual scenes.
Proceedings of the International Conference on Image Processing, 2010

Scalable active learning strategy for object category retrieval.
Proceedings of the International Conference on Image Processing, 2010

2009
<i>Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval</i>.
J. Electronic Imaging, 2009


Advanced Techniques in CBIR: Local Descriptors, Visual Dictionaries and Bags of Features.
Proceedings of the Tutorials of the XXII Brazilian Symposium on Computer Graphics and Image Processing, 2009

Similarity Search and Indexing for High-Dimensional Data.
Proceedings of the XXIV Simpósio Brasileiro de Banco de Dados, 2009

Spatio-Temporal Tube Kernel for actor retrieval.
Proceedings of the International Conference on Image Processing, 2009

Optimization on active learning strategy for object category retrieval.
Proceedings of the International Conference on Image Processing, 2009

Text segmentation in natural scenes using Toggle-Mapping.
Proceedings of the International Conference on Image Processing, 2009

Geometric consistency checking for local-descriptor based document retrieval.
Proceedings of the 2009 ACM Symposium on Document Engineering, 2009

2008
Supervised Learning.
Proceedings of the Machine Learning Techniques for Multimedia, 2008

Online Content-Based Image Retrieval Using Active Learning.
Proceedings of the Machine Learning Techniques for Multimedia, 2008

Image Retrieval Over Networks: Active Learning Using Ant Algorithm.
IEEE Trans. Multim., 2008

Active Learning Methods for Interactive Image Retrieval.
IEEE Trans. Image Process., 2008

Detection, Characterization, and Modeling Vegetation in Urban Areas From High-Resolution Aerial Imagery.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2008

Combining visual dictionary, kernel-based similarity and learning strategy for image category retrieval.
Comput. Vis. Image Underst., 2008

VSUMM: An Approach for Automatic Video Summarization and Quantitative Evaluation.
Proceedings of the SIBGRAPI 2008, 2008

Rushes summarization by IRIM consortium: redundancy removal and multi-feature fusion.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Summarization scheme based on near-duplicate analysis.
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008

Fast approximate kernel-based similarity search for image retrieval task.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Long term learning for image retrieval over networks.
Proceedings of the International Conference on Image Processing, 2008

Actor retrieval system based on kernels on bags of bags.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

Fast identification of visual documents using local descriptors.
Proceedings of the 2008 ACM Symposium on Document Engineering, 2008

High-dimensional descriptor indexing for large multimedia databases.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

Image retrieval over networks: Ant algorithm for long term active learning.
Proceedings of the International Workshop on Content-Based Multimedia Indexing, 2008

2007
Stochastic exploration and active learning for image retrieval.
Image Vis. Comput., 2007

Automatic Extraction and Classification of Vegetation Areas from High Resolution Images in Urban Areas.
Proceedings of the Image Analysis, 15th Scandinavian Conference, 2007

3-Way-Trees: A Similarity Search Method for High-Dimensional Descriptor Matching.
Proceedings of the International Conference on Image Processing, 2007

Kernels on Bags of Fuzzy Regions for Fast Object retrieval.
Proceedings of the International Conference on Image Processing, 2007

Matching Local Descriptors for Image Identification on Cultural Databases.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

Kernels on bags for multi-object database retrieval.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

RETIN: a smart interactive digital media retrieval system.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

2006
Feature-based approach to semi-supervised similarity learning.
Pattern Recognit., 2006

Performances of Mobile-Agents for Interactive Image Retrieval.
Proceedings of the 2006 IEEE / WIC / ACM International Conference on Web Intelligence (WI 2006), 2006

Shot Boundary Detection at TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

Video Segmentation by Supervised Learning.
Proceedings of the 19th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2006), 2006

CBIR in Distributed Databases using a Multi-Agent System.
Proceedings of the International Conference on Image Processing, 2006

Precision-Oriented Active Selection for Interactive Image Retrieval.
Proceedings of the International Conference on Image Processing, 2006

Image Retrieval using Long-Term Semantic Learning.
Proceedings of the International Conference on Image Processing, 2006

Content-Based Retrieval of Images for Cultural Institutions Using Local Descriptors.
Proceedings of the 2006 International Conference on Geometric Modeling and Imaging, 2006

Robust scene cut detection by supervised learning.
Proceedings of the 14th European Signal Processing Conference, 2006

2005
Interactive Exploration for Image Retrieval.
EURASIP J. Adv. Signal Process., 2005

Semantic kernel learning for interactive image retrieval.
Proceedings of the 2005 International Conference on Image Processing, 2005

Semantic Learning Methods: Application to Image Retrieval.
Proceedings of the Actes de CAP 05, Conférence francophone sur l'apprentissage automatique, 2005

2004
Approche interactive de la recherche d'images par le contenu.
Tech. Sci. Informatiques, 2004

Smooth Surface Reconstruction Using Tensor Fields as Structuring Elements.
Comput. Graph. Forum, 2004

Semantic kernel updating for content-based image retrieval.
Proceedings of the Sixth IEEE International Symposium on Multimedia Software Engineering, 2004

Retin al: an active learning strategy for image category retrieval.
Proceedings of the 2004 International Conference on Image Processing, 2004

A Comparison of Active Classification Methods for ContentBased Image Retrieval.
Proceedings of the First International Workshop on Computer Vision meets Databases, 2004

2003
Reconstruction Using Surface Dedicated Tensorial Fields.
Proceedings of the 16th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2003), 2003

Exploration and Search-by-Similarity in CBIR.
Proceedings of the 16th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2003), 2003

2002
Filtering Sparse Data with 3D Tensorial Structuring Elements.
Proceedings of the 15th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2002), 2002

Navegador3D: An Internet Based Flight Simulator of Urban Centers.
Proceedings of the 15th Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI 2002), 2002

A Flexible Search-by-Similarity Algorithm for Content-Based Image Retrieval.
Proceedings of the 6th Joint Conference on Information Science, 2002

Long-term similarity learning in content-based image retrieval.
Proceedings of the 2002 International Conference on Image Processing, 2002

Terrain surface modeling from altimetric data.
Proceedings of the 2002 International Conference on Image Processing, 2002

2001
Three-dimensional building detection and modeling using a statistical approach.
IEEE Trans. Image Process., 2001

RETIN: A Content-Based Image Indexing and Retrieval System.
Pattern Anal. Appl., 2001

Accurate Building Structure Recovery from High Resolution Aerial Imagery.
Comput. Vis. Image Underst., 2001

Back-propagation algorithm for relevance feedback in image retrieval.
Proceedings of the 2001 International Conference on Image Processing, 2001

1999
Accurate Building Structure Recovery from Aerial Imagery.
Proceedings of the XII Brazilian Symposium on Computer Graphics and Image Processing (SIBGRAPI '99), 1999

Bayesian Model Identification: Application to Building Reconstruction in Aerial Imagery.
Proceedings of the 1999 International Conference on Image Processing, 1999

1998
Building Detection and Reconstruction from Mid- and High-Resolution Aerial Imagery.
Comput. Vis. Image Underst., 1998

Combining intensity and stereo data to improve satellite urban scenes modeling.
Proceedings of the 9th European Signal Processing Conference, 1998


  Loading...