2025
Global registration of kidneys in 3D ultrasound and CT images.
Int. J. Comput. Assist. Radiol. Surg., January, 2025
2024
Semantic augmentation by mixing contents for semi-supervised learning.
Pattern Recognit., January, 2024
ITEM: Improving Training and Evaluation of Message-Passing based GNNs for top-k recommendation.
Trans. Mach. Learn. Res., 2024
VidEdit: Zero-Shot and Spatially Aware Text-Driven Video Editing.
Trans. Mach. Learn. Res., 2024
Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval.
Comput. Vis. Image Underst., 2024
MERLIN-Seg: Self-supervised despeckling for label-efficient semantic segmentation.
Comput. Vis. Image Underst., 2024
Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting.
CoRR, 2024
Supra-Laplacian Encoding for Transformer on Dynamic Graphs.
CoRR, 2024
Temporal receptive field in dynamic graph learning: A comprehensive analysis.
CoRR, 2024
Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features.
CoRR, 2024
Energy Correction Model in the Feature Space for Out-of-Distribution Detection.
CoRR, 2024
Physics-Informed Model and Hybrid Planning for Efficient Dyna-Style Reinforcement Learning.
RLJ, 2024
Supra-Laplacian Encoding for Transformer on Dynamic Graphs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
GalLoP: Learning Global and Local Prompts for Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
Deep Time Series Forecasting With Shape and Temporal Criteria.
IEEE Trans. Pattern Anal. Mach. Intell., 2023
TRUSTED: The Paired 3D Transabdominal Ultrasound and CT Human Data for Kidney Segmentation and Registration Research.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Optimization of Rank Losses for Image Retrieval.
CoRR, 2023
Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks.
CoRR, 2023
Full Contextual Attention for Multi-resolution Transformers in Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection.
Proceedings of the International Conference on Machine Learning, 2023
EAGLE: Large-scale Learning of Turbulent Fluid Dynamics with Mesh Transformers.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
Confidence Estimation via Auxiliary Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Structured Vision-Language Pretraining for Computational Cooking.
CoRR, 2022
Take One Gram of Neural Features, Get Enhanced Group Robustness.
CoRR, 2022
3D spatial priors for semi-supervised organ segmentation with deep convolutional neural networks.
Int. J. Comput. Assist. Radiol. Surg., 2022
Memory Transformers for Full Context and High-Resolution 3D Medical Segmentation.
Proceedings of the Machine Learning in Medical Imaging - 13th International Workshop, 2022
Swapping Semantic Contents for Mixing Images.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Diverse Probabilistic Trajectory Forecasting with Admissibility Constraints.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Hierarchical Average Precision Training for Pertinent Image Retrieval.
Proceedings of the Computer Vision - ECCV 2022, 2022
Complementing Brightness Constancy with Deep Networks for Optical Flow Prediction.
Proceedings of the Computer Vision - ECCV 2022, 2022
Towards efficient feature sharing in MIMO architectures.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
2021
U-Net Transformer: Self and Cross Attention for Medical Image Segmentation.
CoRR, 2021
Iterative confidence relabeling with deep ConvNets for organ segmentation with partial labels.
Comput. Medical Imaging Graph., 2021
Robust and Decomposable Average Precision for Image Retrieval.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
U-Net Transformer: Self and Cross Attention for Medical Image Segmentation.
Proceedings of the Machine Learning in Medical Imaging - 12th International Workshop, 2021
Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting.
Proceedings of the 9th International Conference on Learning Representations, 2021
Beyond Full Supervision in Deep Learning.
Proceedings of the Multi-faceted Deep Learning - Models and Data, 2021
2020
Probabilistic Time Series Forecasting with Structured Shape and Temporal Diversity.
CoRR, 2020
Probabilistic Time Series Forecasting with Shape and Temporal Diversity.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020
Disentangling Physical Dynamics From Unknown Factors for Unsupervised Video Prediction.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
A Deep Physical Model for Solar Irradiance Forecasting with Fisheye Images.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Exploiting Negative Evidence for Deep Latent Structured Models.
IEEE Trans. Pattern Anal. Mach. Intell., 2019
Distributed optimization for deep learning with gossip exchange.
Neurocomputing, 2019
End-to-End Learning of Latent Deformable Part-Based Representations for Object Detection.
Int. J. Comput. Vis., 2019
DualDis: Dual-Branch Disentangling with Adversarial Learning.
CoRR, 2019
Shape and Time Distortion Loss for Training Deep Time Series Forecasting Models.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Addressing Failure Prediction by Learning Model Confidence.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
DiscoNet: Shapes Learning on Disconnected Manifolds for 3D Editing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
MUREL: Multimodal Relational Reasoning for Visual Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
SyMIL: MinMax Latent SVM for Weakly Labeled Data.
IEEE Trans. Neural Networks Learn. Syst., 2018
Classifying low-resolution images by integrating privileged information in deep CNNs.
Pattern Recognit. Lett., 2018
Cross-Modal Retrieval in the Cooking Context: Learning Semantic Text-Image Embeddings.
Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, 2018
Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018
Handling Missing Annotations for Semantic Segmentation with Deep ConvNets.
Proceedings of the Deep Learning in Medical Image Analysis - and - Multimodal Learning for Clinical Decision Support, 2018
Shade: Information-Based Regularization for Deep Learning.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
HybridNet: Classification and Reconstruction Cooperation for Semi-supervised Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018
Manifold Learning in Quotient Spaces.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
2017
Gaze latent support vector machine for image classification improved by weakly supervised region selection.
Pattern Recognit., 2017
Learning a Distance Metric from Relative Comparisons between Quadruplets of Images.
Int. J. Comput. Vis., 2017
MUTAN: Multimodal Tucker Fusion for Visual Question Answering.
Proceedings of the IEEE International Conference on Computer Vision, 2017
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Deformable Part-based Fully Convolutional Network for Object Detection.
Proceedings of the British Machine Vision Conference 2017, 2017
2016
Master's Thesis : Deep Learning for Visual Recognition.
CoRR, 2016
M2CAI Workflow Challenge: Convolutional Neural Networks with Time Smoothing and Hidden Markov Model for Video Frames Classification.
CoRR, 2016
Gossip training for deep learning.
CoRR, 2016
Maxmin convolutional neural networks for image classification.
CoRR, 2016
Gaze latent support vector machine for image classification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
Deep Neural Networks Under Stress.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
Max-min convolutional neural networks for image classification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016
WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
2015
Recipe recognition with large multimodal food dataset.
Proceedings of the 2015 IEEE International Conference on Multimedia & Expo Workshops, 2015
LR-CNN for fine-grained classification with varying resolution.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
Exemplar based metric learning for robust visual localization.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015
Absolute geo-localization thanks to Hidden Markov Model and exemplar-based metric learning.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015
Apprentissage de métrique appliqué à la détection de changement de page Web et aux attributs relatifs.
Proceedings of the CORIA 2015 - Conférence en Recherche d'Infomations et Applications, 2015
2014
Bag-of-Words Image Representation: Key Ideas and Further Insight.
Proceedings of the Fusion in Computer Vision - Understanding Complex Visual Content, 2014
Learning Deep Hierarchical Visual Feature Coding.
IEEE Trans. Neural Networks Learn. Syst., 2014
Perceptual Principles for Video Classification With Slow Feature Analysis.
IEEE J. Sel. Top. Signal Process., 2014
SnooperText: A text detection system for automatic indexing of urban scenes.
Comput. Vis. Image Underst., 2014
Sequentially Generated Instance-Dependent Image Representations for Classification.
Proceedings of the 2nd International Conference on Learning Representations, 2014
Incremental learning of latent structural SVM for weakly supervised image classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Semantic pooling for image categorization using multiple kernel learning.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014
Fantope Regularization in Metric Learning.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014
2013
Extended Coding and Pooling in the HMAX Model.
IEEE Trans. Image Process., 2013
T-HOG: An effective gradient-based descriptor for single line text regions.
Pattern Recognit., 2013
JKernelMachines: a simple framework for kernel machine.
J. Mach. Learn. Res., 2013
Pooling in image representation: The visual codeword point of view.
Comput. Vis. Image Underst., 2013
Top-Down Regularization of Deep Belief Networks.
Proceedings of the Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a meeting held December 5-8, 2013
Image classification using object detectors.
Proceedings of the IEEE International Conference on Image Processing, 2013
Quadruplet-Wise Image Similarity Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013
Dynamic Scene Classification: Learning Motion Descriptors with Slow Features Analysis.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013
2012
Classification of Urban Scenes from Geo-referenced Images in Urban Street-View Context.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012
Contextual detection of drawn symbols in old maps.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Learning geometric combinations of Gaussian kernels with alternating Quasi-Newton algorithm.
Proceedings of the 20th European Symposium on Artificial Neural Networks, 2012
Hybrid Pooling Fusion in the BoW Pipeline.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012
Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines.
Proceedings of the Computer Vision - ECCV 2012, 2012
Structural and visual comparisons for web page archiving.
Proceedings of the ACM Symposium on Document Engineering, 2012
BossaNova at ImageCLEF 2012 Flickr Photo Annotation Task.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012
Structural and visual similarity learning for Web page archiving.
Proceedings of the 10th International Workshop on Content-Based Multimedia Indexing, 2012
2011
A cognitive and video-based approach for multinational License Plate Recognition.
Mach. Vis. Appl., 2011
Pedestrian Head Detection and Tracking Using Skeleton Graph for People Counting in Crowded Environments.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011
HMAX-S: Deep scale representation for biologically inspired image categorization.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Efficient Bag-of-Feature kernel representation for image similarity search.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Snoopertrack: Text detection and tracking for outdoor videos.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Learning invariant color features with sparse topographic restricted Boltzmann machines.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
BOSSA: Extended bow formalism for image classification.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011
Text detection and recognition in urban scenes.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011
2010
An efficient system for combining complementary kernels in complex visual categorization tasks.
Proceedings of the International Conference on Image Processing, 2010
Snoopertext: A multiresolution system for text detection in complex visual scenes.
Proceedings of the International Conference on Image Processing, 2010
Fast People Counting Using Head Detection From Skeleton Graph.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010
2008
A Real-Time, Multiview Fall Detection System: A LHMM-Based Approach.
IEEE Trans. Circuits Syst. Video Technol., 2008
Learning articulated appearance models for tracking humans: A spectral graph matching approach.
Signal Process. Image Commun., 2008
A bottom-up, view-point invariant human detector.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008
2007
A Combined Statistical-Structural Strategy for Alphanumeric Recognition.
Proceedings of the Advances in Visual Computing, Third International Symposium, 2007
2006
A HHMM-Based Approach for Robust Fall Detection.
Proceedings of the Ninth International Conference on Control, 2006
Human Body Part Labeling and Tracking Using Graph Matching Theory.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006
2005
A robust appearance model for tracking human motions.
Proceedings of the Advanced Video and Signal Based Surveillance, 2005