2025
EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models.
CoRR, June, 2025
Guest Editorial: Special Issue on Multimodal Learning.
Int. J. Comput. Vis., May, 2025
Visual Text Processing: A Comprehensive Review and Unified Evaluation.
,
,
,
,
,
,
,
,
,
,
,
CoRR, April, 2025
On Large Multimodal Models as Open-World Image Classifiers.
CoRR, March, 2025
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis.
CoRR, March, 2025
2024
Simplifying open-set video domain adaptation with contrastive learning.
Comput. Vis. Image Underst., 2024
Automatic benchmarking of large multimodal models via iterative experiment programming.
CoRR, 2024
Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion Sampling.
CoRR, 2024
Vocabulary-free Image Classification and Semantic Segmentation.
CoRR, 2024
Socially Pertinent Robots in Gerontological Healthcare.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Text-Enhanced Zero-Shot Action Recognition: A Training-Free Approach.
Proceedings of the Pattern Recognition - 27th International Conference, 2024
Test-Time Zero-Shot Temporal Action Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Deep Unsupervised Key Frame Extraction for Efficient Video Classification.
ACM Trans. Multim. Comput. Commun. Appl., 2023
Continual Attentive Fusion for Incremental Learning in Semantic Segmentation.
IEEE Trans. Multim., 2023
Uncertainty-Aware Contrastive Distillation for Incremental Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2023
Editorial for the Special Issue on Industrial Machine Learning Applications.
J. Imaging, 2023
Vocabulary-free Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Rotation Synchronization via Deep Matrix Factorization.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
The Unreasonable Effectiveness of Large Language-Vision Models for Source-free Video Domain Adaptation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
AutoLabel: CLIP-based framework for Open-Set Video Domain Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Curriculum Learning: A Survey.
Int. J. Comput. Vis., 2022
Low-budget label query through domain alignment enforcement.
Comput. Vis. Image Underst., 2022
Dual-Head Contrastive Domain Adaptation for Video Action Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
Unsupervised Domain Adaptation for Video Transformers in Action Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Cluster-level pseudo-labelling for source-free cross-domain facial expression recognition.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022
2021
Curriculum self-paced learning for cross-domain object detection.
Comput. Vis. Image Underst., 2021
Variational Structured Attention Networks for Deep Visual Representation Learning.
CoRR, 2021
2020
Deep Learning for Classification and Localization of COVID-19 Markers in Point-of-Care Lung Ultrasound.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
IEEE Trans. Medical Imaging, 2020
Low-Budget Unsupervised Label Query through Domain Alignment Enforcement.
CoRR, 2020
Class-Aware Modality Mix and Center-Guided Metric Learning for Visible-Thermal Person Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Real-Time Cross-Dataset Quality Production Assessment in Industrial Laser Cutting Machines.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020
An Online Deep Learning Based System for Defects Detection in Glass Panels.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020
2019
Cut Quality Estimation in Industrial Laser Cutting Machines: A Machine Learning Approach.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019
2018
Filling the Gaps: Predicting Missing Joints of Human Poses Using Denoising Autoencoders.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
2017
The S-Hock dataset: A new benchmark for spectator crowd analysis.
Comput. Vis. Image Underst., 2017
Indirect Match Highlights Detection with Deep Convolutional Neural Networks.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2017, 2017
2016
Clustering of cell populations in flow cytometry data using a combination of Gaussian mixtures.
Pattern Recognit., 2016
The Role of Machine Learning in Medical Data Analysis. A Case Study: Flow Cytometry.
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016
Bad teacher or unruly student: Can deep learning say something in Image Forensics analysis?
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Flow Cytometry based automatic MRD assessment in Acute Lymphoblastic Leukaemia: Longitudinal evaluation of time-specific cell population models.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016
2015
Social interaction analysis in in videos, from wide to close perspective.
PhD thesis, 2015
Human interaction recognition in the wild: Analyzing trajectory clustering from multiple-instance-learning perspective.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015
Real-life violent social interaction detection.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015
The S-HOCK dataset: Analyzing crowds at the stadium.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
On automated Flow Cytometric analysis for MRD estimation of Acute Lymphoblastic Leukaemia: A comparison among different approaches.
Proceedings of the 2015 IEEE International Conference on Bioinformatics and Biomedicine, 2015
2014
Collaborative creativity: The Music Room.
Pers. Ubiquitous Comput., 2014
2013
Particles cross-influence for entity grouping.
Proceedings of the 21st European Signal Processing Conference, 2013
Recognition of two-person interaction in multi-view surveillance video via proxemics cues and spatio-temporal interest points.
Proceedings of the Video Surveillance and Transportation Imaging Applications 2013, 2013
Exploiting visual search theory to infer social interactions.
Proceedings of the Multimedia Content and Mobile Devices 2013, 2013
Proceedings of the 2013 ACM SIGCHI Conference on Human Factors in Computing Systems, 2013
2012
Real Time Detection of Social Interactions in Surveillance Video.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012
2010
Exploiting DCT masking effect to improve the perceptual quality of data hiding.
Proceedings of the Image Processing: Algorithms and Systems VIII, 2010