Rita Cucchiara

Orcid: 0000-0002-2239-283X

Affiliations:
  • University of Modena and Reggio Emilia, Italy


According to our database1, Rita Cucchiara authored at least 497 papers between 1992 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Towards Retrieval-Augmented Architectures for Image Captioning.
ACM Trans. Multim. Comput. Commun. Appl., August, 2024

Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets.
Int. J. Comput. Vis., May, 2024

Trustworthy AI - Part III.
Computer, March, 2024

Video Surveillance and Privacy: A Solvable Paradox?
Computer, March, 2024

Unveiling the Truth: Exploring Human Gaze Patterns in Fake Images.
IEEE Signal Process. Lett., 2024

Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization.
IEEE Intell. Syst., 2024

Is Multiple Object Tracking a Matter of Specialization?
CoRR, 2024

TPP-Gaze: Modelling Gaze Dynamics in Space and Time with Neural Temporal Point Processes.
CoRR, 2024

Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments.
CoRR, 2024

Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training.
CoRR, 2024

Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection.
CoRR, 2024

KRONC: Keypoint-based Robust Camera Optimization for 3D Car Reconstruction.
CoRR, 2024

μgat: Improving Single-Page Document Parsing by Providing Multi-Page Context.
CoRR, 2024

Alfie: Democratising RGBA Image Generation With No $$$.
CoRR, 2024

Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization.
CoRR, 2024

UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation.
CoRR, 2024

Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities.
CoRR, 2024

Bringing Masked Autoencoders Explicit Contrastive Properties for Point Cloud Self-Supervised Learning.
CoRR, 2024

Mask and Compress: Efficient Skeleton-based Action Recognition in Continual Learning.
CoRR, 2024

Sharing Key Semantics in Transformer Makes Efficient Image Restoration.
CoRR, 2024

A Second-Order perspective on Compositionality and Incremental Learning.
CoRR, 2024

Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs.
CoRR, 2024

AIGeN: An Adversarial Approach for Instruction Generation in VLN.
CoRR, 2024

Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing.
CoRR, 2024

The (R)Evolution of Multimodal Large Language Models: A Survey.
CoRR, 2024

VATr++: Choose Your Words Wisely for Handwritten Text Generation.
CoRR, 2024

Key-Graph Transformer for Image Restoration.
CoRR, 2024

DistFormer: Enhancing Local and Global Features for Monocular Per-Object Distance Estimation.
CoRR, 2024

What's Outside the Intersection? Fine-grained Error Analysis for Semantic Segmentation Beyond IoU.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Large-Scale Transformer models for Transactional Data.
Proceedings of the Ital-IA Intelligenza Artificiale, 2024

Trends, Applications, and Challenges in Human Attention Modelling.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Mapping High-level Semantic Regions in Indoor Environments without Object Recognition.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Fluent and Accurate Image Captioning with a Self-trained Reward Model.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Trajectory Forecasting Through Low-Rank Adaptation of Discrete Latent Codes.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Binarizing Documents by Leveraging both Space and Frequency.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues.
Proceedings of the Computer Vision - ECCV 2024, 2024

Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas.
Proceedings of the Computer Vision - ECCV 2024, 2024

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities.
Proceedings of the Computer Vision - ECCV 2024, 2024

AIGeN: An Adversarial Approach for Instruction Generation in VLN.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

The Revolution of Multimodal Large Language Models: A Survey.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Fully-attentive iterative networks for region-based controllable image and video captioning.
Comput. Vis. Image Underst., December, 2023

Evaluating synthetic pre-Training for handwriting processing tasks.
Pattern Recognit. Lett., August, 2023

Predicting gene and protein expression levels from DNA and protein sequences with Perceiver.
Comput. Methods Programs Biomed., June, 2023

Trustworthy AI - Part II.
Computer, May, 2023

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates.
Sensors, February, 2023

Trustworthy AI - Part 1.
Computer, February, 2023

Depth-based 3D human pose refinement: Evaluating the refinet framework.
Pattern Recognit. Lett., 2023

From Show to Tell: A Survey on Deep Learning-Based Image Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation.
CoRR, 2023

Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation.
CoRR, 2023

Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training.
CoRR, 2023

Multi-Class Explainable Unlearning for Image Classification via Weight Filtering.
CoRR, 2023

Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images.
CoRR, 2023

Positive-Augmented Constrastive Learning for Image and Video Captioning Evaluation.
CoRR, 2023

One Transformer for All Time Series: Representing and Training with Time-Dependent Heterogeneous Tabular Data.
CoRR, 2023

LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Deep Learning and Large Scale Models for Bank Transactions.
Proceedings of the Italia Intelligenza Artificiale, 2023

Where Research meets Industry: New Challenges and Opportunities at AImageLab.
Proceedings of the Italia Intelligenza Artificiale, 2023

Embodied Agents for Efficient Exploration and Smart Scene Description.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Input Perturbation Reduces Exposure Bias in Diffusion Models.
Proceedings of the International Conference on Machine Learning, 2023

Towards Explainable Navigation and Recounting.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

Enhancing Open-Vocabulary Semantic Segmentation with Prototype Retrieval.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023

How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-Tuning.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Volumetric Fast Fourier Convolution for Detecting Ink on the Carbonized Herculaneum Papyri.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

TrackFlow: Multi-Object Tracking with Normalizing Flows.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Handwritten Text Generation from Visual Archetypes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

HWD: A Novel Evaluation Score for Styled Handwritten Text Generation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Superpixel Positional Encoding to Improve ViT-based Semantic Segmentation Models.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Special Section on AI-empowered Multimedia Data Analytics for Smart Healthcare.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Transform, Warp, and Dress: A New Transformation-guided Model for Virtual Try-on.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Fine-grained Human Analysis under Occlusions and Perspective Constraints in Multimedia Surveillance.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Wind Turbine Power Curve Monitoring Based on Environmental and Operational Data.
IEEE Trans. Ind. Informatics, 2022

A computational approach for progressive architecture shrinkage in action recognition.
Softw. Pract. Exp., 2022

Focus on Impact: Indoor Exploration With Intrinsic Motivation.
IEEE Robotics Autom. Lett., 2022

Warp and Learn: Novel Views Generation for Vehicles and Other Objects.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Information fusion as an integrative cross-cutting enabler to achieve robust, explainable, and trustworthy medical artificial intelligence.
Inf. Fusion, 2022

Boosting modern and historical handwritten text recognition with deformable convolutions.
Int. J. Document Anal. Recognit., 2022

Explaining transformer-based image captioning models: An empirical analysis.
AI Commun., 2022

Maximum Class Separation as Inductive Bias in One Matrix.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Spot the Difference: A Novel Task for Embodied Agents in Changing Environments.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

CaMEL: Mean Teacher Learning for Image Captioning.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

SeeFar: Vehicle Speed Estimation and Flow Analysis from a Moving UAV.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

First Steps Towards 3D Pedestrian Detection and Tracking from Single Image.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Investigating Bidimensional Downsampling in Vision Transformer Models.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Embodied Navigation at the Art Gallery.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Consistency-Based Self-supervised Learning for Temporal Anomaly Localization.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Dress Code: High-Resolution Multi-Category Virtual Try-On.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Dual-Branch Collaborative Transformer for Virtual Try-On.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

The Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Retrieval-Augmented Transformer for Image Captioning.
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022

ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval.
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022

2021
A Systematic Comparison of Depth Map Representations for Face Recognition.
Sensors, 2021

Working Memory Connections for LSTM.
Neural Networks, 2021

Unifying tensor factorization and tensor nuclear norm approaches for low-rank tensor completion.
Neurocomputing, 2021

Video action detection by learning graph-based spatio-temporal interactions.
Comput. Vis. Image Underst., 2021

Multimodal attention networks for low-level vision-and-language navigation.
Comput. Vis. Image Underst., 2021

AC-VRNN: Attentive Conditional-VRNN for multi-future trajectory prediction.
Comput. Vis. Image Underst., 2021

Universal Captioner: Long-Tail Vision-and-Language Model Training through Content-Style Separation.
CoRR, 2021

From Show to Tell: A Survey on Image Captioning.
CoRR, 2021

SHREC 2021: Track on Skeleton-based Hand Gesture Recognition in the Wild.
CoRR, 2021

SHREC 2021: Skeleton-based hand gesture recognition in the wild.
Comput. Graph., 2021

Learning to Select: A Fully Attentive Approach for Novel Object Captioning.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Improving Indoor Semantic Segmentation with Boundary-Level Objectives.
Proceedings of the Advances in Computational Intelligence, 2021

FashionSearch++: Improving Consumer-to-Shop Clothes Retrieval with Hard Negatives.
Proceedings of the 11th Italian Information Retrieval Workshop 2021, 2021

MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking?
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Estimating (and Fixing) the Effect of Face Obfuscation in Video Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Revisiting the Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data.
Proceedings of the Computer Analysis of Images and Patterns, 2021

Out of the Box: Embodied Navigation in the Real World.
Proceedings of the Computer Analysis of Images and Patterns, 2021

Assessing the Role of Boundary-Level Objectives in Indoor Semantic Segmentation.
Proceedings of the Computer Analysis of Images and Patterns, 2021

Multi-Category Mesh Reconstruction From Image Collections.
Proceedings of the International Conference on 3D Vision, 2021

2020
Explaining digital humanities by aligning images and textual descriptions.
Pattern Recognit. Lett., 2020

Face-from-Depth for Head Pose Estimation on Depth Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

A unified cycle-consistent neural model for text and image retrieval.
Multim. Tools Appl., 2020

Multimodal Hand Gesture Classification for the Human-Car Interaction.
Informatics, 2020

Inter-Homines: Distance-Based Risk Estimation for Human Safety.
CoRR, 2020

Mercury: A Vision-Based Framework for Driver Monitoring.
Proceedings of the Intelligent Human Systems Integration 2020, 2020

SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

RMS-Net: Regression and Masking for Soccer Event Spotting.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

A Novel Attention-based Aggregation Function to Combine Vision and Language.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Future Urban Scenes Generation Through Vehicles Synthesis.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

DAG-Net: Double Attentive Graph Neural Network for Trajectory Forecasting.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Anomaly Detection, Localization and Classification for Railway Inspection.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

VITON-GT: An Image-based Virtual Try-On Model with Geometric Transformations.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

RefiNet: 3D Human Pose Refinement with Depth Maps.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Explore and Explain: Self-supervised Navigation and Recounting.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Baracca: a Multimodal Dataset for Anthropometric Measurements in Automotive.
Proceedings of the 2020 IEEE International Joint Conference on Biometrics, 2020

Anomaly Detection for Vision-Based Railway Inspection.
Proceedings of the Dependable Computing - EDCC 2020 Workshops, 2020

Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Meshed-Memory Transformer for Image Captioning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Conditional Channel Gated Networks for Task-Aware Continual Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Transformer-Based Network for Dynamic Hand Gesture Recognition.
Proceedings of the 8th International Conference on 3D Vision, 2020

2019
Self-Supervised Optical Flow Estimation by Projective Bootstrap.
IEEE Trans. Intell. Transp. Syst., 2019

Driver Face Verification with Depth Maps.
Sensors, 2019

Predicting the Driver's Focus of Attention: The DR(eye)VE Project.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

M-VAD names: a dataset for video captioning with naming.
Multim. Tools Appl., 2019

Can adversarial networks hallucinate occluded people with a plausible aspect?
Comput. Vis. Image Underst., 2019

M<sup>2</sup>: Meshed-Memory Transformer for Image Captioning.
CoRR, 2019

STAGE: Spatio-Temporal Attention on Graph Entities for Video Action Detection.
CoRR, 2019

Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation.
CoRR, 2019

Semi-parametric Object Synthesis.
CoRR, 2019

Anomaly Locality in Video Surveillance.
CoRR, 2019

Image-to-Image Translation to Unfold the Reality of Artworks: An Empirical Analysis.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Video Synthesis from Intensity and Event Frames.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Hand Gestures for the Human-Car Interaction: The Briareo Dataset.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Manual Annotations on Depth Maps for Human Pose Estimation.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-To-Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Latent Space Autoregression for Novelty Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Classifying Signals on Irregular Domains via Convolutional Cluster Pooling.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019


2018
Guest Editorial: Special Section on "Multimedia Understanding via Multimodal Analytics".
ACM Trans. Multim. Comput. Commun. Appl., 2018

Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention.
ACM Trans. Multim. Comput. Commun. Appl., 2018

Predicting Human Eye Fixations via an LSTM-Based Saliency Attentive Model.
IEEE Trans. Image Process., 2018

Attentive models in vision: Computing saliency maps in the deep learning era.
Intelligenza Artificiale, 2018

Learn to See by Events: RGB Frame Synthesis from Event Cameras.
CoRR, 2018

AND: Autoregressive Novelty Detectors.
CoRR, 2018

A Graph Transduction Game for Multi-target Tracking.
CoRR, 2018

Head Detection with Depth Images in the Wild.
Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018), 2018

Automatic Image Cropping and Selection Using Saliency: An Application to Historical Manuscripts.
Proceedings of the Digital Libraries and Multimedia Archives, 2018

Human Behaviour Understanding for Automotive and Surveillance.
Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods, 2018

Domain Translation with Conditional GANs: from Depth to RGB Face-to-Face.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Aligning Text and Document Illustrations: Towards Visually Explainable Digital Humanities.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Fully Convolutional Network for Head Detection with Depth Images.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Hands on the wheel: A Dataset for Driver Hand Detection and Tracking.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

What Was Monet Seeing While Painting? Translating Artworks to Photo-Realistic Images.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

End-to-End 6-DoF Object Pose Estimation Through Differentiable Rasterization.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World.
Proceedings of the Computer Vision - ECCV 2018, 2018

Towards Cycle-Consistent Models for Text and Image Retrieval.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Unsupervised Vehicle Re-Identification Using Triplet Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

SAM: Pushing the Limits of Saliency Prediction Models.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

LAMV: Learning to Align and Match Videos With Kernelized Temporal Layers.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Face Verification from Depth using Privileged Information.
Proceedings of the British Machine Vision Conference 2018, 2018

Learning to Generate Facial Depth Maps.
Proceedings of the 2018 International Conference on 3D Vision, 2018

2017
Affective level design for a role-playing videogame evaluated by a brain-computer interface and machine learning methods.
Vis. Comput., 2017

Personalized Egocentric Video Summarization of Cultural Tour on User Preferences Input.
IEEE Trans. Multim., 2017

Recognizing and Presenting the Storytelling Video Structure With Deep Multimodal Networks.
IEEE Trans. Multim., 2017

Guest Editorial Special Issue on Wearable and Ego-Vision Systems for Augmented Experience.
IEEE Trans. Hum. Mach. Syst., 2017

Tracking Social Groups Within and Across Cameras.
IEEE Trans. Circuits Syst. Video Technol., 2017

Segmentation models diversity for object proposals.
Comput. Vis. Image Underst., 2017

Video registration in egocentric vision under day and night illumination changes.
Comput. Vis. Image Underst., 2017

From Depth Data to Head Pose Estimation: A Siamese Approach.
Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) - Volume 5: VISAPP, Porto, Portugal, February 27, 2017

Learning where to attend like a human driver.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Embedded recurrent network for head pose estimation in car.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

A Video Library System Using Scene Detection and Automatic Tagging.
Proceedings of the Digital Libraries and Archives, 2017

Modeling multimodal cues in a deep learning-based framework for emotion recognition in the wild.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

Visual saliency for image captioning in new multimedia services.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Towards Video Captioning with Naming: A Novel Dataset and a Multi-modal Approach.
Proceedings of the Image Analysis and Processing - ICIAP 2017, 2017

Learning to Map Vehicles into Bird's Eye View.
Proceedings of the Image Analysis and Processing - ICIAP 2017, 2017

Fast and Accurate Facial Landmark Localization in Depth Images for In-Car Applications.
Proceedings of the Image Analysis and Processing - ICIAP 2017, 2017

POSEidon: Face-from-Depth for Driver Pose Estimation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Hierarchical Boundary-Aware Neural Encoder for Video Captioning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use.
Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, 2017

Generative adversarial models for people attribute recognition in surveillance.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

From Groups to Leaders and Back: Exploring Mutual Predictability Between Social Groups and Their Leaders.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016
Transductive People Tracking in Unconstrained Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2016

Exploring Architectural Details Through a Wearable Egocentric Vision Device.
Sensors, 2016

Socially Constrained Structural Learning for Groups Detection in Crowd.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Layout analysis and content enrichment of digitized books.
Multim. Tools Appl., 2016

An Indoor Location-Aware System for an IoT-Based Smart Museum.
IEEE Internet Things J., 2016

A Location-Aware Architecture for an IoT-Based Smart Museum.
Int. J. Electron. Gov. Res., 2016

Where Should You Attend While Driving?
CoRR, 2016

Shot, Scene and Keyframe Ordering for Interactive Video Re-use.
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016

Bridging the experiential gap in cultural visits with computer vision.
Proceedings of the 2nd IEEE International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow, 2016

A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Motion Segmentation using Visual and Bio-mechanical Features.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features.
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016

Layout Analysis and Content Classification in Digitized Books.
Proceedings of the Digital Libraries and Multimedia Archives, 2016

Deep Head Pose Estimation from Depth Data for In-Car Automotive Applications.
Proceedings of the Understanding Human Activities Through 3D Sensors, 2016

A deep multi-level network for saliency prediction.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Historical document digitization through layout analysis and deep content classification.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Fast gesture recognition with Multiple Stream Discrete HMMs on 3D skeletons.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Spotting prejudice with nonverbal behaviours.
Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2016

Performance Measures and a Data Set for Multi-target, Multi-camera Tracking.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Multi-level Net: A Visual Saliency Prediction Model.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Body Part Based Re-Identification from an Egocentric Perspective.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

DR(eye)VE: A Dataset for Attention-Based Tasks with Applications to Autonomous and Assisted Driving.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2016

Optimizing Image Registration for Interactive Applications.
Proceedings of the Augmented Reality, Virtual Reality, and Computer Graphics, 2016

2015
A General-Purpose Sensing Floor Architecture for Human-Environment Interaction.
ACM Trans. Interact. Intell. Syst., 2015

Active query process for digital video surveillance forensic applications.
Signal Image Video Process., 2015

Understanding social relationships in egocentric vision.
Pattern Recognit., 2015

Mapping Appearance Descriptors on 3D Body Models for People Re-identification.
Int. J. Comput. Vis., 2015

GOLD: Gaussians of Local Descriptors for image representation.
Comput. Vis. Image Underst., 2015

Innovative IoT-aware Services for a Smart Museum.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Classification of Affective Data to Evaluate the Level Design in a Role-Playing Videogame.
Proceedings of the 7th International Conference on Games and Virtual Worlds for Serious Applications, 2015

Egocentric Video Summarization of Cultural Tour based on User Preferences.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

A Deep Siamese Network for Scene Detection in Broadcast Videos.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Personalized Egocentric Video Summarization for Cultural Experience.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

Analysis and Re-Use of Videos in Educational Digital Libraries with Automatic Scene Detection.
Proceedings of the Digital Libraries on the Move, 2015

Wearable vision for retrieving architectural details in augmented tourist experiences.
Proceedings of the 7th International Conference on Intelligent Technologies for Interactive Entertainment, 2015

Scene segmentation using temporal clustering for accessing and re-using broadcast video.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Egocentric Video Personalization in Cultural Experiences Scenarios.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

Detection of Human Movements with Pressure Floor Sensors.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

Egocentric Object Tracking: An Odometry-Based Solution.
Proceedings of the Image Analysis and Processing - ICIAP 2015, 2015

Learning to Divide and Conquer for Online Multi-target Tracking.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Measuring Scene Detection Performance.
Proceedings of the Pattern Recognition and Image Analysis - 7th Iberian Conference, 2015

Learning to identify leaders in crowd.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video.
Proceedings of the Computer Analysis of Images and Patterns, 2015

Automatic configuration and calibration of modular sensing floors.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

Towards the evaluation of reproducible robustness in tracking-by-detection.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014
Benchmarking for Person Re-identification.
Proceedings of the Person Re-Identification, 2014

Detection of static groups and crowds gathered in open spaces by texture classification.
Pattern Recognit. Lett., 2014

Pattern recognition and crowd analysis.
Pattern Recognit. Lett., 2014

A fast and effective ellipse detector for embedded vision applications.
Pattern Recognit., 2014

Visual Tracking: An Experimental Survey.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

A complete system for garment segmentation and color classification.
Mach. Vis. Appl., 2014

3D Hough transform for sphere recognition on point clouds - A systematic study and a new method proposal.
Mach. Vis. Appl., 2014

Miniature illustrations retrieval and innovative interaction for digital illuminated manuscripts.
Multim. Syst., 2014

Visions for Augmented Cultural Heritage Experience.
IEEE Multim., 2014

Covariance of Covariance Features for Image Classification.
Proceedings of the International Conference on Multimedia Retrieval, 2014

On detection of novel categories and subcategories of images using incongruence.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Illustrations Segmentation in Digitized Documents Using Local Correlation Features.
Proceedings of the Pushing the Boundaries of the Digital Libraries Field, 2014

Kernelized Structural Classification for 3D Dogs Body Parts Detection.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Learning Graph Cut Energy Functions for Image Segmentation.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Head Pose Estimation in First-Person Camera Views.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Learning superpixel relations for supervised image segmentation.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Gesture Recognition in Ego-centric Videos Using Dense Trajectories and Hand Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

From Ego to Nos-Vision: Detecting Social Relationships in First-Person Views.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
People reidentification in surveillance and forensics: A survey.
ACM Comput. Surv., 2013

An automated picking workstation for healthcare applications.
Comput. Ind. Eng., 2013

Beyond Bag of Words for Concept Detection and Search of Cultural Heritage Archives.
Proceedings of the Similarity Search and Applications - 6th International Conference, 2013

Video surveillance online repository (ViSOR): www.openvisor.org.
Proceedings of the Multimedia Systems Conference 2013, 2013

Modeling local descriptors with multivariate gaussians for object and scene recognition.
Proceedings of the ACM Multimedia Conference, 2013

Hand segmentation for gesture recognition in EGO-vision.
Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices, 2013

Learning articulated body models for people re-identification.
Proceedings of the ACM Multimedia Conference, 2013

Automatic Single-Image People Segmentation and Removal for Cultural Heritage Imaging.
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2013, 2013

Image Classification with Multivariate Gaussian Descriptors.
Proceedings of the Image Analysis and Processing - ICIAP 2013, 2013

Lightweight sign recognition for mobile devices.
Proceedings of the Seventh International Conference on Distributed Smart Cameras, 2013

Human Behavior Understanding with Wide Area Sensing Floors.
Proceedings of the Human Behavior Understanding - 4th International Workshop, 2013

A fast approach for integrating ORB descriptors in the bag of words model.
Proceedings of the Multimedia Content and Mobile Devices 2013, 2013

A Mobile Vision System for Fast and Accurate Ellipse Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

UNIMORE at ImageCLEF 2013: Scalable Concept Image Annotation.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Structured learning for detection of social groups in crowd.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

A people counting system for business analytics.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

Sensing floors for privacy-compliant surveillance of wide areas.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

Intelligent Video Surveillance as a Service.
Proceedings of the Intelligent Multimedia Surveillance - Current Trends and Research, 2013

2012
Multistage Particle Windows for Fast and Accurate Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Guest Editorial.
J. Multim., 2012

Real-time object detection and localization with SIFT-based clustering.
Image Vis. Comput., 2012

Relevance Feedback as an Interactive Navigation Tool.
Proceedings of the VISAPP 2012, 2012

Veiling Luminance estimation on FPGA-based embedded smart camera.
Proceedings of the 2012 IEEE Intelligent Vehicles Symposium, 2012

Learning non-target items for interesting clothes segmentation in fashion images.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

2D images map warping for improved user interaction.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Class-Based Color Bag of Words for Fashion Retrieval.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

A Human vs. Machine Challenge in Fashion Color Classification.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

People Orientation Recognition by Mixtures of Wrapped Distributions on Random Trees.
Proceedings of the Computer Vision - ECCV 2012, 2012

Understanding dyadic interactions applying proxemic theory on videosurveillance trajectories.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011
Mixtures of von Mises Distributions for People Trajectory Shape Analysis.
IEEE Trans. Circuits Syst. Video Technol., 2011

Probabilistic people tracking with appearance models and occlusion classification: The AD-HOC system.
Pattern Recognit. Lett., 2011

Vision based smoke detection system using image energy and color information.
Mach. Vis. Appl., 2011

Automatic segmentation of digitalized historical manuscripts.
Multim. Tools Appl., 2011

Contextual Information and Covariance Descriptors for People Surveillance: An Application for Safety of Construction Workers.
EURASIP J. Image Video Process., 2011

Detecting anomalies in people's trajectories using spectral graph analysis.
Comput. Vis. Image Underst., 2011

Multimedia for Cultural Heritage: Key Issues.
Proceedings of the Multimedia for Cultural Heritage - First International Workshop, 2011

Towards Artistic Collections Navigation Tools Based on Relevance Feedback.
Proceedings of the Multimedia for Cultural Heritage - First International Workshop, 2011

Joint ACM workshop on human gesture and behavior understanding: (J-HGBU'11).
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

3DPeS: 3D people dataset for surveillance and forensics.
Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011

Relevance feedback strategies for artistic image collections tagging.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

Optimal Decision Trees Generation from OR-Decision Tables.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

SARC3D: A New 3D Body Model for People Tracking and Re-identification.
Proceedings of the Image Analysis and Processing - ICIAP 2011, 2011

Identification of intruders in groups of people using cameras and RFIDs.
Proceedings of the 2011 Fifth ACM/IEEE International Conference on Distributed Smart Cameras, 2011

Energy-efficient feedback tracking on embedded smart cameras by hardware-level optimization.
Proceedings of the 2011 Fifth ACM/IEEE International Conference on Distributed Smart Cameras, 2011

Iterative active querying for surveillance data retrieval in crime detection and forensics.
Proceedings of the 4th International Conference on Imaging for Crime Detection and Prevention, 2011

People appearance tracing in video by spectral graph transduction.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

Multi-view people surveillance using 3D information.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

An Evidential Fusion Architecture for People Surveillance in Wide Open Areas.
Proceedings of the Hybrid Artificial Intelligent Systems - 6th International Conference, 2011

A real-time embedded solution for skew correction in banknote analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

A reasoning engine for intruders' localization in wide open areas using a network of cameras and RFIDs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

Energy-efficient foreground object detection on embedded smart cameras by hardware-level operations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2011

A multi-stage pedestrian detection using monolithic classifiers.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Appearance tracking by transduction in surveillance scenarios.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Feature Space Warping Relevance Feedback with Transductive Learning.
Proceedings of the Advances Concepts for Intelligent Vision Systems, 2011

2010
Moving Pixels in Static Cameras: Detecting Dangerous Situations due to Environment or People.
Proceedings of the Intelligent Multimedia Analysis for Security Applications, 2010

Optimized Block-Based Connected Components Labeling With Decision Trees.
IEEE Trans. Image Process., 2010

Video Surveillance Online Repository (ViSOR): an integrated framework.
Multim. Tools Appl., 2010

Markerless body part tracking for action recognition.
Int. J. Multim. Intell. Secur., 2010

Bag-of-words classification of miniature illustrations.
Proceedings of the 11th International Workshop on Image Analysis for Multimedia Interactive Services, 2010

When multimedia meets surveillance and forensics in people security.
Proceedings of the 2nd ACM workshop on Multimedia in forensics, security and intelligence, 2010

Rerum novarum: interactive exploration of illuminated manuscripts.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Surfing on artistic documents with visually assisted tagging.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

A videosurveillance data browsing software architecture for forensics: from trajectories similarities to video fragments.
Proceedings of the 2nd ACM workshop on Multimedia in forensics, security and intelligence, 2010

Improving Classification and Retrieval of Illuminated Manuscript with Semantic Information.
Proceedings of the Digital Libraries - 6th Italian Research Conference, 2010

HMM Based Action Recognition with Projection Histogram Features.
Proceedings of the Recognizing Patterns in Signals, Speech, Images and Videos, 2010

Decision Trees for Fast Thinning Algorithms.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Alignment-Based Similarity of People Trajectories Using Semi-directional Statistics.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Mutual calibration of camera motes and RFIDs for people localization and identification.
Proceedings of the 2010 Fourth ACM/IEEE International Conference on Distributed Smart Cameras, Atlanta, GA, USA - August 31, 2010

3D Body Model Construction and Matching for Real Time People Re-Identification.
Proceedings of the Eurographics Italian Chapter Conference 2010, Genova, Italy, 2010, 2010

Multi-stage Sampling with Boosting Cascades for Pedestrian Detection in Images and Videos.
Proceedings of the Computer Vision - ECCV 2010, 2010

High Performance Connected Components Labeling on FPGA.
Proceedings of the Database and Expert Systems Applications, 2010

Event driven software architecture for multi-camera and distributed surveillance research systems.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Perspective and appearance context for people surveillance in open areas.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

People trajectory mining with statistical pattern recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010

Unsupervised learning in body-area networks.
Proceedings of the 5th International ICST Conference on Body Area Networks, 2010

Fast Background Initialization with Recursive Hadamard Transform.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

2009
Dynamic Pictorially Enriched Ontologies for Digital Video Libraries.
IEEE Multim., 2009

A fast multi-model approach for object duplicate extraction.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2009), 2009

Automatic Analysis of Historical Manuscripts.
Proceedings of the Pattern Recognition in Information Systems, 2009

Multiple Object Detection for Pick-and-Place Applications.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2009), 2009

Multimedia in forensics.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Video surveillance and multimedia forensics: an application to trajectory analysis.
Proceedings of the First ACM workshop on Multimedia in forensics, 2009

An efficient Bayesian framework for on-line action recognition.
Proceedings of the International Conference on Image Processing, 2009

Fast block based connected components labeling.
Proceedings of the International Conference on Image Processing, 2009

Pathnodes Integration of Standalone Particle Filters for People Tracking on Distributed Surveillance Systems.
Proceedings of the Image Analysis and Processing, 2009

Connected Component Labeling Techniques on Modern Architectures.
Proceedings of the Image Analysis and Processing, 2009

Color Features Performance Comparison for Image Retrieval.
Proceedings of the Image Analysis and Processing, 2009

Covariance descriptors on moving regions for human detection in very complex outdoor scenes.
Proceedings of the Third ACM/IEEE International Conference on Distributed Smart Cameras, 2009

A real-time system for abnormal path detection.
Proceedings of the 3rd International Conference on Imaging for Crime Detection and Prevention, 2009

Picture extraction from digitized historical manuscripts.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

Learning People Trajectories Using Semi-directional Statistics.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

Statistical Pattern Recognition for Multi-Camera Detection, Tracking, and Trajectory Analysis.
Proceedings of the Multi-Camera Networks, 2009

2008
Video Streaming for Mobile Video Surveillance.
IEEE Trans. Multim., 2008

Bayesian-Competitive Consistent Labeling for People Surveillance.
IEEE Trans. Pattern Anal. Mach. Intell., 2008

HECOL: Homography and epipolar-based consistent labeling for outdoor park surveillance.
Comput. Vis. Image Underst., 2008

Pervasive Self-Learning with Multi-modal Distributed Sensors.
Proceedings of the Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems, 2008

Enabling technologies on hybrid camera networks for behavioral analysis of unattended indoor environments and their surroundings.
Proceedings of the 1st ACM Workshop on Vision Networks for Behavior Analysis, 2008

"Inside the bible": segmentation, annotation and retrieval for a new browsing experience.
Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, 2008

Smoke Detection in Video Surveillance: A MoG Model in the Wavelet Domain.
Proceedings of the Computer Vision Systems, 6th International Conference, 2008

Describing texture directions with Von Mises distributions.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Mobile Video Surveillance Systems: An Architectural Overview.
Proceedings of the Mobile Multimedia Processing: Fundamentals, 2008

ViSOR: VIdeo Surveillance On-line Repository for annotation retrieval.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Reliable smoke detection in the domains of image energy and color.
Proceedings of the International Conference on Image Processing, 2008

A markerless approach for consistent action recognition in a multi-camera system.
Proceedings of the 2008 Second ACM/IEEE International Conference on Distributed Smart Cameras, 2008

Using circular statistics for trajectory shape analysis.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

Smoke detection in video surveillance: the use of ViSOR (video surveillance on-line repository).
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008

Annotation Collection and Online Performance Evaluation for Video Surveillance: The ViSOR Project.
Proceedings of the Fifth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2008

Action Signature: A Novel Holistic Representation for Action Recognition.
Proceedings of the Fifth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2008

2007
Linear Transition Detection as a Unified Shot Detection Approach.
IEEE Trans. Circuits Syst. Video Technol., 2007

Expert environments: machine intelligence methods for ambient intelligence.
Expert Syst. J. Knowl. Eng., 2007

A multi-camera vision system for fall detection and alarm generation.
Expert Syst. J. Knowl. Eng., 2007

Compressed Domain Features Extraction for Shot Characterization.
Proceedings of the 1st International Workshop on Knowledge Acquisition from Multimedia Content KAMC'07, 2007

Mobile video surveillance with low-bandwidth low-latency video streaming.
Proceedings of the international workshop on Workshop on mobile video, 2007

Dynamic pictorial ontologies for video digital libraries annotation.
Proceedings of the 1st ACM Workshop on The Many Faces of Multimedia Semantics, 2007

Network patterns recognition for automatic dermatologic images classification.
Proceedings of the Medical Imaging 2007: Image Processing, 2007

An Open Source Architecture for Low-Latency Video Streaming on PDAs.
Proceedings of the Ninth IEEE International Symposium on Multimedia, 2007

Efficient Stereo Vision for Obstacle Detection and AGV Navigation.
Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

A Dynamic Programming Technique for Classifying Trajectories.
Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

A Distributed Outdoor Video Surveillance System for Detection of Abnormal People Trajectories.
Proceedings of the 2007 First ACM/IEEE International Conference on Distributed Smart Cameras, 2007

Video Shots Comparison using the Mallows Distance.
Proceedings of the 18th International Workshop on Database and Expert Systems Applications (DEXA 2007), 2007

Video Transcoding and Streaming for Mobile Applications.
Proceedings of the Digital Libraries: Research and Development, 2007

Prototypes Selection with Context Based Intra-class Clustering for Video Annotation with Mpeg7 Features.
Proceedings of the Digital Libraries: Research and Development, 2007

Similarity-Based Retrieval with MPEG-7 3D Descriptors: Performance Evaluation on the Princeton Shape Benchmark.
Proceedings of the Digital Libraries: Research and Development, 2007

Enhancing HSV histograms with achromatic points detection for video retrieval.
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007

Detection of abnormal behaviors using a mixture of Von Mises distributions.
Proceedings of the Fourth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2007

2006
Semantic adaptation of sport videos with user-centred performance analysis.
IEEE Trans. Multim., 2006

A system for automatic face obscuration for privacy purposes.
Pattern Recognit. Lett., 2006

A semi-automatic system for segmentation of cardiac M-mode images.
Pattern Anal. Appl., 2006

Special issue on multimedia Surveillance systems: guest editorial.
Multim. Syst., 2006

University of Modena and Reggio Emilia at TRECVID 2006.
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006

PEANO: pictorial enriched annotation of video.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Reliable background suppression for complex scenes.
Proceedings of the 4th ACM International Workshop on Video Surveillance and Sensor Networks, 2006

Multimedia surveillance: content-based retrieval with multicamera people tracking.
Proceedings of the 4th ACM International Workshop on Video Surveillance and Sensor Networks, 2006

MOM: multimedia ontology manager. A framework for automatic annotation and semantic retrieval of video sequences.
Proceedings of the 14th ACM International Conference on Multimedia, 2006

Comparison of color clustering algorithms for segmentation of dermatological images.
Proceedings of the Medical Imaging 2006: Image Processing, 2006

Distance transform for automatic dermatologic images composition.
Proceedings of the Medical Imaging 2006: Image Processing, 2006

A Semi-Automatic Video Annotation tool with MPEG-7 Content Collections.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Low-Latency Live Video Streaming over Low-Capacity Networks.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Sub-Shot Summarization for MPEG-7 based Fast Browsing.
Proceedings of the Post-proceedings of the Second Italian Research Conference on Digital Library Management Systems (IRCDL 2006), 2006

Performance of the MPEG-7 Shape Spectrum Descriptor for 3D Objects Retrieval.
Proceedings of the Post-proceedings of the Second Italian Research Conference on Digital Library Management Systems (IRCDL 2006), 2006

Fast Dynamic Mosaicing and Person Following.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Estimating Geospatial Trajectory of a Moving Camera.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Line Detection and Texture Characterization of Network Patterns.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

FaceMouse: A Human-Computer Interface for Tetraplegic People.
Proceedings of the Computer Vision in Human-Computer Interaction, 2006

3-D Virtual Environments on Mobile Devices for Remote Surveillance.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

Group Detection at Camera Handoff for Collecting People Appearance in Multi-camera Systems.
Proceedings of the Advanced Video and Signal Based Surveillance, 2006

2005
Probabilistic posture classification for Human-behavior analysis.
IEEE Trans. Syst. Man Cybern. Part A, 2005

Guest Editorial: Special Issue on Video Segmentation for Semantic Annotation and Transcoding.
Multim. Tools Appl., 2005

An Integrated Framework for Semantic Annotation and Adaptation.
Multim. Tools Appl., 2005

Predictive and Probabilistic Tracking to Detect Stopped Vehicles.
Proceedings of the 7th IEEE Workshop on Applications of Computer Vision / IEEE Workshop on Motion and Video Computing (WACV/MOTION 2005), 2005

Assessing Temporal Coherence for Posture Classification with Large Occlusions.
Proceedings of the 7th IEEE Workshop on Applications of Computer Vision / IEEE Workshop on Motion and Video Computing (WACV/MOTION 2005), 2005

Video Understanding and Content-Based Retrieval.
Proceedings of the 2005 TREC Video Retrieval Evaluation, 2005

Multimedia surveillance systems.
Proceedings of the Third ACM International Workshop on Video Surveillance & Sensor Networks, 2005

On the usefulness of object shape coding with MPEG-4.
Proceedings of the Seventh IEEE International Symposium on Multimedia (ISM 2005), 2005

MPEG-7 Compliant Shot Detection in Sport Videos.
Proceedings of the Seventh IEEE International Symposium on Multimedia (ISM 2005), 2005

Video Annotation with Pictorially Enriched Ontologies.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Posture classification in a multi-camera indoor environment.
Proceedings of the 2005 International Conference on Image Processing, 2005

Shot Detection and Motion Analysis for Automatic MPEG-7 Annotation of Sports Videos.
Proceedings of the Image Analysis and Processing, 2005

Consistent Labeling for Multi-camera Object Tracking.
Proceedings of the Image Analysis and Processing, 2005

Domain Knowledge Extension with Pictorially Enriched Ontologies.
Proceedings of the Computer Analysis of Images and Patterns, 11th International Conference, 2005

Entry edge of field of view for multi-camera tracking in distributed video surveillance.
Proceedings of the Advanced Video and Signal Based Surveillance, 2005

2004
Introduction to the Special Section on In-Vehicle Computer Vision Systems.
IEEE Trans. Veh. Technol., 2004

Neighbor cache prefetching for multimedia image and video processing.
IEEE Trans. Multim., 2004

Real-time motion segmentation from moving cameras.
Real Time Imaging, 2004

An Intelligent Surveillance System for Dangerous Situation Detection in Home Environments.
Intelligenza Artificiale, 2004

Track-based and object-based occlusion for people tracking refinement in indoor surveillance.
Proceedings of the ACM 2nd International Workshop on Video Surveillance & Sensor Networks, 2004

Semantic video adaptation based on automatic annotation of sport videos.
Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2004

Color Calibration for a Dermatological Video Camera System.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Probabilistic People Tracking for Occlusion Handling.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Object-Based and Event-Based Semantic Video Adaptation.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Content-based video adaptation with user's preferences.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003
A New Algorithm for Border Description of Polarized Light Surface Microscopic Images of Pigmented Skin Lesions.
IEEE Trans. Medical Imaging, 2003

Detecting Moving Shadows: Algorithms and Evaluation.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Detecting Moving Objects, Ghosts, and Shadows in Video Streams.
IEEE Trans. Pattern Anal. Mach. Intell., 2003

Improving Data Prefetching Efficacy in Multimedia Applications.
Multim. Tools Appl., 2003

Semantic Video Transcoding Using Classes of Relevance.
Int. J. Image Graph., 2003

Tuning Range Image Segmentation by Genetic Algorithm.
EURASIP J. Adv. Signal Process., 2003

Object and event detection for semantic annotation and transcoding.
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003

A machine learning approach for human posture detection in domotics applications.
Proceedings of the 12th International Conference on Image Analysis and Processing (ICIAP 2003), 2003

A Hough transform-based method for radial lens distortion correction.
Proceedings of the 12th International Conference on Image Analysis and Processing (ICIAP 2003), 2003

Object Segmentation in Videos from Moving Camera with MRFs on Color and Motion Features.
Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), 2003

2002
Semantic transcoding for live video server.
Proceedings of the 10th ACM International Conference on Multimedia 2002, 2002

Building the Topological Tree by Recursive FCM Color Clustering.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

Improvement in Range Segmentation Parameters Tuning.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

2001
From Eager to Lazy Constrained Data Acquisition: A General Framework.
New Gener. Comput., 2001

An application of machine learning and statistics to defect detection.
Intell. Data Anal., 2001

Detecting Objects, Shadows and Ghosts in Video Streams by Exploiting Color and Motion Information.
Proceedings of the 11th International Conference on Image Analysis and Processing (ICIAP 2001), 2001

Analysis and Detection of Shadows in Video Streams: A Comparative Evaluation.
Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2001), 2001

2000
Image analysis and rule-based reasoning for a traffic monitoring system.
IEEE Trans. Intell. Transp. Syst., 2000

Computational models for image processing for shared-memory multiprocessors.
Integr. Comput. Aided Eng., 2000

Optimal Range Segmentation Parameters through Genetic Algorithms.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Hardware Prefetching Techniques for Cache Memories in Multimedia Applications.
Proceedings of the Fifth International Workshop on Computer Architectures for Machine Perception (CAMP 2000), 2000

Focus based Feature Extraction for Pallets Recognition.
Proceedings of the British Machine Vision Conference 2000, 2000

1999
Eliciting visual primitives for detection of elongated shapes.
Image Vis. Comput., 1999

Constraint Propagation and Value Acquisition: Why we should do it Interactively.
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

Vehicle Detection under Day and Night Illumination.
Proceedings of the Third ICSC Symposia on Intelligent Industrial Automation (IIA'99) and Soft Computing (SOCO'99), 1999

Exploiting Cache in Multimedia.
Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Real-Time Detection of Moving Vehicles.
Proceedings of the 1oth International Conference on Image Analysis and Processing (ICIAP 1999), 1999

3D Object Recognition by VC-Graphs and Interactive Constraint Satisfaction.
Proceedings of the 1oth International Conference on Image Analysis and Processing (ICIAP 1999), 1999

1998
The Vector-Gradient Hough Transform.
IEEE Trans. Pattern Anal. Mach. Intell., 1998

Genetic algorithms for clustering in machine vision.
Mach. Vis. Appl., 1998

A real-time hardware implementation of the hough transform.
J. Syst. Archit., 1998

Exploiting image processing locality in cache pre-fetching.
Proceedings of the 5th International Conference On High Performance Computing, 1998

Interactive Constraint Satisfaction and its Application to Visual Object Recognition.
Proceedings of the 1998 Joint Conference on Declarative Programming, 1998

1997
The GIOTTO System: a Parallel Computer for Image Processing.
Real Time Imaging, 1997

Block processing on multiprocessor DSPs for multimedia applications.
Proceedings of the First IEEE Workshop on Multimedia Signal Processing, 1997

An Interactive Constraint-Based System for Selective Attention in Visual Search.
Proceedings of the Foundations of Intelligent Systems, 10th International Symposium, 1997

Exploiting Symbolic Learning in Visual Inspection.
Proceedings of the Advances in Intelligent Data Analysis, 1997

Learning for Feature Selection and Shape Detection.
Proceedings of the Image Analysis and Processing, 9th International Conference, 1997

1996
The vector-gradient Hough transform for identifying straight-translation generated shapes.
Proceedings of the 13th International Conference on Pattern Recognition, 1996

Detection of luminosity profiles of elongated shapes.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

1995
Detection of Circular Objects by Wave Propagation on a Mesh-Connected Computer.
J. Parallel Distributed Comput., 1995

A Highly Selective HT based Algorithm for Detecting Extended, Almost Rectilinear Shapes.
Proceedings of the Computer Analysis of Images and Patterns, 6th International Conference, 1995

1993
Reconfiguring the boundaries of a mesh-connected array of processors with run-time programmable logic.
Microprocess. Microsystems, 1993

Processing of variable size images on a cellular array: Performance analysis with the Abingdon Cross Benchmark.
Proceedings of the International Conference on Application-Specific Array Processors, 1993

1992
Analysis of Design Methodology with Logic Cell Arrays.
Proceedings of the Fifth International Conference on VLSI Design, 1992


  Loading...