Lorenzo Baraldi
Orcid: 0000-0001-5125-4957Affiliations:
- University of Pisa, Pisa, Toscana, Italy - professor
According to our database1,
Lorenzo Baraldi
authored at least 133 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
ACM Trans. Multim. Comput. Commun. Appl., August, 2024
Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets.
Int. J. Comput. Vis., May, 2024
IEEE Intell. Syst., 2024
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization.
IEEE Intell. Syst., 2024
CoRR, 2024
Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation.
CoRR, 2024
Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering.
CoRR, 2024
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments.
CoRR, 2024
Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection.
CoRR, 2024
CoRR, 2024
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities.
CoRR, 2024
CoRR, 2024
What's Outside the Intersection? Fine-grained Error Analysis for Semantic Segmentation Beyond IoU.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Mapping High-level Semantic Regions in Indoor Environments without Object Recognition.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024
Proceedings of the Pattern Recognition - 27th International Conference, 2024
Proceedings of the Pattern Recognition - 27th International Conference, 2024
Proceedings of the Pattern Recognition - 27th International Conference, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities.
Proceedings of the Computer Vision - ECCV 2024, 2024
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Multimodal Technol. Interact., December, 2023
Fully-attentive iterative networks for region-based controllable image and video captioning.
Comput. Vis. Image Underst., December, 2023
Pattern Recognit. Lett., August, 2023
Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates.
Sensors, February, 2023
IEEE Trans. Pattern Anal. Mach. Intell., 2023
Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation.
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
CoRR, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the Italia Intelligenza Artificiale, 2023
Proceedings of the IEEE International Conference on Robotics and Automation, 2023
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023
Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis.
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023
Proceedings of the Image Analysis and Processing - ICIAP 2023, 2023
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the 34th British Machine Vision Conference 2023, 2023
2022
Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach.
ACM Trans. Multim. Comput. Commun. Appl., 2022
A computational approach for progressive architecture shrinkage in action recognition.
Softw. Pract. Exp., 2022
IEEE Robotics Autom. Lett., 2022
Boosting modern and historical handwritten text recognition with deformable convolutions.
Int. J. Document Anal. Recognit., 2022
AI Commun., 2022
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
The Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval.
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022
2021
Comput. Vis. Image Underst., 2021
Comput. Vis. Image Underst., 2021
Universal Captioner: Long-Tail Vision-and-Language Model Training through Content-Style Separation.
CoRR, 2021
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021
Proceedings of the Advances in Computational Intelligence, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
Revisiting the Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021
Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data.
Proceedings of the Computer Analysis of Images and Patterns, 2021
Proceedings of the Computer Analysis of Images and Patterns, 2021
Proceedings of the Computer Analysis of Images and Patterns, 2021
2020
Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling.
IEEE Trans. Image Process., 2020
Pattern Recognit. Lett., 2020
Multim. Tools Appl., 2020
Toward reliable experiments on the performance of Connected Components Labeling algorithms.
J. Real Time Image Process., 2020
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions.
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the Augmented Reality, Virtual Reality, and Computer Graphics, 2020
2019
CoRR, 2019
Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation.
CoRR, 2019
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019
Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-To-Image Translation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the 9th International Conference on Cloud Computing and Services Science, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
2018
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention.
ACM Trans. Multim. Comput. Commun. Appl., 2018
IEEE Trans. Image Process., 2018
Intelligenza Artificiale, 2018
Proceedings of the Reproducible Research in Pattern Recognition, 2018
Automatic Image Cropping and Selection Using Saliency: An Application to Historical Manuscripts.
Proceedings of the Digital Libraries and Multimedia Archives, 2018
Proceedings of the IEEE International Conference on Image Processing, 2018
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Aligning Text and Document Illustrations: Towards Visually Explainable Digital Humanities.
Proceedings of the 24th International Conference on Pattern Recognition, 2018
What Was Monet Seeing While Painting? Translating Artworks to Photo-Realistic Images.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
2017
Recognizing and Presenting the Storytelling Video Structure With Deep Multimodal Networks.
IEEE Trans. Multim., 2017
Proceedings of the Digital Libraries and Archives, 2017
Modeling multimodal cues in a deep learning-based framework for emotion recognition in the wild.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017
Proceedings of the Image Analysis and Processing - ICIAP 2017, 2017
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing, 2017
2016
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016
A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
Proceedings of the Digital Libraries and Multimedia Archives, 2016
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Historical document digitization through layout analysis and deep content classification.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2016
2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Analysis and Re-Use of Videos in Educational Digital Libraries with Automatic Scene Detection.
Proceedings of the Digital Libraries on the Move, 2015
Scene segmentation using temporal clustering for accessing and re-using broadcast video.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015
Proceedings of the Pattern Recognition and Image Analysis - 7th Iberian Conference, 2015
Proceedings of the Computer Analysis of Images and Patterns, 2015
2014
Gesture Recognition in Ego-centric Videos Using Dense Trajectories and Hand Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014
2013
Proceedings of the 3rd ACM international workshop on Interactive multimedia on mobile & portable devices, 2013