Mustafa Shukor

Victor Guilherme Turrisi da Costa

Comput. Vis. Image Underst., 2024

Multimodal Autoregressive Pre-training of Large Vision Encoders.

[BibT_eX]

[DOI]

CoRR, 2024

Skipping Computations in Multimodal LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features.

[BibT_eX]

[DOI]

Paul Couairon

Jean-Emmanuel Haugeard

Barbara Toniella Corradini

CoRR, 2024

FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

Improved Baselines for Data-efficient Perceptual Augmentation of LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

A Concept-Based Explainability Framework for Large Multimodal Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut.

[BibT_eX]

[DOI]

Paul Couairon

Jean-Emmanuel Haugeard

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

What Makes Multimodal In-Context Learning Work?

[BibT_eX]

[DOI]

Folco Bertini Baldassini

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Sci-Net: scale-invariant model for buildings segmentation from aerial imagery.

[BibT_eX]

[DOI]

Hasan Nasrallah

Ali J. Ghandour

Signal Image Video Process., 2023

Zero-Shot Refinement of Buildings' Segmentation Models using SAM.

[BibT_eX]

[DOI]

CoRR, 2023

Extending CAM-based XAI methods for Remote Sensing Imagery Segmentation.

[BibT_eX]

[DOI]

Abdul Karim Gizzini

Ali J. Ghandour

CoRR, 2023

Empirical Study of PEFT techniques for Winter Wheat Segmentation.

[BibT_eX]

[DOI]

CoRR, 2023

Unified Model for Image, Video, Audio and Language Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

eP-ALM: Efficient Perceptual Augmentation of Language Models.

[BibT_eX]

[DOI]

Corentin Dancette

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

Structured Vision-Language Pretraining for Computational Cooking.

[BibT_eX]

[DOI]

CoRR, 2022

Video Coding using Learned Latent GAN Compression.

[BibT_eX]

[DOI]

Bharath Bhushan Damodaran

Xu Yao

Pierre Hellier

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Semantic Unfolding of Stylegan Latent Space.

[BibT_eX]

[DOI]

Bharath Bhushan Damodaran

Xu Yao

Pierre Hellier

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment.

[BibT_eX]

[DOI]

Guillaume Couairon

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021

Buildings Classification using Very High Resolution Satellite Imagery.

[BibT_eX]

[DOI]

CoRR, 2021

Semantic and Geometric Unfolding of StyleGAN Latent Space.

[BibT_eX]

[DOI]