Mustafa Shukor

According to our database1, Mustafa Shukor authored at least 25 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval.
Comput. Vis. Image Underst., 2024

Skipping Computations in Multimodal LLMs.
CoRR, 2024

A Concept-Based Explainability Framework for Large Multimodal Models.
CoRR, 2024

Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features.
CoRR, 2024

Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs.
CoRR, 2024

FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models.
CoRR, 2024

Improved Baselines for Data-efficient Perceptual Augmentation of LLMs.
CoRR, 2024

Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

What Makes Multimodal In-Context Learning Work?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.
Trans. Mach. Learn. Res., 2023

Sci-Net: scale-invariant model for buildings segmentation from aerial imagery.
Signal Image Video Process., 2023

Zero-Shot Refinement of Buildings' Segmentation Models using SAM.
CoRR, 2023

Extending CAM-based XAI methods for Remote Sensing Imagery Segmentation.
CoRR, 2023

Empirical Study of PEFT techniques for Winter Wheat Segmentation.
CoRR, 2023

Unified Model for Image, Video, Audio and Language Tasks.
CoRR, 2023

Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

eP-ALM: Efficient Perceptual Augmentation of Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Structured Vision-Language Pretraining for Computational Cooking.
CoRR, 2022

Video Coding using Learned Latent GAN Compression.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Semantic Unfolding of Stylegan Latent Space.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Buildings Classification using Very High Resolution Satellite Imagery.
CoRR, 2021

Semantic and Geometric Unfolding of StyleGAN Latent Space.
CoRR, 2021

Synthetic training data generation for deep learning based quality inspection.
CoRR, 2021


  Loading...