Muzammal Naseer

Orcid: 0000-0001-7663-7161

According to our database1, Muzammal Naseer authored at least 65 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024
Guidance Through Surrogate: Toward a Generic Diagnostic Attack.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models.
CoRR, 2024

UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities.
CoRR, 2024

Y-CA-Net: A Convolutional Attention Based Network for Volumetric Medical Image Segmentation.
CoRR, 2024

CDChat: A Large Multimodal Model for Remote Sensing Change Description.
CoRR, 2024

Distillation-free Scaling of Large SSMs for Images and Videos.
CoRR, 2024

STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models.
CoRR, 2024

Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors.
CoRR, 2024

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs.
CoRR, 2024

Towards Evaluating the Robustness of Visual State Space Models.
CoRR, 2024

On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models.
CoRR, 2024

Multi-Granularity Language-Guided Multi-Object Tracking.
CoRR, 2024

Multi-modal Generation via Cross-Modal In-Context Learning.
CoRR, 2024

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs.
CoRR, 2024

VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding.
CoRR, 2024

Learning to Prompt with Text Only Supervision for Vision-Language Models.
CoRR, 2024

Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding.
CoRR, 2024

Learnable weight initialization for volumetric medical image segmentation.
Artif. Intell. Medicine, 2024

Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

PromptSmooth: Certifying Robustness of Medical Vision-Language Models via Prompt Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

BAPLe: Backdoor Attacks on Medical Foundational Models Using Prompt Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024 Workshops, 2024

Language Guided Domain Generalized Medical Image Segmentation.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Composed Video Retrieval via Enriched Context and Discriminative Embeddings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GeoChat: Grounded Large Vision-Language Model for Remote Sensing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes.
Proceedings of the Computer Vision - ACCV 2024, 2024

S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Stylized Adversarial Defense.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Enhancing Novel Object Detection via Cooperative Foundational Models.
CoRR, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.
CoRR, 2023

Videoprompter: an ensemble of foundational models for zero-shot video understanding.
CoRR, 2023

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.
CoRR, 2023

Foundational Models Defining a New Era in Vision: A Survey and Outlook.
CoRR, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Boosting Adversarial Transferability using Dynamic Cues.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FLIP: Cross-domain Face Anti-spoofing with Language Guidance.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-regulating Prompts: Foundational Model Adaptation without Forgetting.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Transformers in Vision: A Survey.
ACM Comput. Surv., January, 2022

Guidance Through Surrogate: Towards a Generic Diagnostic Attack.
CoRR, 2022

On Improving Adversarial Transferability of Vision Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Self-supervised Video Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

How to Train Vision Transformer on Small-scale Datasets?
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Self-distilled Vision Transformer for Domain Generalization.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Intriguing Properties of Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Orthogonal Projection Loss.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Generating Transferable Targeted Perturbations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rich Semantics Improve Few-Shot Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
A Self-supervised Approach for Adversarial Robustness.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey.
IEEE Access, 2019

Local Gradients Smoothing: Defense Against Localized Adversarial Attacks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Cross-Domain Transferability of Adversarial Perturbations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Distorting Neural Representations to Generate Highly Transferable Adversarial Examples.
CoRR, 2018

Indoor Scene Understanding in 2.5/3D: A Survey.
CoRR, 2018


  Loading...