Salman H. Khan

Ming-Hsuan Yang

IEEE Trans. Medical Imaging, September, 2024

Guidance Through Surrogate: Toward a Generic Diagnostic Attack.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., February, 2024

Robust Perception and Precise Segmentation for Scribble-Supervised RGB-D Saliency Detection.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

CT-VOS: Cutout prediction and tagging for self-supervised video object segmentation.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., January, 2024

Remote Sensing Change Detection With Transformers Trained From Scratch.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2024

Discriminative Image Generation with Diffusion Models for Zero-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2024

EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues.

[BibT_eX]

[DOI]

Muhammad Sohail Danish

CoRR, 2024

UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities.

[BibT_eX]

[DOI]

CoRR, 2024

Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation.

[BibT_eX]

[DOI]

CoRR, 2024

BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities.

[BibT_eX]

[DOI]

Mohammed Irfan Kurpath

CoRR, 2024

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks.

[BibT_eX]

[DOI]

Muhammad Sohail Danish

Syed Roshaan Ali Shah

CoRR, 2024

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages.

[BibT_eX]

[DOI]

Henok Biadglign Ademtew

Feno Heriniaina Rabevohitra

Mike Zhang

Mahardika Krisna Ihsani

Fadillah Adamsyah Maani

Amirpouya Ghasemaghaei

Johan S. Obando-Ceron

CoRR, 2024

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos.

[BibT_eX]

[DOI]

CoRR, 2024

COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes.

[BibT_eX]

[DOI]

CoRR, 2024

CAMEL-Bench: A Comprehensive Arabic LMM Benchmark.

[BibT_eX]

[DOI]

Sara Ghaboura

Ahmed Heakl

Omkar Thawakar

Ali Husain Salem Abdulla Alharthi

CoRR, 2024

How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?

[BibT_eX]

[DOI]

CoRR, 2024

Frontiers in Intelligent Colonoscopy.

[BibT_eX]

[DOI]

CoRR, 2024

Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking.

[BibT_eX]

[DOI]

Ayesha Ishaq

CoRR, 2024

CDChat: A Large Multimodal Model for Remote Sensing Change Description.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region.

[BibT_eX]

[DOI]

CoRR, 2024

Connecting Dreams with Visual Brainstorming Instruction.

[BibT_eX]

[DOI]

CoRR, 2024

Underwater Object Detection Enhancement via Channel Stabilization.

[BibT_eX]

[DOI]

CoRR, 2024

GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model.

[BibT_eX]

[DOI]

CoRR, 2024

FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background.

[BibT_eX]

[DOI]

CoRR, 2024

CPT: Consistent Proxy Tuning for Black-box Optimization.

[BibT_eX]

[DOI]

CoRR, 2024

Open-Vocabulary Temporal Action Localization using Multimodal Guidance.

[BibT_eX]

[DOI]

CoRR, 2024

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs.

[BibT_eX]

[DOI]

CoRR, 2024

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

Towards Evaluating the Robustness of Visual State Space Models.

[BibT_eX]

[DOI]

CoRR, 2024

On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models.

[BibT_eX]

[DOI]

CoRR, 2024

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation.

[BibT_eX]

[DOI]

CoRR, 2024

Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging.

[BibT_eX]

[DOI]

CoRR, 2024

Multi-modal Generation via Cross-Modal In-Context Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning.

[BibT_eX]

[DOI]

CoRR, 2024

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs.

[BibT_eX]

[DOI]

Muhammad Ferjad Naeem

CoRR, 2024

Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Video Object Segmentation via Modulated Cross-Attention Memory.

[BibT_eX]

[DOI]

CoRR, 2024

VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding.

[BibT_eX]

[DOI]

CoRR, 2024

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation.

[BibT_eX]

[DOI]

CoRR, 2024

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT.

[BibT_eX]

[DOI]

CoRR, 2024

PALO: A Polyglot Large Multimodal Model for 5B People.

[BibT_eX]

[DOI]

CoRR, 2024

Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding.

[BibT_eX]

[DOI]

CoRR, 2024

Learnable weight initialization for volumetric medical image segmentation.

[BibT_eX]

[DOI]

Shahina K. Kunhimon

Artif. Intell. Medicine, 2024

A New Perspective to Boost Performance Fairness For Medical Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

BAPLe: Backdoor Attacks on Medical Foundational Models Using Prompt Learning.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Language Guided Domain Generalized Medical Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Long-Tailed 3D Semantic Segmentation with Adaptive Weight Constraint and Sampling.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Enhanced Segmentation of Deformed Waste Objects in Cluttered Environments.

[BibT_eX]

[DOI]

Omar Alsuwaidi

Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods, 2024

Bidirectional Reciprocative Information Communication for Few-Shot Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Modulate Your Spectrum in Self-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Sentence-level Prompts Benefit Composed Image Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Diabetic Retinopathy Detection and Grading AI for Mobile and Hand-held Devices: A Readiness Survey.

[BibT_eX]

[DOI]

Proceedings of the 11th International Conference on Future Internet of Things and Cloud, 2024

BiMediX: Bilingual Medical Mixture of Experts LLM.

[BibT_eX]

[DOI]

Sara Pieri

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

CONDA: Condensed Deep Association Learning for Co-salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-distillation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Composed Video Retrieval via Enriched Context and Discriminative Embeddings.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GLaMM: Pixel Grounding Large Multimodal Model.

[BibT_eX]

[DOI]

Omkar Chakradhar Thawakar

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GeoChat: Grounded Large Vision-Language Model for Remote Sensing.

[BibT_eX]

[DOI]

Kartik Kuckreja

Muhammad Sohail Danish

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models.

[BibT_eX]

[DOI]

Fahad Khan

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2024, 2024

S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Generative Multi-Label Zero-Shot Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Guest Editorial Introduction to the Special Section on Transformer Models in Vision.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges.

[BibT_eX]

[DOI]

Mach. Intell. Res., October, 2023

Transformers in medical imaging: A survey.

[BibT_eX]

[DOI]

Medical Image Anal., August, 2023

Stylized Adversarial Defense.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Cascaded structure tensor for robust baggage threat detection.

[BibT_eX]

[DOI]

Neural Comput. Appl., May, 2023

Transformers in Remote Sensing: A Survey.

[BibT_eX]

[DOI]

Abdulaziz Amer Aleissaee

Remote. Sens., April, 2023

Unsupervised anomaly instance segmentation for baggage threat recognition.

[BibT_eX]

[DOI]

J. Ambient Intell. Humaniz. Comput., March, 2023

Learning Enriched Features for Fast Image Restoration and Enhancement.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2023

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2023

How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation.

[BibT_eX]

[DOI]

CoRR, 2023

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models.

[BibT_eX]

[DOI]

Shehan Munasinghe

Rusiru Thushara

Mubarak Shah

CoRR, 2023

Enhancing Novel Object Detection via Cooperative Foundational Models.

[BibT_eX]

[DOI]

CoRR, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.

[BibT_eX]

[DOI]

Jameel Hassan

Hanan Gani

Noor Hussein

CoRR, 2023

Videoprompter: an ensemble of foundational models for zero-shot video understanding.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.

[BibT_eX]

[DOI]

CoRR, 2023

Foundational Models Defining a New Era in Vision: A Survey and Outlook.

[BibT_eX]

[DOI]

CoRR, 2023

PromptIR: Prompting for All-in-One Blind Image Restoration.

[BibT_eX]

[DOI]

CoRR, 2023

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.

[BibT_eX]

[DOI]

Omkar Thawakar

CoRR, 2023

FlexPooling with Simple Auxiliary Classifiers in Deep Networks.

[BibT_eX]

[DOI]

Omar Alsuwaidi

Proceedings of the 18th International Joint Conference on Computer Vision, 2023

BGD: Generalization Using Large Step Sizes to Attract Flat Minima.

[BibT_eX]

[DOI]

Omar Alsuwaidi

Proceedings of the 18th International Joint Conference on Computer Vision, 2023

Hardware Resilience Properties of Text-Guided Image Classifiers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.

[BibT_eX]

[DOI]

Jameel Abdul Samadh

Hanan Gani

Noor Hussein

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PromptIR: Prompting for All-in-One Image Restoration.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cal-DETR: Calibrated Detection Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D Indoor Instance Segmentation in an Open-World.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Accelerated MRI Reconstruction via Dynamic Deformable Alignment Based Transformer.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning in Medical Imaging - 14th International Workshop, 2023

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Boosting Adversarial Transferability using Dynamic Cues.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.

[BibT_eX]

[DOI]

Syed Talal Wasim

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications.

[BibT_eX]

[DOI]

Ming-Hsuan Yang

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Generative Multiplane Neural Radiance for 3D-Aware Image Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-regulating Prompts: Foundational Model Adaptation without Forgetting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Instance-adaptive Inference for Federated Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fine-tuned CLIP Models are Efficient Video Learners.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection.

[BibT_eX]

[DOI]

Muhammad Haris Khan

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

D<sup>3</sup>Former: Debiased Dual Distilled Transformer for Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MaPLe: Multi-modal Prompt Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Burstormer: Burst Image Restoration and Enhancement Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Person Image Synthesis via Denoising Diffusion Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Landmark Discovery Using Consistency-Guided Bottleneck.

[BibT_eX]

[DOI]

Mamona Awan

Muhammad Haris Khan

Sanoojan Baliah

Muhammad Ahmad Waseem

Arif Mahmood

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022

Transformers in Vision: A Survey.

[BibT_eX]

[DOI]

ACM Comput. Surv., January, 2022

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items.

[BibT_eX]

[DOI]

IEEE Trans. Syst. Man Cybern. Syst., 2022

Weakly Supervised Visual Saliency Prediction.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Incremental Object Detection via Meta-Learning.

[BibT_eX]

[DOI]

K. J. Joseph

Vineeth N. Balasubramanian

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Towards Partial Supervision for Generic Object Counting in Natural Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Tensor pooling-driven instance segmentation framework for baggage threat recognition.

[BibT_eX]

[DOI]

Neural Comput. Appl., 2022

Learning discriminative representations for multi-label image recognition.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2022

Visual Affordance and Function Understanding: A Survey.

[BibT_eX]

[DOI]

Mohammed Hassanin

Murat Tahtali

ACM Comput. Surv., 2022

Guidance Through Surrogate: Towards a Generic Diagnostic Attack.

[BibT_eX]

[DOI]

CoRR, 2022

CLIP model is an Efficient Continual Learner.

[BibT_eX]

[DOI]

CoRR, 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.

[BibT_eX]

[DOI]

CoRR, 2022

3D Vision with Transformers: A Survey.

[BibT_eX]

[DOI]

CoRR, 2022

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging.

[BibT_eX]

[DOI]

CoRR, 2022

An Investigation into Whitening Loss for Self-supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Improving Adversarial Transferability of Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Class-Agnostic Object Detection with Multi-modal Transformer.

[BibT_eX]

[DOI]

Vineeth N. Balasubramanian

Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Disentanglement with Decoupled Labels for Vision-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

DoodleFormer: Creative Sketch Drawing with Transformers.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Restormer: Efficient Transformer for High-Resolution Image Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Spatio-temporal Relation Modeling for Few-shot Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-supervised Video Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adaptive Feature Consolidation Network for Burst Super-Resolution.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Energy-based Latent Aligner for Incremental Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OW-DETR: Open-world Detection Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Burst Image Restoration and Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

NTIRE 2022 Burst Super-Resolution Challenge.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Robust normalizing flows using Bernstein-type polynomials.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Self-distilled Vision Transformer for Domain Generalization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2022, 2022

2021

Understanding More About Human and Machine Attention in Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

Accuracy vs. complexity: A trade-off in visual question answering models.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

Deeply Supervised Discriminative Learning for Adversarial Defense.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

Historical document image binarization via style augmentation and atrous convolutions.

[BibT_eX]

[DOI]

Hanif Rasyidi

Neural Comput. Appl., 2021

Learning digital camera pipeline for extreme low-light imaging.

[BibT_eX]

[DOI]

Neurocomputing, 2021

A Deep Journey into Super-resolution: A Survey.

[BibT_eX]

[DOI]

Saeed Anwar

ACM Comput. Surv., 2021

Multi-modal Transformers Excel at Class-agnostic Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Intriguing Properties of Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Conditional Generative Modeling via Learning the Latent Space.

[BibT_eX]

[DOI]

Kanchana Nisal Ranasinghe

Stephen Gould

Proceedings of the 9th International Conference on Learning Representations, 2021

Orthogonal Projection Loss.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Generating Transferable Targeted Perturbations.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Discriminative Region-based Multi-Label Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Handwriting Transformers.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Stage Progressive Image Restoration.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Open World Object Detection.

[BibT_eX]

[DOI]

K. J. Joseph

Vineeth N. Balasubramanian

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Meta-learning the Learning Trends Shared Across Tasks.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Self-supervised Knowledge Distillation for Few-shot Learning.

[BibT_eX]

[DOI]

Syed Muhammad talha Zaidi

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Mode-Guided Feature Augmentation for Domain Generalization.

[BibT_eX]

[DOI]

Muhammad Haris Khan

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Rich Semantics Improve Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Spatiotemporal Deformable Scene Graphs for Complex Activity Detection.

[BibT_eX]

[DOI]

Fabio Cuzzolin

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

Image Super-Resolution as a Defense Against Adversarial Attacks.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

Meta-Transfer Learning Driven Tensor-Shot Detector for the Autonomous Localization and Recognition of Concealed Baggage Threats.

[BibT_eX]

[DOI]

Sensors, 2020

Feature mask network for person re-identification.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2020

From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts.

[BibT_eX]

[DOI]

Image Vis. Comput., 2020

Representation Learning on Unit Ball with 3D Roto-translational Equivariance.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Zero-Shot Object Detection: Joint Recognition and Localization of Novel Concepts.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

How to train your conditional GAN: An approach using geometrically structured latent manifolds.

[BibT_eX]

[DOI]

CoRR, 2020

Attention Guided Semantic Relationship Parsing for Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2020

Trainable Structure Tensors for Autonomous Baggage Threat Detection Under Extreme Occlusion.

[BibT_eX]

[DOI]

CoRR, 2020

Cascaded Structure Tensor Framework for Robust Identification of Heavily Occluded Baggage Items from X-ray Scans.

[BibT_eX]

[DOI]

CoRR, 2020

Incremental Object Detection via Meta-Learning.

[BibT_eX]

[DOI]

K. J. Joseph

Vineeth Balasubramanian

Ling Shao

CoRR, 2020

Filling the Gaps in Atrous Convolution: Semantic Segmentation With a Better Context.

[BibT_eX]

[DOI]

IEEE Access, 2020

A New Localization Objective for Accurate Fine-Grained Affordance Segmentation Under High-Scale Variations.

[BibT_eX]

[DOI]

Mohammed Hassanin

Murat Tahtali

IEEE Access, 2020

Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Spectral-GANs for High-Resolution 3D Point-cloud Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Learned and Hand-crafted Feature Fusion in Unit Ball for 3D Object Classification.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods, 2020

Question-Agnostic Attention for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Detecting Prohibited Items in X-Ray Images: a Contour Proposal Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Image Processing, 2020

Learning Enriched Features for Real Image Restoration and Enhancement.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

AIM 2020 Challenge on Real Image Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Fixing Localization Errors to Improve Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

CycleISP: Real Image Restoration via Improved Data Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semi-Supervised Learning for Few-Shot Image-to-Image Translation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

iTAML: An Incremental Task-Agnostic Meta-learning Approach.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Self-supervised Approach for Adversarial Robustness.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Geometry to the Rescue: 3D Instance Reconstruction from a Cluttered Scene.

[BibT_eX]

[DOI]

Lin Li

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AnimalWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces.

[BibT_eX]

[DOI]

Georgios Tzimiropoulos

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Any-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Synthesizing the Unseen for Zero-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Fine-Grained Recognition: Accounting for Subtle Differences between Similar Classes.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Improved Visual-Semantic Alignment for Zero-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Feature Affinity-Based Pseudo Labeling for Semi-Supervised Person Re-Identification.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Regularization of deep neural networks with spectral dropout.

[BibT_eX]

[DOI]

Neural Networks, 2019

Deep CMST Framework for the Autonomous Recognition of Heavily Occluded and Cluttered Baggage Items from Multivendor Security Radiographs.

[BibT_eX]

[DOI]

CoRR, 2019

Human vs Machine Attention in Neural Networks: A Comparative Study.

[BibT_eX]

[DOI]

CoRR, 2019

Towards better Validity: Dispersion based Clustering for Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2019

Random Path Selection for Incremental Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Max-margin Class Imbalanced Learning with Gaussian Affinity.

[BibT_eX]

[DOI]

CoRR, 2019

Striking the Right Balance with Uncertainty.

[BibT_eX]

[DOI]

CoRR, 2019

Volumetric Convolution: Automatic Representation Learning in Unit Ball.

[BibT_eX]

[DOI]

CoRR, 2019

Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey.

[BibT_eX]

[DOI]

IEEE Access, 2019

Local Gradients Smoothing: Defense Against Localized Adversarial Attacks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Random Path Selection for Continual Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Cross-Domain Transferability of Adversarial Perturbations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Sentinel-1, WW3 and Buoy Spectral Comparisons in the Southern Ocean.

[BibT_eX]

[DOI]

Emilio Echevarria

Mark Hemer

Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

Silhouette-Assisted 3D Object Instance Reconstruction from a Cluttered Scene.

[BibT_eX]

[DOI]

Lin Li

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Transductive Learning for Zero-Shot Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Gaussian Affinity for Max-Margin Class Imbalanced Learning.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Ground-to-Aerial Image Geo-Localization With a Hard Exemplar Reweighting Triplet Loss.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Historical Document Text Binarization using Atrous Convolution and Multi-Scale Feature Decoder.

[BibT_eX]

[DOI]

Hanif Rasyidi

Pablo Navarrete Michelini

Proceedings of the 2019 Digital Image Computing: Techniques and Applications, 2019

iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

NTIRE 2019 Challenge on Video Deblurring: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Striking the Right Balance With Uncertainty.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Unsupervised Primitive Discovery for Improved 3D Generative Modeling.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

NTIRE 2019 Challenge on Image Enhancement: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

NTIRE 2019 Challenge on Real Image Denoising: Methods and Results.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Dispersion based Clustering for Unsupervised Person Re-identification.

[BibT_eX]

[DOI]

Guodong Ding

Zhenmin Tang

Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018

A Guide to Convolutional Neural Networks for Computer Vision

[BibT_eX]

[DOI]

Synthesis Lectures on Computer Vision, Morgan & Claypool Publishers, ISBN: 978-3-031-01821-3, 2018

Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2018

A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning.

[BibT_eX]

[DOI]

Salman Hameed Khan

IEEE Trans. Image Process., 2018

Distorting Neural Representations to Generate Highly Transferable Adversarial Examples.

[BibT_eX]

[DOI]

CoRR, 2018

Polarity Loss for Zero-shot Object Detection.

[BibT_eX]

[DOI]

CoRR, 2018

Indoor Scene Understanding in 2.5/3D: A Survey.

[BibT_eX]

[DOI]

Salman Hameed Khan

CoRR, 2018

Adversarial Training of Variational Auto-Encoders for High Fidelity Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Unsupervised Learning of Endoscopy Video Frames' Correspondences from Global and Local Transformation.

[BibT_eX]

[DOI]

Proceedings of the OR 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, - and - Skin Image Analysis, 2018

Center Based Pseudo-Labeling For Semi-Supervised Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

A Context-Aware Capsule Network for Multi-label Classification.

[BibT_eX]

[DOI]

C. D. Athuraliya

Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Reciprocal Attention Fusion for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2018, 2018

Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

Deep Multiple Instance Learning for Zero-Shot Image Tagging.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2018, 2018

2017

Forest Change Detection in Incomplete Satellite Images With Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2017

Empowering Simple Binary Classifiers for Image Set Based Face Recognition.

[BibT_eX]

[DOI]

Mohammed Bennamoun

Int. J. Comput. Vis., 2017

Let Features Decide for Themselves: Feature Mask Network for Person Re-identification.

[BibT_eX]

[DOI]

CoRR, 2017

Learning deep structured network for weakly supervised change detection.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Scene Categorization with Spectral Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

Joint Registration and Representation Learning for Unconstrained Face Identification.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

A Discriminative Representation of Convolutional Features for Indoor Scene Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2016

Automatic Shadow Detection and Removal from a Single Image.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2016

Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Weakly Supervised Change Detection in a Pair of Images.

[BibT_eX]

[DOI]

CoRR, 2016

2015

Secure biometric template generation for multi-factor authentication.

[BibT_eX]

[DOI]

Pattern Recognit., 2015

Cost Sensitive Learning of Deep Feature Representations from Imbalanced Data.

[BibT_eX]

[DOI]

CoRR, 2015

Contractive Rectifier Networks for Nonlinear Maximum Margin Classification.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Multi-Factor Authentication on Cloud.

[BibT_eX]

[DOI]

Muhammad Ali Akbar

Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Separating objects and clutter in indoor scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Optimized Reconfigurable Autopilot Design for an Aerospace CPS.

[BibT_eX]

[DOI]

Arsalan H. Khan

Zeashan H. Khan

Proceedings of the Computational Intelligence for Decision Support in Cyber-Physical Systems, 2014

Geometry Driven Semantic Labeling of Indoor Scenes.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Automatic Feature Learning for Robust Shadow Detection.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013

Trial-by-Trial Adaptation of Movements during Mental Practice under Force Field.

[BibT_eX]

[DOI]

Muhammad Nabeel Anwar

Salman Hameed Khan

Comput. Math. Methods Medicine, 2013

Can Signature Biometrics Address Both Identification and Verification Problems?

[BibT_eX]

[DOI]