Salman H. Khan

Orcid: 0000-0002-9502-1749

Affiliations:
  • Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, UAE
  • Australian National University, Canberra, Australia
  • Data61, Commonwealth Scientific and Industrial Research Organization (CSIRO), Canberra, Australia
  • University of Western Australia, Crawley, Australia (PhD 2016)


According to our database1, Salman H. Khan authored at least 268 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Understanding Whitening Loss in Self-Supervised Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

UNETR++: Delving Into Efficient and Accurate 3D Medical Image Segmentation.
IEEE Trans. Medical Imaging, September, 2024

Guidance Through Surrogate: Toward a Generic Diagnostic Attack.
IEEE Trans. Neural Networks Learn. Syst., February, 2024

Robust Perception and Precise Segmentation for Scribble-Supervised RGB-D Saliency Detection.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

CT-VOS: Cutout prediction and tagging for self-supervised video object segmentation.
Comput. Vis. Image Underst., January, 2024

Remote Sensing Change Detection With Transformers Trained From Scratch.
IEEE Trans. Geosci. Remote. Sens., 2024

ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes.
CoRR, 2024

CAMEL-Bench: A Comprehensive Arabic LMM Benchmark.
CoRR, 2024

How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
CoRR, 2024

Frontiers in Intelligent Colonoscopy.
CoRR, 2024

Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking.
CoRR, 2024

AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment.
CoRR, 2024

CDChat: A Large Multimodal Model for Remote Sensing Change Description.
CoRR, 2024

Efficient Localized Adaptation of Neural Weather Forecasting: A Case Study in the MENA Region.
CoRR, 2024

Connecting Dreams with Visual Brainstorming Instruction.
CoRR, 2024

Underwater Object Detection Enhancement via Channel Stabilization.
CoRR, 2024

GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model.
CoRR, 2024

FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background.
CoRR, 2024

CPT: Consistent Proxy Tuning for Black-box Optimization.
CoRR, 2024

Open-Vocabulary Temporal Action Localization using Multimodal Guidance.
CoRR, 2024

VANE-Bench: Video Anomaly Evaluation Benchmark for Conversational LMMs.
CoRR, 2024

VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding.
CoRR, 2024

Towards Evaluating the Robustness of Visual State Space Models.
CoRR, 2024

On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models.
CoRR, 2024

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation.
CoRR, 2024

Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging.
CoRR, 2024

Multi-modal Generation via Cross-Modal In-Context Learning.
CoRR, 2024

Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning.
CoRR, 2024

How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs.
CoRR, 2024

Dynamic Pre-training: Towards Efficient and Scalable All-in-One Image Restoration.
CoRR, 2024

Efficient Video Object Segmentation via Modulated Cross-Attention Memory.
CoRR, 2024

VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding.
CoRR, 2024

AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation.
CoRR, 2024

ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes.
CoRR, 2024

MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT.
CoRR, 2024

PALO: A Polyglot Large Multimodal Model for 5B People.
CoRR, 2024

Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding.
CoRR, 2024

Learnable weight initialization for volumetric medical image segmentation.
Artif. Intell. Medicine, 2024

A New Perspective to Boost Performance Fairness For Medical Federated Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Hierarchical Text-to-Vision Self Supervised Alignment for Improved Histopathology Representation Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

BAPLe: Backdoor Attacks on Medical Foundational Models Using Prompt Learning.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Language Guided Domain Generalized Medical Image Segmentation.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Long-Tailed 3D Semantic Segmentation with Adaptive Weight Constraint and Sampling.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Enhanced Segmentation of Deformed Waste Objects in Cluttered Environments.
Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods, 2024

Bidirectional Reciprocative Information Communication for Few-Shot Semantic Segmentation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Modulate Your Spectrum in Self-Supervised Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Sentence-level Prompts Benefit Composed Image Retrieval.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Diabetic Retinopathy Detection and Grading AI for Mobile and Hand-held Devices: A Readiness Survey.
Proceedings of the 11th International Conference on Future Internet of Things and Cloud, 2024

BiMediX: Bilingual Medical Mixture of Experts LLM.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

CONDA: Condensed Deep Association Learning for Co-salient Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Continual Learning and Unknown Object Discovery in 3D Scenes via Self-distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

VideoGrounding-DINO: Towards Open-Vocabulary Spatio- Temporal Video Grounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Composed Video Retrieval via Enriched Context and Discriminative Embeddings.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GLaMM: Pixel Grounding Large Multimodal Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GeoChat: Grounded Large Vision-Language Model for Remote Sensing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Cross-Modal Self-Training: Aligning Images and Pointclouds to learn Classification without Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Progressive Semantic-Guided Vision Transformer for Zero-Shot Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models.
Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

S3A: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Generative Multi-Label Zero-Shot Learning.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Guest Editorial Introduction to the Special Section on Transformer Models in Vision.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges.
Mach. Intell. Res., October, 2023

Transformers in medical imaging: A survey.
Medical Image Anal., August, 2023

Stylized Adversarial Defense.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Cascaded structure tensor for robust baggage threat detection.
Neural Comput. Appl., May, 2023

Transformers in Remote Sensing: A Survey.
Remote. Sens., April, 2023

Unsupervised anomaly instance segmentation for baggage threat recognition.
J. Ambient Intell. Humaniz. Comput., March, 2023

Learning Enriched Features for Fast Image Restoration and Enhancement.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering.
CoRR, 2023

How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation.
CoRR, 2023

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models.
CoRR, 2023

Enhancing Novel Object Detection via Cooperative Foundational Models.
CoRR, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.
CoRR, 2023

Videoprompter: an ensemble of foundational models for zero-shot video understanding.
CoRR, 2023

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment.
CoRR, 2023

Foundational Models Defining a New Era in Vision: A Survey and Outlook.
CoRR, 2023

PromptIR: Prompting for All-in-One Blind Image Restoration.
CoRR, 2023

XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
CoRR, 2023

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models.
CoRR, 2023

Video Instance Segmentation in an Open-World.
CoRR, 2023

FlexPooling with Simple Auxiliary Classifiers in Deep Networks.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

BGD: Generalization Using Large Step Sizes to Attract Flat Minima.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

Hardware Resilience Properties of Text-Guided Image Classifiers.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PromptIR: Prompting for All-in-One Image Restoration.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Cal-DETR: Calibrated Detection Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D Indoor Instance Segmentation in an Open-World.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Accelerated MRI Reconstruction via Dynamic Deformable Alignment Based Transformer.
Proceedings of the Machine Learning in Medical Imaging - 14th International Workshop, 2023

Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

Boosting Adversarial Transferability using Dynamic Cues.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Generative Multiplane Neural Radiance for 3D-Aware Image Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Self-regulating Prompts: Foundational Model Adaptation without Forgetting.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Towards Instance-adaptive Inference for Federated Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Arabic Mini-ClimateGPT : A Climate Change and Sustainability Tailored Arabic LLM.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fine-tuned CLIP Models are Efficient Video Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

D<sup>3</sup>Former: Debiased Dual Distilled Transformer for Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MaPLe: Multi-modal Prompt Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Burstormer: Burst Image Restoration and Enhancement Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Person Image Synthesis via Denoising Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Landmark Discovery Using Consistency-Guided Bottleneck.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Transformers in Vision: A Survey.
ACM Comput. Surv., January, 2022

A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items.
IEEE Trans. Syst. Man Cybern. Syst., 2022

Weakly Supervised Visual Saliency Prediction.
IEEE Trans. Image Process., 2022

Incremental Object Detection via Meta-Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Towards Partial Supervision for Generic Object Counting in Natural Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Tensor pooling-driven instance segmentation framework for baggage threat recognition.
Neural Comput. Appl., 2022

Learning discriminative representations for multi-label image recognition.
J. Vis. Commun. Image Represent., 2022

Visual Affordance and Function Understanding: A Survey.
ACM Comput. Surv., 2022

Guidance Through Surrogate: Towards a Generic Diagnostic Attack.
CoRR, 2022

CLIP model is an Efficient Continual Learner.
CoRR, 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.
CoRR, 2022

3D Vision with Transformers: A Survey.
CoRR, 2022

Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging.
CoRR, 2022

An Investigation into Whitening Loss for Self-supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On Improving Adversarial Transferability of Vision Transformers.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Class-Agnostic Object Detection with Multi-modal Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Disentanglement with Decoupled Labels for Vision-Language Navigation.
Proceedings of the Computer Vision - ECCV 2022, 2022

DoodleFormer: Creative Sketch Drawing with Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

Restormer: Efficient Transformer for High-Resolution Image Restoration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Spatio-temporal Relation Modeling for Few-shot Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-supervised Video Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Adaptive Feature Consolidation Network for Burst Super-Resolution.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Energy-based Latent Aligner for Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OW-DETR: Open-world Detection Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Burst Image Restoration and Enhancement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Robust normalizing flows using Bernstein-type polynomials.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

AVisT: A Benchmark for Visual Object Tracking in Adverse Visibility.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Self-distilled Vision Transformer for Domain Generalization.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
Understanding More About Human and Machine Attention in Deep Neural Networks.
IEEE Trans. Multim., 2021

Accuracy vs. complexity: A trade-off in visual question answering models.
Pattern Recognit., 2021

Deeply Supervised Discriminative Learning for Adversarial Defense.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Historical document image binarization via style augmentation and atrous convolutions.
Neural Comput. Appl., 2021

Learning digital camera pipeline for extreme low-light imaging.
Neurocomputing, 2021

A Deep Journey into Super-resolution: A Survey.
ACM Comput. Surv., 2021

Multi-modal Transformers Excel at Class-agnostic Object Detection.
CoRR, 2021

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Intriguing Properties of Vision Transformers.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Conditional Generative Modeling via Learning the Latent Space.
Proceedings of the 9th International Conference on Learning Representations, 2021

Orthogonal Projection Loss.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

On Generating Transferable Targeted Perturbations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Discriminative Region-based Multi-Label Zero-Shot Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Handwriting Transformers.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Multi-Stage Progressive Image Restoration.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Towards Open World Object Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Meta-learning the Learning Trends Shared Across Tasks.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Self-supervised Knowledge Distillation for Few-shot Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Mode-Guided Feature Augmentation for Domain Generalization.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Rich Semantics Improve Few-Shot Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Spatiotemporal Deformable Scene Graphs for Complex Activity Detection.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging.
IEEE Trans. Multim., 2020

Image Super-Resolution as a Defense Against Adversarial Attacks.
IEEE Trans. Image Process., 2020

Meta-Transfer Learning Driven Tensor-Shot Detector for the Autonomous Localization and Recognition of Concealed Baggage Threats.
Sensors, 2020

Feature mask network for person re-identification.
Pattern Recognit. Lett., 2020

From known to the unknown: Transferring knowledge to answer questions about novel visual and semantic concepts.
Image Vis. Comput., 2020

Representation Learning on Unit Ball with 3D Roto-translational Equivariance.
Int. J. Comput. Vis., 2020

Zero-Shot Object Detection: Joint Recognition and Localization of Novel Concepts.
Int. J. Comput. Vis., 2020

How to train your conditional GAN: An approach using geometrically structured latent manifolds.
CoRR, 2020

Attention Guided Semantic Relationship Parsing for Visual Question Answering.
CoRR, 2020

Trainable Structure Tensors for Autonomous Baggage Threat Detection Under Extreme Occlusion.
CoRR, 2020

Cascaded Structure Tensor Framework for Robust Identification of Heavily Occluded Baggage Items from X-ray Scans.
CoRR, 2020

Incremental Object Detection via Meta-Learning.
CoRR, 2020

Filling the Gaps in Atrous Convolution: Semantic Segmentation With a Better Context.
IEEE Access, 2020

A New Localization Objective for Accurate Fine-Grained Affordance Segmentation Under High-Scale Variations.
IEEE Access, 2020

Blended Convolution and Synthesis for Efficient Discrimination of 3D Shapes.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Spectral-GANs for High-Resolution 3D Point-cloud Generation.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Learned and Hand-crafted Feature Fusion in Unit Ball for 3D Object Classification.
Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods, 2020

Question-Agnostic Attention for Visual Question Answering.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Detecting Prohibited Items in X-Ray Images: a Contour Proposal Learning Approach.
Proceedings of the IEEE International Conference on Image Processing, 2020

Learning Enriched Features for Real Image Restoration and Enhancement.
Proceedings of the Computer Vision - ECCV 2020, 2020


Fixing Localization Errors to Improve Image Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

CycleISP: Real Image Restoration via Improved Data Synthesis.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Semi-Supervised Learning for Few-Shot Image-to-Image Translation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

iTAML: An Incremental Task-Agnostic Meta-learning Approach.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Self-supervised Approach for Adversarial Robustness.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Geometry to the Rescue: 3D Instance Reconstruction from a Cluttered Scene.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

AnimalWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Any-Shot Object Detection.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Synthesizing the Unseen for Zero-Shot Object Detection.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Fine-Grained Recognition: Accounting for Subtle Differences between Similar Classes.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Improved Visual-Semantic Alignment for Zero-Shot Object Detection.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Feature Affinity-Based Pseudo Labeling for Semi-Supervised Person Re-Identification.
IEEE Trans. Multim., 2019

Regularization of deep neural networks with spectral dropout.
Neural Networks, 2019

Deep CMST Framework for the Autonomous Recognition of Heavily Occluded and Cluttered Baggage Items from Multivendor Security Radiographs.
CoRR, 2019

Human vs Machine Attention in Neural Networks: A Comparative Study.
CoRR, 2019

Towards better Validity: Dispersion based Clustering for Unsupervised Person Re-identification.
CoRR, 2019

Random Path Selection for Incremental Learning.
CoRR, 2019

Max-margin Class Imbalanced Learning with Gaussian Affinity.
CoRR, 2019

Striking the Right Balance with Uncertainty.
CoRR, 2019

Volumetric Convolution: Automatic Representation Learning in Unit Ball.
CoRR, 2019

Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey.
IEEE Access, 2019

Local Gradients Smoothing: Defense Against Localized Adversarial Attacks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Random Path Selection for Continual Learning.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Cross-Domain Transferability of Adversarial Perturbations.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Sentinel-1, WW3 and Buoy Spectral Comparisons in the Southern Ocean.
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019

Silhouette-Assisted 3D Object Instance Reconstruction from a Cluttered Scene.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Transductive Learning for Zero-Shot Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Gaussian Affinity for Max-Margin Class Imbalanced Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Ground-to-Aerial Image Geo-Localization With a Hard Exemplar Reweighting Triplet Loss.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Historical Document Text Binarization using Atrous Convolution and Multi-Scale Feature Decoder.
Proceedings of the 2019 Digital Image Computing: Techniques and Applications, 2019

iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019


Striking the Right Balance With Uncertainty.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Unsupervised Primitive Discovery for Improved 3D Generative Modeling.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019




Dispersion based Clustering for Unsupervised Person Re-identification.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
A Guide to Convolutional Neural Networks for Computer Vision
Synthesis Lectures on Computer Vision, Morgan & Claypool Publishers, ISBN: 978-3-031-01821-3, 2018

Cost-Sensitive Learning of Deep Feature Representations From Imbalanced Data.
IEEE Trans. Neural Networks Learn. Syst., 2018

A Unified Approach for Conventional Zero-Shot, Generalized Zero-Shot, and Few-Shot Learning.
IEEE Trans. Image Process., 2018

Distorting Neural Representations to Generate Highly Transferable Adversarial Examples.
CoRR, 2018

Polarity Loss for Zero-shot Object Detection.
CoRR, 2018

Indoor Scene Understanding in 2.5/3D: A Survey.
CoRR, 2018

Adversarial Training of Variational Auto-Encoders for High Fidelity Image Generation.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Unsupervised Learning of Endoscopy Video Frames' Correspondences from Global and Local Transformation.
Proceedings of the OR 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, - and - Skin Image Analysis, 2018

Center Based Pseudo-Labeling For Semi-Supervised Person Re-Identification.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

A Context-Aware Capsule Network for Multi-label Classification.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Reciprocal Attention Fusion for Visual Question Answering.
Proceedings of the British Machine Vision Conference 2018, 2018

Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts.
Proceedings of the Computer Vision - ACCV 2018, 2018

Deep Multiple Instance Learning for Zero-Shot Image Tagging.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Forest Change Detection in Incomplete Satellite Images With Deep Neural Networks.
IEEE Trans. Geosci. Remote. Sens., 2017

Empowering Simple Binary Classifiers for Image Set Based Face Recognition.
Int. J. Comput. Vis., 2017

Let Features Decide for Themselves: Feature Mask Network for Person Re-identification.
CoRR, 2017

Learning deep structured network for weakly supervised change detection.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Scene Categorization with Spectral Features.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Joint Registration and Representation Learning for Unconstrained Face Identification.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
A Discriminative Representation of Convolutional Features for Indoor Scene Recognition.
IEEE Trans. Image Process., 2016

A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification.
IEEE Trans. Image Process., 2016

Automatic Shadow Detection and Removal from a Single Image.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images.
Int. J. Comput. Vis., 2016

Weakly Supervised Change Detection in a Pair of Images.
CoRR, 2016

2015
Secure biometric template generation for multi-factor authentication.
Pattern Recognit., 2015

Cost Sensitive Learning of Deep Feature Representations from Imbalanced Data.
CoRR, 2015

Contractive Rectifier Networks for Nonlinear Maximum Margin Classification.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Multi-Factor Authentication on Cloud.
Proceedings of the 2015 International Conference on Digital Image Computing: Techniques and Applications, 2015

Separating objects and clutter in indoor scenes.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Optimized Reconfigurable Autopilot Design for an Aerospace CPS.
Proceedings of the Computational Intelligence for Decision Support in Cyber-Physical Systems, 2014

Geometry Driven Semantic Labeling of Indoor Scenes.
Proceedings of the Computer Vision - ECCV 2014, 2014

Automatic Feature Learning for Robust Shadow Detection.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Trial-by-Trial Adaptation of Movements during Mental Practice under Force Field.
Comput. Math. Methods Medicine, 2013

Can Signature Biometrics Address Both Identification and Verification Problems?
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013


  Loading...