Qi Zhao

Orcid: 0000-0003-3054-8934

Affiliations:
  • University of Minnesota, Department of Computer Science & Engineering, Minneapolis, MN, USA


According to our database1, Qi Zhao authored at least 128 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Deep Learning to Interpret Autism Spectrum Disorder Behind the Camera.
IEEE Trans. Cogn. Dev. Syst., October, 2024

Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review.
Remote. Sens., July, 2024

A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation.
IEEE Trans. Cogn. Dev. Syst., February, 2024

SWIN-TOD: Smooth Wasserstein Distance and Instance-Level Neighboring Enhancement for Remote Sensing Tiny Object Detection.
IEEE Trans. Geosci. Remote. Sens., 2024

Every Problem, Every Step, All in Focus: Learning to Solve Vision-Language Problems With Integrated Attention.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

OV-VG: A benchmark for open-vocabulary visual grounding.
Neurocomputing, 2024

OVGNet: A Unified Visual-Linguistic Framework for Open-Vocabulary Robotic Grasping.
CoRR, 2024

BACON: Bayesian Optimal Condensation Framework for Dataset Distillation.
CoRR, 2024

Self-training guided disentangled adaptation for cross-domain remote sensing image semantic segmentation.
Int. J. Appl. Earth Obs. Geoinformation, 2024

Self-Training and Curriculum Learning Guided Dynamic Refined Network for Remote Sensing Class-Incremental Semantic Segmentation.
Proceedings of the IGARSS 2024, 2024

GRACE: Graph-Based Contextual Debiasing for Fair Visual Question Answering.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning.
Proceedings of the Computer Vision - ECCV 2024, 2024

GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths.
Proceedings of the Computer Vision - ECCV 2024, 2024

Beyond Average: Individualized Visual Scanpath Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Learn by Oneself: Exploiting Weight-Sharing Potential in Knowledge Distillation Guided Ensemble Network.
IEEE Trans. Circuits Syst. Video Technol., November, 2023

MGML: Multigranularity Multilevel Feature Ensemble Network for Remote Sensing Scene Classification.
IEEE Trans. Neural Networks Learn. Syst., May, 2023

TPH-YOLOv5++: Boosting Object Detection on Drone-Captured Scenarios with Cross-Layer Asymmetric Transformer.
Remote. Sens., March, 2023

Feature reconstruction and metric based network for few-shot object detection.
Comput. Vis. Image Underst., January, 2023

Learning to Minimize the Remainder in Supervised Learning.
IEEE Trans. Multim., 2023

Emotional Attention: From Eye Tracking to Computational Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review.
CoRR, 2023

Self-Training Guided Disentangled Adaptation for Cross-Domain Remote Sensing Image Semantic Segmentation.
CoRR, 2023

What Do Deep Saliency Models Learn about Visual Attention?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Fully Integrated W-Band Four-Channel Silicon-Based Radiometer Array in 65-nm CMOS.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

Enhancing Spatial Consistency and Class-Level Diversity for Segmenting Fine-Grained Objects.
Proceedings of the Neural Information Processing - 30th International Conference, 2023

Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Decision Boundary Optimization for Few-shot Class-Incremental Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Toward Multi-Granularity Decision-Making: Explicit Visual Reasoning with Hierarchical Knowledge.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Embedded Self-Distillation in Compact Multibranch Ensemble Network for Remote Sensing Scene Classification.
IEEE Trans. Geosci. Remote. Sens., 2022

Semantic Segmentation With Attention Mechanism for Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Artificial Intelligence Enables Real-Time and Intuitive Control of Prostheses via Nerve Interface.
IEEE Trans. Biomed. Eng., 2022

A feature consistency driven attention erasing network for fine-grained image retrieval.
Pattern Recognit., 2022

Attention in Reasoning: Dataset, Analysis, and Modeling.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Look in Different Views: Multi-Scheme Regression Guided Cell Instance Segmentation.
CoRR, 2022

A Multi-Modality Ovarian Tumor Ultrasound Image Dataset for Unsupervised Cross-Domain Semantic Segmentation.
CoRR, 2022

Learning to Predict Gradients for Semi-Supervised Continual Learning.
CoRR, 2022

ML-FDA: Meta-Learning via Feature Distribution Alignment for Few-Shot Learning.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Using Guided Self-Attention with Local Information for Polyp Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

A Similarity Distillation Guided Feature Refinement Network for Few-Shot Semantic Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

New Datasets and Models for Contextual Reasoning in Visual Dialog.
Proceedings of the Computer Vision - ECCV 2022, 2022

Query and Attention Augmentation for Knowledge-Based Explainable Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

VisualHow: Multimodal Problem Solving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

REX: Reasoning-aware and Grounded Explanation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Direction Concentration Learning: Enhancing Congruency in Machine Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Attention-based Feature Decomposition-Reconstruction Network for Scene Text Detection.
CoRR, 2021

A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation.
CoRR, 2021

Embedded Self-Distillation in Compact Multi-Branch Ensemble Network for Remote Sensing Scene Classification.
CoRR, 2021

A Portable, Self-Contained Neuroprosthetic Hand with Deep Learning-Based Finger Control.
CoRR, 2021

Saliency Prediction with External Knowledge.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Self-Distillation for Few-Shot Image Captioning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Learning to Predict Trustworthiness with Steep Slope Loss.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Leveraging Human Attention in Novel Object Captioning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021


Explicit Knowledge Incorporation for Visual Reasoning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Predicting Human Scanpaths in Visual Question Answering.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Make Baseline Model Stronger: Embedded Knowledge Distillation in Weight-Sharing Based Ensemble Network.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Attention to Action: Leveraging Attention for Object Navigation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
G-Softmax: Improving Intraclass Compactness and Interclass Separability of Features.
IEEE Trans. Neural Networks Learn. Syst., 2020

Interact as You Intend: Intention-Driven Human-Object Interaction Detection.
IEEE Trans. Multim., 2020

Video Storytelling: Textual Summaries for Events.
IEEE Trans. Multim., 2020

A Deeper Look at Human Visual Perception of Images.
SN Comput. Sci., 2020

A structure-guided approach to the prediction of natural image saliency.
Neurocomputing, 2020

Visual Social Relationship Recognition.
Int. J. Comput. Vis., 2020

MM-FSOD: Meta and metric integrated few-shot object detection.
CoRR, 2020

MGML: Multi-Granularity Multi-Level Feature Ensemble Network for Remote Sensing Scene Classification.
CoRR, 2020

Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection.
CoRR, 2020

Long-term real time object tracking based on multi-scale local correlation filtering and global re-detection.
Computing, 2020

Knowledge distilling based model compression and feature learning in fault diagnosis.
Appl. Soft Comput., 2020

Rapidly Learning Bayesian Networks for Complex System Diagnosis: A Reinforcement Learning Directed Greedy Search Approach.
IEEE Access, 2020

GradMix: Multi-source Transfer across Domains and Tasks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Predicting Core Characteristics of ASD Through Facial Emotion Recognition and Eye Tracking in Youth.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

n-Reference Transfer Learning for Saliency Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

AiR: Attention with Reasoning Capability.
Proceedings of the Computer Vision - ECCV 2020, 2020

Fantastic Answers and Where to Find Them: Immersive Question-Directed Visual Attention.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
CapVis: Toward Better Understanding of Visual-Verbal Saliency Consistency.
ACM Trans. Intell. Syst. Technol., 2019

Anticipating Where People will Look Using Adversarial Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Shallowing Deep Networks: Layer-Wise Pruning Based on Feature Representations.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Advancing System Performance with Redundancy: From Biological to Artificial Designs.
Neural Comput., 2019

Interpretable Relative Squeezing bottleneck design for compact convolutional neural networks model.
Image Vis. Comput., 2019

G-softmax: Improving Intra-class Compactness and Inter-class Separability of Features.
CoRR, 2019

RSNet: A Compact Relative Squeezing Net for Image Recognition.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019

Diagnosing Strong-fault Models with a Two-step A* Search Method.
Proceedings of the 2019 IEEE International Conference on Prognostics and Health Management, 2019

Attention-Based Autism Spectrum Disorder Screening With Privileged Modality.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Classifying Individuals with ASD Through Facial Emotion Recognition and Eye-Tracking.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Learning to Detect Human-Object Interactions With Knowledge.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Visual Attention in Multi-Label Image Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Learning to Learn From Noisy Labeled Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Block-Sparse Modeling for Compressed Sensing of Neural Action Potentials and Local Field Potentials.
Proceedings of the 53rd Asilomar Conference on Signals, Systems, and Computers, 2019

2018
Multiactivation Pooling Method in Convolutional Neural Networks for Image Recognition.
Wirel. Commun. Mob. Comput., 2018

Diagnosing a Strong-Fault Model by Conflict and Consistency.
Sensors, 2018

A CNN-SIFT Hybrid Pedestrian Navigation Method Based on First-Person Vision.
Remote. Sens., 2018

Image Visual Realism: From Human Perception to Machine Computation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

A novel prediction method based on the support vector regression for the remaining useful life of lithium-ion batteries.
Microelectron. Reliab., 2018

AlphaMEX: A smarter global pooling method for convolutional neural networks.
Neurocomputing, 2018

What am I searching for?
CoRR, 2018

Finding any Waldo: zero-shot invariant and efficient visual search.
CoRR, 2018

Video Storytelling.
CoRR, 2018

Unsupervised Learning of View-invariant Action Representations.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Egocentric Spatial Memory.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Boosted Attention: Leveraging Human Attention for Image Captioning.
Proceedings of the Computer Vision - ECCV 2018, 2018

Emotional Attention: A Study of Image Sentiment and Visual Attention.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Determining child orientation from overhead video: A multiple kernel learning approach.
Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics, 2017

Visual-verbal consistency of image saliency.
Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics, 2017

Saliency prediction with scene structural guidance.
Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics, 2017

Attention Transfer from Web Images for Video Recognition.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

The Role of Visual Attention in Sentiment Prediction.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Prognostics of remaining useful life for lithium-ion batteries based on a feature vector selection and relevance vector machine approach.
Proceedings of the 2017 IEEE International Conference on Prognostics and Health Management, 2017

Foveated neural network: Gaze prediction on egocentric videos.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Multi-layer linear model for top-down modulation of visual attention in natural egocentric vision.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Dual-Glance Model for Deciphering Social Relationships.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Visual Attention to Identify People with Autism Spectrum Disorder.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Learning to Predict Sequences of Human Visual Fixations.
IEEE Trans. Neural Networks Learn. Syst., 2016

A Paradigm for Building Generalized Models of Human Image Perception through Data Fusion.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
State Tracking and Fault Diagnosis for Dynamic Systems Using Labeled Uncertainty Graph.
Sensors, 2015

Multi-Camera Saliency.
IEEE Trans. Pattern Anal. Mach. Intell., 2015

Research on Ionospheric Scintillation with Beidou Satellite Signal.
IEICE Trans. Commun., 2015

Foveation-based Mechanisms Alleviate Adversarial Examples.
CoRR, 2015

SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Label Consistent Quadratic Surrogate model for visual saliency prediction.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

SALICON: Saliency in Context.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Saliency Prediction with Active Semantic Segmentation.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Saliency in Crowd.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
Leveraging Human Fixations in Sparse Coding: Learning a Discriminative Dictionary for Saliency Prediction.
Proceedings of the IEEE International Conference on Systems, 2013

2009
Noise Characterization, Modeling, and Reduction for In Vivo Neural Recording.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009


  Loading...