Jungong Han

Orcid: 0000-0003-4361-956X

According to our database1, Jungong Han authored at least 360 papers between 2002 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Adversarial diffusion for few-shot scene adaptive video anomaly detection.
Neurocomputing, 2025

2024
Manipulating Identical Filter Redundancy for Efficient Pruning on Deep and Complicated CNN.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

AMNet: Learning to Align Multi-Modality for RGB-T Tracking.
IEEE Trans. Circuits Syst. Video Technol., August, 2024

Virtual Category Learning: A Semi-Supervised Learning Method for Dense Prediction With Extremely Limited Labels.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2024

Mitigating Modality Discrepancies for RGB-T Semantic Segmentation.
IEEE Trans. Neural Networks Learn. Syst., July, 2024

TCGNet: Type-Correlation Guidance for Salient Object Detection.
IEEE Trans. Intell. Transp. Syst., July, 2024

Pedestrian Attribute Recognition via Spatio-temporal Relationship Learning for Visual Surveillance.
ACM Trans. Multim. Comput. Commun. Appl., June, 2024

A Coarse-to-Fine Cell Division Approach for Hyperspectral Remote Sensing Image Classification.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Self-Prompting Perceptual Edge Learning for Dense Prediction.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Co-segmentation assisted cross-modality person re-identification.
Inf. Fusion, April, 2024

Zero-Shot Learning With Attentive Region Embedding and Enhanced Semantics.
IEEE Trans. Neural Networks Learn. Syst., March, 2024

Feature Calibrating and Fusing Network for RGB-D Salient Object Detection.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Weakly Supervised Joint Transfer and Regression of Textures for 3-D Human Reconstruction.
IEEE Trans. Consumer Electron., February, 2024

Learning Foreground Information Bottleneck for few-shot semantic segmentation.
Pattern Recognit., February, 2024

On Exploring Shape and Semantic Enhancements for RGB-X Semantic Segmentation.
IEEE Trans. Intell. Veh., January, 2024

Dynamic contrastive learning guided by class confidence and confusion degree for medical image segmentation.
Pattern Recognit., January, 2024

Supervised biadjacency networks for stereo matching.
Multim. Tools Appl., January, 2024

DCMSTRD: End-to-end Dense Captioning via Multi-Scale Transformer Decoding.
IEEE Trans. Multim., 2024

Binocular Image Dehazing via a Plain Network Without Disparity Estimation.
IEEE Trans. Multim., 2024

Lightweight Multiperson Pose Estimation With Staggered Alignment Self-Distillation.
IEEE Trans. Multim., 2024

Exploring Multi-Modal Spatial-Temporal Contexts for High-Performance RGB-T Tracking.
IEEE Trans. Image Process., 2024

Model Attention Expansion for Few-Shot Class-Incremental Learning.
IEEE Trans. Image Process., 2024

Salient Object Detection From Arbitrary Modalities.
IEEE Trans. Image Process., 2024

Confidence-Guided Centroids for Unsupervised Person Re-Identification.
IEEE Trans. Inf. Forensics Secur., 2024

MDCGA-Net: Multiscale Direction Context-Aware Network With Global Attention for Building Extraction From Remote Sensing Images.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

Exploring target-related information with reliable global pixel relationships for robust RGB-T tracking.
Pattern Recognit., 2024

Transductive zero-shot learning with generative model-driven structure alignment.
Pattern Recognit., 2024

Zero-shot sketch-based image retrieval via adaptive relation-aware metric learning.
Pattern Recognit., 2024

Adaptive Relation-Aware Network for zero-shot classification.
Neural Networks, 2024

Tolerant Self-Distillation for image classification.
Neural Networks, 2024

ECMEE: Expert Constrained Multi-Expert Ensembles with Category Entropy Minimization for Long-tailed Visual Recognition.
Neurocomputing, 2024

Dense affinity matching for Few-Shot Segmentation.
Neurocomputing, 2024

Lightweight cross-modal transformer for RGB-D salient object detection.
Comput. Vis. Image Underst., 2024

A Fresh Look at Generalized Category Discovery through Non-negative Matrix Factorization.
CoRR, 2024

Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding.
CoRR, 2024

LLMI3D: Empowering LLM with 3D Perception from a Single 2D Image.
CoRR, 2024

Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective.
CoRR, 2024

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results.
CoRR, 2024

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation.
CoRR, 2024

VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model.
CoRR, 2024

YOLOv10: Real-Time End-to-End Object Detection.
CoRR, 2024

Modality Prompts for Arbitrary Modality Salient Object Detection.
CoRR, 2024

Raformer: Redundancy-Aware Transformer for Video Wire Inpainting.
CoRR, 2024

On Exploring PDE Modeling for Point Cloud Video Representation Learning.
CoRR, 2024

Pixel Sentence Representation Learning.
CoRR, 2024

Pixel Matching Network for Cross-Domain Few-Shot Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Eliminate Before Align: A Remote Sensing Image-Text Retrieval Framework with Keyword Explicit Reasoning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

TaD: A Plug-and-Play Task-Aware Decoding Method to Better Adapt LLMs on Downstream Tasks.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

PYRA: Parallel Yielding Re-activation for Training-Inference Efficient Task Adaptation.
Proceedings of the Computer Vision - ECCV 2024, 2024

On the Approximation Risk of Few-Shot Class-Incremental Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence.
Proceedings of the Computer Vision - ECCV 2024, 2024

Pseudo-labelling Should Be Aware of Disguising Channel Activations.
Proceedings of the Computer Vision - ECCV 2024, 2024

Context Enhancement with Reconstruction as Sequence for Unified Unsupervised Anomaly Detection.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Rep ViT: Revisiting Mobile CNN From ViT Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

WaveFace: Authentic Face Restoration with Efficient Frequency Recovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Multilayer Evolving Fuzzy Neural Networks.
IEEE Trans. Fuzzy Syst., December, 2023

On exploring pose estimation as an auxiliary learning task for Visible-Infrared Person Re-identification.
Neurocomputing, November, 2023

LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context Propagation in Transformers.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Decoupling Multimodal Transformers for Referring Video Object Segmentation.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Margin-aware rectified augmentation for long-tailed recognition.
Pattern Recognit., September, 2023

2.75D: Boosting learning by representing 3D Medical imaging to 2D features for small data.
Biomed. Signal Process. Control., July, 2023

Knowledge Distillation Classifier Generation Network for Zero-Shot Learning.
IEEE Trans. Neural Networks Learn. Syst., June, 2023

A Discriminative Cross-Aligned Variational Autoencoder for Zero-Shot Learning.
IEEE Trans. Cybern., June, 2023

Boosting Variational Inference With Margin Learning for Few-Shot Scene-Adaptive Anomaly Detection.
IEEE Trans. Circuits Syst. Video Technol., June, 2023

Perception consistency ultrasound image super-resolution via self-supervised CycleGAN.
Neural Comput. Appl., June, 2023

Hierarchical Regression and Classification for Accurate Object Detection.
IEEE Trans. Neural Networks Learn. Syst., May, 2023

Hybrid routing transformer for zero-shot learning.
Pattern Recognit., May, 2023

Filter pruning with uniqueness mechanism in the frequency domain for efficient neural networks.
Neurocomputing, April, 2023

Semi-Supervised Unpaired Medical Image Segmentation Through Task-Affinity Consistency.
IEEE Trans. Medical Imaging, March, 2023

Textual Context-Aware Dense Captioning With Diverse Words.
IEEE Trans. Multim., 2023

Latent Feature Pyramid Network for Object Detection.
IEEE Trans. Multim., 2023

Progressive Recurrent Neural Network for Multispectral Remote Sensing Image Destriping.
IEEE Trans. Geosci. Remote. Sens., 2023

Exploring modality-shared appearance features and modality-invariant relation features for cross-modality person Re-IDentification.
Pattern Recognit., 2023

Video Object Segmentation using Point-based Memory Network.
Pattern Recognit., 2023

Deep learning for visible-infrared cross-modality person re-identification: A comprehensive review.
Inf. Fusion, 2023

Re-parameterized Low-rank Prompt: Generalize a Vision-Language Model within 0.5K Parameters.
CoRR, 2023

RepViT-SAM: Towards Real-Time Segmenting Anything.
CoRR, 2023

CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs.
CoRR, 2023

Filter Pruning for Efficient CNNs via Knowledge-driven Differential Filter Sampler.
CoRR, 2023

SegGPT Meets Co-Saliency Scene.
CoRR, 2023

Autonomous learning for fuzzy systems: a review.
Artif. Intell. Rev., 2023

Deep learning for video object segmentation: a review.
Artif. Intell. Rev., 2023

Re-parameterizing Your Optimizers rather than Architectures.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Beyond One-to-One: Rethinking the Referring Image Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficient RGB-T Tracking via Cross-Modality Distillation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Memory Attention Networks for Skeleton-Based Action Recognition.
IEEE Trans. Neural Networks Learn. Syst., 2022

Employing Bilinear Fusion and Saliency Prior Information for RGB-D Salient Object Detection.
IEEE Trans. Multim., 2022

Guest Editorial Artificial Intelligence in Pre-DICOM.
IEEE J. Biomed. Health Informatics, 2022

Disentangled Capsule Routing for Fast Part-Object Relational Saliency.
IEEE Trans. Image Process., 2022

Information Symmetry Matters: A Modal-Alternating Propagation Network for Few-Shot Learning.
IEEE Trans. Image Process., 2022

Middle-Level Feature Fusion for Lightweight RGB-D Salient Object Detection.
IEEE Trans. Image Process., 2022

Solo-to-Collaborative Dual-Attention Network for One-Shot Object Detection in Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Variational Self-Distillation for Remote Sensing Scene Classification.
IEEE Trans. Geosci. Remote. Sens., 2022

SAENet: Self-Supervised Adversarial and Equivariant Network for Weakly Supervised Object Detection in Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Multiview Subspace Clustering by an Enhanced Tensor Nuclear Norm.
IEEE Trans. Cybern., 2022

DGIG-Net: Dynamic Graph-in-Graph Networks for Few-Shot Human-Object Interaction.
IEEE Trans. Cybern., 2022

SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text Retrieval.
IEEE Trans. Cybern., 2022

SiamCDA: Complementarity- and Distractor-Aware RGB-T Tracking Based on Siamese Network.
IEEE Trans. Circuits Syst. Video Technol., 2022

Engaging Part-Whole Hierarchies and Contrast Cues for Salient Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Bi-Directional Progressive Guidance Network for RGB-D Salient Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022

Stereo Refinement Dehazing Network.
IEEE Trans. Circuits Syst. Video Technol., 2022

Revisiting Modality-Specific Feature Compensation for Visible-Infrared Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2022

Improving Synthetic to Realistic Semantic Segmentation With Parallel Generative Ensembles for Autonomous Urban Driving.
IEEE Trans. Cogn. Dev. Syst., 2022

Column-Spatial Correction Network for Remote Sensing Image Destriping.
Remote. Sens., 2022

Editorial for the special issue on deep learning for precise and efficient object detection.
Pattern Recognit. Lett., 2022

Zero-shot learning via a specific rank-controlled semantic autoencoder.
Pattern Recognit., 2022

Discriminative unimodal feature selection and fusion for RGB-D salient object detection.
Pattern Recognit., 2022

Cross-modality person re-identification via multi-task learning.
Pattern Recognit., 2022

Part-Object Relational Visual Saliency.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Multi-view graph embedding clustering network: Joint self-supervision and block diagonal representation.
Neural Networks, 2022

A Self-Training Hierarchical Prototype-based Ensemble Framework for Remote Sensing Scene Classification.
Inf. Fusion, 2022

Meta hyperbolic networks for zero-shot learning.
Neurocomputing, 2022

Long-tailed visual recognition with deep models: A methodological survey and evaluation.
Neurocomputing, 2022

Real-time facial expression recognition based on iterative transfer learning and efficient attention network.
IET Image Process., 2022

Ground Plane Matters: Picking Up Ground Plane Prior in Monocular 3D Object Detection.
CoRR, 2022

Semi-supervised Object Detection via Virtual Category Learning.
CoRR, 2022

Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs.
CoRR, 2022

Dual Attention with the Self-Attention Alignment for Efficient Video Super-resolution.
Cogn. Comput., 2022

Onfocus detection: identifying individual-camera eye contact from unconstrained images.
Sci. China Inf. Sci., 2022

Densely nested top-down flows for salient object detection.
Sci. China Inf. Sci., 2022

Laplacian Regularized Variational Few-Shot Learning for Image Classification.
Proceedings of the Advances in Computational Intelligence Systems, 2022

Physically-Based Face Rendering for NIR-VIS Face Recognition.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Boosting Video-Text Retrieval with Explicit High-Level Semantics.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Temporal Saliency Query Network for Efficient Video Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semi-supervised Object Detection via VC Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

FMCNet: Feature-Level Modality Compensation for Visible-Infrared Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scaling Up Your Kernels to 31×31: Revisiting Large Kernel Design in CNNs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

ReMoNet: Recurrent Multi-Output Network for Efficient Video Denoising.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Alignment Enhancement Network for Fine-grained Visual Categorization.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Deep Attentive Video Summarization With Distribution Consistency Learning.
IEEE Trans. Neural Networks Learn. Syst., 2021

Joint Cross-Modal and Unimodal Features for RGB-D Salient Object Detection.
IEEE Trans. Multim., 2021

Learning Transformation-Invariant Local Descriptors With Low-Coupling Binary Codes.
IEEE Trans. Image Process., 2021

Where to Prune: Using LSTM to Guide Data-Dependent Soft Pruning.
IEEE Trans. Image Process., 2021

Integrating Part-Object Relationship and Contrast for Camouflaged Object Detection.
IEEE Trans. Inf. Forensics Secur., 2021

Revisiting Feature Fusion for RGB-T Salient Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2021

Efficient Selective Context Network for Accurate Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2021

Automatic pancreas segmentation based on lightweight DCNN modules and spatial prior propagation.
Pattern Recognit., 2021

Exploring a unified low rank representation for multi-focus image fusion.
Pattern Recognit., 2021

Cross-modality deep feature learning for brain tumor segmentation.
Pattern Recognit., 2021

Learning modulation filter networks for weak signal detection in noise.
Pattern Recognit., 2021

Relation-based Discriminative Cooperation Network for Zero-Shot Classification.
Pattern Recognit., 2021

Cascaded hierarchical atrous spatial pyramid pooling module for semantic segmentation.
Pattern Recognit., 2021

Graph embedding clustering: Graph attention auto-encoder with cluster-specificity distribution.
Neural Networks, 2021

Deep image compression with multi-stage representation.
J. Vis. Commun. Image Represent., 2021

Lightweight facial expression recognition method based on attention mechanism and key region fusion.
J. Electronic Imaging, 2021

LODE: Deep Local Deblurring and A New Benchmark.
CoRR, 2021

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition.
CoRR, 2021

Middle-level Fusion for Lightweight RGB-D Salient Object Detection.
CoRR, 2021

Exploring Modality-shared Appearance Features and Modality-invariant Relation Features for Cross-modality Person Re-Identification.
CoRR, 2021

Image Captioning with Memorized Knowledge.
Cogn. Comput., 2021

Ultrasound tissue classification: a review.
Artif. Intell. Rev., 2021

Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder.
IEEE Access, 2021

ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RepVGG: Making VGG-Style ConvNets Great Again.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Diverse Branch Block: Building a Convolution as an Inception-Like Unit.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ABMDRNet: Adaptive-Weighted Bi-Directional Modality Difference Reduction Network for RGB-T Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Increasing Oversampling Diversity for Long-Tailed Visual Recognition.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

2020
ACMNet: Adaptive Confidence Matching Network for Human Behavior Analysis via Cross-modal Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2020

Attribute-Guided Network for Cross-Modal Zero-Shot Hashing.
IEEE Trans. Neural Networks Learn. Syst., 2020

Using Generative Adversarial Networks to Break and Protect Text Captchas.
ACM Trans. Priv. Secur., 2020

The Structure Transfer Machine Theory and Applications.
IEEE Trans. Image Process., 2020

Exploring Task Structure for Brain Tumor Segmentation From Multi-Modality MR Images.
IEEE Trans. Image Process., 2020

RGB-T Salient Object Detection via Fusing Multi-Level CNN Features.
IEEE Trans. Image Process., 2020

On Aggregation of Unsupervised Deep Binary Descriptor With Weak Bits.
IEEE Trans. Image Process., 2020

Deep Salient Object Detection With Contextual Information Guidance.
IEEE Trans. Image Process., 2020

Aggregation Signature for Small Object Tracking.
IEEE Trans. Image Process., 2020

Taking a Look at Small-Scale Pedestrians and Occluded Pedestrians.
IEEE Trans. Image Process., 2020

Discrete Probability Distribution Prediction of Image Emotions with Shared Sparse Learning.
IEEE Trans. Affect. Comput., 2020

Pedestrian attribute recognition based on multiple time steps attention.
Pattern Recognit. Lett., 2020

Multi-focus image fusion based on non-negative sparse representation and patch-level consistency rectification.
Pattern Recognit., 2020

Multi-layer Attention Based CNN for Target-Dependent Sentiment Classification.
Neural Process. Lett., 2020

Label-activating framework for zero-shot learning.
Neural Networks, 2020

Indoor scene understanding via RGB-D image segmentation employing depth-based CNN and CRFs.
Multim. Tools Appl., 2020

Fast simultaneous image super-resolution and motion deblurring with decoupled cooperative learning.
J. Real Time Image Process., 2020

Semantic segmentation with hybrid pyramid pooling and stacked pyramid structure.
Neurocomputing, 2020

Pixelated Semantic Colorization.
Int. J. Comput. Vis., 2020

Lossless CNN Channel Pruning via Gradient Resetting and Convolutional Re-parameterization.
CoRR, 2020

Few-Cost Salient Object Detection with Adversarial-Paced Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Exploring Multi-scale Deep Encoder-Decoder and PatchGAN for Perceptual Ultrasound Image Super-Resolution.
Proceedings of the Neural Computing for Advanced Applications, 2020

Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-Tailed Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

NAS-Count: Counting-by-Density with Neural Architecture Search.
Proceedings of the Computer Vision - ECCV 2020, 2020

Episode-Based Prototype Generating Network for Zero-Shot Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

BidNet: Binocular Image Dehazing Without Explicit Disparity Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

IMRAM: Iterative Matching With Recurrent Attention Memory for Cross-Modal Image-Text Retrieval.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Shallow Feature Based Dense Attention Network for Crowd Counting.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Heterogeneous Transfer Learning with Weighted Instance-Correspondence Data.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Personalized Emotion Recognition by Personality-Aware High-Order Learning of Physiological Signals.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Unsupervised Deep Video Hashing via Balanced Code for Large-Scale Video Retrieval.
IEEE Trans. Image Process., 2019

Deep Manifold Structure Transfer for Action Recognition.
IEEE Trans. Image Process., 2019

DECODE: Deep Confidence Network for Robust Image Classification.
IEEE Trans. Image Process., 2019

JCS-Net: Joint Classification and Super-Resolution Network for Small-Scale Pedestrian Detection in Surveillance Images.
IEEE Trans. Inf. Forensics Secur., 2019

Joint Image-Text Hashing for Fast Large-Scale Cross-Media Retrieval Using Self-Supervised Deep Learning.
IEEE Trans. Ind. Electron., 2019

Salient Object Detection via Two-Stage Graphs.
IEEE Trans. Circuits Syst. Video Technol., 2019

Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection.
Sensors, 2019

ST-CNN: Spatial-Temporal Convolutional Neural Network for crowd counting in videos.
Pattern Recognit. Lett., 2019

Are mid-air dynamic gestures applicable to user identification?
Pattern Recognit. Lett., 2019

Optimized projection for hashing.
Pattern Recognit. Lett., 2019

Salient object detection employing a local tree-structured low-rank representation and foreground consistency.
Pattern Recognit., 2019

Video Synchronization Based on Projective-Invariant Descriptor.
Neural Process. Lett., 2019

Flexible unsupervised feature extraction for image classification.
Neural Networks, 2019

Adaptive robust principal component analysis.
Neural Networks, 2019

Guest editorial: Automatic facial and bodily expression perception for human behaviour understanding.
Multim. Tools Appl., 2019

Attention module-based spatial-temporal graph convolutional networks for skeleton-based action recognition.
J. Electronic Imaging, 2019

Single image super-resolution using multi-scale deep encoder-decoder with phase congruency edge map guidance.
Inf. Sci., 2019

Hyperspectral image denoising via minimizing the partial sum of singular values and superpixel segmentation.
Neurocomputing, 2019

Class-specific synthesized dictionary model for Zero-Shot Learning.
Neurocomputing, 2019

Survey on GAN-based face hallucination with its model development.
IET Image Process., 2019

SAR image change detection based on deep denoising and CNN.
IET Image Process., 2019

Zero-shot multi-label learning via label factorisation.
IET Comput. Vis., 2019

Meta-Transfer Networks for Zero-Shot Learning.
CoRR, 2019

Complex Deep Learning and Evolutionary Computing Models in Computer Vision.
Complex., 2019

Neural Image Caption Generation with Weighted Training and Reference.
Cogn. Comput., 2019

Taylor Convolutional Networks for Image Classification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Deep Feature-Preserving Based Face Hallucination: Feature Discrimination Versus Pixels Approximation.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Global Sparse Momentum SGD for Pruning Very Deep Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Cross-Modal Image-Text Retrieval with Semantic Consistency.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Incremental Few-Shot Learning for Pedestrian Attribute Recognition.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Low Shot Box Correction for Weakly Supervised Object Detection.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Graph and Autoencoder Based Feature Extraction for Zero-shot Learning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Landmark Selection for Zero-shot Learning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Zero-shot Learning with Many Classes by High-rank Deep Embedding Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Approximated Oracle Filter Pruning for Destructive CNN Width Optimization.
Proceedings of the 36th International Conference on Machine Learning, 2019

Complementary Features with Reasonable Receptive Field for Road Scene 3D Object Detection.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Saliency-Guided Attention Network for Image-Sentence Matching.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Hierarchical Shot Detector.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Employing Deep Part-Object Relationships for Salient Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Centripetal SGD for Pruning Very Deep Convolutional Networks With Complicated Structure.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Recurrent Attention Model for Pedestrian Attribute Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Attentive Temporal Pyramid Network for Dynamic Scene Classification.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Dual-View Ranking with Hardness Assessment for Zero-Shot Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Projection Convolutional Neural Networks for 1-bit CNNs via Discrete Back Propagation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Learning Object Context for Dense Captioning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
End-to-End Feature-Aware Label Space Encoding for Multilabel Classification With Many Classes.
IEEE Trans. Neural Networks Learn. Syst., 2018

Real-Time Scalable Visual Tracking via Quadrangle Kernelized Correlation Filters.
IEEE Trans. Intell. Transp. Syst., 2018

Action Recognition From Arbitrary Views Using Transferable Dictionary Learning.
IEEE Trans. Image Process., 2018

Latent Constrained Correlation Filter.
IEEE Trans. Image Process., 2018

Gabor Convolutional Networks.
IEEE Trans. Image Process., 2018

Discriminant Analysis via Joint Euler Transform and ℓ<sub>2, 1</sub>-Norm.
IEEE Trans. Image Process., 2018

Robust Quantization for General Similarity Search.
IEEE Trans. Image Process., 2018

Unconstrained Face Recognition Using a Set-to-Set Distance Measure on Deep Learned Features.
IEEE Trans. Circuits Syst. Video Technol., 2018

Dense Invariant Feature-Based Support Vector Ranking for Cross-Camera Person Reidentification.
IEEE Trans. Circuits Syst. Video Technol., 2018

Secure and privacy-preserving data sharing in the cloud based on lossless image coding.
Signal Process., 2018

Automatic Modulation Classification Based on Deep Learning for Unmanned Aerial Vehicles.
Sensors, 2018

High-Fidelity Inhomogeneous Ground Clutter Simulation of Airborne Phased Array PD Radar Aided by Digital Elevation Model and Digital Land Classification Data.
Sensors, 2018

Robust sparse representation based multi-focus image fusion with dictionary construction and local spatial consistency.
Pattern Recognit., 2018

Deep Fisher discriminant learning for mobile hand gesture recognition.
Pattern Recognit., 2018

End-to-end video background subtraction with 3d convolutional neural networks.
Multim. Tools Appl., 2018

Single satellite imagery simultaneous super-resolution and colorization using multi-task deep neural networks.
J. Vis. Commun. Image Represent., 2018

Fast hyperspectral band selection based on spatial feature extraction.
J. Real Time Image Process., 2018

Salient object detection employing robust sparse representation and local consistency.
Image Vis. Comput., 2018

Sparse representation based multi-sensor image fusion for multi-focus and multi-modality images: A review.
Inf. Fusion, 2018

Single image super-resolution using a deep encoder-decoder symmetrical network with iterative back projection.
Neurocomputing, 2018

Memory Attention Networks for Skeleton-based Action Recognition.
CoRR, 2018

One-Two-One Networks for Compression Artifacts Reduction in Remote Sensing.
CoRR, 2018

The Structure Transfer Machine Theory and Applications.
CoRR, 2018

Attribute-Guided Network for Cross-Modal Zero-Shot Hashing.
CoRR, 2018

Gabor Convolutional Networks.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Where to Prune: Using LSTM to Guide End-to-end Pruning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Personality-Aware Personalized Emotion Recognition from Physiological Signals.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Memory Attention Networks for Skeleton-based Action Recognition.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Unsupervised Deep Hashing via Binary Latent Factor Models for Large-scale Cross-modal Retrieval.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Zero Shot Learning via Low-rank Embedded Semantic AutoEncoder.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Implicit Non-linear Similarity Scoring for Recognizing Unseen Classes.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Show, Observe and Tell: Attribute-driven Attention Model for Image Captioning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018


Modulated Convolutional Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Pixel-level Semantics Guided Image Colorization.
Proceedings of the British Machine Vision Conference 2018, 2018

Attend to Knowledge: Memory-Enhanced Attention Network for Image Captioning.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2018

Euler Sparse Representation for Image Classification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

On Trivial Solution and High Correlation Problems in Deep Supervised Hashing.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Zero-Shot Learning With Attribute Selection.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Auto-Balanced Filter Pruning for Efficient Convolutional Neural Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Temporal-Difference Learning With Sampling Baseline for Image Captioning.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Action Recognition Using 3D Histograms of Texture and A Multi-Class Boosting Classifier.
IEEE Trans. Image Process., 2017

LLE Score: A New Filter-Based Unsupervised Feature Selection Method Based on Nonlinear Manifold Embedding and Its Application to Image Recognition.
IEEE Trans. Image Process., 2017

Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval.
IEEE Trans. Image Process., 2017

Learning to Hash With Optimized Anchor Embedding for Scalable Retrieval.
IEEE Trans. Image Process., 2017

Zero-Shot Learning With Transferred Samples.
IEEE Trans. Image Process., 2017

Cross-View Retrieval via Probability-Based Semantics-Preserving Hashing.
IEEE Trans. Cybern., 2017

Guest Editorial: Feature Learning from RGB-D Data for Multimedia Applications.
Multim. Tools Appl., 2017

RGB-D datasets using microsoft kinect or similar sensors: a survey.
Multim. Tools Appl., 2017

Hyperspectral Band Selection Using Improved Classification Map.
IEEE Geosci. Remote. Sens. Lett., 2017

Image Reconstruction via Manifold Constrained Convolutional Sparse Coding for Image Sets.
IEEE J. Sel. Top. Signal Process., 2017

Large-scale image retrieval with Sparse Embedded Hashing.
Neurocomputing, 2017

Attribute-based supervised deep learning model for action recognition.
Frontiers Comput. Sci., 2017

Salient object detection based on super-pixel clustering and unified low-rank representation.
Comput. Vis. Image Underst., 2017

Sparse Representation based Multi-sensor Image Fusion: A Review.
CoRR, 2017

Gabor Convolutional Networks.
CoRR, 2017

Deep Spatio-temporal Manifold Network for Action Recognition.
CoRR, 2017

Multi-Temporal Depth Motion Maps-Based Local Binary Patterns for 3-D Human Action Recognition.
IEEE Access, 2017

Learning Visual Emotion Distributions via Multi-Modal Features Fusion.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

TUCH: Turning Cross-view Hashing into Single-view Hashing via Generative Adversarial Nets.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Approximating Discrete Probability Distribution of Image Emotions by Multi-Modal Features Fusion.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Dynamic Multi-View Hashing for Online Image Retrieval.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Unsupervised Deep Video Hashing with Balanced Rotation.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Synthesizing Samples for Zero-shot Learning.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

SitNet: Discrete Similarity Transfer Network for Zero-shot Hashing.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Zero-Shot Recognition via Direct Classifier Learning with Transferred Samples and Pseudo Labels.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Active Learning with Cross-Class Similarity Transfer.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Reference Based LSTM for Image Captioning.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Cosaliency Detection Based on Intrasaliency Prior Transfer and Deep Intersaliency Mining.
IEEE Trans. Neural Networks Learn. Syst., 2016

Guest Editorial Special Section on Visual Saliency Computing and Learning.
IEEE Trans. Neural Networks Learn. Syst., 2016

Robust object representation by boosting-like deep learning architecture.
Signal Process. Image Commun., 2016

An improved Fisher discriminant vector employing updated between-scatter matrix.
Neurocomputing, 2016

Adaptive Multi-class Correlation Filters.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Robust Iterative Quantization for Efficient ℓ<sub>p</sub>-norm Similarity Search.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

3D Action Recognition Using Multi-Temporal Depth Motion Maps and Fisher Vector.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

2015
Learning Computational Models of Video Memorability from fMRI Brain Imaging.
IEEE Trans. Cybern., 2015

Analysis of music/speech via integration of audio content and functional brain response.
Inf. Sci., 2015

Saliency-aware image-to-class distances for image classification.
Neurocomputing, 2015

Auto-encoder-based shared mid-level visual dictionary learning for scene classification using very high resolution remote sensing images.
IET Comput. Vis., 2015

2014
Efficient highlight removal of metal surfaces.
Signal Process., 2014

Image visual attention computation and application via the learning of object attributes.
Mach. Vis. Appl., 2014

Video abstraction based on fMRI-driven visual attention model.
Inf. Sci., 2014

Guest Editorial: Special issue on advanced computing for image-guided intervention.
Neurocomputing, 2014

A subset method for improving Linear Discriminant Analysis.
Neurocomputing, 2014

Spatial and temporal visual attention prediction in videos using eye movement data.
Neurocomputing, 2014

Clustering and retrieval of video shots based on natural stimulus fMRI.
Neurocomputing, 2014

Feature-based motion compensated interpolation for frame rate up-conversion.
Neurocomputing, 2014

2013
Computer vision for RGB-D sensors: Kinect and its applications [special issue intro.].
IEEE Trans. Cybern., 2013

Enhanced Computer Vision With Microsoft Kinect Sensor: A Review.
IEEE Trans. Cybern., 2013

Visible and infrared image registration in man-made environments employing hybrid visual features.
Pattern Recognit. Lett., 2013

Extracting semantics from multi-spectrum video.
Pattern Recognit. Lett., 2013

Fast saliency-aware multi-modality image fusion.
Neurocomputing, 2013

2012
Employing a RGB-D sensor for real-time tracking of humans across multiple re-entries in a smart environment.
IEEE Trans. Consumer Electron., 2012

Intelligent trainee behavior assessment system for medical training employing video analysis.
Pattern Recognit. Lett., 2012

Multimodality and Multiresolution Image Fusion.
Proceedings of the VISAPP 2012, 2012

2011
Real-time multiple people tracking for automatic group-behavior evaluation in delivery simulation training.
Multim. Tools Appl., 2011

A Mixed-Reality System for Broadcasting Sports Video to Mobile Devices.
IEEE Multim., 2011

Analysis and retargeting of ball sports video.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Visible and Infrared Image Registration Employing Line-Based Geometric Analysis.
Proceedings of the Computational Intelligence for Multimedia Understanding, 2011

Multimodal monitoring of cultural heritage sites and the FIRESENSE project.
Proceedings of the 4th International Symposium on Applied Sciences in Biomedical and Communication Technologies, 2011

2010
Flexible Human Behavior Analysis Framework for Video Surveillance Applications.
Int. J. Digit. Multim. Broadcast., 2010

Video Analysis, Abstraction, and Retrieval: Techniques and Applications.
Int. J. Digit. Multim. Broadcast., 2010

2009
Automatic video-based human motion analyzer for consumer surveillance system.
IEEE Trans. Consumer Electron., 2009

Behavioral State Detection of Newborns Based on Facial Expression Analysis.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2009

2008
Broadcast Court-Net Sports Video Analysis Using Fast 3-D Camera Modeling.
IEEE Trans. Circuits Syst. Video Technol., 2008

A real-time video surveillance system with human occlusion handling using nonlinear regression.
Proceedings of the 2008 IEEE International Conference on Multimedia and Expo, 2008

Fast Detection and Modeling of Human-Body Parts from Monocular Video.
Proceedings of the Articulated Motion and Deformable Objects, 5th International Conference, 2008

Video-Based Fall Detection in the Home Using Principal Component Analysis.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2008

2007
A Matching-Based Approach for Human Motion Analysis.
Proceedings of the Advances in Multimedia Modeling, 2007

Generic 3-D Modeling for Content Analysis of Court-Net Sports Sequences.
Proceedings of the Advances in Multimedia Modeling, 2007

A real-time augmented-reality system for sports broadcast video enhancement.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

3-D Camera Modeling and Its Applications in Sports Broadcast Video Analysis.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

High-Level Traffic-Violation Detection for Embedded Traffic Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Real-time video content analysis tool for consumer media storage system.
IEEE Trans. Consumer Electron., 2006

Automatic Sports Video Analysis using Audio Clues and Context Knowledge.
Proceedings of the IASTED International Conference on Internet and Multimedia Systems and Applications, 2006

Scene-Level Analysis for Tennis Sports Video using Weighted Linear Combination of Visual Cues.
Proceedings of the IASTED International Conference on Internet and Multimedia Systems and Applications, 2006

Content-Based Model Template Adaptation and Real-Time System for Behavior Interpretation in Sports Video.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2006

2005
Real-Time and Distributed AV Content Analysis System for Consumer Electronics Networks.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Fast camera calibration for the analysis of sport sequences.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

2004
A novel stereo image coding algorithm based on delaunay triangulation mesh.
Proceedings of the Visual Communications and Image Processing 2004, 2004

Variable block-size transform and entropy coding at the enhancement layer of FGS.
Proceedings of the 2004 International Conference on Image Processing, 2004

2002
Novel image retrieval technique using salient edges.
Proceedings of the Storage and Retrieval for Media Databases 2002, 2002


  Loading...