Jing Zhang

Orcid: 0000-0001-6595-7661

Affiliations:
  • University of Sydney, School of Computer Science, Faculty of Engineering, UBTECH Sydney Artificial Intelligence Centre, NSW, Australia
  • Hangzhou Dianzi University, School of Automation, China (2017 - 2018)
  • iFLYTEK Research, Hefei, China (2016 - 2017)
  • ZTE Shanghai R&D Center, Shanghai, China (2015 - 2016)
  • University of Science and Technology of China, Department of Automation, Hefei, China (PhD 2015)


According to our database1, Jing Zhang authored at least 249 papers between 2013 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Empowering Agrifood System with Artificial Intelligence: A Survey of the Progress, Challenges and Opportunities.
ACM Comput. Surv., February, 2025

2024
A Survey on Self-Supervised Learning: Algorithms, Applications, and Future Trends.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Learning Visual Affordance Grounding From Demonstration Videos.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

VNAS: Variational Neural Architecture Search.
Int. J. Comput. Vis., September, 2024

SPH-Net: Hyperspectral Image Super-Resolution via Smoothed Particle Hydrodynamics Modeling.
IEEE Trans. Cybern., July, 2024

Grounded Affordance from Exocentric View.
Int. J. Comput. Vis., June, 2024

On Robust Cross-view Consistency in Self-supervised Monocular Depth Estimation.
Mach. Intell. Res., June, 2024

Vision Transformer With Quadrangle Attention.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Pruning Self-Attentions Into Convolutional Layers in Single Path.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

ProtoSimi: label correction for fine-grained visual categorization.
Mach. Learn., April, 2024

ViTPose++: Vision Transformer for Generic Body Pose Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

On Exploring Multiplicity of Primitives and Attributes for Texture Recognition in the Wild.
IEEE Trans. Pattern Anal. Mach. Intell., January, 2024

Expanding and Refining Hybrid Compressors for Efficient Object Re-Identification.
IEEE Trans. Image Process., 2024

Looking Into Gait for Perceiving Emotions via Bilateral Posture and Movement Graph Convolutional Networks.
IEEE Trans. Affect. Comput., 2024

MTP: Advancing Remote Sensing Foundation Model via Multitask Pretraining.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

IoUformer: Pseudo-IoU prediction with transformer for visual tracking.
Neural Networks, 2024

SoccerNet 2024 Challenges Results.
CoRR, 2024

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation.
CoRR, 2024

Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection.
CoRR, 2024

BEVNav: Robot Autonomous Navigation Via Spatial-Temporal Contrastive Learning in Bird's-Eye View.
CoRR, 2024

SiamMo: Siamese Motion-Centric 3D Object Tracking.
CoRR, 2024

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results.
CoRR, 2024

PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions.
CoRR, 2024

Is Your HD Map Constructor Reliable under Sensor Corruptions?
CoRR, 2024

Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset.
CoRR, 2024

HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model.
CoRR, 2024

Technique Report of CVPR 2024 PBDL Challenges.
CoRR, 2024

3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation.
CoRR, 2024

Team Samsung-RAL: Technical Report for 2024 RoboDrive Challenge-Robust Map Segmentation Track.
CoRR, 2024

The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition.
CoRR, 2024

Contact-aware Human Motion Generation from Textual Descriptions.
CoRR, 2024

MTP: Advancing Remote Sensing Foundation Model via Multi-Task Pretraining.
CoRR, 2024

Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation.
CoRR, 2024

GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching.
CoRR, 2024

From Pixels to Preservation: The Power of Large Vision Models in Heritage Content Understanding.
Proceedings of the 6th workshop on the analySis, 2024

Instance-Level Scaling and Dynamic Margin-Alignment Knowledge Distillation.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Multi-Granularity Hand Action Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Unleashing the Power of Generic Segmentation Model: A Simple Baseline for Infrared Small Target Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

SAR-SLAM: Self-Attentive Rendering-based SLAM with Neural Point Cloud Encoding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

HandRefiner: Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

LeMeViT: Efficient Vision Transformer with Learnable Meta Tokens for Remote Sensing Image Interpretation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

MapDistill: Boosting Efficient Camera-Based HD Map Construction via Camera-LiDAR Fusion Model Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

IRPruneDet: Efficient Infrared Small Target Detection via Wavelet Structure-Regularized Soft Channel Pruning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

SurgicalSAM: Efficient Class Promptable Surgical Instrument Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Decomposing Semantic Shifts for Composed Image Retrieval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Curvature Consistent Network for Microscope Chip Image Super-Resolution.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

End-to-End One-Shot Human Parsing.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Inter-layer transition in neural architecture search.
Pattern Recognit., November, 2023

Unifying Flow, Stereo and Depth Estimation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Spatiotemporal Fusion Model of Remote Sensing Images Combining Single-Band and Multi-Band Prediction.
Remote. Sens., October, 2023

Transformer-Based Context Condensation for Boosting Feature Pyramids in Object Detection.
Int. J. Comput. Vis., October, 2023

SiamDF: Tracking training data-free siamese tracker.
Neural Networks, August, 2023

Online intervention siamese tracking.
Inf. Sci., August, 2023

Rethinking Portrait Matting with Privacy Preserving.
Int. J. Comput. Vis., August, 2023

IC9600: A Benchmark Dataset for Automatic Image Complexity Assessment.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

ViTAEv2: Vision Transformer Advanced by Exploring Inductive Bias for Image Recognition and Beyond.
Int. J. Comput. Vis., May, 2023

DCN-T: Dual Context Network With Transformer for Hyperspectral Image Classification.
IEEE Trans. Image Process., 2023

Learning to Purification for Unsupervised Person Re-Identification.
IEEE Trans. Image Process., 2023

SIR: Self-Supervised Image Rectification via Seeing the Same Scene From Multiple Different Lenses.
IEEE Trans. Image Process., 2023

Dim2Clear Network for Infrared Small Target Detection.
IEEE Trans. Geosci. Remote. Sens., 2023

Advancing Plain Vision Transformer Toward Remote Sensing Foundation Model.
IEEE Trans. Geosci. Remote. Sens., 2023

An Empirical Study of Remote Sensing Pretraining.
IEEE Trans. Geosci. Remote. Sens., 2023

Fluid Micelle Network for Image Super-Resolution Reconstruction.
IEEE Trans. Cybern., 2023

ST$^{2}$: Spatial-Temporal State Transformer for Crowd-Aware Autonomous Navigation.
IEEE Robotics Autom. Lett., 2023

Deep Corner.
Int. J. Comput. Vis., 2023

A Comprehensive Survey and Taxonomy on Single Image Dehazing Based on Deep Learning.
ACM Comput. Surv., 2023

APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond.
CoRR, 2023

Part to Whole: Collaborative Prompting for Surgical Instrument Segmentation.
CoRR, 2023

DA-STC: Domain Adaptive Video Semantic Segmentation via Spatio-Temporal Consistency.
CoRR, 2023

Decompose Semantic Shifts for Composed Image Retrieval.
CoRR, 2023

BEVTrack: A Simple Baseline for 3D Single Object Tracking in Bird's-Eye-View.
CoRR, 2023

PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning.
CoRR, 2023

The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT.
CoRR, 2023

RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation.
CoRR, 2023

FHA-Kitchens: A Novel Dataset for Fine-Grained Hand Action Recognition in Kitchen Scenes.
CoRR, 2023

Human-imperceptible, Machine-recognizable Images.
CoRR, 2023

Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model.
CoRR, 2023

Revolutionizing Agrifood Systems with Artificial Intelligence: A Survey.
CoRR, 2023

Scalable Mask Annotation for Video Text Spotting.
CoRR, 2023

Event-based Simultaneous Localization and Mapping: A Comprehensive Survey.
CoRR, 2023

UVA: Towards Unified Volumetric Avatar for View Synthesis, Pose rendering, Geometry and Texture Editing.
CoRR, 2023

Deep Image Matting: A Comprehensive Survey.
CoRR, 2023

BEVSimDet: Simulated Multi-modal Distillation in Bird's-Eye View for Multi-view 3D Object Detection.
CoRR, 2023

Deep Learning for Camera Calibration and Beyond: A Survey.
CoRR, 2023

Sensitivity-Aware Visual Parameter-Efficient Tuning.
CoRR, 2023

ESceme: Vision-and-Language Navigation with Episodic Scene Memory.
CoRR, 2023

OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System.
CoRR, 2023


SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

AniPixel: Towards Animatable Pixel-Aligned Human Avatar.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

OSP2B: One-Stage Point-to-Box Network for 3D Siamese Tracking.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

ESSAformer: Efficient Transformer for Hyperspectral Image Super-resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Domain Specified Optimization for Deployment Authorization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

MOEF: Modeling Occasion Evolution in Frequency Domain for Promotion-Aware Click-Through Rate Prediction.
Proceedings of the Database Systems for Advanced Applications, 2023

Cold-Start Based Multi-scenario Ranking Model for Click-Through Rate Prediction.
Proceedings of the Database Systems for Advanced Applications, 2023

CLAMP: Prompt-based Contrastive Learning for Connecting Language and Animal Pose.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Leverage Interactive Affinity for Affordance Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Referring Image Matting.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Dynamic Focus-aware Positional Queries for Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

GLT-T: Global-Local Transformer Voting for 3D Single Object Tracking in Point Clouds.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Learning to Learn Better for Video Object Segmentation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
DUT: Learning Video Stabilization by Simply Watching Unstable Videos.
IEEE Trans. Image Process., 2022

Robust Object Detection via Adversarial Novel Style Exploration.
IEEE Trans. Image Process., 2022

Stagewise Unsupervised Domain Adaptation With Adversarial Self-Training for Road Segmentation of Remote-Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2022

Multibranch Adversarial Regression for Domain Adaptative Hand Pose Estimation.
IEEE Trans. Circuits Syst. Video Technol., 2022

AFA: adversarial frequency alignment for domain generalized lung nodule detection.
Neural Comput. Appl., 2022

PRPN: Progressive region prediction network for natural scene text detection.
Knowl. Based Syst., 2022

Information-Theoretic Odometry Learning.
Int. J. Comput. Vis., 2022

One-Shot Object Affordance Detection in the Wild.
Int. J. Comput. Vis., 2022

CODON: On Orchestrating Cross-Domain Attentions for Depth Super-Resolution.
Int. J. Comput. Vis., 2022

Bridging Composite and Real: Towards End-to-End Deep Image Matting.
Int. J. Comput. Vis., 2022

Wide-Angle Image Rectification: A Survey.
Int. J. Comput. Vis., 2022

I3CL: Intra- and Inter-Instance Collaborative Learning for Arbitrary-Shaped Scene Text Detection.
Int. J. Comput. Vis., 2022

Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning.
IEEE CAA J. Autom. Sinica, 2022

Localizing Scan Targets from Human Pose for Autonomous Lung Ultrasound Imaging.
CoRR, 2022

ViTPose+: Vision Transformer Foundation Model for Generic Body Pose Estimation.
CoRR, 2022

1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results.
CoRR, 2022

Rethinking Hierarchies in Pre-trained Plain Vision Transformer.
CoRR, 2022

Advancing Plain Vision Transformer Towards Remote Sensing Foundation Model.
CoRR, 2022

PromptPose: Language Prompt Helps Animal Pose Estimation.
CoRR, 2022

Toward Real-world Single Image Deraining: A New Benchmark and Beyond.
CoRR, 2022

Multi-Task Learning with Multi-query Transformer for Dense Prediction.
CoRR, 2022

A Comprehensive Survey on Data-Efficient GANs in Image Generation.
CoRR, 2022

Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

MetaCVR: Conversion Rate Prediction via Meta Learning in Small-Scale Recommendation Scenarios.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Re-weighting Negative Samples for Model-Agnostic Matching.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Exploring Figure-Ground Assignment Mechanism in Perceptual Organization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Watermarking for Out-of-distribution Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring Feature Compensation and Cross-level Correlation for Infrared Small Target Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

RKformer: Runge-Kutta Transformer with Random-Connection Attention for Infrared Small Target Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

GT-MUST: Gated Try-on by Learning the Mannequin-Specific Transformation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SAR-to-Optical Image Translation via Neural Partial Differential Equations.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Two-Layers Super-Resolution Based Generation Adversarial Spatiotemporal Fusion Model.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2022

Towards Scale Consistent Monocular Visual Odometry by Learning from the Virtual World.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

FP-DETR: Detection Transformer Advanced by Fully Pre-training.
Proceedings of the Tenth International Conference on Learning Representations, 2022

JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics.
Proceedings of the Computer Vision - ECCV 2022, 2022

VSA: Learning Varied-Size Window Attention in Vision Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

RegionCL: Exploring Contrastive Region Pairs for Self-supervised Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Data-Efficient Detection Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

ReAct: Temporal Action Detection with Relational Queries.
Proceedings of the Computer Vision - ECCV 2022, 2022

BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2022, 2022

MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis.
Proceedings of the Computer Vision - ECCV 2022, 2022

FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs.
Proceedings of the Computer Vision - ECCV 2022, 2022

ISNet: Shape Matters for Infrared Small Target Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

GMFlow: Learning Optical Flow via Global Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Learning Affordance Grounding from Exocentric Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

RU-Net: Regularized Unrolling Network for Scene Graph Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FIBA: Frequency-Injection based Backdoor Attack in Medical Image Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Recurrent Glimpse-based Decoder for Detection with Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Hierarchically Fusing Long and Short-Term User Interests for Click-Through Rate Prediction in Product Search.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Attentive Cascaded Pyramid Network for Online Video Stabilization.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Siamese Network with Interactive Transformer for Video Object Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

SASA: Semantics-Augmented Set Abstraction for Point-Based 3D Object Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Leveraging Deep Statistics for Underwater Image Enhancement.
ACM Trans. Multim. Comput. Commun. Appl., 2021

A Human-Like Dual-Forklift Collaborative Mechanism for Container Handling.
IEEE Trans. Ind. Electron., 2021

Recovery of image and video based on compressive sensing via tensor approximation and Spatio-temporal correlation.
Multim. Tools Appl., 2021

Empowering Things With Intelligence: A Survey of the Progress, Challenges, and Opportunities in Artificial Intelligence of Things.
IEEE Internet Things J., 2021

Terra: A Smart and Sensible Digital Twin Framework for Robust Robot Deployment in Challenging Environments.
IEEE Internet Things J., 2021

Towards High Performance Human Keypoint Detection.
Int. J. Comput. Vis., 2021

Recursive Context Routing for Object Detection.
Int. J. Comput. Vis., 2021

Generative domain adaptation for chest X-ray image analysis.
IET Image Process., 2021

Conversion Rate Prediction via Meta Learning in Small-Scale Recommendation Scenarios.
CoRR, 2021

SAME: Scenario Adaptive Mixture-of-Experts for Promotion-Aware Click-Through Rate Prediction.
CoRR, 2021

RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?
CoRR, 2021

Hierarchically Modeling Micro and Macro Behaviors via Multi-Task Learning for Conversion Rate Prediction.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

AP-10K: A Benchmark for Animal Pose Estimation in the Wild.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Privacy-Preserving Portrait Matting.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

DSP: Dual Soft-Paste for Unsupervised Domain Adaptive Semantic Segmentation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Transfer Learning for KiTS21 Challenge.
Proceedings of the Kidney and Kidney Tumor Segmentation - MICCAI 2021 Challenge, 2021

One-Shot Affordance Detection.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Deep Automatic Natural Image Matting.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

A Comprehensive Survey on Image Dehazing Based on Deep Learning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Cascade Network for Hyperspectral Image Classification.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2021

Multi-Label Hyperspectral Classification with Discriminative Features.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2021

Adaptive Channel Attention and Feature Super-Resolution for Remote Sensing Images Spatiotemporal Fusion.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2021

Multi-label Classification of Hyperspectral Images Based on Label-Specific Feature Fusion.
Proceedings of the Neural Information Processing - 28th International Conference, 2021

Out-of-boundary View Synthesis Towards Full-Frame Video Stabilization.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

SAR-Net: A Scenario-Aware Ranking Network for Personalized Fair Recommendation in Hundreds of Travel Scenarios.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Progressive One-shot Human Parsing.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
FAMED-Net: A Fast and Accurate Multi-Scale End-to-End Dehazing Network.
IEEE Trans. Image Process., 2020

Heart sound classification based on improved MFCC features and convolutional recurrent neural networks.
Neural Networks, 2020

Deep time-frequency representation and progressive decision fusion for ECG classification.
Knowl. Based Syst., 2020

Human gait recognition based on deterministic learning and knowledge fusion through multiple walking views.
J. Frankl. Inst., 2020

Specific category region proposal network for text detection in natural scene.
IET Image Process., 2020

End-to-end Animal Image Matting.
CoRR, 2020

Condensing Two-stage Detection with Automatic Object Key Part Discovery.
CoRR, 2020

Entire Space Multi-Task Modeling via Post-Click Behavior Decomposition for Conversion Rate Prediction.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Auto Learning Attention.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Nighttime Dehazing with a Synthetic Benchmark.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

ANAS: Sentence Similarity Calculation Based on Automatic Neural Architecture Search.
Proceedings of the Intelligence Science III: 4th IFIP TC 12 International Conference, 2020

Pan-Sharpening Based On Parallel Pyramid Convolutional Neural Network.
Proceedings of the IEEE International Conference on Image Processing, 2020

Deep Degradation Prior for Low-Quality Image Classification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Grapy-ML: Graph Pyramid Mutual Learning for Cross-Dataset Human Parsing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Adaptive anchor box mechanism to improve the accuracy in the object detection system.
Multim. Tools Appl., 2019

Progressive LiDAR adaptation for road detection.
IEEE CAA J. Autom. Sinica, 2019

Reconstruction for block-based compressive sensing of image with reweighted double sparse constraint.
EURASIP J. Image Video Process., 2019

Human Keypoint Detection by Progressive Context Refinement.
CoRR, 2019

Conversion Rate Prediction via Post-Click Behaviour Modeling.
CoRR, 2019

Fine-grained ECG Classification Based on Deep CNN and Online Decision Fusion.
CoRR, 2019

Scale and Gradient Aware Image Smoothing.
IEEE Access, 2019

Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Progressive Retinex: Mutually Reinforced Illumination-Noise Perception Network for Low-Light Image Enhancement.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Deep Multiple-Attribute-Perceived Network for Real-World Texture Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

MirrorGAN: Learning Text-To-Image Generation by Redescription.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Multi-Level Deep Cascade Trees for Conversion Rate Prediction in Recommendation System.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Multi-Level Deep Cascade Trees for Conversion Rate Prediction.
CoRR, 2018

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images.
CoRR, 2018

Particle Swarm Programming-Based Interactive Content-Based Image Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2018, 2018

Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Modelling Spatial Correlations by Using Deep CNN and LSTM for Texture Image Classification.
Proceedings of the 27th IEEE International Symposium on Industrial Electronics, 2018

Fault Diagnosis for Rotating Machinery with Scarce Labeled Samples: A Deep CNN Method Based on Knowledge-Transferring from Shallow Models.
Proceedings of the 2018 International Conference on Control, 2018

An Online-Updating Deep CNN Method Based on Kalman Filter for Illumination-Drifting Road Damage Classification.
Proceedings of the 2018 International Conference on Control, 2018

Radar Emitter Identification Based on Deep Convolutional Neural Network.
Proceedings of the 2018 International Conference on Control, 2018

2017
Image guided depth enhancement via deep fusion and local linear regularizaron.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

A deep CNN method for underwater image enhancement.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
A Unified Scheme for Super-Resolution and Depth Estimation From Asymmetric Stereoscopic Video.
IEEE Trans. Circuits Syst. Video Technol., 2016

Salient object detection and classification for stereoscopic images.
Multim. Tools Appl., 2016

Automatic tag saliency ranking for stereo images.
Neurocomputing, 2016

Nighttime Haze Removal with Illumination Correction.
CoRR, 2016

Fast response aggregation for depth estimation using light field camera.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Depth map super-resolution using stereo-vision-assisted model.
Neurocomputing, 2015

Simultaneously Retargeting and Super-Resolution for Stereoscopic Video.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015

2014
A new closed loop method of super-resolution for multi-view images.
Mach. Vis. Appl., 2014

Depth map super-resolution via local and nonlocal priors.
J. Electronic Imaging, 2014

A novel segmentation based video-denoising method with noise level estimation.
Inf. Sci., 2014

Background segmentation of dynamic scenes based on dual model.
IET Comput. Vis., 2014

Underwater stereo image enhancement using a new physical model.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Nighttime haze removal based on a new imaging model.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

A new image filtering method: Nonlocal image guided averaging.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A Novel Segmentation-Based Video Denoising Method with Noise Level Estimation.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

A New Closed Loop Method of Super-Resolution for Multi-view Images.
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013

A simultaneous method for 3D video super-resolution and high-quality depth estimation.
Proceedings of the IEEE International Conference on Image Processing, 2013


  Loading...