Zhaoxiang Zhang

Orcid: 0000-0003-2648-3875

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, Center for Research on Intelligent Perception and Computing (CRIPAC), National Laboratory of Pattern Recognition (NLPR), Beijing, China
  • University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China
  • Chinese Academy of Sciences, Hong Kong Institute of Science & Innovation, Centre for Artificial Intelligence and Robotics, Hong Kong
  • Beihang University, School of Computer Science and Engineering, Laboratory of Intelligent Recognition and Image Processing, Beijing, China (2009 - 2015)
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China (PhD 2009)


According to our database1, Zhaoxiang Zhang authored at least 338 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A Curriculum-Style Self-Training Approach for Source-Free Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Enhancing Sound Source Localization via False Negative Elimination.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Large-Scale Object Detection in the Wild With Imbalanced Data Distribution, and Multi-Labels.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Visually Guided Sound Source Separation With Audio-Visual Predictive Coding.
IEEE Trans. Neural Networks Learn. Syst., November, 2024

Fully Sparse Fusion for 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2024

GRAMO: geometric resampling augmentation for monocular 3D object detection.
Frontiers Comput. Sci., October, 2024

Reusable Architecture Growth for Continual Stereo Matching.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Depth-Guided Vision Transformer With Normalizing Flows for Monocular 3D Object Detection.
IEEE CAA J. Autom. Sinica, March, 2024

MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection.
IEEE Trans. Image Process., 2024

Joint optimization of preventive maintenance and triggering mechanism for k-out-of-n: F systems with protective devices based on periodic inspection.
Reliab. Eng. Syst. Saf., 2024

Large-scale continual learning for ancient Chinese character recognition.
Pattern Recognit., 2024

Learnable Graph Matching: A Practical Paradigm for Data Association.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization.
CoRR, 2024

CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes.
CoRR, 2024

OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction.
CoRR, 2024

FreeVS: Generative View Synthesis on Free Driving Trajectory.
CoRR, 2024

MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models.
CoRR, 2024

DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model.
CoRR, 2024

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models.
CoRR, 2024

SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality.
CoRR, 2024

CityX: Controllable Procedural Content Generation for Unbounded 3D Cities.
CoRR, 2024

Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection.
CoRR, 2024

Enhancing End-to-End Autonomous Driving with Latent World Model.
CoRR, 2024

Trim 3D Gaussian Splatting for Accurate Geometry Representation.
CoRR, 2024

Towards Flexible Interactive Reflection Removal with Human Guidance.
CoRR, 2024

Learning Object-Centric Representation via Reverse Hierarchy Guidance.
CoRR, 2024

A Survey on Personalized Content Synthesis with Diffusion Models.
CoRR, 2024

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond.
CoRR, 2024

Robust Depth Enhancement via Polarization Prompt Fusion Tuning.
CoRR, 2024

CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians.
CoRR, 2024

SceneX: Procedural Controllable Large-scale Scene Generation via Large-language Models.
CoRR, 2024

Continual Forgetting for Pre-trained Vision Models.
CoRR, 2024

Open-world Machine Learning: A Review and New Outlooks.
CoRR, 2024

DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer.
CoRR, 2024

Semantic Anything in 3D Gaussians.
CoRR, 2024

Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering.
CoRR, 2024

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes.
CoRR, 2024

Generative Active Learning for Image Synthesis Personalization.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

General Geometry-Aware Weakly Supervised 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

CSOT: Cross-scan Object Transfer for Semi-Supervised LiDAR Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

Monocular Occupancy Prediction for Scalable Indoor Scenes.
Proceedings of the Computer Vision - ECCV 2024, 2024

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers.
Proceedings of the Computer Vision - ECCV 2024, 2024

CityGaussian: Real-Time High-Quality Large-Scale Scene Rendering with Gaussians.
Proceedings of the Computer Vision - ECCV 2024, 2024

Point-Supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance.
Proceedings of the Computer Vision - ECCV 2024, 2024

Expanding Scene Graph Boundaries: Fully Open-Vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention.
Proceedings of the Computer Vision - ECCV 2024, 2024

RCL: Reliable Continual Learning for Unified Failure Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing Visual Continual Learning with Language-Guided Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

HardMo: A Large-Scale Hardcase Dataset for Motion Capture.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

MemoNav: Working Memory Model for Visual Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual Forgetting for Pre-Trained Vision Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Driving Into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Compositional Inversion for Stable Diffusion Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Fully Data-Driven Pseudo Label Estimation for Pointly-Supervised Panoptic Segmentation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
MMT: Cross Domain Few-Shot Learning via Meta-Memory Transfer.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Weakly Supervised Semantic Segmentation via Box-Driven Masking and Filling Rate Shifting.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2023

Toward Practical Weakly Supervised Semantic Segmentation via Point-Level Supervision.
Int. J. Comput. Vis., December, 2023

Object Affinity Learning: Towards Annotation-Free Instance Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

GAIA-Universe: Everything is Super-Netify.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Super Sparse 3D Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Extracting Semantic Knowledge From GANs With Unsupervised Learning.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Memory-Based Cross-Image Contexts for Weakly Supervised Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Adversarial Learning Guided Task Relatedness Refinement for Multi-Task Deep Learning.
IEEE Trans. Multim., 2023

Coarse Mask Guided Interactive Object Segmentation.
IEEE Trans. Image Process., 2023

Learning to Adapt Across Dual Discrepancy for Cross-Domain Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2023

Bootstrap Masked Visual Modeling via Hard Patches Mining.
CoRR, 2023

GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives.
CoRR, 2023

Visual Commonsense based Heterogeneous Graph Contrastive Learning.
CoRR, 2023

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models.
CoRR, 2023

FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels.
CoRR, 2023

DiffusePast: Diffusion-based Generative Replay for Class Incremental Semantic Segmentation.
CoRR, 2023

2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction.
CoRR, 2023

Tracking Objects with 3D Representation from Videos.
CoRR, 2023

Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation.
CoRR, 2023

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory.
CoRR, 2023

Pulling Target to Source: A New Perspective on Domain Adaptive Semantic Segmentation.
CoRR, 2023

Real-Aug: Realistic Scene Synthesis for LiDAR Augmentation in 3D Object Detection.
CoRR, 2023

Fully Sparse Fusion for 3D Object Detection.
CoRR, 2023

A Survey of Deep Visual Cross-Domain Few-Shot Learning.
CoRR, 2023

LMR: A Large-Scale Multi-Reference Dataset for Reference-based Super-Resolution.
CoRR, 2023

Fairly Adaptive Negative Sampling for Recommendations.
Proceedings of the ACM Web Conference 2023, 2023

DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Echoes Beyond Points: Unleashing the Power of Raw Radar Data in Multi-modality Fusion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LMR: A Large-Scale Multi-Reference Dataset for Reference-based Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SSF: Accelerating Training of Spiking Neural Networks with Stabilized Spiking Flow.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Informative Data Mining for One-shot Cross-Domain Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Once Detected, Never Lost: Surpassing Human Performance in Offline LiDAR based 3D Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Graphics Capsule: Learning Hierarchical 3D Face Representations from 2D Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Sharpness-Aware Gradient Matching for Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hard Patches Mining for Masked Image Modeling.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Intrinsic Physical Concepts Discovery with Object-Centric Predictive Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

BAEFormer: Bi-Directional and Early Interaction Transformers for Bird's Eye View Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Blind Video Deflickering by Neural Filtering with a Flawed Atlas.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D Video Object Detection with Learnable Object-Centric Global Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Robust Feature Rectification of Pretrained Vision Models for Object Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Alleviating Modality Bias Training for Infrared-Visible Person Re-Identification.
IEEE Trans. Multim., 2022

Multimodal channel-wise attention transformer inspired by multisensory integration mechanisms of the brain.
Pattern Recognit., 2022

Context-aware co-supervision for accurate object detection.
Pattern Recognit., 2022

MonoPoly: A practical monocular 3D object detector.
Pattern Recognit., 2022

Enhanced task attention with adversarial learning for dynamic multi-task CNN.
Pattern Recognit., 2022

Identifying the key frames: An attention-aware sampling method for action recognition.
Pattern Recognit., 2022

Delving into the Effectiveness of Receptive Fields: Learning Scale-Transferrable Architectures for Practical Object Detection.
Int. J. Comput. Vis., 2022

From Individual to Whole: Reducing Intra-class Variance by Feature Aggregation.
Int. J. Comput. Vis., 2022

Improving Image Segmentation with Boundary Patch Refinement.
Int. J. Comput. Vis., 2022

Editorial for Special Issue on Brain-inspired Machine Learning.
Int. J. Autom. Comput., 2022

Toward few-shot domain adaptation with perturbation-invariant representation and transferable prototypes.
Frontiers Comput. Sci., 2022

MemoNav: Selecting Informative Memories for Visual Navigation.
CoRR, 2022

Pro-tuning: Unified Prompt Tuning for Vision Tasks.
CoRR, 2022

Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes.
CoRR, 2022

Emergence of Machine Language: Towards Symbolic Intelligence with Neural Networks.
CoRR, 2022

4D Unsupervised Object Discovery.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Fully Sparse 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Interact with Open Scenes: A Life-long Evolution Framework for Interactive Segmentation Models.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Object Dynamics Distillation for Scene Decomposition and Representation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Stereo Depth Estimation with Echoes.
Proceedings of the Computer Vision - ECCV 2022, 2022

RRSR: Reciprocal Reference-Based Image Super-Resolution with Progressive Feature Alignment and Selection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Densely Constrained Depth Estimator for Monocular 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Pointly-Supervised Panoptic Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

The Devil Is in the Details: Window-based Attention for Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Implicit Sample Extension for Unsupervised Person Re-Identification.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Remember the Difference: Cross-Domain Few-Shot Semantic Segmentation via Meta-Memory Transfer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards Noiseless Object Contours for Weakly Supervised Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Embracing Single Stride 3D Object Detector with Sparse Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Sparse Instance Activation for Real-Time Instance Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

DATA: Domain-Aware and Task-Aware Self-supervised Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Deconfounding Physical Dynamics with Global Causal Relation and Confounder Transmission for Counterfactual Prediction.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Image Inpainting by End-to-End Cascaded Refinement With Mask Awareness.
IEEE Trans. Image Process., 2021

Attention Guided Multiple Source and Target Domain Adaptation.
IEEE Trans. Image Process., 2021

Multi-Domain Image-to-Image Translation via a Unified Circular Framework.
IEEE Trans. Image Process., 2021

Joint Multisource Saliency and Exemplar Mechanism for Weakly Supervised Video Object Segmentation.
IEEE Trans. Image Process., 2021

Unsupervised Domain Adaptation with Background Shift Mitigating for Person Re-Identification.
Int. J. Comput. Vis., 2021

Biologically inspired visual computing: the state of the art.
Frontiers Comput. Sci., 2021

Immortal Tracker: Tracklet Never Dies.
CoRR, 2021

Give Me Your Trained Model: Domain Adaptive Semantic Segmentation without Source Data.
CoRR, 2021

Uncertainty-aware Pseudo Label Refinery for Domain Adaptive Semantic Segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Clothing Status Awareness for Long-Term Person Re-Identification.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT Philosophy.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

RefineMask: Towards High-Quality Instance Segmentation With Fine-Grained Features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Unsupervised Object Detection With LIDAR Clues.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Look Closer To Segment Better: Boundary Patch Refinement for Instance Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Learnable Graph Matching: Incorporating Graph Partitioning With Deep Feature Learning for Multiple Object Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

GAIA: A Transfer Learning System of Object Detection That Fits Your Needs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Beyond Scalar Neuron: Adopting Vector-Neuron Capsules for Long-Term Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., 2020

SARPNET: Shape attention regional proposal network for liDAR-based 3D object detection.
Neurocomputing, 2020

Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates.
CoRR, 2020

Manual-Label Free 3D Detection via An Open-Source Simulator.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Attentive Part-aware Networks for Partial Person Re- identification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Generalizing Person Re-Identification by Camera-Aware Invariance Learning and Cross-Domain Mixup.
Proceedings of the Computer Vision - ECCV 2020, 2020

Employing Multi-estimations for Weakly-Supervised Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Boosting Decision-Based Black-Box Adversarial Attacks with Random Sign Flip.
Proceedings of the Computer Vision - ECCV 2020, 2020

Context-Aware Attention Network for Image-Text Retrieval.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Large-Scale Object Detection in the Wild From Imbalanced Multi-Labels.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Integral Objects With Intra-Class Discriminator for Weakly-Supervised Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Bi-Directional Interaction Network for Person Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Instance Guided Proposal Network for Person Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Cascading Convolutional Color Constancy.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Guest Editorial: Special Issue on Machine Learning Implementations.
J. Signal Process. Syst., 2019

Dynamic Collaborative Tracking.
IEEE Trans. Neural Networks Learn. Syst., 2019

Multi-Pseudo Regularized Label for Generated Data in Person Re-Identification.
IEEE Trans. Image Process., 2019

Deep Learning for Pattern Recognition.
Pattern Recognit. Lett., 2019

Generative adversarial dehaze mapping nets.
Pattern Recognit. Lett., 2019

Image Caption Generation with Part of Speech Guidance.
Pattern Recognit. Lett., 2019

Semi-supervised domain adaptation via Fredholm integral based kernel methods.
Pattern Recognit., 2019

Spatiotemporal distilled dense-connectivity network for video action recognition.
Pattern Recognit., 2019

SimpleDet: A Simple and Versatile Distributed Framework for Object Detection and Instance Recognition.
J. Mach. Learn. Res., 2019

Efficient Neural Architecture Transformation Searchin Channel-Level for Object Detection.
CoRR, 2019

Revisiting Feature Alignment for One-stage Object Detection.
CoRR, 2019

Uncertainty-optimized deep learning model for small-scale person re-identification.
Sci. China Inf. Sci., 2019

Air Quality Measurement Based on Double-Channel Convolutional Neural Network Ensemble Learning.
IEEE Access, 2019

SAS: Painting Detection and Recognition via Smart Art System With Mobile Devices.
IEEE Access, 2019

Efficient Neural Architecture Transformation Search in Channel-Level for Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Relational Network for Skeleton-Based Action Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

CASIA-AHCDB: A Large-Scale Chinese Ancient Handwritten Characters Database.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Sequence Level Semantics Aggregation for Video Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

POD: Practical Object Detection With Scale-Sensitive Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Spectral Feature Transformation for Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Scale-Aware Trident Networks for Object Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Human-Like Delicate Region Erasing Strategy for Weakly Supervised Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Attention-Aware Sampling via Deep Reinforcement Learning for Action Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
GII Representation-Based Cross-View Gait Recognition by Discriminative Projection With List-Wise Constraints.
IEEE Trans. Cybern., 2018

Efficient auto-refocusing for light field camera.
Pattern Recognit., 2018

Improving context-sensitive similarity via smooth neighborhood for object retrieval.
Pattern Recognit., 2018

On the role of sparsity in feature selection and an innovative method LRMI.
Neurocomputing, 2018

CIAN: Cross-Image Affinity Net for Weakly Supervised Semantic Segmentation.
CoRR, 2018

Accelerating Deep Neural Networks with Spatial Bottleneck Modules.
CoRR, 2018

Skeleton-Based Relational Modeling for Action Recognition.
CoRR, 2018

Multi-pseudo Regularized Label for Generated Samples in Person Re-Identification.
CoRR, 2018

Special Issue of BICS 2016.
Cogn. Comput., 2018

Hierarchical Convolutional Neural Networks for EEG-Based Emotion Recognition.
Cogn. Comput., 2018

End-to-End Lifelong Learning: a Framework to Achieve Plasticities of both the Feature and Classifier Constructions.
Cogn. Comput., 2018

A Primal Neural Network for Online Equality-Constrained Quadratic Programming.
Cogn. Comput., 2018

Weakly-Supervised Object Localization by Cutting Background with Deep Reinforcement Learning.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

View Decomposition and Adversarial for Semantic Segmentation.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Conditional Expression Synthesis with Face Parsing Transformation.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Deep Convolutional Neural Networks with Merge-and-Run Mappings.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Multi-task Layout Analysis for Historical Handwritten Documents Using Fully Convolutional Networks.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Accelerating the Classification of Very Deep Convolutional Network by A Cascading Approach.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Rethinking ReLU to Train Better CNNs.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Deep Temporal Feature Encoding for Action Recognition.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Inception Donut Convolution for Top-down Semantic Segmentation.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Diversified Dual Domain-Adversarial Neural Networks.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Hard-Aware Point-to-Set Deep Metric for Person Re-identification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Integrating Both Visual and Audio Cues for Enhanced Video Caption.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

CMCGAN: A Uniform Framework for Cross-Modal Visual-Audio Mutual Generation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

DarkRank: Accelerating Deep Metric Learning via Cross Sample Similarities Transfer.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Pedestrian Counting With Back-Propagated Information and Target Drift Remedy.
IEEE Trans. Syst. Man Cybern. Syst., 2017

GIFT: Towards Scalable 3D Shape Retrieval.
IEEE Trans. Multim., 2017

Learning to Classify Fine-Grained Categories with Privileged Visual-Semantic Misalignment.
IEEE Trans. Big Data, 2017

Spectral attribute learning for visual regression.
Pattern Recognit., 2017

Local structured representation for generic object detection.
Frontiers Comput. Sci., 2017

Training Better CNNs Requires to Rethink ReLU.
CoRR, 2017

Diverse Neuron Type Selection for Convolutional Neural Networks.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Random Shifting for CNN: a Solution to Reduce Information Loss in Down-Sampling Layers.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Dynamic Multi-Task Learning with Convolutional Neural Network.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

2016
Compressive Sequential Learning for Action Similarity Labeling.
IEEE Trans. Image Process., 2016

Corrections to "Relevance Metric Learning for Person Re-Identification by Exploiting Listwise Similarities".
IEEE Trans. Image Process., 2016

Facial Age Estimation Using Robust Label Distribution.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Smooth Neighborhood Structure Mining on Multiple Affinity Graphs with Applications to Context-Sensitive Similarity.
Proceedings of the Computer Vision - ECCV 2016, 2016

GIFT: A Real-Time and Scalable 3D Shape Search Engine.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Visual Information Processing Mechanism Revealed by fMRI Data.
Proceedings of the Brain Informatics and Health - International Conference, 2016

Implementation of EEG Emotion Recognition System Based on Hierarchical Convolutional Neural Networks.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2016

Incremental PCANet: A Lifelong Learning Framework to Achieve the Plasticity of both Feature and Classifier Constructions.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2016

An Improved Recurrent Network for Online Equality-Constrained Quadratic Programming.
Proceedings of the Advances in Brain Inspired Cognitive Systems, 2016

2015
Relevance Metric Learning for Person Re-Identification by Exploiting Listwise Similarities.
IEEE Trans. Image Process., 2015

Patch-wise skin segmentation of human body parts via deep neural networks.
J. Electronic Imaging, 2015

Crowd counting in public video surveillance by label distribution learning.
Neurocomputing, 2015

Enhancing person re-identification by integrating gait biometric.
Neurocomputing, 2015

Dither modulation of significant amplitude difference for wavelet based robust watermarking.
Neurocomputing, 2015

2014
On-line signature verification based on spatio-temporal correlation.
Multim. Tools Appl., 2014

Incremental learning patch-based bag of facial words representation for face recognition in videos.
Multim. Tools Appl., 2014

Secure multimodal biometric authentication with wavelet quantization based fingerprint watermarking.
Multim. Tools Appl., 2014

Face synthesis from low-resolution near-infrared to high-resolution visual light spectrum based on tensor analysis.
Neurocomputing, 2014

Pan-sharpening based on weighted red black wavelets.
IET Image Process., 2014

Object Classification in Traffic Scene Surveillance Based on Online Semi-supervised Active Learning.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Enhanced Human Parsing with Multiple Feature Fusion and Augmented Pose Model.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Relevance Metric Learning for Person Re-identification by Exploiting Global Similarities.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Learning visual categories through a sparse representation classifier based cross-category knowledge transfer.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

A Segmentation Based Change Detection Method for High Resolution Remote Sensing Image.
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014

Pan-Sharpening Based on Improvement of Panchromatic Image to Minimize Spectral Distortion.
Proceedings of the Pattern Recognition - 6th Chinese Conference, 2014

Enhancing Human Pose Estimation with Temporal Clues.
Proceedings of the Biometric Recognition - 9th Chinese Conference, 2014

2013
Learning the Spherical Harmonic Features for 3-D Face Recognition.
IEEE Trans. Image Process., 2013

Transferring Training Instances for Convenient Cross-View Object Classification in Surveillance.
IEEE Trans. Inf. Forensics Secur., 2013

View-Invariant Discriminative Projection for Multi-View Gait-Based Human Identification.
IEEE Trans. Inf. Forensics Secur., 2013

Incremental Learning for Video-Based Gait Recognition With LBP Flow.
IEEE Trans. Cybern., 2013

Practical Camera Calibration From Moving Objects for Traffic Scene Surveillance.
IEEE Trans. Circuits Syst. Video Technol., 2013

Estimation of view angles for gait using a robust regression method.
Multim. Tools Appl., 2013

Multi-block dependency based fragile watermarking scheme for fingerprint images protection.
Multim. Tools Appl., 2013

Cross-View Gait Recognition with Short Probe Sequences: from View Transformation Model to View-Independent stance-Independent Identity Vector.
Int. J. Pattern Recognit. Artif. Intell., 2013

View independent object classification by exploring scene consistency information for traffic scene surveillance.
Neurocomputing, 2013

Pixel-wise skin colour detection based on flexible neural tree.
IET Image Process., 2013

Cross-View Gait-Based Gender Classification by Transfer Learning.
Proceedings of the Advances in Multimedia Information Processing - PCM 2013, 2013

Cross-view action recognition via transductive transfer learning.
Proceedings of the IEEE International Conference on Image Processing, 2013

Semi-supervised learning in traffic scene surveillance based on label-propagation.
Proceedings of the IEEE International Conference on Image Processing, 2013

Enhancing Person Re-identification by Robust Structural Metric Learning.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

Face Tracking and Recognition via Incremental Local Sparse Representation.
Proceedings of the Seventh International Conference on Image and Graphics, 2013

Biometric information hiding: Promoting multimedia security with content and identity authentication.
Proceedings of the 2013 IEEE China Summit and International Conference on Signal and Information Processing, 2013

2012
Combining Tensor Space Analysis and Active Appearance Models for Aging Effect Simulation on Face Images.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Three-Dimensional Deformable-Model-Based Localization and Recognition of Road Vehicles.
IEEE Trans. Image Process., 2012

Spam Short Messages Detection via Mining Social Networks.
J. Comput. Sci. Technol., 2012

Representing 3D Face from Point Cloud to Face-Aligned spherical Depth Map.
Int. J. Pattern Recognit. Artif. Intell., 2012

Maximisation of mutual information for gait-based soft biometric classification using Gabor features.
IET Biom., 2012

Automatic object classification using motion blob based local feature fusion for traffic scene surveillance.
Frontiers Comput. Sci., 2012

Tamper detection and self-recovery of biometric images using salient region-based authentication watermarking scheme.
Comput. Stand. Interfaces, 2012

Active Learning for Transferrable Object Classification in Cross-View Traffic Scene Surveillance.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Towards Independent Color Space Selection for Human Skin Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Incremental Learning of Patch-Based Bag of Facial Words Representation for Online Face Recognition in Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

A Robust Watermarking Scheme Based on Dual Quantization of Wavelet Significant Difference.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

A Local Texture-Constrained Super-Resolution Method.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

A Secure Semi-fragile Self-recoverable Watermarking Algorithm Using Group-Based Wavelet Quantization.
Proceedings of the Advances in Multimedia Information Processing - PCM 2012, 2012

Robust mobile spamming detection via graph patterns.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Enhancing cross-view object classification by feature-based transfer learning.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Enhancing biometric security with wavelet quantization watermarking based two-stage multimodal authentication.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Pan-sharpening using weighted red-black wavelet.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Locally linear embedding based example learning for pan-sharpening.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Moving Object Detection in Aerial Video.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

A Hybrid Transfer Learning Mechanism for Object Classification across View.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

Recognizing Occluded 3D Faces Using an Efficient ICP Variant.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Cross-view object classification in traffic scene surveillance based on transductive transfer learning.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Model-Based Multi-view Face Construction and Recognition in Videos.
Proceedings of the Intelligent Computing Theories and Applications, 2012

Ethnicity classification based on fusion of face and gait.
Proceedings of the 5th IAPR International Conference on Biometrics, 2012

A hierarchical framework for image-based human age estimation by weighted and OHRanked Sparse Representation-based classification.
Proceedings of the 5th IAPR International Conference on Biometrics, 2012

Ground Plane Rectification Based on Rich Line Representation of Vehicle in Surveillance.
Proceedings of the Pattern Recognition - Chinese Conference, 2012

Ethnicity Classification Based on a Hierarchical Fusion.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012

Patch-Based Bag of Features for Face Recognition in Videos.
Proceedings of the Biometric Recognition - 7th Chinese Conference, 2012

Efficient Human Parsing Based on Sketch Representation.
Proceedings of the Computer Vision - ACCV 2012, 2012

2011
Gait-Based Gender Classification Using Mixed Conditional Random Field.
IEEE Trans. Syst. Man Cybern. Part B, 2011

A novel self-recovery fragile watermarking scheme based on dual-redundant-ring structure.
Comput. Electr. Eng., 2011

Face synthesis from near-infrared to visual light spectrum using quotient image and kernel-based multifactor analysis.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Multi-view multi-stance gait identification.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

Codebook Reconstruction with Word Correlation Feedback Mechanism.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

On-line Signature Verification Using Graph Representation.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

Visual Saliency Based Aerial Video Summarization by Online Scene Classification.
Proceedings of the Sixth International Conference on Image and Graphics, 2011

On-line Signature Verification Using Segment-to-Segment Graph Matching.
Proceedings of the 2011 International Conference on Document Analysis and Recognition, 2011

Face synthesis from near-infrared to visual light via sparse representation.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011

On-line signature verification using wavelet packet.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011

A Survey of On-line Signature Verification.
Proceedings of the Biometric Recognition - 6th Chinese Conference, 2011

Video-Based Face Recognition: State of the Art.
Proceedings of the Biometric Recognition - 6th Chinese Conference, 2011

A Survey of Advances in Biometric Gait Recognition.
Proceedings of the Biometric Recognition - 6th Chinese Conference, 2011

Sparse Reconstruction Based Watermarking for Secure Biometric Authentication.
Proceedings of the Biometric Recognition - 6th Chinese Conference, 2011

View-invariant action recognition in surveillance videos.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

A real time object tracking approach for mobile robot visual servo control.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

Saliency model based head pose estimation by sparse optical flow.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

Interesting region detection in aerial video using Bayesian topic models.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

Multi-view moving objects classification via transfer learning.
Proceedings of the First Asian Conference on Pattern Recognition, 2011

2010
Unstructured Point Cloud Surface Denoising and Decimation Using Distance RBF K-Nearest Neighbor Kernel.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Protecting Biometric Templates Using Authentication Watermarking.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

3D Model Based Vehicle Tracking Using Gradient Based Fitness Evaluation under Particle Filter Framework.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Block Pyramid Based Adaptive Quantization Watermarking for Multimodal Biometric Authentication.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Combining Spatial and Temporal Information for Gait Based Gender Classification.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Automatic and robust 3D face registration using multiresolution Spherical Depth Map.
Proceedings of the International Conference on Image Processing, 2010

2009
Robust visual tracking based on simplified biologically inspired features.
Proceedings of the International Conference on Image Processing, 2009

Rapid and robust human detection and tracking based on omega-shape features.
Proceedings of the International Conference on Image Processing, 2009

Pyramidal Statistics of Oriented Filtering for robust pedestrian detection.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

2008
3D model based vehicle localization by optimizing local gradient based fitness evaluation.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Boosting local feature descriptors for automatic objects classification in traffic scene surveillance.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Estimating the number of people in crowded scenes by MID based foreground segmentation and head-shoulder detection.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Robust automated ground plane rectification based on moving vehicles for traffic scene surveillance.
Proceedings of the International Conference on Image Processing, 2008

Practical camera auto-calibration based on object appearance and motion for traffic scene visual surveillance.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Real-Time Moving Object Classification with Automatic Scene Division.
Proceedings of the International Conference on Image Processing, 2007

EDA Approach for Model Based Localization and Recognition of Vehicles.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007


  Loading...