Xiao Wang

Orcid: 0000-0001-6117-6745

Affiliations:
  • Peng Cheng Laboratory, Shenzhen, China
  • Anhui University, School of Computer Science, Hefei, China (PhD 2019)


According to our database1, Xiao Wang authored at least 94 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Temporal adaptive bidirectional bridging for RGB-D tracking.
Pattern Recognit., 2025

Semantic-aware frame-event fusion based pattern recognition via large vision-language models.
Pattern Recognit., 2025

2024
Learning Graph Attentions via Replicator Dynamics.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

RGBT Tracking via Progressive Fusion Transformer With Dynamically Guided Learning.
IEEE Trans. Circuits Syst. Video Technol., September, 2024

MutualFormer: Multi-modal Representation Learning via Cross-Diffusion Attention.
Int. J. Comput. Vis., September, 2024

Tiny Object Tracking: A Large-Scale Dataset and a Baseline.
IEEE Trans. Neural Networks Learn. Syst., August, 2024

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows.
IEEE Trans. Cybern., March, 2024

Prompt-Based Learning for Unpaired Image Captioning.
IEEE Trans. Multim., 2024

Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer Based Approach.
IEEE Trans. Multim., 2024

AMatFormer: Efficient Feature Matching via Anchor Matching Transformer.
IEEE Trans. Multim., 2024

HeGraphAdapter: Tuning Multi-Modal Vision-Language Models with Heterogeneous Graph Adapter.
CoRR, 2024

CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset.
CoRR, 2024

VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models.
CoRR, 2024

Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm.
CoRR, 2024

MambaEVT: Event Stream based Visual Object Tracking using State Space Model.
CoRR, 2024

Event Stream based Human Action Recognition: A High-Definition Benchmark Dataset and Algorithms.
CoRR, 2024

R2GenCSR: Retrieving Context Samples for Large Language Model based X-ray Medical Report Generation.
CoRR, 2024

Treat Stillness with Movement: Remote Sensing Change Detection via Coarse-grained Temporal Foregrounds Mining.
CoRR, 2024

An Empirical Study of Mamba-based Pedestrian Attribute Recognition.
CoRR, 2024

Retain, Blend, and Exchange: A Quality-aware Spatial-Stereo Fusion Approach for Event Stream Recognition.
CoRR, 2024

Aligning Large Language Models from Self-Reference AI Feedback with one General Principle.
CoRR, 2024

Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition.
CoRR, 2024

Pre-training on High Definition X-ray Images: An Experimental Study.
CoRR, 2024

State Space Model for New-Generation Network Alternative to Transformers: A Survey.
CoRR, 2024

Long-term Frame-Event Visual Tracking: Benchmark Dataset and Baseline.
CoRR, 2024

Two Heads Are Better Than One: Integrating Knowledge from Knowledge Graphs and Large Language Models for Entity Alignment.
CoRR, 2024

Uncertainty-aware Bridge based Mobile-Former Network for Event-based Pattern Recognition.
CoRR, 2024

CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras.
CoRR, 2024

Mamba-FETrack: Frame-Event Tracking via State Space Model.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

A Reinforced Passage Interactive Retrieval Framework Incorporating Implicit Knowledge for KB-VQA.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction.
Proceedings of the Computer Vision - ECCV 2024, 2024

Event Stream-Based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel Baseline.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Finding Visual Saliency in Continuous Spike Stream.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Structural Information Guided Multimodal Pre-training for Vehicle-Centric Perception.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Deep Triply Attention Network for RGBT Tracking.
Cogn. Comput., November, 2023

Learning Spatial-Frequency Transformer for Visual Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey.
Mach. Intell. Res., August, 2023

Transformer vision-language tracking via proxy token guided cross-modal fusion.
Pattern Recognit. Lett., April, 2023

Parallel Learning: Overview and Perspective for Computational Learning Across Syn2Real and Sim2Real.
IEEE CAA J. Autom. Sinica, March, 2023

Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition.
IEEE Trans. Multim., 2023

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking.
IEEE Trans. Multim., 2023

VcT: Visual Change Transformer for Remote Sensing Image Change Detection.
IEEE Trans. Geosci. Remote. Sens., 2023

Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video Recognition.
CoRR, 2023

Pedestrian Attribute Recognition via CLIP based Prompt Vision-Language Fusion.
CoRR, 2023

SequencePAR: Understanding Pedestrian Attributes via A Sequence Generation Paradigm.
CoRR, 2023

SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition.
CoRR, 2023

Point-Voxel Absorbing Graph Representation Learning for Event Stream based Recognition.
CoRR, 2023

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation.
CoRR, 2023

Learning Invariant Molecular Representation in Latent Discrete Space.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Aligning Contrastive Clusters for Cross-Network Node Classification.
Proceedings of the IEEE International Conference on Data Mining, 2023

Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Tracking by Joint Local and Global Search: A Target-Aware Attention-Based Approach.
IEEE Trans. Neural Networks Learn. Syst., 2022

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-Based Beam Search.
IEEE Trans. Image Process., 2022

Large-Scale Spatio-Temporal Person Re-Identification: Algorithms and Benchmark.
IEEE Trans. Circuits Syst. Video Technol., 2022

Criteria Comparative Learning for Real-Scene Image Super-Resolution.
IEEE Trans. Circuits Syst. Video Technol., 2022

Pedestrian attribute recognition: A survey.
Pattern Recognit., 2022

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric.
CoRR, 2022

Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer based Approach.
CoRR, 2022

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search.
CoRR, 2022

See Finer, See More: Implicit Modality Alignment for Text-Based Person Retrieval.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Event-based Video Reconstruction via Potential-assisted Spiking Neural Network.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Retinomorphic Object Detection in Asynchronous Visual Streams.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
cmSalGAN: RGB-D Salient Object Detection With Cross-View Generative Adversarial Networks.
IEEE Trans. Multim., 2021

Dynamic Attention Guided Multi-Trajectory Analysis for Single Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2021

Semantic-Guided Pixel Sampling for Cloth-Changing Person Re-Identification.
IEEE Signal Process. Lett., 2021

RGBT tracking via cross-modality message passing.
Neurocomputing, 2021

MutualFormer: Multi-Modality Representation Learning via Mutual Transformer.
CoRR, 2021

Large-Scale Spatio-Temporal Person Re-identification: Algorithm and Benchmark.
CoRR, 2021

Learn to Match: Automatic Matching Network Design for Visual Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

NeuSpike-Net: High Speed Video Reconstruction via Bio-inspired Neuromorphic Cameras.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Towards More Flexible and Accurate Object Tracking With Natural Language: Algorithms and Benchmark.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Multi-modal foreground detection via inter- and intra-modality-consistent low-rank separation.
Neurocomputing, 2020

SaADB: A Self-attention Guided ADB Network for Person Re-identification.
CoRR, 2020

3R: Word and Phoneme Edition based Data Augmentation for Lexical Punctuation Prediction.
Proceedings of the 16th International Conference on Computational Intelligence and Security, 2020

2019
Quality-aware dual-modal saliency detection via deep reinforcement learning.
Signal Process. Image Commun., 2019

FMT: fusing multi-task convolutional neural network for person search.
Multim. Tools Appl., 2019

cmSalGAN: RGB-D Salient Object Detection with Cross-View Generative Adversarial Networks.
CoRR, 2019

Improved Hard Example Mining by Discovering Attribute-based Hard Person Identity.
CoRR, 2019

Pedestrian Attribute Recognition: A Survey.
CoRR, 2019

A Novel Method for Thermal Image Based Electrical-Equipment Detection.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Dense Feature Aggregation and Pruning for RGBT Tracking.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Learning Target-Oriented Dual Attention for Robust RGB-T Tracking.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Learning Target-aware Attention for Robust Tracking with Conditional Adversarial Network.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Deep Co-Space: Sample Mining Across Feature Transformation for Semi-Supervised Learning.
IEEE Trans. Circuits Syst. Video Technol., 2018

Moving object detection via robust background modeling with recurring patterns voting.
Multim. Tools Appl., 2018

Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning.
CoRR, 2018

Describe and Attend to Track: Learning Natural Language guided Structural Representation and Visual Attention for Object Tracking.
CoRR, 2018

SINT++: Robust Visual Tracking via Adversarial Positive Instance Generation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Noise-Aware Correlation Filter for Visual Tracking.
Proceedings of the Big Data - 6th CCF Conference, 2018

2017
Grayscale-Thermal Object Tracking via Multitask Laplacian Sparse Representation.
IEEE Trans. Syst. Man Cybern. Syst., 2017

Weighted Low-Rank Decomposition for Robust Grayscale-Thermal Foreground Detection.
IEEE Trans. Circuits Syst. Video Technol., 2017

End-to-End View-Aware Vehicle Classification via Progressive CNN Learning.
Proceedings of the Computer Vision - Second CCF Chinese Conference, 2017


  Loading...