Xiatian Zhu

Orcid: 0000-0002-9284-2955

According to our database1, Xiatian Zhu authored at least 188 papers between 2012 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Multi-Scale Convolutional Neural Networks optimized by elite strategy dung beetle optimization algorithm for encrypted traffic classification.
Expert Syst. Appl., 2025

2024
Vision Transformers: From Semantic Segmentation to Dense Prediction.
Int. J. Comput. Vis., December, 2024

Illumination Distribution-Aware Thermal Pedestrian Detection.
IEEE Trans. Intell. Transp. Syst., November, 2024

Softmax-Free Linear Transformers.
Int. J. Comput. Vis., August, 2024

Compressed-SDR to HDR Video Reconstruction.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Semi-Supervised and Unsupervised Deep Visual Learning: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Source-free domain adaptation with Class Prototype Discovery.
Pattern Recognit., January, 2024

Uncertainty-aware pseudo-label filtering for source-free unsupervised domain adaptation.
Neurocomputing, 2024

Source-Free Domain Adaptation via Target Prediction Distribution Searching.
Int. J. Comput. Vis., 2024

Shooting condition insensitive unmanned aerial vehicle object detection.
Expert Syst. Appl., 2024

A hybrid approach for Android malware detection using improved multi-scale convolutional neural networks and residual networks.
Expert Syst. Appl., 2024

Motion Forecasting in Continuous Driving.
CoRR, 2024

Rethinking Weak-to-Strong Augmentation in Source-Free Domain Adaptive Object Detection.
CoRR, 2024

Single Image, Any Face: Generalisable 3D Face Generation.
CoRR, 2024

FrozenSeg: Harmonizing Frozen Foundation Models for Open-Vocabulary Segmentation.
CoRR, 2024

MacFormer: Semantic Segmentation with Fine Object Boundaries.
CoRR, 2024

DeepInteraction++: Multi-Modality Interaction for Autonomous Driving.
CoRR, 2024

Few-Shot Medical Image Segmentation with High-Fidelity Prototypes.
CoRR, 2024

AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis.
CoRR, 2024

Gaussian Splatting with Localized Points Management.
CoRR, 2024

Proxy Denoising for Source-Free Domain Adaptation.
CoRR, 2024

Tetrahedron Splatting for 3D Generation.
CoRR, 2024

Automating the Diagnosis of Human Vision Disorders by Cross-modal 3D Generation.
CoRR, 2024

Diffusion Deepfake.
CoRR, 2024

Unsupervised Audio-Visual Segmentation with Modality Alignment.
CoRR, 2024

Unified Source-Free Domain Adaptation.
CoRR, 2024

OmniCount: Multi-label Object Counting with Semantic-Geometric Priors.
CoRR, 2024

Fast Dynamic 3D Object Generation from a Single-view Video.
CoRR, 2024

Adversarial Experts Model for Black-box Domain Adaptation.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Bayesian Detector Combination for Object Detection with Crowdsourced Annotations.
Proceedings of the Computer Vision - ECCV 2024, 2024

PartCraft: Crafting Creative Objects by Parts.
Proceedings of the Computer Vision - ECCV 2024, 2024

ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Source-Free Domain Adaptation with Frozen Multimodal Foundation Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DiffSED: Sound Event Detection with Denoising Diffusion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Adaptive Mutual Learning for Unsupervised Domain Adaptation.
IEEE Trans. Circuits Syst. Video Technol., November, 2023

Multimodal Learning With Transformers: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Neural operator search.
Pattern Recognit., April, 2023

Periodic Vibration Gaussian: Dynamic Urban Scene Reconstruction and Real-time Rendering.
CoRR, 2023

Typhoon Intensity Prediction with Vision Transformer.
CoRR, 2023

DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination.
CoRR, 2023

Adaptive-Labeling for Enhancing Remote Sensing Cloud Understanding.
CoRR, 2023

Recognize Any Regions.
CoRR, 2023

Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting.
CoRR, 2023

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection.
CoRR, 2023

Leveraging Foundation models for Unsupervised Audio-Visual Segmentation.
CoRR, 2023

MSQNet: Actor-agnostic Action Recognition with Multi-modal Query.
CoRR, 2023

PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds.
CoRR, 2023

Unsupervised Hashing via Similarity Distribution Calibration.
CoRR, 2023

Preconditioned Score-based Generative Models.
CoRR, 2023

HeadSculpt: Crafting 3D Head Avatars with Text.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Independent Feature Decomposition and Instance Alignment for Unsupervised Domain Adaptation.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Vision-Language Assisted Attribute Learning.
Proceedings of the 8th IEEE International Conference on Network Intelligence and Digital Content, 2023

Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Actor-agnostic Multi-label Action Recognition with Multi-modal Query.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Homeomorphism Alignment for Unsupervised Domain Adaptation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DeepChange: A Long-Term Person Re-Identification Benchmark with Clothes Change.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Controllable Person Image Synthesis with Pose-Constrained Latent Diffusion.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Post-Processing Temporal Action Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Generative Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Unsupervised Hashing with Similarity Distribution Calibration.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Learning hybrid ranking representation for person re-identification.
Pattern Recognit., 2022

Low-resolution human pose estimation.
Pattern Recognit., 2022

Unsupervised cross-domain person re-identification by instance and distribution alignment.
Pattern Recognit., 2022

Few-shot Website Fingerprinting attack with Meta-Bias Learning.
Pattern Recognit., 2022

Towards Uncovering the Intrinsic Data Structures for Unsupervised Domain Adaptation Using Structurally Regularized Deep Clustering.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

A Comprehensive Survey on Single-Person Pose Estimation in Social Robotics.
Int. J. Soc. Robotics, 2022

Joint Bilateral-Resolution Identity Modeling for Cross-Resolution Person Re-Identification.
Int. J. Comput. Vis., 2022

Multi-Modal Few-Shot Temporal Action Detection via Vision-Language Meta-Adaptation.
CoRR, 2022

Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders.
CoRR, 2022

Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective.
CoRR, 2022

Temporal Action Detection with Global Segmentation Mask Learning.
CoRR, 2022

Softmax-free Linear Transformers.
CoRR, 2022

PolarFormer: Multi-camera 3D Object Detection with Polar Transformers.
CoRR, 2022

ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning for Action Recognition.
CoRR, 2022

Multimodal Learning with Transformers: A Survey.
CoRR, 2022

Accelerating Score-based Generative Models for High-Resolution Image Synthesis.
CoRR, 2022

Knowledge Distillation Meets Open-Set Semi-Supervised Learning.
CoRR, 2022

End-to-End Multi-Tab Website Fingerprinting Attack: A Detection Perspective.
CoRR, 2022

Unsupervised Long-Term Person Re-Identification with Clothes Change.
CoRR, 2022

DeepInteraction: 3D Object Detection via Modality Interaction.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MetaTeacher: Coordinating Multi-Model Domain Adaptation for Medical Image Classification.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Class Discriminative Adversarial Learning for Unsupervised Domain Adaptation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Structure-Preserving Motion Estimation for Learned Video Compression.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

KUNet: Imaging Knowledge-Inspired Single HDR Image Reconstruction.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers.
Proceedings of the Computer Vision - ECCV 2022, 2022

Zero-Shot Temporal Action Detection via Vision-Language Prompting.
Proceedings of the Computer Vision - ECCV 2022, 2022

Semi-supervised Temporal Action Detection with Proposal-Free Masking.
Proceedings of the Computer Vision - ECCV 2022, 2022

Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling.
Proceedings of the Computer Vision - ECCV 2022, 2022

Learning Ego 3D Representation as Ray Tracing.
Proceedings of the Computer Vision - ECCV 2022, 2022

FashionViL: Fashion-Focused Vision-and-Language Representation Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

SOS! Self-supervised Learning over Sets of Handled Objects in Egocentric Action Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Source-Free Object Detection by Learning to Overlook Domain Style.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Global Aggregation Then Local Distribution for Scene Parsing.
IEEE Trans. Image Process., 2021

Few-Shot Website Fingerprinting Attack with Data Augmentation.
Secur. Commun. Networks, 2021

Hierarchical distillation learning for scalable person search.
Pattern Recognit., 2021

Intra-Camera Supervised Person Re-Identification.
Int. J. Comput. Vis., 2021

Multi-perspective cross-class domain adaptation for open logo detection.
Comput. Vis. Image Underst., 2021

Single Person Pose Estimation: A Survey.
CoRR, 2021

Long-term Person Re-identification: A Benchmark.
CoRR, 2021

Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization.
CoRR, 2021

Few-Shot Website Fingerprinting Attack.
CoRR, 2021

Unsupervised Noisy Tracklet Person Re-identification.
CoRR, 2021

Few-shot website fingerprinting attack.
Comput. Networks, 2021

Low-Fidelity Video Encoder Optimization for Temporal Action Localization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

SOFT: Softmax-free Transformer with Linear Complexity.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Boundary-sensitive Pre-training for Temporal Localization in Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Few-shot Action Recognition with Prototype-centered Attentive Learning.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Knowing What, Where and When to Look: Video Action modelling with Attention.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Few-Shot Temporal Action Localization with Query Adaptive Transformer.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Scalable logo detection by self co-learning.
Pattern Recognit., 2020

Face re-identification challenge: Are face recognition models good enough?
Pattern Recognit., 2020

Unsupervised Tracklet Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Scalable Person Re-Identification by Harmonious Attention.
Int. J. Comput. Vis., 2020

Egocentric Action Recognition by Video Attention and Temporal Context.
CoRR, 2020

Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention.
CoRR, 2020

Joint COCO and Mapillary Workshop at ICCV 2019 Keypoint Detection Challenge Track Technical Report: Distribution-Aware Coordinate Representation for Human Pose Estimation.
CoRR, 2020

Characteristic Regularisation for Super-Resolving Face Images.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Distribution-Aware Coordinate Representation for Human Pose Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Incremental Few-Shot Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Stochastic Classifiers for Unsupervised Domain Adaptation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Deep Semantic Clustering by Partition Confidence Maximisation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Inter-Task Association Critic for Cross-Resolution Person Re-Identification.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Tracklet Self-Supervised Learning for Unsupervised Person Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neural Graph Embedding for Neural Architecture Search.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Unsupervised Deep Learning via Affinity Diffusion.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Semi-Supervised Learning under Class Distribution Mismatch.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Imbalanced Deep Learning by Minority Class Incremental Rectification.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Universal Person Re-Identification.
CoRR, 2019

Efficient Human Pose Estimation in Hierarchical Context.
IEEE Access, 2019

Unsupervised Deep Learning by Neighbourhood Discovery.
Proceedings of the 36th International Conference on Machine Learning, 2019

Person Re-Identification by Ranking Ensemble Representations.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Intra-Camera Supervised Person Re-Identification: A New Benchmark.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Instance-Guided Context Rendering for Cross-Domain Person Re-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Person Search by Text Attribute Query As Zero-Shot Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Fast Human Pose Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Spatio-Temporal Associative Representation for Video Person Re-Identification.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Single-Label Multi-Class Image Classification by Deep Logistic Regression.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Fast Open-World Person Re-Identification.
IEEE Trans. Image Process., 2018

Person Re-Identification by Camera Correlation Aware Feature Augmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Combating the class imbalance problemin sparse representation learning.
J. Intell. Fuzzy Syst., 2018

Person Re-identification in Identity Regression Space.
Int. J. Comput. Vis., 2018

Surveillance Face Recognition Challenge.
CoRR, 2018

Scalable Deep Learning Logo Detection.
CoRR, 2018

Knowledge Distillation by On-the-Fly Native Ensemble.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Unsupervised Person Re-identification by Deep Learning Tracklet Association.
Proceedings of the Computer Vision - ECCV 2018, 2018

Person Search by Multi-Scale Matching.
Proceedings of the Computer Vision - ECCV 2018, 2018

Semi-supervised Deep Learning with Memory.
Proceedings of the Computer Vision - ECCV 2018, 2018

Vehicle Re-identification in Context.
Proceedings of the Pattern Recognition - 40th German Conference, 2018

Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Harmonious Attention Network for Person Re-Identification.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Open Logo Detection Challenge.
Proceedings of the British Machine Vision Conference 2018, 2018

Deep Association Learning for Unsupervised Video Person Re-identification.
Proceedings of the British Machine Vision Conference 2018, 2018

Self-Referenced Deep Learning.
Proceedings of the Computer Vision - ACCV 2018, 2018

Low-Resolution Face Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

Deep Low-Resolution Person Re-Identification.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Person re-identification by unsupervised video matching.
Pattern Recognit., 2017

Identity Alignment by Noisy Pixel Removal.
CoRR, 2017

Discovering visual concept structure with sparse and incomplete tags.
Artif. Intell., 2017

Deep Learning Logo Detection with Data Expansion by Synthesising Context.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Multi-task Curriculum Transfer Deep Learning of Clothing Attributes.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Person Re-Identification by Deep Joint Learning of Multi-Loss Classification.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Person Re-identification by Deep Learning Multi-scale Representations.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Attribute Recognition by Joint Recurrent Learning of Context and Correlation.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Class Rectification Hard Mining for Imbalanced Deep Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Deep Reinforcement Learning Attention Selection For Person Re-Identification.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Constrained Clustering With Imperfect Oracles.
IEEE Trans. Neural Networks Learn. Syst., 2016

Person Re-Identification by Discriminative Selection in Video Ranking.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

Learning from Multiple Sources for Video Summarisation.
Int. J. Comput. Vis., 2016

Towards unsupervised open-set person re-identification.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Human-in-the-Loop Person Re-identification.
Proceedings of the Computer Vision - ECCV 2016, 2016

Video Semantic Clustering with Sparse and Incomplete Tags.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Multi-Source Video Summarisation.
CoRR, 2015

2014
Balanced Neighborhood Classifiers for Imbalanced Data Sets.
IEICE Trans. Inf. Syst., 2014

Person Re-identification by Video Ranking.
Proceedings of the Computer Vision - ECCV 2014, 2014

Constructing Robust Affinity Graphs for Spectral Clustering.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Constrained Clustering: Effective Constraint Propagation with Imperfect Oracles.
Proceedings of the 2013 IEEE 13th International Conference on Data Mining, 2013

Video Synopsis by Heterogeneous Multi-source Correlation.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Comparing Visual Feature Coding for Learning Disjoint Camera Dependencies.
Proceedings of the British Machine Vision Conference, 2012


  Loading...