Zitong Yu

Orcid: 0000-0003-0422-6616

According to our database1, Zitong Yu authored at least 132 papers between 2014 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Distilled transformers with locally enhanced global representations for face forgery detection.
Pattern Recognit., 2025

2024
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation.
ACM Trans. Multim. Comput. Commun. Appl., November, 2024

Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing.
Int. J. Comput. Vis., November, 2024

CG-FAS: Cross-label Generative Augmentation for Face Anti-Spoofing.
Int. J. Comput. Vis., November, 2024

3sG: Three-stage guidance for indoor human action recognition.
IET Image Process., June, 2024

Exploiting Multi-Scale Parallel Self-Attention and Local Variation via Dual-Branch Transformer-CNN Structure for Face Super-Resolution.
IEEE Trans. Multim., 2024

rPPG-MAE: Self-Supervised Pretraining With Masked Autoencoders for Remote Physiological Measurements.
IEEE Trans. Multim., 2024

Rethinking Few-Shot Class-Incremental Learning With Open-Set Hypothesis in Hyperbolic Geometry.
IEEE Trans. Multim., 2024

GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning.
IEEE Trans. Inf. Forensics Secur., 2024

Category-Conditional Gradient Alignment for Domain Adaptive Face Anti-Spoofing.
IEEE Trans. Inf. Forensics Secur., 2024

S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing With Statistical Tokens.
IEEE Trans. Inf. Forensics Secur., 2024

Benchmarking Joint Face Spoofing and Forgery Detection With Visual and Physiological Cues.
IEEE Trans. Dependable Secur. Comput., 2024

Fine-Grained Temporal-Enhanced Transformer for Dynamic Facial Expression Recognition.
IEEE Signal Process. Lett., 2024

Pose-Promote: Progressive Visual Perception for Activities of Daily Living.
IEEE Signal Process. Lett., 2024

Discovering attention-guided cross-modality correlation for visible-infrared person re-identification.
Pattern Recognit., 2024

Exposing image splicing traces in scientific publications via uncertainty-guided refinement.
Patterns, 2024

Face anti-spoofing with cross-stage relation enhancement and spoof material perception.
Neural Networks, 2024

BIG-MoE: Bypass Isolated Gating MoE for Generalized Multimodal Face Anti-Spoofing.
CoRR, 2024

EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities.
CoRR, 2024

CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing.
CoRR, 2024

PGD-Imp: Rethinking and Unleashing Potential of Classic PGD with Dual Strategies for Imperceptible Adversarial Attacks.
CoRR, 2024

scFusionTTT: Single-cell transcriptomics and proteomics fusion with Test-Time Training layers.
CoRR, 2024

SFDA-rPPG: Source-Free Domain Adaptive Remote Physiological Measurement with Spatio-Temporal Consistency.
CoRR, 2024

PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba.
CoRR, 2024

MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection.
CoRR, 2024

Towards Data-Centric Face Anti-Spoofing: Improving Cross-domain Generalization via Physics-based Data Synthesis.
CoRR, 2024

TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model.
CoRR, 2024

EMO-LLaMA: Enhancing Facial Emotion Understanding with Instruction Tuning.
CoRR, 2024

DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News Detection.
CoRR, 2024

G<sup>2</sup>V<sup>2</sup>former: Graph Guided Video Vision Transformer for Face Anti-Spoofing.
CoRR, 2024

Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter.
CoRR, 2024

Difflare: Removing Image Lens Flare with Latent Diffusion Model.
CoRR, 2024

GM-DF: Generalized Multi-Scenario Deepfake Detection.
CoRR, 2024

Identity-free Artificial Emotional Intelligence via Micro-Gesture Understanding.
CoRR, 2024

Benchmarking Cross-Domain Audio-Visual Deception Detection.
CoRR, 2024

CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization.
CoRR, 2024

FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba.
CoRR, 2024

Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations.
CoRR, 2024

Answering Diverse Questions via Text Attached with Key Audio-Visual Clues.
CoRR, 2024

A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection.
CoRR, 2024

SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models.
CoRR, 2024

GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning.
CoRR, 2024

Facial Physiological and Emotional Analysis.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024

HideMIA: Hidden Wavelet Mining for Privacy-Enhancing Medical Image Analysis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Multi-Modal Document Presentation Attack Detection with Forensics Trace Disentanglement.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

MCA-Net: A Lightweight Multi-order Context Aggregation Network for Low Dose CT Denoising.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024

DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models.
Proceedings of the IEEE International Joint Conference on Biometrics, 2024

Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter.
Proceedings of the IEEE International Joint Conference on Biometrics, 2024

Flexible-Modal Deception Detection with Audio-Visual Adapter.
Proceedings of the IEEE International Joint Conference on Biometrics, 2024

AUFormer: Vision Transformers Are Parameter-Efficient Facial Action Unit Detectors.
Proceedings of the Computer Vision - ECCV 2024, 2024

CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios.
Proceedings of the Computer Vision - ECCV 2024, 2024

MTaDCS: Moving Trace and Feature Density-Based Confidence Sample Selection Under Label Noise.
Proceedings of the Computer Vision - ECCV 2024, 2024

DiffFAS: Face Anti-spoofing via Generative Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Generalized Face Anti-Spoofing via Finer Domain Partition and Disentangling Liveness-Irrelevant Factors.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Suppress and Rebalance: Towards Generalized Multi-Modal Face Anti-Spoofing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Robust facial expression recognition with Transformer Block Enhancement Module.
Eng. Appl. Artif. Intell., November, 2023

PhysFormer++: Facial Video-Based Physiological Measurement with SlowFast Temporal Difference Transformer.
Int. J. Comput. Vis., June, 2023

Deep Learning for Face Anti-Spoofing: A Survey.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023

Consistency Regularization for Deep Face Anti-Spoofing.
IEEE Trans. Inf. Forensics Secur., 2023

FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing.
IEEE Trans. Inf. Forensics Secur., 2023

FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection.
Signal Image Video Process., 2023

Biomedical Image Splicing Detection using Uncertainty-Guided Refinement.
CoRR, 2023

Hyperbolic Face Anti-Spoofing.
CoRR, 2023

Multi-scale Promoted Self-adjusting Correlation Learning for Facial Action Unit Detection.
CoRR, 2023

Visual Prompt Flexible-Modal Face Anti-Spoofing.
CoRR, 2023

DEMIST: A deep-learning-based task-specific denoising approach for myocardial perfusion SPECT.
CoRR, 2023

rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for Remote Physiological Measurement.
CoRR, 2023

Rehearsal-Free Domain Continual Face Anti-Spoofing: Generalize More and Forget Less.
CoRR, 2023

Need for Objective Task-based Evaluation of Deep Learning-Based Denoising Methods: A Study in the Context of Myocardial Perfusion SPECT.
CoRR, 2023

Generalized Few-Shot Continual Learning with Contrastive Mixture of Adapters.
CoRR, 2023

Flexible-modal Deception Detection with Audio-Visual Adapter.
CoRR, 2023

A task-specific deep-learning-based denoising approach for myocardial perfusion SPECT.
Proceedings of the Medical Imaging 2023: Image Perception, 2023

Audio-Visual Deception Detection: DOLOS Dataset and Parameter-Efficient Crossmodal Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rehearsal-Free Domain Continual Face Anti-Spoofing: Generalize More and Forget Less.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Flexible-Modal Face Anti-Spoofing: A Benchmark.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Neuron Structure Modeling for Generalizable Remote Physiological Measurement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization.
Proceedings of the Biometric Recognition - 17th Chinese Conference, 2023

Learning Motion-Robust Remote Photoplethysmography through Arbitrary Resolution Videos.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Spatio-Temporal Pain Estimation Network With Measuring Pseudo Heart Rate Gain.
IEEE Trans. Multim., 2022

Contrastive Context-Aware Learning for 3D High-Fidelity Mask Face Presentation Attack Detection.
IEEE Trans. Inf. Forensics Secur., 2022

Self-supervised 2D face presentation attack detection via temporal sequence sampling.
Pattern Recognit. Lett., 2022

Adversarial learning and decomposition-based domain generalization for face anti-spoofing.
Pattern Recognit. Lett., 2022

Meta-Teacher For Face Anti-Spoofing.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Deep mutual attention network for acoustic scene classification.
Digit. Signal Process., 2022

Face Presentation Attack Detection.
CoRR, 2022

Boosting Binary Neural Networks via Dynamic Thresholds Learning.
CoRR, 2022

Forensicability Assessment of Questioned Images in Recapturing Detection.
CoRR, 2022

Rethinking Few-Shot Class-Incremental Learning with Open-Set Hypothesis in Hyperbolic Geometry.
CoRR, 2022

Flexible-Modal Face Anti-Spoofing: A Benchmark.
CoRR, 2022

Detection of Molecules Based on Enhanced Backscattering Effect in Microsphere Lens.
Proceedings of the 17th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2022

Investigating the limited performance of a deep-learning-based SPECT denoising approach: an observer-study-based characterization.
Proceedings of the Medical Imaging 2022: Image Perception, 2022

Ideal-Observer Computation with Anthropomorphic Phantoms using Markov Chain Monte Carlo.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022

IDPT: Interconnected Dual Pyramid Transformer for Face Super-Resolution.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Vitranspad: Video Transformer Using Convolution And Self-Attention For Face Presentation Attack Detection.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Benchmarking 3D Face De-Identification with Preserving Facial Attributes.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Geometry-Contrastive Transformer for Generalized 3D Pose Transfer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition.
IEEE Trans. Image Process., 2021

Revisiting Pixel-Wise Supervision for Face Anti-Spoofing.
IEEE Trans. Biom. Behav. Identity Sci., 2021

Facial-Video-Based Physiological Signal Measurement: Recent advances and affective applications.
IEEE Signal Process. Mag., 2021

TransRPPG: Remote Photoplethysmography Transformer for 3D Mask Face Presentation Attack Detection.
IEEE Signal Process. Lett., 2021

NAS-FAS: Static-Dynamic Central Difference Network Search for Face Anti-Spoofing.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Review of Face Presentation Attack Detection Competitions.
CoRR, 2021

Consistency Regularization for Deep Face Anti-Spoofing.
CoRR, 2021

Fluorescence Enhancement Utilizing Dielectric Microbeads with Semi-open Microwells.
Proceedings of the 16th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2021

Dual-Cross Central Difference Network for Face Anti-Spoofing.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Non-contact Pain Recognition from Video Sequences with Remote Physiological Measurements Prediction.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

3D High-Fidelity Mask Face Presentation Attack Detection Challenge.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Pixel Difference Networks for Efficient Edge Detection.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

iMiGUE: An Identity-Free Video Dataset for Micro-Gesture Understanding and Emotion Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Atrial Fibrillation Detection From Face Videos by Fusing Subtle Variations.
IEEE Trans. Circuits Syst. Video Technol., 2020

AutoHR: A Strong End-to-End Baseline for Remote Heart Rate Measurement With Neural Searching.
IEEE Signal Process. Lett., 2020

2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors Challenges: An Efficient Optical Flow Stream Guided Framework.
CoRR, 2020

Understanding Query Interfaces: Automatic Extraction of Data from Domain-specific Deep Web based on Ontology.
Proceedings of the 22nd International Conference on Enterprise Information Systems, 2020

Auto-Fas: Searching Lightweight Networks for Face Anti-Spoofing.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Revisiting motion-based respiration measurement from videos.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

Face Anti-Spoofing with Human Material Perception.
Proceedings of the Computer Vision - ECCV 2020, 2020

Video-Based Remote Physiological Measurement via Cross-Verified Feature Disentangling.
Proceedings of the Computer Vision - ECCV 2020, 2020

Searching Central Difference Convolutional Networks for Face Anti-Spoofing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-Modal Face Anti-Spoofing Based on Central Difference Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Deep Spatial Gradient and Temporal Depth Learning for Face Anti-Spoofing.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

The 1st Challenge on Remote Physiological Signal Sensing (RePSS).
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Meta Model for Zero- and Few-Shot Face Anti-Spoofing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Recovering remote Photoplethysmograph Signal from Facial videos Using Spatio-Temporal Convolutional Networks.
CoRR, 2019

Pedestrian re-Identification Based on Tree Branch Network with Local and Global Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Face Liveness Detection by rPPG Features and Contextual Patch-Based CNN.
Proceedings of the 3rd International Conference on Biometric Engineering and Applications, 2019

Remote Photoplethysmograph Signal Measurement from Facial Videos Using Spatio-Temporal Networks.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
The Role of Structure and Textural Information in Image Utility and Quality Assessment Tasks.
J. Percept. Imaging, 2018

2014
Camera identification for very low bit rate time varying quantization noise videos.
Proceedings of the 9th International Symposium on Communication Systems, 2014


  Loading...