Zitong Yu
Orcid: 0000-0003-0422-6616
According to our database1,
Zitong Yu
authored at least 132 papers
between 2014 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Distilled transformers with locally enhanced global representations for face forgery detection.
Pattern Recognit., 2025
2024
From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation.
ACM Trans. Multim. Comput. Commun. Appl., November, 2024
Rethinking Vision Transformer and Masked Autoencoder in Multimodal Face Anti-Spoofing.
Int. J. Comput. Vis., November, 2024
Int. J. Comput. Vis., November, 2024
IET Image Process., June, 2024
Exploiting Multi-Scale Parallel Self-Attention and Local Variation via Dual-Branch Transformer-CNN Structure for Face Super-Resolution.
IEEE Trans. Multim., 2024
rPPG-MAE: Self-Supervised Pretraining With Masked Autoencoders for Remote Physiological Measurements.
IEEE Trans. Multim., 2024
Rethinking Few-Shot Class-Incremental Learning With Open-Set Hypothesis in Hyperbolic Geometry.
IEEE Trans. Multim., 2024
GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning.
IEEE Trans. Inf. Forensics Secur., 2024
IEEE Trans. Inf. Forensics Secur., 2024
S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing With Statistical Tokens.
IEEE Trans. Inf. Forensics Secur., 2024
Benchmarking Joint Face Spoofing and Forgery Detection With Visual and Physiological Cues.
IEEE Trans. Dependable Secur. Comput., 2024
Fine-Grained Temporal-Enhanced Transformer for Dynamic Facial Expression Recognition.
IEEE Signal Process. Lett., 2024
IEEE Signal Process. Lett., 2024
Discovering attention-guided cross-modality correlation for visible-infrared person re-identification.
Pattern Recognit., 2024
Exposing image splicing traces in scientific publications via uncertainty-guided refinement.
Patterns, 2024
Face anti-spoofing with cross-stage relation enhancement and spoof material perception.
Neural Networks, 2024
CoRR, 2024
EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities.
CoRR, 2024
CA-Edit: Causality-Aware Condition Adapter for High-Fidelity Local Facial Attribute Editing.
CoRR, 2024
PGD-Imp: Rethinking and Unleashing Potential of Classic PGD with Dual Strategies for Imperceptible Adversarial Attacks.
CoRR, 2024
scFusionTTT: Single-cell transcriptomics and proteomics fusion with Test-Time Training layers.
CoRR, 2024
SFDA-rPPG: Source-Free Domain Adaptive Remote Physiological Measurement with Spatio-Temporal Consistency.
CoRR, 2024
PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba.
CoRR, 2024
MFCLIP: Multi-modal Fine-grained CLIP for Generalizable Diffusion Face Forgery Detection.
CoRR, 2024
Towards Data-Centric Face Anti-Spoofing: Improving Cross-domain Generalization via Physics-based Data Synthesis.
CoRR, 2024
TRRG: Towards Truthful Radiology Report Generation With Cross-modal Disease Clue Enhanced Large Language Model.
CoRR, 2024
CoRR, 2024
CoRR, 2024
G<sup>2</sup>V<sup>2</sup>former: Graph Guided Video Vision Transformer for Face Anti-Spoofing.
CoRR, 2024
Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter.
CoRR, 2024
CoRR, 2024
CourseGPT-zh: an Educational Large Language Model Based on Knowledge Distillation Incorporating Prompt Optimization.
CoRR, 2024
CoRR, 2024
Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations.
CoRR, 2024
CoRR, 2024
A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection.
CoRR, 2024
SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models.
CoRR, 2024
GenFace: A Large-Scale Fine-Grained Face Forgery Benchmark and Cross Appearance-Edge Learning.
CoRR, 2024
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Multi-Modal Document Presentation Attack Detection with Forensics Trace Disentanglement.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
MCA-Net: A Lightweight Multi-order Context Aggregation Network for Low Dose CT Denoising.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024
Proceedings of the IEEE International Joint Conference on Biometrics, 2024
Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter.
Proceedings of the IEEE International Joint Conference on Biometrics, 2024
Proceedings of the IEEE International Joint Conference on Biometrics, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios.
Proceedings of the Computer Vision - ECCV 2024, 2024
MTaDCS: Moving Trace and Feature Density-Based Confidence Sample Selection Under Label Noise.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Generalized Face Anti-Spoofing via Finer Domain Partition and Disentangling Liveness-Irrelevant Factors.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Eng. Appl. Artif. Intell., November, 2023
PhysFormer++: Facial Video-Based Physiological Measurement with SlowFast Temporal Difference Transformer.
Int. J. Comput. Vis., June, 2023
IEEE Trans. Pattern Anal. Mach. Intell., May, 2023
IEEE Trans. Inf. Forensics Secur., 2023
IEEE Trans. Inf. Forensics Secur., 2023
FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection.
Signal Image Video Process., 2023
Multi-scale Promoted Self-adjusting Correlation Learning for Facial Action Unit Detection.
CoRR, 2023
DEMIST: A deep-learning-based task-specific denoising approach for myocardial perfusion SPECT.
CoRR, 2023
rPPG-MAE: Self-supervised Pre-training with Masked Autoencoders for Remote Physiological Measurement.
CoRR, 2023
CoRR, 2023
Need for Objective Task-based Evaluation of Deep Learning-Based Denoising Methods: A Study in the Context of Myocardial Perfusion SPECT.
CoRR, 2023
CoRR, 2023
A task-specific deep-learning-based denoising approach for myocardial perfusion SPECT.
Proceedings of the Medical Imaging 2023: Image Perception, 2023
Audio-Visual Deception Detection: DOLOS Dataset and Parameter-Efficient Crossmodal Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Detect Any Deepfakes: Segment Anything Meets Face Forgery Detection and Localization.
Proceedings of the Biometric Recognition - 17th Chinese Conference, 2023
Learning Motion-Robust Remote Photoplethysmography through Arbitrary Resolution Videos.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE Trans. Multim., 2022
Contrastive Context-Aware Learning for 3D High-Fidelity Mask Face Presentation Attack Detection.
IEEE Trans. Inf. Forensics Secur., 2022
Self-supervised 2D face presentation attack detection via temporal sequence sampling.
Pattern Recognit. Lett., 2022
Adversarial learning and decomposition-based domain generalization for face anti-spoofing.
Pattern Recognit. Lett., 2022
Digit. Signal Process., 2022
Rethinking Few-Shot Class-Incremental Learning with Open-Set Hypothesis in Hyperbolic Geometry.
CoRR, 2022
Proceedings of the 17th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2022
Investigating the limited performance of a deep-learning-based SPECT denoising approach: an observer-study-based characterization.
Proceedings of the Medical Imaging 2022: Image Perception, 2022
Ideal-Observer Computation with Anthropomorphic Phantoms using Markov Chain Monte Carlo.
Proceedings of the 19th IEEE International Symposium on Biomedical Imaging, 2022
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022
Vitranspad: Video Transformer Using Convolution And Self-Attention For Face Presentation Attack Detection.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022
PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
Searching Multi-Rate and Multi-Modal Temporal Enhanced Networks for Gesture Recognition.
IEEE Trans. Image Process., 2021
IEEE Trans. Biom. Behav. Identity Sci., 2021
Facial-Video-Based Physiological Signal Measurement: Recent advances and affective applications.
IEEE Signal Process. Mag., 2021
TransRPPG: Remote Photoplethysmography Transformer for 3D Mask Face Presentation Attack Detection.
IEEE Signal Process. Lett., 2021
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Proceedings of the 16th IEEE International Conference on Nano/Micro Engineered and Molecular Systems, 2021
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Non-contact Pain Recognition from Video Sequences with Remote Physiological Measurements Prediction.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
iMiGUE: An Identity-Free Video Dataset for Micro-Gesture Understanding and Emotion Analysis.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
IEEE Trans. Circuits Syst. Video Technol., 2020
AutoHR: A Strong End-to-End Baseline for Remote Heart Rate Measurement With Neural Searching.
IEEE Signal Process. Lett., 2020
2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors Challenges: An Efficient Optical Flow Stream Guided Framework.
CoRR, 2020
Understanding Query Interfaces: Automatic Extraction of Data from Domain-specific Deep Web based on Ontology.
Proceedings of the 22nd International Conference on Enterprise Information Systems, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
Video-Based Remote Physiological Measurement via Cross-Verified Feature Disentangling.
Proceedings of the Computer Vision - ECCV 2020, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Recovering remote Photoplethysmograph Signal from Facial videos Using Spatio-Temporal Convolutional Networks.
CoRR, 2019
Pedestrian re-Identification Based on Tree Branch Network with Local and Global Learning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Proceedings of the 3rd International Conference on Biometric Engineering and Applications, 2019
Remote Photoplethysmograph Signal Measurement from Facial Videos Using Spatio-Temporal Networks.
Proceedings of the 30th British Machine Vision Conference 2019, 2019
2018
The Role of Structure and Textural Information in Image Utility and Quality Assessment Tasks.
J. Percept. Imaging, 2018
2014
Proceedings of the 9th International Symposium on Communication Systems, 2014