2025
When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network.
CoRR, June, 2025
LeanPO: Lean Preference Optimization for Likelihood Alignment in Video-LLMs.
CoRR, June, 2025
LongDWM: Cross-Granularity Distillation for Building a Long-Term Driving World Model.
CoRR, June, 2025
ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos.
CoRR, May, 2025
Temporal Triplane Transformers as Occupancy World Models.
CoRR, March, 2025
Fully Spiking Actor Network With Intralayer Connections for Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., February, 2025
A Global Visual Information Intervention Model for Medical Visual Question Answering.
Comput. Biol. Medicine, 2025
Exploiting Continuous Motion Clues for Vision-Based Occupancy Prediction.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Towards Building Human-like Smart Agents in Modern 3D Video Games (Student Abstract).
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Visual Reinforcement Learning with Residual Action.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Dual modality prompt learning for visual question-grounded answering in robotic surgery.
Vis. Comput. Ind. Biomed. Art, December, 2024
Eye Gaze Guided Cross-Modal Alignment Network for Radiology Report Generation.
IEEE J. Biomed. Health Informatics, December, 2024
Cascaded Attention: Adaptive and Gated Graph Attention Network for Multiagent Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., March, 2024
Sensitivity Decouple Learning for Image Compression Artifacts Reduction.
IEEE Trans. Image Process., 2024
CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection.
CoRR, 2024
MTLight: Efficient Multi-Task Reinforcement Learning for Traffic Signal Control.
CoRR, 2024
Noisy Spiking Actor Network for Exploration.
CoRR, 2024
Fully Spiking Actor Network with Intra-layer Connections for Reinforcement Learning.
CoRR, 2024
A fuzzy logic constrained particle swarm optimization algorithm for industrial design problems.
Appl. Soft Comput., 2024
Seek Commonality but Preserve Differences: Dissected Dynamics Modeling for Multi-modal Visual RL.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Prior-Posterior Knowledge Prompting-and-Reasoning for Surgical Visual Question Localized-Answering.
Proceedings of the International Joint Conference on Neural Networks, 2024
DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Density-Adaptive Model Based on Motif Matrix for Multi-Agent Trajectory Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Adaptive Discovering and Merging for Incremental Novel Class Discovery.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control.
IEEE Trans. Knowl. Data Eng., November, 2023
Picking Up Quantization Steps for Compressed Image Classification.
IEEE Trans. Circuits Syst. Video Technol., April, 2023
Population-Based Evolutionary Gaming for Unsupervised Person Re-identification.
Int. J. Comput. Vis., 2023
Training Full Spike Neural Networks via Auxiliary Accumulation Pathway.
CoRR, 2023
Hierarchical Adaptive Value Estimation for Multi-modal Visual Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Dynamic Belief for Decentralized Multi-Agent Cooperative Learning.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023
Reinforcement Learning-Based Consensus Reaching in Large-Scale Social Networks.
Proceedings of the Neural Information Processing - 30th International Conference, 2023
Learning Sparse Neural Networks with Identity Layers.
Proceedings of the Image and Graphics - 12th International Conference, 2023
Stabilizing Visual Reinforcement Learning via Asymmetric Interactive Cooperation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based Reinforcement Learning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Self-Guided Adaptation: Progressive Representation Alignment for Domain Adaptive Object Detection.
IEEE Trans. Multim., 2022
Adversarial Reciprocal Points Learning for Open Set Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022
Annotation Efficient Person Re-Identification with Diverse Cluster-Based Pair Selection.
CoRR, 2022
Deep Reinforcement Learning with Spiking Q-learning.
CoRR, 2022
Spectrum Random Masking for Generalization in Image-based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
2021
Model Latent Views With Multi-Center Metric Learning for Vehicle Re-Identification.
IEEE Trans. Intell. Transp. Syst., 2021
Variationally and Intrinsically motivated reinforcement learning for decentralized traffic signal control.
CoRR, 2021
Adaptive Multi-Scale Semantic Fusion Network For Zero-Shot Learning.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021
Generate And Adjust: A Novel Framework For Semi-Supervised Pedestrian Attribute Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia & Expo Workshops, 2021
Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Allocating DNN Layers Computation Between Front-End Devices and The Cloud Server for Video Big Data Processing.
Proceedings of the IEEE International Conference on Acoustics, 2021
Reducing Image Compression Artifacts for Deep Neural Networks.
Proceedings of the 31st Data Compression Conference, 2021
2020
Discriminative Spatial Feature Learning for Person Re-Identification.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Masked Face Recognition with Generative Data Augmentation and Domain Constrained Ranking.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Masked Face Recognition with Latent Part Detection.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
End-Edge-Cloud Collaborative System: A Video Big Data Processing and Analysis Architecture.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020
Hybrid Learning for Multi-agent Cooperation with Sub-optimal Demonstrations.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020
Learning Open Set Network with Discriminative Reciprocal Points.
Proceedings of the Computer Vision - ECCV 2020, 2020
Binary Representation and High Efficient Compression of 3D CNN Features for Action Recognition.
Proceedings of the Data Compression Conference, 2020
Domain Adaptive Attention Learning for Unsupervised Person Re-Identification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Domain Adaptive Attention Model for Unsupervised Cross-Domain Person Re-Identification.
CoRR, 2019
Joint Learning of Dictionary and Convolutional Network for Pedestrian Attribute Recognition.
Proceedings of the 2019 IEEE Visual Communications and Image Processing, 2019
Learning Deep Decentralized Policy Network by Collective Rewards for Real-Time Combat Game.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Multi-View Learning for Vehicle Re-Identification.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019
Vehicle Re-Identification by Multi-Grain Learni.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
Unsupervised Person Re-identification Based on Clustering and Domain-Invariant Network.
Proceedings of the Image and Graphics - 10th International Conference, 2019
2018
Joint Semantic and Latent Attribute Modelling for Cross-Class Transfer Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018
Cooperative Multi-Agent Policy Gradients with Sub-optimal Demonstration.
CoRR, 2018
Visual Tracking via Spatially Aligned Correlation Filters Network.
Proceedings of the Computer Vision - ECCV 2018, 2018
2016
Joint Learning of Semantic and Latent Attributes.
Proceedings of the Computer Vision - ECCV 2016, 2016
Unsupervised Cross-Dataset Transfer Learning for Person Re-identification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
2015
Robust multiple cameras pedestrian detection with multi-view Bayesian network.
Pattern Recognit., 2015
2012
Multi-camera Pedestrian Detection with Multi-view Bayesian Network Model.
Proceedings of the British Machine Vision Conference, 2012
Single and Multiple View Detection, Tracking and Video Analysis in Crowded Environments.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012
2011
PKU-NEC @TRECVID2011 SED: Sequence-Based Event Detection in Surveillance Video.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011