2025
Enabling Visual Object Detection With Object Sounds via Visual Modality Recalling Memory.
IEEE Trans. Neural Networks Learn. Syst., January, 2025
Multispectral Pedestrian Detection with Sparsely Annotated Label.
CoRR, January, 2025
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection.
CoRR, January, 2025
2024
Robust Airway Generation Labeling With Airway Segmentation for Reliable Airway Assessment.
IEEE Access, 2024
Enhancing Audio-Visual Question Answering with Missing Modality via Trans-Modal Associative Learning.
Proceedings of the IEEE International Conference on Acoustics, 2024
Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality.
Proceedings of the Computer Vision - ECCV 2024, 2024
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024
Towards Model-Agnostic Dataset Condensation by Heterogeneous Models.
Proceedings of the Computer Vision - ECCV 2024, 2024
Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Stereoscopic Vision Recalling Memory for Monocular 3D Object Detection.
IEEE Trans. Image Process., 2023
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications.
CoRR, 2023
One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Robust Multispectral Pedestrian Detection Via Spectral Position-Free Feature Mapping.
Proceedings of the IEEE International Conference on Image Processing, 2023
Online Class Incremental Learning on Stochastic Blurry Task Boundary via Mask and Visual Prompt Tuning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Similarity Relation Preserving Cross-Modal Learning for Multispectral Pedestrian Detection Against Adversarial Attacks.
Proceedings of the IEEE International Conference on Acoustics, 2023
Towards Robust Audio-Based Vehicle Detection Via Importance-Aware Audio-Visual Learning.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection.
IEEE Trans. Circuits Syst. Video Technol., 2022
Robust Thermal Infrared Pedestrian Detection By Associating Visible Pedestrian Knowledge.
Proceedings of the IEEE International Conference on Acoustics, 2022
Towards Versatile Pedestrian Detector with Multisensory-Matching and Multispectral Recalling Memory.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022
2021
CUA Loss: Class Uncertainty-Aware Gradient Modulation for Robust Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2021
Robust Multispectral Pedestrian Detection via Uncertainty-Aware Cross-Modal Learning.
Proceedings of the MultiMedia Modeling - 27th International Conference, 2021
Robust Small-scale Pedestrian Detection with Cued Recall via Memory Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Towards Robust Training of Multi-Sensor Data Fusion Network Against Adversarial Examples in Semantic Segmentation.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
BBC Net: Bounding-Box Critic Network for Occlusion-Robust Object Detection.
IEEE Trans. Circuits Syst. Video Technol., 2020
Investigating Vulnerability to Adversarial Examples on Multimodal Data Fusion in Deep Learning.
CoRR, 2020
Towards Human-Like Interpretable Object Detection Via Spatial Relation Encoding.
Proceedings of the IEEE International Conference on Image Processing, 2020
Revisiting Role of Autoencoders in Adversarial Settings.
Proceedings of the IEEE International Conference on Image Processing, 2020
Class Incremental Learning With Task-Selection.
Proceedings of the IEEE International Conference on Image Processing, 2020
Learning Style Correlation for Elaborate Few-Shot Classification.
Proceedings of the IEEE International Conference on Image Processing, 2020
Comprehensive Facial Expression Synthesis Using Human-Interpretable Language.
Proceedings of the IEEE International Conference on Image Processing, 2020
Fake Video Detection With Certainty-Based Attention Network.
Proceedings of the IEEE International Conference on Image Processing, 2020
Towards High-Performance Object Detection: Task-Specific Design Considering Classification and Localization Separation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-Based Symptom Relation Embedding.
Proceedings of the Computer Vision - ECCV 2020, 2020
Structure Boundary Preserving Segmentation for Medical Image With Ambiguous Boundary.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Attentive Layer Separation for Object Classification and Object Localization in Object Detection.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019
2018
Object Bounding Box-Critic Networks for Occlusion-Robust Object Detection in Road Scene.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018
2017
Robust and Real-Time Visual Tracking with Triplet Convolutional Neural Network.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017
Iterative deep convolutional encoder-decoder network for medical image segmentation.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017