Taming vision transformers for clinical laryngoscopy assessment.
J. Biomed. Informatics, 2025
Counterfactual Debiasing for Physical Audiovisual Commonsense Reasoning.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Zero-Shot Human-Object Interaction Detection via Similarity Propagation.
IEEE Trans. Neural Networks Learn. Syst., December, 2024
Toward Explainable Physical Audiovisual Commonsense Reasoning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Balancing Multimodal Learning via Online Logit Modulation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
BGNN-XML: Bilateral Graph Neural Networks for Extreme Multi-Label Text Classification.
IEEE Trans. Knowl. Data Eng., July, 2023
Multimodal Sentiment Analysis via Efficient Multimodal Transformer and Modality-Aware Adaptive Training Strategy.
Proceedings of the 4th on Multimodal Sentiment Analysis Challenge and Workshop: Mimicked Emotions, 2023
Learning Aligned Audiovisual Representations for Multimodal Sentiment Analysis.
Proceedings of the 1st International Workshop on Multimodal and Responsible Affective Computing, 2023
Building Robust Multimodal Sentiment Recognition via a Simple yet Effective Multimodal Transformer.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
AcFormer: An Aligned and Compact Transformer for Multimodal Sentiment Analysis.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Stable Speech Emotion Recognition with Head-k-Pooling Loss.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
McOmet: Multimodal Fusion Transformer for Physical Audiovisual Commonsense Reasoning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
LCBM: A Multi-View Probabilistic Model for Multi-Label Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2021
A Sequential Contrastive Learning Framework for Robust Dysarthric Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
ASHF-Net: Adaptive Sampling and Hierarchical Folding Network for Robust Point Cloud Completion.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
GNN-XML: Graph Neural Networks for Extreme Multi-label Text Classification.
CoRR, 2020