2025
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models.
CoRR, May, 2025

AI Awareness.
CoRR, April, 2025

VideoVista-CulturalLingo: 360<sup>°</sup> Horizons-Bridging Cultures, Languages, and Domains in Video Comprehension.
CoRR, April, 2025

Interpretable Dynamic Directed Graph Convolutional Network for Multi-Relational Prediction of Missense Mutation and Drug Response.
IEEE J. Biomed. Health Informatics, February, 2025

CSHNet: A Novel Information Asymmetric Image Translation Method.
CoRR, January, 2025

DRExplainer: Quantifiable interpretability in drug response prediction with directed graph convolutional network.
Artif. Intell. Medicine, 2025

2024
Reusability report: Uncovering associations in biomedical bipartite networks via a bilinear attention network with domain adaptation.
Nat. Mac. Intell., 2024

Controllable Edge-Type-Specific Interpretation in Multi-Relational Graph Neural Networks for Drug Response Prediction.
CoRR, 2024

DRExplainer: Quantifiable Interpretability in Drug Response Prediction with Directed Graph Convolutional Network.
CoRR, 2024

VideoVista: A Versatile Benchmark for Video Understanding and Reasoning.
CoRR, 2024

TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction.
CoRR, 2024

Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024

Separating Spectral and Spatial Feature Aggregation for Demosaicking.
Proceedings of the International Joint Conference on Neural Networks, 2024

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tracking Control of Mobile Robots for Moving Targets in Unknown Environments Based on Improved Dynamic Window Approach.
Proceedings of the 5th International Conference on Artificial Intelligence and Computer Engineering, 2024

KIE-STCformer: Key Information Enhanced Spatio-Temporal Correction Transformer for Time Series Forecasting.
Proceedings of the 2024 8th International Conference on Computer Science and Artificial Intelligence, 2024

Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
GRA-GCN: Dense Granule Protein Prediction in Apicomplexa Protozoa Through Graph Convolutional Network.
IEEE ACM Trans. Comput. Biol. Bioinform., 2023

Toward Moiré-Free and Detail-Preserving Demosaicking.
CoRR, 2023