Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
VideoVista-CulturalLingo: 360<sup>°</sup> Horizons-Bridging Cultures, Languages, and Domains in Video Comprehension.
CoRR, April, 2025
Interpretable Dynamic Directed Graph Convolutional Network for Multi-Relational Prediction of Missense Mutation and Drug Response.
IEEE J. Biomed. Health Informatics, February, 2025
CSHNet: A Novel Information Asymmetric Image Translation Method.
CoRR, January, 2025
DRExplainer: Quantifiable interpretability in drug response prediction with directed graph convolutional network.
Artif. Intell. Medicine, 2025
Reusability report: Uncovering associations in biomedical bipartite networks via a bilinear attention network with domain adaptation.
Nat. Mac. Intell., 2024
Controllable Edge-Type-Specific Interpretation in Multi-Relational Graph Neural Networks for Drug Response Prediction.
CoRR, 2024
DRExplainer: Quantifiable Interpretability in Drug Response Prediction with Directed Graph Convolutional Network.
CoRR, 2024
VideoVista: A Versatile Benchmark for Video Understanding and Reasoning.
CoRR, 2024
TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction.
CoRR, 2024
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024
Separating Spectral and Spatial Feature Aggregation for Demosaicking.
Proceedings of the International Joint Conference on Neural Networks, 2024
VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Tracking Control of Mobile Robots for Moving Targets in Unknown Environments Based on Improved Dynamic Window Approach.
Proceedings of the 5th International Conference on Artificial Intelligence and Computer Engineering, 2024
KIE-STCformer: Key Information Enhanced Spatio-Temporal Correction Transformer for Time Series Forecasting.
Proceedings of the 2024 8th International Conference on Computer Science and Artificial Intelligence, 2024
Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
GRA-GCN: Dense Granule Protein Prediction in Apicomplexa Protozoa Through Graph Convolutional Network.
IEEE ACM Trans. Comput. Biol. Bioinform., 2023
Toward Moiré-Free and Detail-Preserving Demosaicking.
CoRR, 2023