2025
Computational evaluation of interactive dynamics for a full transcatheter aortic valve device in a patient-specific aortic root.
Comput. Biol. Medicine, 2025
High-Accuracy prediction and efficient adjustment of surface shape distortion in optical elements: Model correction based on uncertainty quantification-driven transfer learning.
Adv. Eng. Informatics, 2025
Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator.
Proceedings of the 31st International Conference on Computational Linguistics, 2025
2024
Unifying Structure Reasoning and Language Pre-Training for Complex Reasoning Tasks.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference.
CoRR, 2024
RSL-SQL: Robust Schema Linking in Text-to-SQL Generation.
CoRR, 2024
MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration.
CoRR, 2024
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis.
CoRR, 2024
An Iterative Framework for Document-Level Event Argument Extraction Assisted by Long Short-Term Memory.
Proceedings of the Natural Language Processing and Chinese Computing, 2024
Graph Interpretation of Image-Text Matching: Link Prediction on Concept-Enhanced Cross-Modal Graph.
Proceedings of the Natural Language Processing and Chinese Computing, 2024
ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
MIPI 2024 Challenge on Demosaic for Hybridevs Camera: Methods and Results.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
A flexible speller based on time-space frequency conversion SSVEP stimulation paradigm under dry electrode.
Frontiers Comput. Neurosci., February, 2023
Unifying Structure Reasoning and Language Model Pre-training for Complex Reasoning.
CoRR, 2023
AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
DANDELION: An ASV Deployed Micro-Profiler Array for Air-Sea Observation.
IROS, 2023
Topic-Aware Modeling for Unsupervised Extractive Summarization.
Proceedings of the International Joint Conference on Neural Networks, 2023
Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise.
Proceedings of the International Conference on Machine Learning, 2023
MIPI 2023 Challenge on RGBW Remosaic: Methods and Results.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
OTST: A Two-Phase Framework for Joint Denoising and Remosaicing in RGBW CFA.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Query Structure Modeling for Inductive Logical Reasoning Over Knowledge Graphs.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-training.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
GJTD-LR: A Trainable Grouped Joint Tensor Dictionary With Low-Rank Prior for Single Hyperspectral Image Super-Resolution.
IEEE Trans. Geosci. Remote. Sens., 2022
GENIE: Large Scale Pre-training for Text Generation with Diffusion Model.
CoRR, 2022
A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training.
CoRR, 2022
MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment.
CoRR, 2022
Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022
MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
DRAGONFLY: a UAV Rapidly Deployed Micro-Profiler Array for Underwater Thermocline Observation.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022
Residual Feature Distillation Channel Spatial Attention Network for ISP on Smartphone.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
Learning to Joint Remosaic and Denoise in Quad Bayer CFA via Universal Multi-scale Channel Attention Network.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
NTIRE 2022 Spectral Demosaicing Challenge and Data Set.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022
Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering.
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022
Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
SGUNet: Style-guided UNet for adversely conditioned fundus image super-resolution.
Neurocomputing, 2021
Fusion of multi-source retinal fundus images via automatic registration for clinical diagnosis.
Neurocomputing, 2021
Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval.
CoRR, 2021
Mask Attention Networks: Rethinking and Strengthen Transformer.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021
An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-level Structural Information.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
2020
PathQG: Neural Question Generation from Facts.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Single Fundus Image Super-Resolution Via Cascaded Channel-Wise Attention Network.
Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020
An Enhanced Knowledge Injection Model for Commonsense Generation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020
Reconstruction of 3D Retina from Multi-viewed Stereo Fundus Images via Dynamic Registration.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020
2019
Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
A Multi-Agent Communication Framework for Question-Worthy Phrase Extraction and Question Generation.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
ISCLAB at SemEval-2018 Task 1: UIR-Miner for Affect in Tweets.
Proceedings of The 12th International Workshop on Semantic Evaluation, 2018
A Question Type Driven Framework to Diversify Visual Question Generation.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators.
Proceedings of the 27th International Conference on Computational Linguistics, 2018