2025

Computational evaluation of interactive dynamics for a full transcatheter aortic valve device in a patient-specific aortic root.

[DOI]

,

,

,

,

,

,

,

Comput. Biol. Medicine, 2025

High-Accuracy prediction and efficient adjustment of surface shape distortion in optical elements: Model correction based on uncertainty quantification-driven transfer learning.

[DOI]

,

,

,

,

,

,

,

Adv. Eng. Informatics, 2025

Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation.

[DOI]

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator.

[DOI]

,

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

2024

Unifying Structure Reasoning and Language Pre-Training for Complex Reasoning Tasks.

[DOI]

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference.

[DOI]

,

,

,

,

,

,

CoRR, 2024

RSL-SQL: Robust Schema Linking in Text-to-SQL Generation.

[DOI]

,

,

,

,

,

CoRR, 2024

MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration.

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Qwen2 Technical Report.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

AI Hospital: Interactive Evaluation and Collaboration of LLMs as Intern Doctors for Clinical Diagnosis.

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

An Iterative Framework for Document-Level Event Argument Extraction Assisted by Long Short-Term Memory.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2024

Graph Interpretation of Image-Text Matching: Link Prediction on Concept-Enhanced Cross-Modal Graph.

[DOI]

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2024

ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking.

[DOI]

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

MIPI 2024 Challenge on Demosaic for Hybridevs Camera: Methods and Results.

[DOI]

,

,

,

,

,

,

,

Shangcheng Zhou

,

,

,

,

Chen Change Loy

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning.

[DOI]

,

,

,

,

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

A flexible speller based on time-space frequency conversion SSVEP stimulation paradigm under dry electrode.

[DOI]

,

,

,

,

,

,

Frontiers Comput. Neurosci., February, 2023

Unifying Structure Reasoning and Language Model Pre-training for Complex Reasoning.

[DOI]

,

,

,

CoRR, 2023

AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DANDELION: An ASV Deployed Micro-Profiler Array for Air-Sea Observation.

[DOI]

,

,

IROS, 2023

Topic-Aware Modeling for Unsupervised Extractive Summarization.

[DOI]

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2023

Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise.

[DOI]

,

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

MIPI 2023 Challenge on RGBW Remosaic: Methods and Results.

[DOI]

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

OTST: A Two-Phase Framework for Joint Denoising and Remosaicing in RGBW CFA.

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Query Structure Modeling for Inductive Logical Reasoning Over Knowledge Graphs.

[DOI]

,

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-training.

[DOI]

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

GJTD-LR: A Trainable Grouped Joint Tensor Dictionary With Low-Rank Prior for Single Hyperspectral Image Super-Resolution.

[DOI]

,

,

IEEE Trans. Geosci. Remote. Sens., 2022

GENIE: Large Scale Pre-training for Text Generation with Diffusion Model.

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

A Unified Continuous Learning Framework for Multi-modal Knowledge Discovery and Pre-training.

[DOI]

,

,

,

,

,

,

CoRR, 2022

MVP: Multi-Stage Vision-Language Pre-Training via Multi-Level Semantic Alignment.

[DOI]

,

,

,

CoRR, 2022

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval.

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning.

[DOI]

,

,

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval.

[DOI]

,

,

,

,

,

,

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

DRAGONFLY: a UAV Rapidly Deployed Micro-Profiler Array for Underwater Thermocline Observation.

[DOI]

,

,

,

,

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Residual Feature Distillation Channel Spatial Attention Network for ISP on Smartphone.

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

MIPI 2022 Challenge on Quad-Bayer Re-mosaic: Dataset and Report.

[DOI]

,

,

,

,

,

,

,

,

Chen Change Loy

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Hiêp Quang Luong

,

,

Anh Minh Truong

,

Wilfried Philips

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Learning to Joint Remosaic and Denoise in Quad Bayer CFA via Universal Multi-scale Channel Attention Network.

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Marcos V. Conde

,

,

Georgy Perevozchikov

,

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

NTIRE 2022 Spectral Demosaicing Challenge and Data Set.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering.

[DOI]

,

,

,

,

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain Conversations.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

SGUNet: Style-guided UNet for adversely conditioned fundus image super-resolution.

[DOI]

,

,

,

,

,

Neurocomputing, 2021

Fusion of multi-source retinal fundus images via automatic registration for clinical diagnosis.

[DOI]

,

,

,

,

,

,

,

,

,

Neurocomputing, 2021

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval.

[DOI]

,

,

,

,

CoRR, 2021

Mask Attention Networks: Rethinking and Strengthen Transformer.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning.

[DOI]

,

,

,

,

,

,

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-level Structural Information.

[DOI]

,

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

PathQG: Neural Question Generation from Facts.

[DOI]

,

,

,

,

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Single Fundus Image Super-Resolution Via Cascaded Channel-Wise Attention Network.

[DOI]

,

,

,

,

Proceedings of the 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2020

An Enhanced Knowledge Injection Model for Commonsense Generation.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Reconstruction of 3D Retina from Multi-viewed Stereo Fundus Images via Dynamic Registration.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

2019

Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning.

[DOI]

,

,

,

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Multi-Agent Communication Framework for Question-Worthy Phrase Extraction and Question Generation.

[DOI]

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

ISCLAB at SemEval-2018 Task 1: UIR-Miner for Affect in Tweets.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of The 12th International Workshop on Semantic Evaluation, 2018

A Question Type Driven Framework to Diversify Visual Question Generation.

[DOI]

,

,

,

,

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators.

[DOI]

,

,

,

,

Proceedings of the 27th International Conference on Computational Linguistics, 2018