2025

Ming-Omni: A Unified Multimodal Model for Perception and Generation.

[DOI]

,

,

,

Chuanyang Zheng

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Merge then Realign: Simple and Effective Modality-Incremental Continual Learning for Multimodal LLMs.

[DOI]

,

,

,

,

CoRR, March, 2025

Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation.

[DOI]

,

,

,

,

,

IEEE Trans. Geosci. Remote. Sens., 2025

Towards Building Human-like Smart Agents in Modern 3D Video Games (Student Abstract).

[DOI]

,

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators.

[DOI]

,

,

Changzhou Zhang

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning.

[DOI]

,

,

IEEE Trans. Image Process., 2024

SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion.

[DOI]

,

,

,

,

,

,

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024

Preformer: Simple and Efficient Design for Precipitation Nowcasting With Transformers.

[DOI]

,

,

,

,

,

IEEE Geosci. Remote. Sens. Lett., 2024

GTPAN: Global Target Preference Attention Network for session-based recommendation.

[DOI]

,

,

,

Expert Syst. Appl., 2024

Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation.

[DOI]

,

,

,

,

CoRR, 2024

Local-to-Global Self-Consistency Learning for Temporal Action Localization.

[DOI]

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception.

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Improved Loss Minimization Control Based on Time-Harmonic Equivalent Circuit for Linear Induction Motors Adopted to Linear Metro.

[DOI]

,

,

,

,

,

Yongdao Shangguan

,

IEEE Trans. Veh. Technol., July, 2023

WiDFF-ID: Device-Free Fast Person Identification Using Commodity WiFi.

[DOI]

,

,

,

,

IEEE Trans. Cogn. Commun. Netw., February, 2023

LSIAN: Exploiting interval interests for session-based recommendation via sparse attention network.

[DOI]

,

,

Inf. Sci., 2023

CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis.

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2023

Adaptive Base-class Suppression and Prior Guidance Network for One-Shot Object Detection.

[DOI]

,

,

,

CoRR, 2023

Programmable Pressure Pneumatic System for Soft Robots.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2023

2022

Multi-interaction fusion collaborative filtering for social recommendation.

[DOI]

,

,

,

,

,

Expert Syst. Appl., 2022

CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Improving Graph Neural Network For Session-based Recommendation System Via Time Sessions.

[DOI]

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2022

Relational Graph Reasoning Transformer for Image Captioning.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Spatiotemporal Contextual Consistency Network for Precipitation Nowcasting.

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Data Mining, 2022

2021

Extracting Effective Image Attributes with Refined Universal Detection.

[DOI]

,

,

,

,

Sensors, 2021

Relational Attention with Textual Enhanced Transformer for Image Captioning.

[DOI]

,

,

,

,

Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021

Reinforcement Stacked Learning with Semantic-Associated Attention for Visual Question Answering.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Comprehensive Efficiency Optimization of Linear Induction Motors for Urban Transit.

[DOI]

,

,

,

,

IEEE Trans. Veh. Technol., 2020

2019

Deep Hierarchical Encoder-Decoder Network for Image Captioning.

[DOI]

,

,

,

,

IEEE Trans. Multim., 2019

Dense semantic embedding network for image captioning.

[DOI]

,

,

,

,

Pattern Recognit., 2019

Precipitation Forecasting via Multi-Scale Deconstructed ConvLSTM.

[DOI]

,

,

,

,

CoRR, 2019

DetNAS: Backbone Search for Object Detection.

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag.

[DOI]

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

What and Where the Themes Dominate in Image.

[DOI]

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Accelerating the Optimal Shape Design of Linear Machines by Transient Simulation Using Mesh Deformation and Mesh Connection Techniques.

[DOI]

,

,

IEEE Trans. Ind. Electron., 2018

2017

Important User Group Based Web Service Recommendation.

[DOI]

,

,

,

,

Proceedings of the 6th IIAI International Congress on Advanced Applied Informatics, 2017

PUED: A Social Spammer Detection Method Based on PU Learning and Ensemble Learning.

[DOI]

,

,

,

,

,

Proceedings of the Collaborative Computing: Networking, Applications and Worksharing, 2017