2025
Ming-Omni: A Unified Multimodal Model for Perception and Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, June, 2025
Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
Merge then Realign: Simple and Effective Modality-Incremental Continual Learning for Multimodal LLMs.
CoRR, March, 2025
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation.
IEEE Trans. Geosci. Remote. Sens., 2025
Towards Building Human-like Smart Agents in Modern 3D Video Games (Student Abstract).
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Unsupervised Modality-Transferable Video Highlight Detection With Representation Activation Sequence Learning.
IEEE Trans. Image Process., 2024
SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2024
Preformer: Simple and Efficient Design for Precipitation Nowcasting With Transformers.
IEEE Geosci. Remote. Sens. Lett., 2024
GTPAN: Global Target Preference Attention Network for session-based recommendation.
Expert Syst. Appl., 2024
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation.
CoRR, 2024
Local-to-Global Self-Consistency Learning for Temporal Action Localization.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception.
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
Improved Loss Minimization Control Based on Time-Harmonic Equivalent Circuit for Linear Induction Motors Adopted to Linear Metro.
IEEE Trans. Veh. Technol., July, 2023
WiDFF-ID: Device-Free Fast Person Identification Using Commodity WiFi.
IEEE Trans. Cogn. Commun. Netw., February, 2023
LSIAN: Exploiting interval interests for session-based recommendation via sparse attention network.
Inf. Sci., 2023
CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis.
CoRR, 2023
Adaptive Base-class Suppression and Prior Guidance Network for One-Shot Object Detection.
CoRR, 2023
Programmable Pressure Pneumatic System for Soft Robots.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2023
2022
Multi-interaction fusion collaborative filtering for social recommendation.
Expert Syst. Appl., 2022
CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Improving Graph Neural Network For Session-based Recommendation System Via Time Sessions.
Proceedings of the International Joint Conference on Neural Networks, 2022
Relational Graph Reasoning Transformer for Image Captioning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022
Spatiotemporal Contextual Consistency Network for Precipitation Nowcasting.
Proceedings of the IEEE International Conference on Data Mining, 2022
2021
Extracting Effective Image Attributes with Refined Universal Detection.
Sensors, 2021
Relational Attention with Textual Enhanced Transformer for Image Captioning.
Proceedings of the Pattern Recognition and Computer Vision - 4th Chinese Conference, 2021
Reinforcement Stacked Learning with Semantic-Associated Attention for Visual Question Answering.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Comprehensive Efficiency Optimization of Linear Induction Motors for Urban Transit.
IEEE Trans. Veh. Technol., 2020
2019
Deep Hierarchical Encoder-Decoder Network for Image Captioning.
IEEE Trans. Multim., 2019
Dense semantic embedding network for image captioning.
Pattern Recognit., 2019
Precipitation Forecasting via Multi-Scale Deconstructed ConvLSTM.
CoRR, 2019
DetNAS: Backbone Search for Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
Guiding the Flowing of Semantics: Interpretable Video Captioning via POS Tag.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
What and Where the Themes Dominate in Image.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Accelerating the Optimal Shape Design of Linear Machines by Transient Simulation Using Mesh Deformation and Mesh Connection Techniques.
IEEE Trans. Ind. Electron., 2018
2017
Important User Group Based Web Service Recommendation.
Proceedings of the 6th IIAI International Congress on Advanced Applied Informatics, 2017
PUED: A Social Spammer Detection Method Based on PU Learning and Ensemble Learning.
Proceedings of the Collaborative Computing: Networking, Applications and Worksharing, 2017