2025
RTBAgent: A LLM-based Agent System for Real-Time Bidding.
CoRR, February, 2025
MEDSQ: Towards personalized medical education via multi-form interaction guidance.
Expert Syst. Appl., 2025
2024
BiC-Net: Learning Efficient Spatio-temporal Relation for Text-Video Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., March, 2024
Contrastive topic-enhanced network for video captioning.
Expert Syst. Appl., March, 2024
Temporally Language Grounding With Multi-Modal Multi-Prompt Tuning.
IEEE Trans. Multim., 2024
Multi-level contrastive graph learning for academic abnormality prediction.
Neural Comput. Appl., 2024
VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool.
CoRR, 2024
FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024
RetrievalMMT: Retrieval-Constrained Multi-Modal Prompt Learning for Multi-Modal Machine Translation.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024
Energy-based Automated Model Evaluation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Multi-Prompts Learning with Cross-Modal Alignment for Attribute-Based Person Re-identification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Keyword-Based Diverse Image Retrieval With Variational Multiple Instance Graph.
IEEE Trans. Neural Networks Learn. Syst., December, 2023
Structural design and simulation analysis of fixed adjustable photovoltaic support.
J. Comput. Methods Sci. Eng., 2023
Interval-enhanced Graph Transformer solution for session-based recommendation.
Expert Syst. Appl., 2023
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models.
CoRR, 2023
Better Sign Language Translation with Monolingual Data.
CoRR, 2023
RewardTLG: Learning to Temporally Language Grounding from Flexible Reward.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Multi-Modal Knowledge Hypergraph for Diverse Image Retrieval.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Moment is Important: Language-Based Video Moment Retrieval via Adversarial Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2022
Video-guided machine translation via dual-level back-translation.
Knowl. Based Syst., 2022
A Spatiotemporal Graph Neural Network for session-based recommendation.
Expert Syst. Appl., 2022
Vision talks: Visual relationship-enhanced transformer for video-guided machine translation.
Expert Syst. Appl., 2022
Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation.
CoRR, 2022
Social-path embedding-based transformer for graduation development prediction.
Appl. Intell., 2022
Point Prompt Tuning for Temporally Language Grounding.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
Two-Stream Interactive Memory Network for Video Facial Expression Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022
Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
A Multi-Feature Fusion Slam System Attaching Semantic Invariant to Points and Lines.
Sensors, 2021
Visual Spatio-temporal Relation-enhanced Network for Cross-modal Text-Video Retrieval.
CoRR, 2021
Robust Stereo Visual SLAM for Dynamic Environments With Moving Object.
IEEE Access, 2021
Fine-grained Cross-modal Alignment Network for Text-Video Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Elective future: The influence factor mining of students' graduation development based on hierarchical attention neural network model with graph.
Appl. Intell., 2020
HHA: An Attentive Prediction Model for Academic Abnormality.
IEEE Access, 2020
Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
STRONG: Spatio-Temporal Reinforcement Learning for Cross-Modal Video Moment Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Dynamic facial expression recognition model based on BiLSTM-Attention.
Proceedings of the 15th International Conference on Computer Science & Education, 2020
2018
A-Stock Price Fluctuation Forecast Model Based on LSTM.
Proceedings of the 14th International Conference on Semantics, Knowledge and Grids, 2018