Yawen Zeng

Orcid: 0000-0003-1908-1157

According to our database1, Yawen Zeng authored at least 38 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
BiC-Net: Learning Efficient Spatio-temporal Relation for Text-Video Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., March, 2024

Contrastive topic-enhanced network for video captioning.
Expert Syst. Appl., March, 2024

Temporally Language Grounding With Multi-Modal Multi-Prompt Tuning.
IEEE Trans. Multim., 2024

Multi-level contrastive graph learning for academic abnormality prediction.
Neural Comput. Appl., 2024

VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool.
CoRR, 2024

FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

RetrievalMMT: Retrieval-Constrained Multi-Modal Prompt Learning for Multi-Modal Machine Translation.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Energy-based Automated Model Evaluation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multi-Prompts Learning with Cross-Modal Alignment for Attribute-Based Person Re-identification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Keyword-Based Diverse Image Retrieval With Variational Multiple Instance Graph.
IEEE Trans. Neural Networks Learn. Syst., December, 2023

Structural design and simulation analysis of fixed adjustable photovoltaic support.
J. Comput. Methods Sci. Eng., 2023

Interval-enhanced Graph Transformer solution for session-based recommendation.
Expert Syst. Appl., 2023

Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models.
CoRR, 2023

Better Sign Language Translation with Monolingual Data.
CoRR, 2023

RewardTLG: Learning to Temporally Language Grounding from Flexible Reward.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Multi-Modal Knowledge Hypergraph for Diverse Image Retrieval.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Moment is Important: Language-Based Video Moment Retrieval via Adversarial Learning.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Video-guided machine translation via dual-level back-translation.
Knowl. Based Syst., 2022

A Spatiotemporal Graph Neural Network for session-based recommendation.
Expert Syst. Appl., 2022

Vision talks: Visual relationship-enhanced transformer for video-guided machine translation.
Expert Syst. Appl., 2022

Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation.
CoRR, 2022

Social-path embedding-based transformer for graduation development prediction.
Appl. Intell., 2022

Point Prompt Tuning for Temporally Language Grounding.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Two-Stream Interactive Memory Network for Video Facial Expression Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
A Multi-Feature Fusion Slam System Attaching Semantic Invariant to Points and Lines.
Sensors, 2021

Visual Spatio-temporal Relation-enhanced Network for Cross-modal Text-Video Retrieval.
CoRR, 2021

Robust Stereo Visual SLAM for Dynamic Environments With Moving Object.
IEEE Access, 2021

Fine-grained Cross-modal Alignment Network for Text-Video Retrieval.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Elective future: The influence factor mining of students' graduation development based on hierarchical attention neural network model with graph.
Appl. Intell., 2020

HHA: An Attentive Prediction Model for Academic Abnormality.
IEEE Access, 2020

Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

STRONG: Spatio-Temporal Reinforcement Learning for Cross-Modal Video Moment Localization.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dynamic facial expression recognition model based on BiLSTM-Attention.
Proceedings of the 15th International Conference on Computer Science & Education, 2020

2018
A-Stock Price Fluctuation Forecast Model Based on LSTM.
Proceedings of the 14th International Conference on Semantics, Knowledge and Grids, 2018


  Loading...