We stand with Ukraine

We stand with Ukraine

Yawen Zeng

Orcid: 0000-0003-1908-1157

According to our database¹, Yawen Zeng authored at least 38 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

BiC-Net: Learning Efficient Spatio-temporal Relation for Text-Video Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., March, 2024

Contrastive topic-enhanced network for video captioning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Expert Syst. Appl., March, 2024

Temporally Language Grounding With Multi-Modal Multi-Prompt Tuning.

[BibT_eX]

[DOI]

,

,

,

IEEE Trans. Multim., 2024

Multi-level contrastive graph learning for academic abnormality prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

Neural Comput. Appl., 2024

VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool.

[BibT_eX]

[DOI]

,

,

Jingsheng Zheng

,

,

,

CoRR, 2024

FinReport: Explainable Stock Earnings Forecasting via News Factor Analyzing Model.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

RetrievalMMT: Retrieval-Constrained Multi-Modal Prompt Learning for Multi-Modal Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

Energy-based Automated Model Evaluation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multi-Prompts Learning with Cross-Modal Alignment for Attribute-Based Person Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Keyword-Based Diverse Image Retrieval With Variational Multiple Instance Graph.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Neural Networks Learn. Syst., December, 2023

Structural design and simulation analysis of fixed adjustable photovoltaic support.

[BibT_eX]

[DOI]

,

,

,

,

J. Comput. Methods Sci. Eng., 2023

Interval-enhanced Graph Transformer solution for session-based recommendation.

[BibT_eX]

[DOI]

,

,

,

,

Expert Syst. Appl., 2023

Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models.

[BibT_eX]

[DOI]

,

CoRR, 2023

Better Sign Language Translation with Monolingual Data.

[BibT_eX]

[DOI]

,

,

CoRR, 2023

RewardTLG: Learning to Temporally Language Grounding from Flexible Reward.

[BibT_eX]

[DOI]

,

,

Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Multi-Modal Knowledge Hypergraph for Diverse Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Moment is Important: Language-Based Video Moment Retrieval via Adversarial Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., 2022

Video-guided machine translation via dual-level back-translation.

[BibT_eX]

[DOI]

,

,

,

Knowl. Based Syst., 2022

A Spatiotemporal Graph Neural Network for session-based recommendation.

[BibT_eX]

[DOI]

,

,

,

,

Expert Syst. Appl., 2022

Vision talks: Visual relationship-enhanced transformer for video-guided machine translation.

[BibT_eX]

[DOI]

,

,

,

Expert Syst. Appl., 2022

Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Social-path embedding-based transformer for graduation development prediction.

[BibT_eX]

[DOI]

,

,

,

,

Appl. Intell., 2022

Point Prompt Tuning for Temporally Language Grounding.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment.

[BibT_eX]

[DOI]

,

,

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Two-Stream Interactive Memory Network for Video Facial Expression Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2022, 2022

Distill The Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation.

[BibT_eX]

[DOI]

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

A Multi-Feature Fusion Slam System Attaching Semantic Invariant to Points and Lines.

[BibT_eX]

[DOI]

,

,

,

,

,

Sensors, 2021

Visual Spatio-temporal Relation-enhanced Network for Cross-modal Text-Video Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2021

Robust Stereo Visual SLAM for Dynamic Environments With Moving Object.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Access, 2021

Fine-grained Cross-modal Alignment Network for Text-Video Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Elective future: The influence factor mining of students' graduation development based on hierarchical attention neural network model with graph.

[BibT_eX]

[DOI]

,

,

,

,

Appl. Intell., 2020

HHA: An Attentive Prediction Model for Academic Abnormality.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Access, 2020

Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

STRONG: Spatio-Temporal Reinforcement Learning for Cross-Modal Video Moment Localization.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Dynamic facial expression recognition model based on BiLSTM-Attention.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 15th International Conference on Computer Science & Education, 2020

2018

A-Stock Price Fluctuation Forecast Model Based on LSTM.

[BibT_eX]

[DOI]

,

Proceedings of the 14th International Conference on Semantics, Knowledge and Grids, 2018

Loading...