Yanan Wang

Orcid: 0000-0001-6562-0487

Affiliations:
  • KDDI Research Inc., Saitama, Japan
  • Keio University, School of Science for Open and Environmental Systems, Tokyo, Japan


According to our database1, Yanan Wang authored at least 23 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Top-down Activity Representation Learning for Video Question Answering.
CoRR, 2024

Multi-object event graph representation learning for Video Question Answering.
CoRR, 2024

TimeGraphs: Graph-based Temporal Reasoning.
CoRR, 2024

Anchor-aware Deep Metric Learning for Audio-visual Retrieval.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

2023
Topic-switch adapted Japanese Dialogue System based on PLATO-2.
CoRR, 2023

VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning.
IEEE Access, 2023

Detecting Dialogue Hallucination Using Graph Neural Networks.
Proceedings of the International Conference on Machine Learning and Applications, 2023

Do I Have Your Attention: A Large Scale Engagement Prediction Dataset and Baselines.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

EmotiW 2023: Emotion Recognition in the Wild Challenge.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023

VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
VQA-GNN: Reasoning with Multimodal Semantic Graph for Visual Question Answering.
CoRR, 2022

VAE-Based Adversarial Multimodal Domain Transfer for Video-Level Sentiment Analysis.
IEEE Access, 2022

Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval.
Proceedings of the IEEE International Symposium on Multimedia, 2022

2020
BERT-Based Dialogue Evaluation Methods with RUBER Framework.
Proceedings of the Advances in Artificial Intelligence, 2020

Advanced Multi-Instance Learning Method with Multi-features Engineering and Conservative Optimization for Engagement Intensity Prediction.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

Implicit Knowledge Injectable Cross Attention Audiovisual Model for Group Emotion Recognition.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

LDNN: Linguistic Knowledge Injectable Deep Neural Network for Group Cohesiveness Understanding.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

An Empirical Study on Feature Extraction in DNN-Based Speech Emotion Recognition.
Proceedings of the HCI International 2020 - Late Breaking Posters, 2020

Efficient Diverse Response Generation in Attention-based Neural Conversational Model with Maximum Mutual Information.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Lightweight Deep Convolutional Neural Networks for Facial Expression Recognition.
Proceedings of the 21st IEEE International Workshop on Multimedia Signal Processing, 2019

Multi-feature and Multi-instance Learning with Anti-overfitting Strategy for Engagement Intensity Prediction.
Proceedings of the International Conference on Multimodal Interaction, 2019

Multi-Attention Fusion Network for Video-based Emotion Recognition.
Proceedings of the International Conference on Multimodal Interaction, 2019

2018
Multi-scale convolutional recurrent neural network with ensemble method for weakly labeled sound event detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018


  Loading...