We stand with Ukraine

We stand with Ukraine

Yi Bin

Orcid: 0000-0001-9714-8738

According to our database¹, Yi Bin authored at least 56 papers between 2015 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Focusing on Relevant Responses for Multi-Modal Rumor Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Knowl. Data Eng., November, 2024

Set of Diverse Queries With Uncertainty Regularization for Composed Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., October, 2024

Filter-based Stance Network for Rumor Verification.

[BibT_eX]

[DOI]

,

,

,

,

,

ACM Trans. Inf. Syst., July, 2024

Align and Retrieve: Composition and Decomposition Learning in Image Retrieval With Text Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2024

Pixel Bleach Network for Detecting Face Forgery Under Compression.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Multim., 2024

Text-to-image Generation Based on Conditional Semantic Augmentation.

[BibT_eX]

[DOI]

,

,

,

Int. J. Softw. Informatics, 2024

Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation.

[BibT_eX]

[DOI]

Thong Thanh Nguyen

,

,

,

Cong-Duy T. Nguyen

,

,

CoRR, 2024

Multi-Scale Contrastive Learning for Video Temporal Grounding.

[BibT_eX]

[DOI]

Thong Thanh Nguyen

,

,

,

,

Cong-Duy T. Nguyen

,

,

CoRR, 2024

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

GalleryGPT: Analyzing Paintings with Large Multimodal Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Shapley Ensemble Adversarial Attack.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Cong-Duy Nguyen

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

Ensemble Diversity Facilitates Adversarial Transferability.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ask or Recommend: An Empirical Study on Conversational Product Search.

[BibT_eX]

[DOI]

,

,

Mohammad Aliannejadi

,

Evangelos Kanoulas

,

,

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives.

[BibT_eX]

[DOI]

,

,

,

,

,

Jay Zhangjie Wu

,

Cong-Duy Nguyen

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Composition-Aware Image Steganography Through Adversarial Self-Generated Supervision.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Neural Networks Learn. Syst., November, 2023

Personalized fashion outfit generation with user coordination preference learning.

[BibT_eX]

[DOI]

,

,

,

Inf. Process. Manag., September, 2023

Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2023

Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Multim., 2023

Solving Math Word Problems with Reexamination.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Cross-modal Consistency Learning with Fine-grained Fusion Network for Multimodal Fake News Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the ACM Multimedia Asia 2023, 2023

Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Modeling Multi-Relational Connectivity for Personalized Fashion Matching.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Non-Autoregressive Sentence Ordering.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

Multi-Modality MR Image Synthesis via Confidence-Guided Aggregation and Cross-Modality Refinement.

[BibT_eX]

[DOI]

,

,

,

,

IEEE J. Biomed. Health Informatics, 2022

Entity Slot Filling for Visual Captioning.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Circuits Syst. Video Technol., 2022

Non-Autoregressive Cross-Modal Coherence Modelling.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021

NTIRE 2021 Challenge on Perceptual Image Quality Assessment.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

José Costa Pereira

,

,

Steven McDonagh

,

,

,

,

,

Seyed Mehdi Ayyoubzadeh

,

,

Sid Ahmed Fezza

,

,

Wassim Hamidouche

,

,

,

,

,

Kiyoharu Aizawa

CoRR, 2021

Hierarchical Composition Learning for Composed Query Image Retrieval.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Progressive Graph Attention Network for Video Question Answering.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning Hierarchal Channel Attention for Fine-grained Visual Classification.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multi-Perspective Video Captioning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

IQMA Network: Image Quality Multi-Scale Assessment Network.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020

Graph-to-Tree Learning for Solving Math Word Problems.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Describing Video With Attention-Based Bidirectional LSTM.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE Trans. Cybern., 2019

Word-to-region attention network for visual question answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Multim. Tools Appl., 2019

English teaching practice based on artificial intelligence technology.

[BibT_eX]

[DOI]

,

Durbadal Mandal

J. Intell. Fuzzy Syst., 2019

Supervised Hashing with Recurrent Scaling.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Web and Big Data - Third International Joint Conference, 2019

MR-NET: Exploiting Mutual Relation for Visual Relationship Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Video Captioning by Adversarial LSTM.

[BibT_eX]

[DOI]

,

,

,

,

,

,

IEEE Trans. Image Process., 2018

2017

Adaptively Attending to Visual Attributes and Linguistic Knowledge for Captioning.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2017 ACM on Multimedia Conference, 2017

CFM@MediaEval 2017 Retrieving Diverse Social Images Task via Re-ranking and Hierarchical Clustering.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

BMC@MediaEval 2017 Multimedia Satellite Task via Regression Random Forest.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

Training Data Selection for Cross-Project Defection Prediction: Which Approach Is Better?

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, 2017

2016

Combining multi-representation for multimedia event detection using co-training.

[BibT_eX]

[DOI]

,

,

,

Neurocomputing, 2016

Bidirectional Long-Short Term Memory for Video Description.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2016

Bidirectional Long-Short Term Memory for Video Description.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015

Local Multimodal Serial Analysis for Fusing EEG-fMRI: A New Method to Study Familial Cortical Myoclonic Tremor and Epilepsy.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Auton. Ment. Dev., 2015

Loading...