Yi Bin

Orcid: 0000-0001-9714-8738

According to our database1, Yi Bin authored at least 54 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Focusing on Relevant Responses for Multi-Modal Rumor Detection.
IEEE Trans. Knowl. Data Eng., November, 2024

Set of Diverse Queries With Uncertainty Regularization for Composed Image Retrieval.
IEEE Trans. Circuits Syst. Video Technol., October, 2024

Filter-based Stance Network for Rumor Verification.
ACM Trans. Inf. Syst., July, 2024

Align and Retrieve: Composition and Decomposition Learning in Image Retrieval With Text Feedback.
IEEE Trans. Multim., 2024

Pixel Bleach Network for Detecting Face Forgery Under Compression.
IEEE Trans. Multim., 2024

Text-to-image Generation Based on Conditional Semantic Augmentation.
Int. J. Softw. Informatics, 2024

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping.
CoRR, 2024

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs.
CoRR, 2024

Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

GalleryGPT: Analyzing Paintings with Large Multimodal Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Leveraging Weak Cross-Modal Guidance for Coherence Modelling via Iterative Learning.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Shapley Ensemble Adversarial Attack.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Ensemble Diversity Facilitates Adversarial Transferability.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ask or Recommend: An Empirical Study on Conversational Product Search.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Composition-Aware Image Steganography Through Adversarial Self-Generated Supervision.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Personalized fashion outfit generation with user coordination preference learning.
Inf. Process. Manag., September, 2023

Asynchronous Generative Adversarial Network for Asymmetric Unpaired Image-to-Image Translation.
IEEE Trans. Multim., 2023

Multi-Modal Transformer With Global-Local Alignment for Composed Query Image Retrieval.
IEEE Trans. Multim., 2023

Solving Math Word Problems with Reexamination.
CoRR, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.
CoRR, 2023

Cross-modal Consistency Learning with Fine-grained Fusion Network for Multimodal Fake News Detection.
Proceedings of the ACM Multimedia Asia 2023, 2023

Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Modeling Multi-Relational Connectivity for Personalized Fashion Matching.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Non-Autoregressive Sentence Ordering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Non-Autoregressive Math Word Problem Solver with Unified Tree Structure.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Multi-Modality MR Image Synthesis via Confidence-Guided Aggregation and Cross-Modality Refinement.
IEEE J. Biomed. Health Informatics, 2022

Entity Slot Filling for Visual Captioning.
IEEE Trans. Circuits Syst. Video Technol., 2022

Non-Autoregressive Cross-Modal Coherence Modelling.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
NTIRE 2021 Challenge on Perceptual Image Quality Assessment.
CoRR, 2021

Hierarchical Composition Learning for Composed Query Image Retrieval.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Progressive Graph Attention Network for Video Question Answering.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Learning Hierarchal Channel Attention for Fine-grained Visual Classification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Multi-Perspective Video Captioning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

IQMA Network: Image Quality Multi-Scale Assessment Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

2020
Graph-to-Tree Learning for Solving Math Word Problems.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Describing Video With Attention-Based Bidirectional LSTM.
IEEE Trans. Cybern., 2019

Word-to-region attention network for visual question answering.
Multim. Tools Appl., 2019

English teaching practice based on artificial intelligence technology.
J. Intell. Fuzzy Syst., 2019

Supervised Hashing with Recurrent Scaling.
Proceedings of the Web and Big Data - Third International Joint Conference, 2019

MR-NET: Exploiting Mutual Relation for Visual Relationship Detection.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Video Captioning by Adversarial LSTM.
IEEE Trans. Image Process., 2018

2017
Adaptively Attending to Visual Attributes and Linguistic Knowledge for Captioning.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

CFM@MediaEval 2017 Retrieving Diverse Social Images Task via Re-ranking and Hierarchical Clustering.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

BMC@MediaEval 2017 Multimedia Satellite Task via Regression Random Forest.
Proceedings of the Working Notes Proceedings of the MediaEval 2017 Workshop co-located with the Conference and Labs of the Evaluation Forum (CLEF 2017), 2017

Training Data Selection for Cross-Project Defection Prediction: Which Approach Is Better?
Proceedings of the 2017 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, 2017

2016
Combining multi-representation for multimedia event detection using co-training.
Neurocomputing, 2016

Bidirectional Long-Short Term Memory for Video Description.
CoRR, 2016

Bidirectional Long-Short Term Memory for Video Description.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015
Local Multimodal Serial Analysis for Fusing EEG-fMRI: A New Method to Study Familial Cortical Myoclonic Tremor and Epilepsy.
IEEE Trans. Auton. Ment. Dev., 2015


  Loading...