Quanzeng You
Orcid: 0000-0003-3608-0607
According to our database1,
Quanzeng You
authored at least 73 papers
between 2011 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation.
CoRR, 2024
BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data.
CoRR, 2024
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.
CoRR, 2024
Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model.
CoRR, 2024
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning.
CoRR, 2024
Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts.
CoRR, 2023
Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis.
CoRR, 2023
CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models.
CoRR, 2023
CoRR, 2023
MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
Disentangled Representation Learning with Causality for Unsupervised Domain Adaptation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
CoRR, 2022
SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering.
CoRR, 2022
QUALIFIER: Question-Guided Self-Attentive Multimodal Fusion Network for Audio Visual Scene-Aware Dialog.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification.
CoRR, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
IEEE Trans. Multim., 2020
Signal Process. Image Commun., 2020
2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Image Captioning at Will: A Versatile Scheme for Effectively Injecting Sentiments into Image Descriptions.
CoRR, 2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018
"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention.
Proceedings of the Computer Vision - ECCV 2018, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018
2017
IEEE Trans. Image Process., 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017
Proceedings of the Eleventh International Conference on Web and Social Media, 2017
When saliency meets sentiment: Understanding how image content invokes emotion and sentiment.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017
Proceedings of the Image and Graphics - 9th International Conference, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Sampling for Nyström Extension-Based Spectral Clustering: Incremental Perspective and Novel Analysis.
ACM Trans. Knowl. Discov. Data, 2016
A picture tells a thousand words - About you! User interest profiling from user generated visual content.
Signal Process., 2016
Cross-modality Consistent Regression for Joint Visual-Textual Sentiment Analysis of Social Multimedia.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016
Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Sentiment and Emotion Analysis for Social Multimedia: Methodologies and Applications.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
The effect of pets on happiness: A data-driven approach via large-scale social media.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016
Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
IEEE Trans. Multim., 2015
Snap n' shop: Visual search-based mobile shopping made a breeze by machine and crowd intelligence.
Proceedings of the 9th IEEE International Conference on Semantic Computing, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Towards Lifestyle Understanding: Predicting Home and Vacation Locations from User's Online Photo Collections.
Proceedings of the Ninth International Conference on Web and Social Media, 2015
Robust Image Sentiment Analysis Using Progressively Trained and Domain Transferred Deep Networks.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
Proceedings of the Multimedia Data Mining and Analytics - Disruptive Innovation, 2015
2014
Transit tomography using probabilistic time geography: planning routes without a road map.
J. Locat. Based Serv., 2014
Inferring Home Location from User's Photo Collections based on Visual Content and Mobility Patterns.
Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014
The Eyes of the Beholder: Gender Prediction Using Images Posted in Online Social Networks.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014
2013
Are there cultural differences in event driven information propagation over social media?
Proceedings of the 2nd international workshop on Socially-aware multimedia, 2013
Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining, 2013
Proceedings of the Thirteenth International Workshop on Multimedia Data Mining, 2013
Proceedings of the 13th IEEE International Conference on Data Mining Workshops, 2013
2011
Frontiers Comput. Sci. China, 2011
Clusterability Analysis and Incremental Sampling for Nyström Extension Based Spectral Clustering.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011