Quanzeng You

Orcid: 0000-0003-3608-0607

According to our database1, Quanzeng You authored at least 73 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation.
CoRR, 2024

BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data.
CoRR, 2024

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.
CoRR, 2024

Law of Vision Representation in MLLMs.
CoRR, 2024

Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model.
CoRR, 2024

ViTAR: Vision Transformer with Any Resolution.
CoRR, 2024

InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding.
CoRR, 2024

Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning.
CoRR, 2024

Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

COCO is "ALL" You Need for Visual Instruction Fine-tuning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts.
CoRR, 2023

Reason out Your Layout: Evoking the Layout Master from Large Language Models for Text-to-Image Synthesis.
CoRR, 2023

CORE-MM: Complex Open-Ended Reasoning Evaluation For Multi-Modal Large Language Models.
CoRR, 2023

RefineVIS: Video Instance Segmentation with Temporal Attention Refinement.
CoRR, 2023

MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Disentangled Representation Learning with Causality for Unsupervised Domain Adaptation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Deep Frequency Filtering for Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention.
CoRR, 2022

SA-VQA: Structured Alignment of Visual and Semantic Representations for Visual Question Answering.
CoRR, 2022

QUALIFIER: Question-Guided Self-Attentive Multimodal Fusion Network for Audio Visual Scene-Aware Dialog.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Lifelong Unsupervised Domain Adaptive Person Re-identification with Coordinated Anti-forgetting and Adaptation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Disentanglement-based Cross-Domain Feature Augmentation for Effective Unsupervised Domain Adaptive Person Re-identification.
CoRR, 2021

Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Sentiment Recognition for Short Annotated GIFs Using Visual-Textual Fusion.
IEEE Trans. Multim., 2020

Double-layer conditional random fields model for human action recognition.
Signal Process. Image Commun., 2020

A Benchmark Dataset for Understandable Medical Language Translation.
CoRR, 2020

Real-time 3D Deep Multi-Camera Tracking.
CoRR, 2020

2019
Real-time Multiple People Hand Localization in 4D Point Clouds.
CoRR, 2019

Action4D: Online Action Recognition in the Crowd and Clutter.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Action4D: Real-time Action Recognition in the Crowd and Clutter.
CoRR, 2018

Image Captioning at Will: A Versatile Scheme for Effectively Injecting Sentiments into Image Descriptions.
CoRR, 2018

Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Risk Prediction on Electronic Health Records with Prior Medical Knowledge.
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018

"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention.
Proceedings of the Computer Vision - ECCV 2018, 2018

End-to-End Convolutional Semantic Embeddings.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

KAME: Knowledge-based Attention Model for Diagnosis Prediction in Healthcare.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Touch Your Heart: A Tone-aware Chatbot for Customer Care on Social Media.
Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 2018

2017
Image-Based Appraisal of Real Estate Properties.
IEEE Trans. Multim., 2017

Adaptive Greedy Dictionary Selection for Web Media Summarization.
IEEE Trans. Image Process., 2017

Social Multimedia Sentiment Analysis.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

Cultural Diffusion and Trends in Facebook Photographs.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

When saliency meets sentiment: Understanding how image content invokes emotion and sentiment.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Aesthetic Quality Assessment of Photos with Faces.
Proceedings of the Image and Graphics - 9th International Conference, 2017

Visual Sentiment Analysis by Attending on Local Image Regions.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Sampling for Nyström Extension-Based Spectral Clustering: Incremental Perspective and Novel Analysis.
ACM Trans. Knowl. Discov. Data, 2016

A picture tells a thousand words - About you! User interest profiling from user generated visual content.
Signal Process., 2016

Image Based Appraisal of Real Estate Properties.
CoRR, 2016

Voting with Feet: Who are Leaving Hillary Clinton and Donald Trump?
CoRR, 2016

Cross-modality Consistent Regression for Joint Visual-Textual Sentiment Analysis of Social Multimedia.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Sentiment and Emotion Analysis for Social Multimedia: Methodologies and Applications.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Image Captioning with Semantic Attention.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

The effect of pets on happiness: A data-driven approach via large-scale social media.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The Benchmark.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
A Multifaceted Approach to Social Multimedia-Based Prediction of Elections.
IEEE Trans. Multim., 2015

Snap n' shop: Visual search-based mobile shopping made a breeze by machine and crowd intelligence.
Proceedings of the 9th IEEE International Conference on Semantic Computing, 2015

Joint Visual-Textual Sentiment Analysis with Deep Neural Networks.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Towards Lifestyle Understanding: Predicting Home and Vacation Locations from User's Online Photo Collections.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

Robust Image Sentiment Analysis Using Progressively Trained and Domain Transferred Deep Networks.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Sentiment Analysis Using Social Multimedia.
Proceedings of the Multimedia Data Mining and Analytics - Disruptive Innovation, 2015

2014
Transit tomography using probabilistic time geography: planning routes without a road map.
J. Locat. Based Serv., 2014

Inferring Home Location from User's Photo Collections based on Visual Content and Mobility Patterns.
Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014

The Eyes of the Beholder: Gender Prediction Using Images Posted in Online Social Networks.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

2013
Are there cultural differences in event driven information propagation over social media?
Proceedings of the 2nd international workshop on Socially-aware multimedia, 2013

Sentribute: image sentiment analysis from a mid-level perspective.
Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining, 2013

Towards social imagematics: sentiment analysis in social multimedia.
Proceedings of the Thirteenth International Workshop on Multimedia Data Mining, 2013

Towards Understanding the Effectiveness of Election Related Images in Social Media.
Proceedings of the 13th IEEE International Conference on Data Mining Workshops, 2013

2011
An improved spectral clustering algorithm based on random walk.
Frontiers Comput. Sci. China, 2011

Clusterability Analysis and Incremental Sampling for Nyström Extension Based Spectral Clustering.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011


  Loading...