Wenqiao Zhang

Orcid: 0000-0002-5988-7609

According to our database1, Wenqiao Zhang authored at least 57 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Unleash the Power of Inconsistency-Based Semi-Supervised Active Learning by Dynamic Programming of Curriculum Learning.
IEEE Trans. Knowl. Data Eng., November, 2024

RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection.
CoRR, 2024

Align<sup>2</sup>LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation.
CoRR, 2024

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition.
CoRR, 2024

Leveraging Invariant Principle for Heterophilic Graph Structure Distribution Shifts.
CoRR, 2024

IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization.
CoRR, 2024

Bridging Local Details and Global Context in Text-Attributed Graphs.
CoRR, 2024

Backpropogation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration.
CoRR, 2024

DuetRAG: Collaborative Retrieval-Augmented Generation.
CoRR, 2024

LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation.
CoRR, 2024

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models.
CoRR, 2024

MART: Learning Hierarchical Music Audio Representations with Part-Whole Transformer.
Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Intelligent Model Update Strategy for Sequential Recommendation.
Proceedings of the ACM on Web Conference 2024, 2024

Semantic Codebook Learning for Dynamic Recommendation Models.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Bridging Local Details and Global Context in Text-Attributed Graphs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DIEM: Decomposition-Integration Enhancing Multimodal Insights.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
METER: A Dynamic Concept Adaptation Framework for Online Anomaly Detection.
Proc. VLDB Endow., December, 2023

Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Personalized Latent Structure Learning for Recommendation.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast.
CoRR, 2023

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer.
CoRR, 2023

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
CoRR, 2023

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.
CoRR, 2023

Denoising Multi-modal Sequential Recommenders with Contrastive Learning.
CoRR, 2023

Toward Cohort Intelligence: A Universal Cohort Representation Learning Framework for Electronic Health Record Analysis.
CoRR, 2023

CAusal and collaborative proxy-tasKs lEarning for Semi-Supervised Domain Adaptation.
CoRR, 2023

IDEAL: Toward High-efficiency Device-Cloud Collaborative and Dynamic Recommendation System.
CoRR, 2023

DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device Model Generalization.
Proceedings of the ACM Web Conference 2023, 2023

Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ART: rule bAsed futuRe-inference deducTion.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

WINNER: Weakly-supervised hIerarchical decompositioN and aligNment for spatio-tEmporal video gRounding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Parameters Efficient Fine-Tuning for Long-Tailed Sequential Recommendation.
Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

Multi-modal Action Chain Abductive Reasoning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Design and Analysis of a 2-DOF Electromagnetic Actuator with an Improved Halbach Array for the Magnetic Suspension Platform.
Sensors, 2022

BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval.
CoRR, 2022

DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly-supervised Disentanglement Network for Video Fingerspelling Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Tell and guess: cooperative learning for natural image caption generation with hierarchical refined attention.
Multim. Tools Appl., 2021

Consensus Graph Representation Learning for Better Grounded Image Captioning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Frame Augmented Alternating Attention Network for Video Question Answering.
IEEE Trans. Multim., 2020

A Novel 2-DOF Lorentz Force Actuator for the Modular Magnetic Suspension Platform.
Sensors, 2020

Relational Graph Learning for Grounded Video Description Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Photo Stream Question Answer.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019
VPModel: High-Fidelity Product Simulation in a Virtual-Physical Environment.
IEEE Trans. Vis. Comput. Graph., 2019


  Loading...