Wenqiao Zhang

Orcid: 0000-0002-5988-7609

According to our database¹, Wenqiao Zhang authored at least 57 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Unleash the Power of Inconsistency-Based Semi-Supervised Active Learning by Dynamic Programming of Curriculum Learning.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., November, 2024

RADAR: Robust Two-stage Modality-incomplete Industrial Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Align<sup>2</sup>LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation.

[BibT_eX]

[DOI]

CoRR, 2024

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition.

[BibT_eX]

[DOI]

CoRR, 2024

Leveraging Invariant Principle for Heterophilic Graph Structure Distribution Shifts.

[BibT_eX]

[DOI]

CoRR, 2024

IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization.

[BibT_eX]

[DOI]

CoRR, 2024

Bridging Local Details and Global Context in Text-Attributed Graphs.

[BibT_eX]

[DOI]

CoRR, 2024

Backpropogation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration.

[BibT_eX]

[DOI]

CoRR, 2024

DuetRAG: Collaborative Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

CoRR, 2024

LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation.

[BibT_eX]

[DOI]

CoRR, 2024

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

MART: Learning Hierarchical Music Audio Representations with Part-Whole Transformer.

[BibT_eX]

[DOI]

Proceedings of the Companion Proceedings of the ACM on Web Conference 2024, 2024

Intelligent Model Update Strategy for Sequential Recommendation.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2024, 2024

Semantic Codebook Learning for Dynamic Recommendation Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fact : Teaching MLLMs with Faithful, Concise and Transferable Rationales.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Bridging Local Details and Global Context in Text-Attributed Graphs.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer.

[BibT_eX]

[DOI]

Wenqiao Zhang

Zheqi Lv

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DIEM: Decomposition-Integration Enhancing Multimodal Insights.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

METER: A Dynamic Concept Adaptation Framework for Online Anomaly Detection.

[BibT_eX]

[DOI]

Proc. VLDB Endow., December, 2023

Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Personalized Latent Structure Learning for Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Music-PAW: Learning Music Representations via Hierarchical Part-whole Interaction and Contrast.

[BibT_eX]

[DOI]

CoRR, 2023

Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer.

[BibT_eX]

[DOI]

CoRR, 2023

De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.

[BibT_eX]

[DOI]

CoRR, 2023

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.

[BibT_eX]

[DOI]

CoRR, 2023

Denoising Multi-modal Sequential Recommenders with Contrastive Learning.

[BibT_eX]

[DOI]

CoRR, 2023

Toward Cohort Intelligence: A Universal Cohort Representation Learning Framework for Electronic Health Record Analysis.

[BibT_eX]

[DOI]

CoRR, 2023

CAusal and collaborative proxy-tasKs lEarning for Semi-Supervised Domain Adaptation.

[BibT_eX]

[DOI]

CoRR, 2023

IDEAL: Toward High-efficiency Device-Cloud Collaborative and Dynamic Recommendation System.

[BibT_eX]

[DOI]

CoRR, 2023

DUET: A Tuning-Free Device-Cloud Collaborative Parameters Generation Framework for Efficient Device Model Generalization.

[BibT_eX]

[DOI]

Proceedings of the ACM Web Conference 2023, 2023

Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ART: rule bAsed futuRe-inference deducTion.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

WINNER: Weakly-supervised hIerarchical decompositioN and aligNment for spatio-tEmporal video gRounding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Parameters Efficient Fine-Tuning for Long-Tailed Sequential Recommendation.

[BibT_eX]

[DOI]

Proceedings of the Artificial Intelligence - Third CAAI International Conference, 2023

Multi-modal Action Chain Abductive Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Design and Analysis of a 2-DOF Electromagnetic Actuator with an Improved Halbach Array for the Magnetic Suspension Platform.

[BibT_eX]

[DOI]

Sensors, 2022

BOSS: Bottom-up Cross-modal Semantic Composition with Hybrid Counterfactual Training for Robust Content-based Image Retrieval.

[BibT_eX]

[DOI]

CoRR, 2022

DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Weakly-supervised Disentanglement Network for Video Fingerspelling Detection.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Collaborative Intelligence Orchestration: Inconsistency-Based Fusion of Semi-Supervised Learning and Active Learning.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

BoostMIS: Boosting Medical Image Semi-supervised Learning with Adaptive Pseudo Labeling and Informative Active Annotation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Tell and guess: cooperative learning for natural image caption generation with hierarchical refined attention.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2021

Consensus Graph Representation Learning for Better Grounded Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Frame Augmented Alternating Attention Network for Video Question Answering.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2020

A Novel 2-DOF Lorentz Force Actuator for the Modular Magnetic Suspension Platform.

[BibT_eX]

[DOI]

Sensors, 2020

Relational Graph Learning for Grounded Video Description Generation.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Photo Stream Question Answer.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

2019

VPModel: High-Fidelity Product Simulation in a Virtual-Physical Environment.

[BibT_eX]

[DOI]

IEEE Trans. Vis. Comput. Graph., 2019

Wenqiao Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...