Zhiwu Lu

Orcid: 0000-0001-6429-7956

Affiliations:
  • Renmin University of China, School of Information, Beijing Key Laboratory of Big Data Management and Analysis Methods, Beijing, China
  • City University of Hong Kong (PhD 2011)
  • Peking University, Beijing, China


According to our database1, Zhiwu Lu authored at least 163 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents.
CoRR, 2024

CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning.
CoRR, 2024

Enhancing Class-Incremental Learning for Image Classification via Bidirectional Transport and Selective Momentum.
Proceedings of the 2024 International Conference on Multimedia Retrieval, 2024

VDT: General-purpose Video Diffusion Transformers via Mask Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multi-Level Contrastive Learning For Hybrid Cross-Modal Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2024

Progressive Image Synthesis from Semantics to Details with Denoising Diffusion GAN.
Proceedings of the IEEE International Conference on Acoustics, 2024

Image Retrieval with Composed Query by Multi-Scale Multi-Modal Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2024

Unsupervised Continual Learning of Image Representation Via Rememory-Based Simsiam.
Proceedings of the IEEE International Conference on Acoustics, 2024

VEMO: A Versatile Elastic Multi-modal Model for Search-Oriented Multi-task Learning.
Proceedings of the Advances in Information Retrieval, 2024

Dual-Enhanced Coreset Selection with Class-Wise Collaboration for Online Blurry Class Incremental Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval.
Mach. Intell. Res., August, 2023

Text-to-Chinese-painting Method Based on Multi-domain VQGAN.
Int. J. Softw. Informatics, 2023

VDT: An Empirical Study on Video Diffusion with Transformers.
CoRR, 2023

TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat.
CoRR, 2023

Improvable Gap Balancing for Multi-Task Learning.
Proceedings of the Uncertainty in Artificial Intelligence, 2023

CTDA: Contrastive Temporal Domain Adaptation for Action Segmentation.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

Binary Neural Network for Video Action Recognition.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Shot Retrieval and Assembly with Text Script for Video Montage Generation.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

Learning with Adaptive Knowledge for Continual Image-Text Modeling.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

CMMT: Cross-Modal Meta-Transformer for Video-Text Retrieval.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

DiST-GAN: Distillation-based Semantic Transfer for Text-Guided Face Generation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Mixup-Inspired Video Class-Incremental Learning.
Proceedings of the IEEE International Conference on Data Mining, 2023

Task-Sensitive Discriminative Mutual Attention Network for Few-Shot Learning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

R-STAR: Robust Self-Taught Task-Wise Reweighting for Rehearsal-Based Class Incremental Learning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Song-to-Video Translation: Writing a Video from Song Lyrics Based on Multimodal Pre-training.
Proceedings of the Advanced Data Mining and Applications - 19th International Conference, 2023

2022
A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language.
CoRR, 2022

Multimodal foundation models are better simulators of the human brain.
CoRR, 2022

Supervised Contrastive Learning for Few-Shot Action Classification.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

LGDN: Language-Guided Denoising Network for Video-Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Fine-Grained Analysis of Stability and Generalization for Modern Meta Learning Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

BMU-MoCo: Bidirectional Momentum Update for Continual Video-Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Fast-Rate PAC-Bayesian Generalization Bounds for Meta-Learning.
Proceedings of the International Conference on Machine Learning, 2022

Task Relatedness-Based Generalization Bounds for Meta Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Learning Versatile Neural Architectures by Propagating Network Codes.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Text2Poster: Laying Out Stylized Texts on Retrieved Images.
Proceedings of the IEEE International Conference on Acoustics, 2022

COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Visual Prompt Tuning for Few-Shot Text Classification.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

SST-VLM: Sparse Sampling-Twice Inspired Video-Language Model.
Proceedings of the Computer Vision - ACCV 2022, 2022

SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Variational Context: Exploiting Visual and Textual Context for Grounding Referring Expressions.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Zero and Few Shot Learning With Semantic Feature Synthesis and Competitive Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model.
CoRR, 2021

Pre-Trained Models: Past, Present and Future.
CoRR, 2021

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training.
CoRR, 2021

Contrastive Prototype Learning with Augmented Embeddings for Few-Shot Learning.
CoRR, 2021

Pre-trained models: Past, present and future.
AI Open, 2021

Domain-Adaptive Few-Shot Learning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

AdarGCN: Adaptive Aggregation GCN for Few-Shot Learning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Vid2Int: Detecting Implicit Intention from Long Dialog Videos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Contrastive prototype learning with augmented embeddings for few-shot learning.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Compressed Video Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Supervised Video Representation Learning with Constrained Spatiotemporal Jigsaw.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Weakly-Supervised Attribute Segmentation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Complex Action Segmentation in Compressed Videos.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

MELR: Meta-Learning via Modeling Episode-Level Relationships for Few-Shot Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Z-Score Normalization, Hubness, and Few-Shot Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Counterfactual VQA: A Cause-Effect Look at Language Bias.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

A Global Occlusion-Aware Approach to Self-Supervised Monocular Visual Odometry.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Transferrable Feature and Projection Learning with Class Hierarchy for Zero-Shot Learning.
Int. J. Comput. Vis., 2020

Margin-Based Transfer Bounds for Meta Learning with Deep Feature Embedding.
CoRR, 2020

Domain-Adaptive Few-Shot Learning.
CoRR, 2020

AdarGCN: Adaptive Aggregation GCN for Few-Shot Learning.
CoRR, 2020

Discriminativeness-Preserved Domain Adaptation for Few-Shot Learning.
IEEE Access, 2020

Lightweight Action Recognition in Compressed Videos.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Learning Depth-Guided Convolutions for Monocular 3D Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Large-Scale Cross-Domain Few-Shot Learning.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Few-Shot Zero-Shot Learning: Knowledge Transfer with Less Supervision.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation.
IEEE Trans. Image Process., 2019

Cross-domain mapping learning for transductive zero-shot learning.
Comput. Vis. Image Underst., 2019

Mobile Video Action Recognition.
CoRR, 2019

Coarse-to-Fine Grained Classification.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

Joint Projection and Subspace Learning for Zero-Shot Recognition.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Zero-Shot Learning with Few Seen Class Samples.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

RUM: Network Representation Learning Using Motifs.
Proceedings of the 35th IEEE International Conference on Data Engineering, 2019

Recursive Visual Attention in Visual Dialog.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Large-Scale Few-Shot Learning: Knowledge Transfer With Class Hierarchy.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Face-Focused Cross-Stream Network for Deception Detection in Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Large-Scale Sparse Learning From Noisy Tags for Semantic Segmentation.
IEEE Trans. Cybern., 2018

Zero-Shot Learning with Sparse Attribute Propagation.
CoRR, 2018

Domain-Invariant Projection Learning for Zero-Shot Recognition.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

InsightGAN: Semi-Supervised Feature Learning with Generative Adversarial Network for Drug Abuse Detection.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

Zero-Shot Learning with Superclasses.
Proceedings of the Neural Information Processing - 25th International Conference, 2018

DeepInsight: Multi-Task Multi-Scale Deep Learning for Mental Disorder Diagnosis.
Proceedings of the British Machine Vision Conference 2018, 2018

Extreme Reverse Projection Learning for Zero-Shot Recognition.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Zero-Shot Scene Classification for High Spatial Resolution Remote Sensing Images.
IEEE Trans. Geosci. Remote. Sens., 2017

Multi-instance dictionary learning via multivariate performance measure optimization.
Pattern Recognit., 2017

Learning from Weak and Noisy Labels for Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

RUM: network Representation learning throUgh Multi-level structural information preservation.
CoRR, 2017

Zero-Shot Fine-Grained Classification by Deep Feature Learning with Semantics.
CoRR, 2017

Graph-boosted convolutional neural networks for semantic segmentation.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

FeaBoost: Joint Feature and Label Refinement for Semantic Segmentation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Image classification by visual bag-of-words refinement and reduction.
Neurocomputing, 2016

CMsearch: simultaneous exploration of protein sequence space and structure space improves not only protein homology detection but also protein structure prediction.
Bioinform., 2016

Large Scale Sparse Clustering.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Segmentation with Selectively Propagated Constraints.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

2015
Semantic Sparse Recoding of Visual Content for Image Applications.
IEEE Trans. Image Process., 2015

Noise-robust semi-supervised learning via fast sparse coding.
Pattern Recognit., 2015

Learning descriptive visual representation for image classification and annotation.
Pattern Recognit., 2015

Local similarity learning for pairwise constraint propagation.
Multim. Tools Appl., 2015

Pairwise Constraint Propagation on Multi-View Data.
CoRR, 2015

Pairwise Constraint Propagation: A Survey.
CoRR, 2015

Community Based Spammer Detection in Social Networks.
Proceedings of the Web-Age Information Management - 16th International Conference, 2015

Weakly Supervised Matrix Factorization for Noisily Tagged Image Parsing.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Social Image Parsing by Cross-Modal Data Refinement.
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Noise-Robust Semi-Supervised Learning by Large-Scale Sparse Coding.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Spatial temporal pyramid matching using temporal sparse representation for human motion retrieval.
Vis. Comput., 2014

Graph-based multimodal semi-supervised image classification.
Neurocomputing, 2014

Can Image-Level Labels Replace Pixel-Level Labels for Image Parsing.
CoRR, 2014

Direct Semantic Analysis for Social Image Classification.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Latent semantic learning with structured sparse representation for human action recognition.
Pattern Recognit., 2013

L<sub>1</sub>-graph construction using structured sparsity.
Neurocomputing, 2013

Exhaustive and Efficient Constraint Propagation: A Graph-Based Learning Approach and Its Applications.
Int. J. Comput. Vis., 2013

Learning Descriptive Visual Representation by Semantic Regularized Matrix Factorization.
Proceedings of the IJCAI 2013, 2013

Multimodal semi-supervised image classification by combining tag refinement, graph-based learning and support vector regression.
Proceedings of the IEEE International Conference on Image Processing, 2013

Unified Constraint Propagation on Multi-View Data.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Image annotation by semantic sparse recoding of visual content.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Modalities consensus for multi-modal constraint propagation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Heterogeneous Constraint Propagation with Constrained Sparse Representation.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

2011
Automatic Image Annotation Based on Generalized Relevance Models.
J. Signal Process. Syst., 2011

Spatial Markov Kernels for Image Categorization and Annotation.
IEEE Trans. Syst. Man Cybern. Part B, 2011

Contextual Kernel and Spectral Methods for Learning the Semantics of Images.
IEEE Trans. Image Process., 2011

Combining multiple clusterings using fast simulated annealing.
Pattern Recognit. Lett., 2011

Robust Image Analysis by L1-Norm Semi-supervised Learning
CoRR, 2011

Exhaustive and Efficient Constraint Propagation: A Semi-Supervised Learning Perspective and Its Applications
CoRR, 2011

Combining latent semantic learning and reduced hypergraph learning for semi-supervised image categorization.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Multi-modal constraint propagation for heterogeneous image clustering.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Spectral learning of latent semantics for action recognition.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Latent Semantic Learning by Efficient Sparse Coding with Hypergraph Regularization.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Symmetric Graph Regularized Constraint Propagation.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Combining Context, Consistency, and Diversity Cues for Interactive Image Categorization.
IEEE Trans. Multim., 2010

Gaussian mixture learning via robust competitive agglomeration.
Pattern Recognit. Lett., 2010

Image categorization via robust pLSA.
Pattern Recognit. Lett., 2010

Action Recognition Based on Learnt Motion Semantic Vocabulary.
Proceedings of the Advances in Multimedia Information Processing - PCM 2010, 2010

Constrained Spectral Clustering via Exhaustive and Efficient Constraint Propagation.
Proceedings of the Computer Vision - ECCV 2010, 2010

2009
Generalized Competitive Learning of Gaussian Mixture Models.
IEEE Trans. Syst. Man Cybern. Part B, 2009

Generalized Relevance Models for Automatic Image Annotation.
Proceedings of the Advances in Multimedia Information Processing, 2009

Semantic concept annotation based on audio PLSA model.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Image categorization by learning with context and consistency.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Image categorization with spatial mismatch kernels.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Context-based multi-label image annotation.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

View topics: automatically generated characteristic view for content-based 3D object retrieval.
Proceedings of the 8th ACM International Conference on Image and Video Retrieval, 2009

Image Categorization Based on a Hierarchical Spatial Markov Model.
Proceedings of the Computer Analysis of Images and Patterns, 13th International Conference, 2009

2008
A Semi-supervised Learning Algorithm on Gaussian Mixture with Automatic Model Selection.
Neural Process. Lett., 2008

Unsupervised learning of finite mixtures using entropy regularization and its application to image segmentation.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

From Comparing Clusterings to Combining Clusterings.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Entropy Regularized Likelihood Learning on Gaussian Mixture: Two Gradient Implementations for Automatic Model Selection.
Neural Process. Lett., 2007

Entropy Regularization, Automatic Model Selection, and Unsupervised Image Segmentation.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2007

Unsupervised Image Categorization Using Constrained Entropy-Regularized Likelihood Learning with Pairwise Constraints.
Proceedings of the Advances in Neural Networks, 2007

Asymptotic Convergence Properties of Entropy Regularized Likelihood Learning on Finite Mixtures with Automatic Model Selection.
Proceedings of the Advances in Neural Networks, 2007

2006
A regularized minimum cross-entropy algorithm on mixtures of experts for time series prediction and curve detection.
Pattern Recognit. Lett., 2006

An iterative algorithm for entropy regularized likelihood learning on Gaussian mixture with automatic model selection.
Neurocomputing, 2006

A Generalized Competitive Learning Algorithm on Gaussian Mixture with Automatic Model Selection.
Proceedings of the Rough Sets and Knowledge Technology, First International Conference, 2006

A Publishing Framework for Digitally Augmented Paper Documents: Towards Cross-Media Information Integration.
Proceedings of the Advances in Multimedia Information Processing, 2006

A Gradient Entropy Regularized Likelihood Learning Algorithm on Gaussian Mixture with Automatic Model Selection.
Proceedings of the Advances in Neural Networks - ISNN 2006, Third International Symposium on Neural Networks, Chengdu, China, May 28, 2006

A Regularized Minimum Cross-Entropy Algorithm on Mixtures of Experts for Time Series Prediction.
Proceedings of the Advances in Neural Networks - ISNN 2006, Third International Symposium on Neural Networks, Chengdu, China, May 28, 2006

Unsupervised Image Segmentation Using an Iterative Entropy Regularized Likelihood Learning Algorithm.
Proceedings of the Advances in Neural Networks - ISNN 2006, Third International Symposium on Neural Networks, Chengdu, China, May 28, 2006

2005
A Gradient BYY Harmony Learning Algorithm on Mixture of Experts for Curve Detection.
Proceedings of the Intelligent Data Engineering and Automated Learning, 2005


  Loading...