Haoyu Lu

Orcid: 0000-0003-2620-6296

According to our database1, Haoyu Lu authored at least 35 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
BotCL: a social bot detection model based on graph contrastive learning.
Knowl. Inf. Syst., September, 2024

Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining.
CoRR, 2024

Exploring the Design Space of Visual Context Representation in Video MLLMs.
CoRR, 2024

Towards Event-oriented Long Video Understanding.
CoRR, 2024

Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs.
CoRR, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding.
CoRR, 2024

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism.
CoRR, 2024

VDT: General-purpose Video Diffusion Transformers via Mask Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Multi-Level Contrastive Learning For Hybrid Cross-Modal Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2024

Progressive Image Synthesis from Semantics to Details with Denoising Diffusion GAN.
Proceedings of the IEEE International Conference on Acoustics, 2024

VEMO: A Versatile Elastic Multi-modal Model for Search-Oriented Multi-task Learning.
Proceedings of the Advances in Information Retrieval, 2024

2023
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval.
Mach. Intell. Res., August, 2023

VDT: An Empirical Study on Video Diffusion with Transformers.
CoRR, 2023

Shot Retrieval and Assembly with Text Script for Video Montage Generation.
Proceedings of the 2023 ACM International Conference on Multimedia Retrieval, 2023

A Topology Based Denoising Approach for 2D Scalar Fields.
Proceedings of the IEEE International Conference on Image Processing, 2023

BotCS: A Lightweight Model for Large-Scale Twitter Bot Detection Comparable to GNN-Based Models.
Proceedings of the IEEE International Conference on Communications, 2023

Speech and Noise Dual-Stream Spectrogram Refine Network With Speech Distortion Loss For Robust Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

PSMiner: A Pattern-Aware Accelerator for High-Performance Streaming Graph Pattern Mining.
Proceedings of the 60th ACM/IEEE Design Automation Conference, 2023

2022
Image fragile watermarking algorithm based on deneighbourhood mapping.
IET Image Process., 2022

Monolingual Recognizers Fusion for Code-switching Speech Recognition.
CoRR, 2022

Multimodal foundation models are better simulators of the human brain.
CoRR, 2022

Image Fragile Watermarking Algorithm Based on Deneighborhood Mapping.
CoRR, 2022

MHCRoBERTa: pan-specific peptide-MHC class I binding prediction through transfer learning with label-agnostic protein sequences.
Briefings Bioinform., 2022

LGDN: Language-Guided Denoising Network for Video-Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

BMU-MoCo: Bidirectional Momentum Update for Continual Video-Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Versatile Neural Architectures by Propagating Network Codes.
Proceedings of the Tenth International Conference on Learning Representations, 2022

COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
WenLan 2.0: Make AI Imagine via a Multimodal Foundation Model.
CoRR, 2021

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training.
CoRR, 2021

Compressed Video Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Self-Supervised Video Representation Learning with Constrained Spatiotemporal Jigsaw.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

2020
PremPS: Predicting the impact of missense mutations on protein stability.
PLoS Comput. Biol., 2020

2019
Affine invariant image watermarking scheme based on ASIFT and Delaunay tessellation.
Multim. Tools Appl., 2019

Deep neural network-based image copyright protection scheme.
J. Electronic Imaging, 2019


  Loading...