Weijia Wu

Orcid: 0000-0003-3912-7212

Affiliations:
  • National University of Singapore, Singapore


According to our database1, Weijia Wu authored at least 39 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A large cross-modal video retrieval dataset with reading comprehension.
Pattern Recognit., 2025

2024
End-to-End Video Text Spotting with Transformer.
Int. J. Comput. Vis., September, 2024

Continual Learning for Image Segmentation With Dynamic Query.
IEEE Trans. Circuits Syst. Video Technol., June, 2024

Binarizing by Classification: Is Soft Function Really Necessary?
IEEE Trans. Circuits Syst. Video Technol., February, 2024

DSText V2: A comprehensive video text spotting dataset for dense and small text.
Pattern Recognit., 2024

ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification.
CoRR, 2024

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization.
CoRR, 2024

Towards Accurate Post-training Quantization for Reparameterized Models.
CoRR, 2024

Controllable Dense Captioner with Multimodal Embedding Bridging.
CoRR, 2024

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

ControlCap: Controllable Region-Level Captioning.
Proceedings of the Computer Vision - ECCV 2024, 2024

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

DragAnything: Motion Control for Anything Using Entity Representation.
Proceedings of the Computer Vision - ECCV 2024, 2024

DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Rethinking the Value of Local Feature Fusion in Convolutional Neural Networks.
Neural Process. Lett., December, 2023

Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization.
Neural Process. Lett., December, 2023

Paragraph-to-Image Generation with Information-Enriched Diffusion Model.
CoRR, 2023

MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
CoRR, 2023

Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM.
CoRR, 2023

ICDAR 2023 Video Text Reading Competition for Dense and Small Text.
CoRR, 2023

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PTQD: Accurate Post-Training Quantization for Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Explore Faster Localization Learning For Scene Text Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

ICDAR 2023 Competition on Video Text Reading for Dense and Small Text.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Generative Prompt Model for Weakly Supervised Object Localization.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

BiViT: Extremely Compressed Binary Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
BiViT: Extremely Compressed Binary Vision Transformer.
CoRR, 2022

A novel feature-based model for zero-shot object detection with simulated attributes.
Appl. Intell., 2022

Polygon-Free: Unconstrained Scene Text Detection with Box Annotations.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

2021
A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer.
CoRR, 2021

EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling.
CoRR, 2021

2020
SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervision and Dynamic Self-Training.
CoRR, 2020

Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild.
CoRR, 2020

Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
TextCohesion: Detecting Text for Arbitrary Shapes.
CoRR, 2019

Efficient Barcode Localization Method for Low-Quality Images.
Proceedings of the 3rd International Conference on Graphics and Signal Processing, 2019


  Loading...