Hongwei Xue

According to our database1, Hongwei Xue authored at least 15 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Visual Perception by Large Language Model's Weights.
CoRR, 2024

Multi-Modal Generative Embedding Model.
CoRR, 2024

2023
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Stare at What You See: Masked Image Modeling without Reconstruction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
A coarse-to-fine and automatic algorithm for breast diagnosis on multi-series MRI images.
Frontiers Comput. Sci., 2022

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment.
CoRR, 2022

Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Tri-axial Motion Sensing with Mechanomagnetic Effect for Human-Machine Interface.
Proceedings of the Intelligent Robotics and Applications - 15th International Conference, 2022

Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training.
CoRR, 2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning Fine-Grained Motion Embedding for Landscape Animation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Semantic Tag Augmented XlanV Model for Video Captioning.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Unifying Multimodal Transformer for Bi-directional Image and Text Generation.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Sed-Net: Detecting Multi-Type Edits Of Images.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020


  Loading...