Jun Wang

Orcid: 0009-0008-6066-4286

Affiliations:
  • Salesforce AI Research, Palo Alto, CA, USA
  • University of Maryland, College Park, MD, USA (PhD 2023)


According to our database1, Jun Wang authored at least 15 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models.
CoRR, 2024

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions.
CoRR, 2024

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations.
CoRR, 2024

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models.
CoRR, 2024

EVE: Enabling Anyone to Train Robot using Augmented Reality.
CoRR, 2024

EVE: Enabling Anyone to Train Robots using Augmented Reality.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

2023
Deep Learning for Scene perception and Understanding.
PhD thesis, 2023

Align and Attend: Multimodal Summarization with Dual Contrastive Losses.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation.
CoRR, 2022

M3DETR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

ESSumm: Extractive Speech Summarization from Untranscribed Meeting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

PointMotionNet: Point-Wise Motion Learning for Large-Scale LiDAR Point Clouds Sequences.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2020
InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling.
Proceedings of the Computer Vision - ECCV 2020, 2020


  Loading...