Weicheng Kuo

According to our database1, Weicheng Kuo authored at least 29 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Learning Visual Grounding from Generative Vision and Language Model.
CoRR, 2024

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Region-Centric Image-Language Pretraining for Open-Vocabulary Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
RECLIP: Resource-efficient CLIP by Training with Small Images.
Trans. Mach. Learn. Res., 2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.
Trans. Mach. Learn. Res., 2023

Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection.
CoRR, 2023

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-Vocabulary Object Detection upon Frozen Vision and Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

PaLI: A Jointly-Scaled Multilingual Language-Image Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contrastive Feature Masking Open-Vocabulary Vision Transformer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Learning Open-World Object Proposals Without Learning to Classify.
IEEE Robotics Autom. Lett., 2022

F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models.
CoRR, 2022

PaLI: A Jointly-Scaled Multilingual Language-Image Model.
CoRR, 2022

Pre-training image-language transformers for open-vocabulary tasks.
CoRR, 2022

Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering.
CoRR, 2022

Open-vocabulary Object Detection via Vision and Language Knowledge Distillation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Video Question Answering with Iterative Video-Text Co-tokenization.
Proceedings of the Computer Vision - ECCV 2022, 2022

FindIt: Generalized Localization with Natural Language Queries.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Noisy Student learning for cross-institution brain hemorrhage detection.
CoRR, 2021

Zero-Shot Detection via Vision and Language Knowledge Distillation.
CoRR, 2021

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
PatchFCN for Intracranial Hemorrhage Detection.
CoRR, 2018

Cost-Sensitive Active Learning for Intracranial Hemorrhage Detection.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

From Lifestyle Vlogs to Everyday Interactions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2015
DeepBox: Learning Objectness with Convolutional Networks.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015


  Loading...