Weicheng Kuo

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

RECLIP: Resource-efficient CLIP by Training with Small Images.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection.

[BibT_eX]

[DOI]

CoRR, 2023

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Open-Vocabulary Object Detection upon Frozen Vision and Language Models.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

PaLI: A Jointly-Scaled Multilingual Language-Image Model.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contrastive Feature Masking Open-Vocabulary Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning.

[BibT_eX]

[DOI]

A. J. Piergiovanni

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Learning Open-World Object Proposals Without Learning to Classify.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2022

F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

PaLI: A Jointly-Scaled Multilingual Language-Image Model.

[BibT_eX]

[DOI]

CoRR, 2022

Pre-training image-language transformers for open-vocabulary tasks.

[BibT_eX]

[DOI]

A. J. Piergiovanni

CoRR, 2022

Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2022

Open-vocabulary Object Detection via Vision and Language Knowledge Distillation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

Video Question Answering with Iterative Video-Text Co-tokenization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

FindIt: Generalized Localization with Natural Language Queries.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

Noisy Student learning for cross-institution brain hemorrhage detection.

[BibT_eX]

[DOI]

Emily Lin

Esther L. Yuh

CoRR, 2021

Zero-Shot Detection via Vision and Language Knowledge Distillation.

[BibT_eX]

[DOI]

CoRR, 2021

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

2019

ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

PatchFCN for Intracranial Hemorrhage Detection.

[BibT_eX]

[DOI]

CoRR, 2018

Cost-Sensitive Active Learning for Intracranial Hemorrhage Detection.

[BibT_eX]

[DOI]

Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2018, 2018

From Lifestyle Vlogs to Everyday Interactions.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2015

DeepBox: Learning Objectness with Convolutional Networks.

[BibT_eX]

[DOI]