Matthias Minderer

Orcid: 0000-0002-6428-8256

According to our database¹, Matthias Minderer authored at least 20 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

PaliGemma: A versatile 3B VLM for transfer.

[BibT_eX]

[DOI]

CoRR, 2024

Improving fine-grained understanding in image-text pre-training.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

On Scaling Up a Multilingual Vision and Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

PaLI-X: On Scaling up a Multilingual Vision and Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

Scaling Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

Matthias Minderer

Alexey A. Gritsenko

Neil Houlsby

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution.

[BibT_eX]

[DOI]

Ibrahim M. Alabdulmohsin

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Scaling Vision Transformers to 22 Billion Parameters.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Video OWL-ViT: Temporally-consistent open-world localization in video.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FlexiViT: One Model for All Patch Sizes.

[BibT_eX]

[DOI]

Ibrahim Alabdulmohsin

Filip Pavetic

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Decoder Denoising Pretraining for Semantic Segmentation.

[BibT_eX]

[DOI]

Emmanuel Asiedu Brempong

Trans. Mach. Learn. Res., 2022

Simple Open-Vocabulary Object Detection with Vision Transformers.

[BibT_eX]

[DOI]

CoRR, 2022

Simple Open-Vocabulary Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Denoising Pretraining for Semantic Segmentation.

[BibT_eX]

[DOI]

Emmanuel Asiedu Brempong

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

SCENIC: A JAX Library for Computer Vision Research and Beyond.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Revisiting the Calibration of Modern Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

On Robustness and Transferability of Convolutional Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Automatic Shortcut Removal for Self-Supervised Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Unsupervised learning of object structure and dynamics from videos.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Matthias Minderer

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...