Matthias Minderer

Orcid: 0000-0002-6428-8256

According to our database1, Matthias Minderer authored at least 20 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PaliGemma: A versatile 3B VLM for transfer.
CoRR, 2024

Improving fine-grained understanding in image-text pre-training.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection.
Proceedings of the Computer Vision - ECCV 2024, 2024


2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model.
CoRR, 2023

Scaling Open-Vocabulary Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023


Video OWL-ViT: Temporally-consistent open-world localization in video.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FlexiViT: One Model for All Patch Sizes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Decoder Denoising Pretraining for Semantic Segmentation.
Trans. Mach. Learn. Res., 2022

Simple Open-Vocabulary Object Detection with Vision Transformers.
CoRR, 2022


Denoising Pretraining for Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

SCENIC: A JAX Library for Computer Vision Research and Beyond.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Revisiting the Calibration of Modern Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.
Proceedings of the 9th International Conference on Learning Representations, 2021

On Robustness and Transferability of Convolutional Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Automatic Shortcut Removal for Self-Supervised Representation Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Unsupervised learning of object structure and dynamics from videos.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019


  Loading...