Alexander Kolesnikov

Affiliations:
  • Google Research, Zurich, Switzerland
  • Institute of Science and Technology Austria, Klosterneuburg, Austria (former)


According to our database1, Alexander Kolesnikov authored at least 43 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
PaliGemma: A versatile 3B VLM for transfer.
CoRR, 2024

Toward a Diffusion-Based Generalist for Dense Vision Tasks.
CoRR, 2024


2023
PaLI-3 Vision Language Models: Smaller, Faster, Stronger.
CoRR, 2023

PaLI-X: On Scaling up a Multilingual Vision and Language Model.
CoRR, 2023

A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision.
CoRR, 2023

Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Tuning Computer Vision Models With Task Rewards.
Proceedings of the International Conference on Machine Learning, 2023


PaLI: A Jointly-Scaled Multilingual Language-Image Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Sigmoid Loss for Language Image Pre-Training.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FlexiViT: One Model for All Patch Sizes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers.
Trans. Mach. Learn. Res., 2022

PaLI: A Jointly-Scaled Multilingual Language-Image Model.
CoRR, 2022

Better plain ViT baselines for ImageNet-1k.
CoRR, 2022

UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LiT: Zero-Shot Transfer with Locked-image text Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Scaling Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Knowledge distillation: A good teacher is patient and consistent.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
SI-Score: An image dataset for fine-grained analysis of robustness to object location, rotation and size.
CoRR, 2021

MLP-Mixer: An all-MLP Architecture for Vision.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale.
Proceedings of the 9th International Conference on Learning Representations, 2021

On Robustness and Transferability of Convolutional Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
The Open Images Dataset V4.
Int. J. Comput. Vis., 2020

Are we done with ImageNet?
CoRR, 2020

Big Transfer (BiT): General Visual Representation Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

2019
Large Scale Learning of General Visual Representations for Transfer.
CoRR, 2019

The Visual Task Adaptation Benchmark.
CoRR, 2019

S<sup>4</sup>L: Self-Supervised Semi-Supervised Learning.
CoRR, 2019

Detecting Visual Relationships Using Box Attention.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

S4L: Self-Supervised Semi-Supervised Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Revisiting Self-Supervised Visual Representation Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Detecting Visual Relationships Using Box Attention.
CoRR, 2018

2017
PixelCNN Models with Auxiliary Variables for Natural Image Modeling.
Proceedings of the 34th International Conference on Machine Learning, 2017

iCaRL: Incremental Classifier and Representation Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Probabilistic Image Colorization.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
iCaRL: Incremental Classifier and Representation Learning.
CoRR, 2016

Deep Probabilistic Modeling of Natural Images using a Pyramid Decomposition.
CoRR, 2016

Seed, Expand and Constrain: Three Principles for Weakly-Supervised Image Segmentation.
Proceedings of the Computer Vision - ECCV 2016, 2016

Improving Weakly-Supervised Object Localization By Micro-Annotation.
Proceedings of the British Machine Vision Conference 2016, 2016

2015
Identifying Reliable Annotations for Large Scale Image Segmentation.
CoRR, 2015

2014
Closed-Form Training of Conditional Random Fields for Large Scale Image Segmentation.
CoRR, 2014

Closed-Form Approximate CRF Training for Scalable Image Segmentation.
Proceedings of the Computer Vision - ECCV 2014, 2014


  Loading...