Vijay Kumar B. G

Vincent Bindschaedler

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Exploring Question Decomposition for Zero-Shot VQA.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

OmniLabel: A Challenging Benchmark for Language-Based Object Detection.

[BibT_eX]

[DOI]

Samuel Schulter

Venkateswararao Cherukuri

Yumin Suh

Konstantinos M. Dafnis

Zhixing Zhang

Shiyu Zhao

Dimitris N. Metaxas

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Exploiting Unlabeled Data with Vision and Language Models for Object Detection.

[BibT_eX]

[DOI]

Anastasis Stathopoulos

Manmohan Chandraker

Dimitris N. Metaxas

Proceedings of the Computer Vision - ECCV 2022, 2022

Single-Stream Multi-level Alignment for Vision-Language Pretraining.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

STRIVE: Scene Text Replacement In Videos.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020

Deep Retinal Image Segmentation With Regularization Under Geometric Priors.

[BibT_eX]

[DOI]

Raja Bala

Vishal Monga

IEEE Trans. Image Process., 2020

Large Scale Multimodal Classification Using an Ensemble of Transformer Models and Co-Attention.

[BibT_eX]

[DOI]

Varnith Chordia

Venkateswararao Cherukuri

CoRR, 2020

2019

Multi-Scale Regularized Deep Network for Retinal Vessel Segmentation.

[BibT_eX]

[DOI]

Raja Bala

Vishal Monga

Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Bayesian Semantic Instance Segmentation in Open Set World.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Multi-modal Cycle-Consistent Generalized Zero-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

DeepSetNet: Predicting Sets with Deep Neural Networks.

[BibT_eX]

[DOI]

Seyed Hamid Rezatofighi

Proceedings of the IEEE International Conference on Computer Vision, 2017

Smart Mining for Deep Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

2016

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue.

[BibT_eX]

[DOI]

Ravi Garg

Ian D. Reid

CoRR, 2016

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2016, 2016

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions.

[BibT_eX]

[DOI]

Gustavo Carneiro

Ian D. Reid

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimising Global Loss Functions.

[BibT_eX]

[DOI]

Gustavo Carneiro

Ian D. Reid

CoRR, 2015

2013

Supervised dictionary learning for action localization.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012

Max-margin Non-negative Matrix Factorization.

[BibT_eX]

[DOI]

Irene Kotsia

Image Vis. Comput., 2012

Learning codebook weights for action detection.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011

Max-Margin Semi-NMF.

[BibT_eX]

[DOI]

Irene Kotsia

Proceedings of the British Machine Vision Conference, 2011

2008

A 2D model for face superresolution.

[BibT_eX]

[DOI]

Rangarajan Aravind

Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Face hallucination using OLPP and Kernel Ridge Regression.

[BibT_eX]

[DOI]