Vijay Kumar B. G

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2024
LLM-Assist: Enhancing Closed-Loop Planning with Language-Based Reasoning.
CoRR, 2024

Taming Self-Training for Open-Vocabulary Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Generating Enhanced Negatives for Training Language-Based Object Detectors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Improving Pseudo Labels for Open-Vocabulary Object Detection.
CoRR, 2023

DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Exploring Question Decomposition for Zero-Shot VQA.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

OmniLabel: A Challenging Benchmark for Language-Based Object Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Exploiting Unlabeled Data with Vision and Language Models for Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

Single-Stream Multi-level Alignment for Vision-Language Pretraining.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
STRIVE: Scene Text Replacement In Videos.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Deep Retinal Image Segmentation With Regularization Under Geometric Priors.
IEEE Trans. Image Process., 2020

Large Scale Multimodal Classification Using an Ensemble of Transformer Models and Co-Attention.
CoRR, 2020

2019
Multi-Scale Regularized Deep Network for Retinal Vessel Segmentation.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Bayesian Semantic Instance Segmentation in Open Set World.
Proceedings of the Computer Vision - ECCV 2018, 2018

Multi-modal Cycle-Consistent Generalized Zero-Shot Learning.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
DeepSetNet: Predicting Sets with Deep Neural Networks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Smart Mining for Deep Metric Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2017

2016
Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue.
CoRR, 2016

Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue.
Proceedings of the Computer Vision - ECCV 2016, 2016

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimising Global Loss Functions.
CoRR, 2015

2013
Supervised dictionary learning for action localization.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012
Max-margin Non-negative Matrix Factorization.
Image Vis. Comput., 2012

Learning codebook weights for action detection.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

2011
Max-Margin Semi-NMF.
Proceedings of the British Machine Vision Conference, 2011

2008
A 2D model for face superresolution.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Face hallucination using OLPP and Kernel Ridge Regression.
Proceedings of the International Conference on Image Processing, 2008


  Loading...