Vinay P. Namboodiri

Orcid: 0000-0001-5262-9722

Affiliations:
  • University of Bath, Department of Computer Science, UK
  • Indian Institute of Technology Kanpur, Department of Computer Science and Engineering, Kanpur, India (former)
  • Indian Institute of Technology Bombay, India (former, PhD 2008)


According to our database1, Vinay P. Namboodiri authored at least 174 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Rectification-Based Knowledge Retention for Task Incremental Learning.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction.
CoRR, 2024

RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance.
CoRR, 2024

TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation.
CoRR, 2024

Trusting Semantic Segmentation Networks.
CoRR, 2024

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning.
CoRR, 2024

LGR2: Language Guided Reward Relabeling for Accelerating Hierarchical Reinforcement Learning.
CoRR, 2024

Spectral Convolutional Transformer: Harmonizing Real vs. Complex Multi-View Spectral Operators for Vision Transformer.
CoRR, 2024

Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors.
CoRR, 2024

VERSE: Virtual-Gradient Aware Streaming Lifelong Learning with Anytime Inference.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

FACTS: Facial Animation Creation using the Transfer of Styles.
Proceedings of the 45th Annual Conference of the European Association for Computer Graphics, 2024

Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Understanding the Vulnerability of CLIP to Image Compression.
CoRR, 2023

FACTS: Facial Animation Creation using the Transfer of Styles.
CoRR, 2023

PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical Reinforcement Learning.
CoRR, 2023

SpectFormer: Frequency and Attention is what you need in a Vision Transformer.
CoRR, 2023

CRISP: Curriculum inducing Primitive Informed Subgoal Prediction for Hierarchical Reinforcement Learning.
CoRR, 2023

Towards Generating Ultra-High Resolution Talking-Face Videos with Lip synchronization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

FaceOff: A Video-to-Video Face Swapping System.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Audio-Visual Face Reenactment.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Towards Accurate Lip-to-Speech Synthesis in-the-Wild.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Streaming LifeLong Learning With Any-Time Inference.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

READ Avatars: Realistic Emotion-controllable Audio Driven Avatars.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Attentive Contractive Flow with Lipschitz Constrained Self-Attention.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
INR-V: A Continuous Representation Space for Video-based Generative Tasks.
Trans. Mach. Learn. Res., 2022

Context extraction module for deep convolutional neural networks.
Pattern Recognit., 2022

Explanation vs. attention: A two-player game to obtain attention for VQA and visual dialog.
Pattern Recognit., 2022

Few-shot image classification with composite rotation based self-supervised auxiliary task.
Neurocomputing, 2022

Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale.
CoRR, 2022

Fair Visual Recognition in Limited Data Regime using Self-Supervision and Self-Distillation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Auto QA: The Question Is Not Only What, but Also Where.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

VQuAD: Video Question Answering Diagnostic Dataset.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

First Workshop on Content Understanding and Generation for E-commerce.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Generalized Keyword Spotting using ASR embeddings.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Learning Speaker-specific Lip-to-Speech Generation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Learning to Predict Speech in Silent Videos Via Audiovisual Analogy.
Proceedings of the IEEE International Conference on Acoustics, 2022

Compressing Video Calls using Synthetic Talking Heads.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Gradient Based Activations for Accurate Bias-Free Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Uncertainty Class Activation Map (U-CAM) Using Gradient Certainty Method.
IEEE Trans. Image Process., 2021

Probabilistic framework for solving visual dialog.
Pattern Recognit., 2021

MUMC: Minimizing uncertainty of mixture of cues.
Image Vis. Comput., 2021

Informative discriminator for domain adaptation.
Image Vis. Comput., 2021

Calibrating feature maps for deep CNNs.
Neurocomputing, 2021

Revisiting paraphrase question generator using pairwise discriminator.
Neurocomputing, 2021

Exploring dropout discriminator for domain adaptation.
Neurocomputing, 2021

Class Incremental Online Streaming Learning.
CoRR, 2021

Attentive Contractive Flow: Improved Contractive Flows with Lipschitz-constrained Self-Attention.
CoRR, 2021

Prb-GAN: A Probabilistic Framework for GAN Modelling.
CoRR, 2021

Mitigating Uncertainty of Classifier for Unsupervised Domain Adaptation.
CoRR, 2021

Visualizing Music Genres using a Topic Model.
CoRR, 2021

SHAD3S: A model to Sketch, Shade and Shadow.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Multimodal Humor Dataset: Predicting Laughter tracks for Sitcoms.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Self Supervision for Attention Networks.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

RNNP: A Robust Few-Shot Learning Approach.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Improving Few-Shot Learning using Composite Rotation based Auxiliary Task.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Domain Impression: A Source Data Free Domain Adaptation Method.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Do not Forget to Attend to Uncertainty while Mitigating Catastrophic Forgetting.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Visual Speech Enhancement Without A Real Visual Stream.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Towards Automatic Speech to Sign Language Generation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Knowledge Consolidation based Class Incremental Online Learning with Limited Data.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Translating sign language videos to talking faces.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Intelligent video editing: incorporating modern talking face generation algorithms in a video editor.
Proceedings of the ICVGIP '21: Indian Conference on Computer Vision, Graphics and Image Processing, Jodhpur, India, December 19, 2021

Speech Prediction in Silent Videos Using Variational Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2021

Collaborative Learning to Generate Audio-Video Jointly.
Proceedings of the IEEE International Conference on Acoustics, 2021

Rectification-Based Knowledge Retention for Continual Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Revisiting Low Resource Status of Indian Languages in Machine Translation.
Proceedings of the CODS-COMAD 2021: 8th ACM IKDD CODS and 26th COMAD, 2021

Personalized One-Shot Lipreading for an ALS Patient.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Deep Knowledge Distillation using Trainable Dense Attention.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

Audio-Visual Speech Super-Resolution.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

More Parameters? No Thanks!
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Acceleration of Deep Convolutional Neural Networks Using Adaptive Filter Pruning.
IEEE J. Sel. Top. Signal Process., 2020

EDS pooling layer.
Image Vis. Comput., 2020

FALF ConvNets: Fatuous auxiliary loss based filter-pruning for efficient deep CNNs.
Image Vis. Comput., 2020

GIFSL - grafting based improved few-shot learning.
Image Vis. Comput., 2020

HetConv: Beyond Homogeneous Convolution Kernels for Deep CNNs.
Int. J. Comput. Vis., 2020

STEER : Simple Temporal Regularization For Neural ODEs.
CoRR, 2020

CovidAID: COVID-19 Detection Using Chest X-Ray.
CoRR, 2020

Uncertainty based Class Activation Maps for Visual Question Answering.
CoRR, 2020

Bridged Variational Autoencoders for Joint Modeling of Images and Attributes.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

A "Network Pruning Network" Approach to Deep Model Compression.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Can I teach a robot to replicate a line art.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Leveraging Filter Correlations for Deep Model Compression.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Cooperative Initialization based Deep Neural Network Training.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Accuracy Booster: Performance Boosting using Feature Map Re-calibration.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Robust Explanations for Visual Question Answering.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Deep Bayesian Network for Visual Question Generation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Jointly Trained Image and Video Generation using Residual Vectors.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

STEER : Simple Temporal Regularization For Neural ODE.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Visually Precise Query.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

A Multilingual Parallel Corpora Collection Effort for Indian Languages.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Learning to Switch CNNs with Model Agnostic Meta Learning for Fine Precision Visual Servoing.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Stochastic Talking Face Generation Using Latent Distribution Matching.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

SkipConv: Skip Convolution for Computationally Efficient Deep CNNs.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Passive Batch Injection Training Technique: Boosting Network Performance by Injecting Mini-Batches from a different Data Distribution.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

PhraseOut: A Code Mixed Data Augmentation Method for MultilingualNeural Machine Tranlsation.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

Exploring Pair-Wise NMT for Indian Languages.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

CPWC: Contextual Point Wise Convolution for Object Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Minimizing Supervision in Multi-label Categorization.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Determinantal Point Process as an alternative to NMS.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

SD-MTCNN: Self-Distilled Multi-Task CNN.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Spotting words in silent speech videos: a retrieval-based approach.
Mach. Vis. Appl., 2019

Deep Exemplar Networks for VQA and VQG.
CoRR, 2019

Dynamic Attention Networks for Task Oriented Grounding.
CoRR, 2019

Granular Multimodal Attention Networks for Visual Dialog.
CoRR, 2019

A Baseline Neural Machine Translation System for Indian Languages.
CoRR, 2019

InfoRL: Interpretable Reinforcement Learning using Information Maximization.
CoRR, 2019

PUTWorkbench: Analysing Privacy in AI-intensive Systems.
CoRR, 2019

Multi-Layer Pruning Framework for Compressing Single Shot MultiBox Detector.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Stability Based Filter Pruning for Accelerating Deep CNNs.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Towards Automatic Face-to-Face Translation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Looking back at Labels: A Class based Domain Adaptation Technique.
Proceedings of the International Joint Conference on Neural Networks, 2019

Unsupervised Synthesis of Anomalies in Videos: Transforming the Normal.
Proceedings of the International Joint Conference on Neural Networks, 2019

Play and Prune: Adaptive Filter Pruning for Deep Model Compression.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

U-CAM: Visual Explanation Using Uncertainty Based Class Activation Maps.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Cross-language Speech Dependent Lip-synchronization.
Proceedings of the IEEE International Conference on Acoustics, 2019

HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Attending to Discriminative Certainty for Domain Adaptation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Curriculum based Dropout Discriminator for Domain Adaptation.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

CVIT's submissions to WAT-2019.
Proceedings of the 6th Workshop on Asian Translation, 2019

2018
Eclectic domain mixing for effective adaptation in action spaces.
Multim. Tools Appl., 2018

Learning Semantic Sentence Embeddings using Pair-wise Discriminator.
CoRR, 2018

Word Spotting in Silent Lip Videos.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Monoaural Audio Source Separation Using Variational Autoencoders.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Unsupervised domain adaptation of deep object detectors.
Proceedings of the 26th European Symposium on Artificial Neural Networks, 2018

Multimodal Differential Network for Visual Question Generation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Differential Attention for Visual Question Answering.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-Agent Diverse Generative Adversarial Networks.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning Semantic Sentence Embeddings using Sequential Pair-wise Discriminator.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Deep active learning for object detection.
Proceedings of the British Machine Vision Conference 2018, 2018

Deep Domain Adaptation in Action Space.
Proceedings of the British Machine Vision Conference 2018, 2018

CVIT-MT Systems for WAT-2018.
Proceedings of the 32nd Pacific Asia Conference on Language, 2018

U-DADA: Unsupervised Deep Action Domain Adaptation.
Proceedings of the Computer Vision - ACCV 2018, 2018

No Modes Left Behind: Capturing the Data Distribution Effectively Using GANs.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning to Estimate Pose by Watching Videos.
CoRR, 2017

SketchSoup: Exploratory Ideation Using Design Sketches.
Comput. Graph. Forum, 2017

Visual Odometry Based Omni-directional Hyperlapse.
Proceedings of the Computer Vision, Pattern Recognition, Image Processing, and Graphics, 2017

Reactive Displays for Virtual Reality.
Proceedings of the IEEE International Symposium on Mixed and Augmented Reality, 2017

Compact Environment-Invariant Codes for Robust Visual Place Recognition.
Proceedings of the 14th Conference on Computer and Robot Vision, 2017

Contextual RNN-GANs for Abstract Reasoning Diagram Generation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Active learning with version spaces for object detection.
CoRR, 2016

Unsupervised Domain Adaptation in the Wild: Dealing with Asymmetric Label Sets.
CoRR, 2016

Message Passing Multi-Agent GANs.
CoRR, 2016

Deep Attributes for One-Shot Face Recognition.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Using Gaussian Processes to Improve Zero-Shot Learning with Relative Attributes.
Proceedings of the Computer Vision - ACCV 2016, 2016

2015
Mind the Gap: Subspace based Hierarchical Domain Adaptation.
CoRR, 2015

Where is my friend? - Person identification in social networks.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Subspace Alignment Based Domain Adaptation for RCNN Detector.
Proceedings of the British Machine Vision Conference 2015, 2015

Adapting RANSAC SVM to Detect Outliers for Robust Classification.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Object and Action Classification with Latent Window Parameters.
Int. J. Comput. Vis., 2014

Object Classification with Adaptable Regions.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Nonuniform image patch exemplars for low level vision.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

2012
Classification with Global, Local and Shared Features.
Proceedings of the Pattern Recognition, 2012

2011
Action recognition: A region based approach.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Systematic evaluation of super-resolution using classification.
Proceedings of the 2011 IEEE Visual Communications and Image Processing, 2011

Object and Action Classification with Latent Variables.
Proceedings of the British Machine Vision Conference, 2011

2008
Regularized depth from defocus.
Proceedings of the International Conference on Image Processing, 2008

Recovery of relative depth from a single observation using an uncalibrated (real-aperture) camera.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
On defocus, diffusion and depth estimation.
Pattern Recognit. Lett., 2007

Retrieval of images of man-made structures based on projective invariance.
Pattern Recognit., 2007

Super-Resolution Using Sub-band Constrained Total Variation.
Proceedings of the Scale Space and Variational Methods in Computer Vision, 2007

Image Restoration using Geometrically Stabilized Reverse Heat Equation.
Proceedings of the International Conference on Image Processing, 2007

Shape Recovery Using Stochastic Heat Flow.
Proceedings of the British Machine Vision Conference 2007, 2007

2006
Improved Kernel-Based Object Tracking Under Occluded Scenarios.
Proceedings of the Computer Vision, Graphics and Image Processing, 5th Indian Conference, 2006

2005
Shock Filters Based on Implicit Cluster Separation.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Use of Linear Diffusion in Depth Estimation Based on Defocus Cue.
Proceedings of the ICVGIP 2004, 2004

Image retrieval based on projective invariance.
Proceedings of the 2004 International Conference on Image Processing, 2004


  Loading...