Anoop Cherian

Orcid: 0000-0002-5566-0351

According to our database1, Anoop Cherian authored at least 105 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Temporally Grounding Instructional Diagrams in Unconstrained Videos.
CoRR, 2024

Disentangled Acoustic Fields For Multimodal Physical Scene Understanding.
CoRR, 2024

Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads.
CoRR, 2024

Sound3DVDet: 3D Sound Source Detection using Multiview Microphone Array and RGB Images.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Pixel-Grounded Prototypical Part Networks.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Multi-level Reasoning for Robotic Assembly: From Sequence Inference to Contact Selection.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Deep Neural Room Acoustics Primitive.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

WI-FI based Indoor Monitoring Enhanced by Multimodal Fusion.
Proceedings of the IEEE International Conference on Acoustics, 2024

RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments - Supplementary Data.
Dataset, April, 2023

Pixel-Grounded Prototypical Part Networks.
CoRR, 2023

Active Sparse Conversations for Improved Audio-Visual Embodied Navigation.
CoRR, 2023

H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Discriminative 3D Shape Modeling for Few-Shot Instance Segmentation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Are Deep Neural Networks SMARTer Than Second Graders?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Tensor Representations for Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Generalized One-Class Learning Using Pairs of Complementary Classifiers.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning Log-Determinant Divergences for Positive Definite Matrices.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

FeLMi : Few shot Learning with hard Mixup.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

Max-Margin Contrastive Learning.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Overview of the Eighth Dialog System Technology Challenge: DSTC8.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Discriminative Video Representation Learning Using Support Vector Classifiers.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Visual Scene Graphs for Audio Source Separation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Image Descriptors for Weakly Annotated Histopathological Breast Cancer Data.
Frontiers Digit. Health, 2020

Optimizing Deep Neural Networks via Discretization of Finite-Time Convergent Flows.
CoRR, 2020

Sound2Sight: Generating Visual Dynamics from Sound and Context.
CoRR, 2020

Spatio-Temporal Scene Graphs for Video Dialog.
CoRR, 2020

Dense Non-Rigid Structure from Motion: A Manifold Viewpoint.
CoRR, 2020

FX-GAN: Self-Supervised GAN Learning via Feature Exchange.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Spatio-Temporal Ranked-Attention Networks for Video Captioning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Representation Learning via Adversarially-Contrastive Optimal Transport.
Proceedings of the 37th International Conference on Machine Learning, 2020

Sound2Sight: Generating Visual Dynamics from Sound and Context.
Proceedings of the Computer Vision - ECCV 2020, 2020

Inferring Temporal Compositions of Actions Using Probabilistic Automata.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Visual Permutation Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2019

Second-order Temporal Pooling for Action Recognition.
Int. J. Comput. Vis., 2019

The Eighth Dialog System Technology Challenge.
CoRR, 2019

Sem-GAN: Semantically-Consistent Image-to-Image Translation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function.
Proceedings of the 36th International Conference on Machine Learning, 2019

Unsupervised Joint 3D Object Model Learning and 6D Pose Estimation for Depth-Based Instance Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

GODS: Generalized One-Class Discriminative Subspaces for Anomaly Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.
Proceedings of the IEEE International Conference on Acoustics, 2019

Audio Visual Scene-Aware Dialog.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7.
CoRR, 2018

Neural Algebra of Classifiers.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Learning Discriminative Video Representations Using Adversarial Perturbations.
Proceedings of the Computer Vision - ECCV 2018, 2018

Video Representation Learning Using Discriminative Pooling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Scalable Dense Non-Rigid Structure-From-Motion: A Grassmannian Perspective.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Non-Linear Temporal Subspace Representations for Activity Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Riemannian Dictionary Learning and Sparse Coding for Positive Definite Matrices.
IEEE Trans. Neural Networks Learn. Syst., 2017

Unsupervised Classification of Polarimetric SAR Images via Riemannian Sparse Coding.
IEEE Trans. Geosci. Remote. Sens., 2017

Human Action Forecasting by Learning Task Grammars.
CoRR, 2017

Learning Discriminative Alpha-Beta-divergence for Positive Definite Matrices (Extended Version).
CoRR, 2017

Action Representation Using Classifier Decision Boundaries.
CoRR, 2017

Sequence Summarization Using Order-constrained Kernelized Feature Subspaces.
CoRR, 2017

Ordered Pooling of Optical Flow Sequences for Action Recognition.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Higher-Order Pooling of CNN Features via Kernel Linearization for Action Recognition.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Active convolutional neural networks for cancerous tissue recognition.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Part-based fine-grained bird image retrieval respecting species correlation.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Clustering Positive Definite Matrices by Learning Information Divergences.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Learning Discriminative αβ-Divergences for Positive Definite Matrices.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Human Pose Forecasting via Deep Markov Models.
Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

DeepPermNet: Visual Permutation Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Generalized Rank Pooling for Activity Recognition.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Bayesian Nonparametric Clustering for Positive Definite Matrices.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization.
CoRR, 2016

Active Constrained Clustering via non-iterative uncertainty sampling.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Riemannian sparse coding for classification of PolSAR images.
Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium, 2016

Evaluation of feature descriptors for cancerous tissue recognition.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Tensor Representations via Kernel Linearization for Action Recognition from 3D Skeletons.
Proceedings of the Computer Vision - ECCV 2016, 2016

Sparse Coding for Third-Order Super-Symmetric Tensor Descriptors with Application to Texture Recognition.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Dictionary Learning and Sparse Coding for Third-order Super-symmetric Tensors.
CoRR, 2015

A vision based ensemble approach to velocity estimation for miniature rotorcraft.
Auton. Robots, 2015

2014
Efficient Nearest Neighbors via Robust Sparse Hashing.
IEEE Trans. Image Process., 2014

Action recognition using global spatio-temporal features derived from sparse representations.
Comput. Vis. Image Underst., 2014

Nearest Neighbors Using Compact Sparse Codes.
Proceedings of the 31th International Conference on Machine Learning, 2014

Riemannian Sparse Coding for Positive Definite Matrices.
Proceedings of the Computer Vision - ECCV 2014, 2014

Mixing Body-Part Sequences for Human Pose Estimation.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Jensen-Bregman LogDet Divergence with Application to Efficient Similarity Search for Covariance Matrices.
IEEE Trans. Pattern Anal. Mach. Intell., 2013

2012
A multi-sensor visual tracking system for behavior monitoring of at-risk children.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Compact covariance descriptors in 3D point clouds for object recognition.
Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Robust Sparse Hashing.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

2011
Generalized Dictionary Learning for Symmetric Positive Definite Matrices with Application to Nearest Neighbor Retrieval.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet Divergence.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Denoising sparse noise via online dictionary learning.
Proceedings of the IEEE International Conference on Acoustics, 2011

Dirichlet process mixture models on symmetric positive definite matrices for appearance clustering in video surveillance applications.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
Motion estimation of a miniature helicopter using a single onboard camera.
Proceedings of the American Control Conference, 2010

2009
Autonomous altitude estimation of a UAV using a single onboard camera.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Accurate 3D ground plane estimation from a single image.
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009


  Loading...