Anoop Cherian

Jonathan Le Roux

Proceedings of the IEEE International Conference on Acoustics, 2024

RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

MOST-GAN Pre-trained Model.

[BibT_eX]

[DOI]

Dataset, August, 2023

AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments - Supplementary Data.

[BibT_eX]

[DOI]

Sudipta Paul

Amit Roy-Chowdhury

Nithin Gopalakrishnan Nair

Dataset, April, 2023

Pixel-Grounded Prototypical Part Networks.

[BibT_eX]

[DOI]

CoRR, 2023

Active Sparse Conversations for Improved Audio-Visual Embodied Navigation.

[BibT_eX]

[DOI]

CoRR, 2023

H-SAUR: Hypothesize, Simulate, Act, Update, and Repeat for Understanding Object Articulations from Interactions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Discriminative 3D Shape Modeling for Few-Shot Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Aligning Step-by-Step Instructional Diagrams to Video Demonstrations.

[BibT_eX]

[DOI]

Cristian Rodriguez Opazo

Stephen Gould

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Are Deep Neural Networks SMARTer Than Second Graders?

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Tensor Representations for Action Recognition.

[BibT_eX]

[DOI]

Lei Wang

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Generalized One-Class Learning Using Pairs of Complementary Classifiers.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Learning Log-Determinant Divergences for Positive Definite Matrices.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

FeLMi : Few shot Learning with hard Mixup.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation.

[BibT_eX]

[DOI]

Narendra Ahuja

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments.

[BibT_eX]

[DOI]

Sudipta Paul

Amit Roy-Chowdhury

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Max-Margin Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

MOST-GAN: 3D Morphable StyleGAN for Disentangled Face Image Manipulation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Overview of the Eighth Dialog System Technology Challenge: DSTC8.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Discriminative Video Representation Learning Using Support Vector Classifiers.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Visual Scene Graphs for Audio Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction.

[BibT_eX]

[DOI]

Narendra Ahuja

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Image Descriptors for Weakly Annotated Histopathological Breast Cancer Data.

[BibT_eX]

[DOI]

Alexander Truskinovsky

Frontiers Digit. Health, 2020

Optimizing Deep Neural Networks via Discretization of Finite-Time Convergent Flows.

[BibT_eX]

[DOI]

Mouhacine Benosman

Orlando Romero

CoRR, 2020

Sound2Sight: Generating Visual Dynamics from Sound and Context.

[BibT_eX]

[DOI]

Narendra Ahuja

CoRR, 2020

Spatio-Temporal Scene Graphs for Video Dialog.

[BibT_eX]

[DOI]

CoRR, 2020

Dense Non-Rigid Structure from Motion: A Manifold Viewpoint.

[BibT_eX]

[DOI]

Suryansh Kumar

Luc Van Gool

Carlos E. P. de Oliveira

Yuchao Dai

Hongdong Li

CoRR, 2020

FX-GAN: Self-Supervised GAN Learning via Feature Exchange.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Spatio-Temporal Ranked-Attention Networks for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Representation Learning via Adversarially-Contrastive Optimal Transport.

[BibT_eX]

[DOI]

Shuchin Aeron

Proceedings of the 37th International Conference on Machine Learning, 2020

Sound2Sight: Generating Visual Dynamics from Sound and Context.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Inferring Temporal Compositions of Actions Using Probabilistic Automata.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Visual Permutation Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2019

Second-order Temporal Pooling for Action Recognition.

[BibT_eX]

[DOI]

Stephen Gould

Int. J. Comput. Vis., 2019

The Eighth Dialog System Technology Challenge.

[BibT_eX]

[DOI]

CoRR, 2019

Sem-GAN: Semantically-Consistent Image-to-Image Translation.

[BibT_eX]

[DOI]

Alan Sullivan

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function.

[BibT_eX]

[DOI]

Arvind U. Raghunathan

Devesh K. Jha

Proceedings of the 36th International Conference on Machine Learning, 2019

Unsupervised Joint 3D Object Model Learning and 6D Pose Estimation for Depth-Based Instance Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

GODS: Generalized One-Class Discriminative Subspaces for Anomaly Detection.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.

[BibT_eX]

[DOI]

Raphael Gontijo Lopes

Proceedings of the IEEE International Conference on Acoustics, 2019

Audio Visual Scene-Aware Dialog.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7.

[BibT_eX]

[DOI]

Huda AlAmri

Vincent Cartillier

Raphael Gontijo Lopes

CoRR, 2018

Neural Algebra of Classifiers.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Learning Discriminative Video Representations Using Adversarial Perturbations.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

Video Representation Learning Using Discriminative Pooling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Scalable Dense Non-Rigid Structure-From-Motion: A Grassmannian Perspective.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Non-Linear Temporal Subspace Representations for Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Riemannian Dictionary Learning and Sparse Coding for Positive Definite Matrices.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2017

Unsupervised Classification of Polarimetric SAR Images via Riemannian Sparse Coding.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2017

Human Action Forecasting by Learning Task Grammars.

[BibT_eX]

[DOI]

CoRR, 2017

Learning Discriminative Alpha-Beta-divergence for Positive Definite Matrices (Extended Version).

[BibT_eX]

[DOI]

CoRR, 2017

Action Representation Using Classifier Decision Boundaries.

[BibT_eX]

[DOI]

CoRR, 2017

Sequence Summarization Using Order-constrained Kernelized Feature Subspaces.

[BibT_eX]

[DOI]

Richard Hartley

CoRR, 2017

Ordered Pooling of Optical Flow Sequences for Action Recognition.

[BibT_eX]

[DOI]

Fatih Porikli

Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Higher-Order Pooling of CNN Features via Kernel Linearization for Action Recognition.

[BibT_eX]

[DOI]

Stephen Gould

Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Active convolutional neural networks for cancerous tissue recognition.

[BibT_eX]

[DOI]

Alexander Truskinovsky

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Part-based fine-grained bird image retrieval respecting species correlation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Clustering Positive Definite Matrices by Learning Information Divergences.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Learning Discriminative αβ-Divergences for Positive Definite Matrices.

[BibT_eX]

[DOI]

Nikos Papanikolopoulos

Proceedings of the IEEE International Conference on Computer Vision, 2017

Human Pose Forecasting via Deep Markov Models.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

DeepPermNet: Visual Permutation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Generalized Rank Pooling for Activity Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Bayesian Nonparametric Clustering for Positive Definite Matrices.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2016

On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization.

[BibT_eX]

[DOI]

CoRR, 2016

Active Constrained Clustering via non-iterative uncertainty sampling.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Riemannian sparse coding for classification of PolSAR images.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium, 2016

Evaluation of feature descriptors for cancerous tissue recognition.

[BibT_eX]

[DOI]

Xinyan Li

Alexander Truskinovsky

Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Tensor Representations via Kernel Linearization for Action Recognition from 3D Skeletons.

[BibT_eX]

[DOI]

Fatih Porikli

Proceedings of the Computer Vision - ECCV 2016, 2016

Sparse Coding for Third-Order Super-Symmetric Tensor Descriptors with Application to Texture Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

Dictionary Learning and Sparse Coding for Third-order Super-symmetric Tensors.

[BibT_eX]

[DOI]

CoRR, 2015

A vision based ensemble approach to velocity estimation for miniature rotorcraft.

[BibT_eX]

[DOI]

Jonathan Andersh

Bérénice Mettler

Auton. Robots, 2015

2014

Efficient Nearest Neighbors via Robust Sparse Hashing.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Action recognition using global spatio-temporal features derived from sparse representations.

[BibT_eX]

[DOI]

Guruprasad Somasundaram

Comput. Vis. Image Underst., 2014

Nearest Neighbors Using Compact Sparse Codes.

[BibT_eX]

[DOI]

Proceedings of the 31th International Conference on Machine Learning, 2014

Riemannian Sparse Coding for Positive Definite Matrices.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2014, 2014

Mixing Body-Part Sequences for Human Pose Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013

Jensen-Bregman LogDet Divergence with Application to Efficient Similarity Search for Covariance Matrices.

[BibT_eX]

[DOI]

Arindam Banerjee

IEEE Trans. Pattern Anal. Mach. Intell., 2013

2012

A multi-sensor visual tracking system for behavior monitoring of at-risk children.

[BibT_eX]

[DOI]

Ravishankar Sivalingam

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Compact covariance descriptors in 3D point clouds for object recognition.

[BibT_eX]

[DOI]

Duc Fehr

Ravishankar Sivalingam

Sam Nickolay

Proceedings of the IEEE International Conference on Robotics and Automation, 2012

Robust Sparse Hashing.

[BibT_eX]

[DOI]

Proceedings of the 19th IEEE International Conference on Image Processing, 2012

2011

Generalized Dictionary Learning for Symmetric Positive Definite Matrices with Application to Nearest Neighbor Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

Efficient similarity search for covariance matrices via the Jensen-Bregman LogDet Divergence.

[BibT_eX]

[DOI]

Arindam Banerjee

Proceedings of the IEEE International Conference on Computer Vision, 2011

Denoising sparse noise via online dictionary learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Dirichlet process mixture models on symmetric positive definite matrices for appearance clustering in video surveillance applications.

[BibT_eX]

[DOI]

Saad Bedros

Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010

Motion estimation of a miniature helicopter using a single onboard camera.

[BibT_eX]

[DOI]

Nikos Papanikolopoulos

Proceedings of the American Control Conference, 2010

2009

Autonomous altitude estimation of a UAV using a single onboard camera.

[BibT_eX]

[DOI]

Jonathan Andersh

Bernard Mettler

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Accurate 3D ground plane estimation from a single image.

[BibT_eX]

[DOI]