Yogesh S. Rawat

CoRR, 2024

GeoMeter: Probing Depth and Height Perception of Large Visual-Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

AirSketch: Generative Motion to Sketch.

[BibT_eX]

[DOI]

CoRR, 2024

Foundation Models for Video Understanding: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images.

[BibT_eX]

[DOI]

Abhishek Jha

CoRR, 2024

Navigating Hallucinations for Reasoning of Unintentional Activities.

[BibT_eX]

[DOI]

Shresth Grover

Vibhav Vineet

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Robustness Analysis on Foundational Segmentation Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Probing Conceptual Understanding of Large Visual-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Activity-Biometrics: Person Identification from Daily Activities.

[BibT_eX]

[DOI]

Shehreen Azad

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Semi-supervised Active Learning for Video Action Detection.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Self-Supervised Learning for Videos: A Survey.

[BibT_eX]

[DOI]

Madeline C. Schiappa

ACM Comput. Surv., 2023

EZ-CLIP: Efficient Zeroshot Video Action Recognition.

[BibT_eX]

[DOI]

Shahzad Ahmad

Sukalpa Chanda

CoRR, 2023

Semi-supervised Active Learning for Video Action Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Robustness Analysis on Foundational Segmentation Models.

[BibT_eX]

[DOI]

CoRR, 2023

Benchmarking self-supervised video representation learning.

[BibT_eX]

[DOI]

CoRR, 2023

Probing Conceptual Understanding of Large Visual-Language Models.

[BibT_eX]

[DOI]

Michael Cogswell

Ajay Divakaran

CoRR, 2023

On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes.

[BibT_eX]

[DOI]

Rajat Modi

Vibhav Vineet

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revealing the unseen: Benchmarking video action recognition under occlusion.

[BibT_eX]

[DOI]

Shresth Grover

Vibhav Vineet

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PRAT: PRofiling Adversarial aTtacks.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficiently Robustify Pre-Trained Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Large-Scale Robustness Analysis of Video Action Recognition Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hybrid Active Learning via Deep Clustering for Video Action Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Multi-modal Robustness Analysis Against Language and Visual Perturbations.

[BibT_eX]

[DOI]

CoRR, 2022

Large-scale Robustness Analysis of Video Action Recognition Models.

[BibT_eX]

[DOI]

CoRR, 2022

Video Action Detection: Analysing Limitations and Challenges.

[BibT_eX]

[DOI]

CoRR, 2022

Pose-guided Generative Adversarial Net for Novel View Action Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

GabriellaV2: Towards better generalization in surveillance videos for Action Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

Robustness Analysis of Video-Language Models Against Visual and Language Perturbations.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Are all Frames Equal? Active Sparse Labeling for Video Action Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

End-to-End Semi-Supervised Learning for Video Action Detection.

[BibT_eX]

[DOI]

Akash Kumar

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SVGraph: Learning Semantic Graphs from Instructional Videos.

[BibT_eX]

[DOI]

Madeline C. Schiappa

Proceedings of the Eighth IEEE International Conference on Multimedia Big Data, 2022

2021

Adversarial Learning for Personalized Tag Recommendation.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

"Knights": First Place Submission for VIPriors21 Action Recognition Challenge at ICCV 2021.

[BibT_eX]

[DOI]

CoRR, 2021

TinyAction Challenge: Recognizing Real-world Low-resolution Activities in Videos.

[BibT_eX]

[DOI]

CoRR, 2021

We don't Need Thousand Proposals: Single Shot Actor-Action Detection in Videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Reformulating Zero-shot Action Recognition for Multi-label Actions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Unsupervised Discriminative Embedding For Sub-Action Learning in Complex Activities.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Novel View Video Prediction using a Dual Representation.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Modeling Multi-Label Action Dependencies for Temporal Action Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PLM: Partial Label Masking for Imbalanced Multi-Label Classification.

[BibT_eX]

[DOI]

Kevin Duarte

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

LARNet: Latent Action Representation for Human Action Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

SSA2D: Single Shot Actor-Action Detection in Videos (Student Abstract).

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Photography and Exploration of Tourist Locations Based on Optimal Foraging Theory.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2020

View-invariant action recognition.

[BibT_eX]

[DOI]

CoRR, 2020

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Surveillance Videos.

[BibT_eX]

[DOI]

CoRR, 2020

UCF-System: Activity Detection in Untrimmed Videos.

[BibT_eX]

[DOI]

Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos.

[BibT_eX]

[DOI]

Proceedings of the 25th International Conference on Pattern Recognition, 2020

TinyVIRAT: Low-resolution Video Action Recognition.

[BibT_eX]

[DOI]

Ugur Demir

Proceedings of the 25th International Conference on Pattern Recognition, 2020

Multi-view Action Recognition Using Cross-View Video Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

A Recurrent Transformer Network for Novel View Action Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Visual-Textual Capsule Routing for Text-Based Video Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

An Online System for Real-Time Activity Detection in Untrimmed Surveillance Videos.

[BibT_eX]

[DOI]

Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing.

[BibT_eX]

[DOI]

Kevin Duarte

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018

A Spring-Electric Graph Model for Socialized Group Photography.

[BibT_eX]

[DOI]

Mingli Song

IEEE Trans. Multim., 2018

Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries.

[BibT_eX]

[DOI]

CoRR, 2018

Time-Aware and View-Aware Video Rendering for Unsupervised Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2018

Action and Object Detection for TRECVID.

[BibT_eX]

[DOI]

Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

VideoCapsuleNet: A Simplified Network for Action Detection.

[BibT_eX]

[DOI]

Kevin Duarte

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017

ClickSmart: A Context-Aware Viewpoint Recommendation System for Mobile Photography.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2017

2016

ConTagNet: Exploiting User Context for Image Tag Recommendation.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015

Context-Aware Photography Learning for Smart Mobile Devices.

[BibT_eX]

[DOI]

ACM Trans. Multim. Comput. Commun. Appl., 2015

Real-Time Assistance in Multimedia Capture Using Social Media.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2014

Context-Based Photography Learning using Crowdsourced Images and Social Media.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Mode of teaching based segmentation and annotation of video lectures.

[BibT_eX]

[DOI]

Chidansh Amitkumar Bhatt