Yogesh S. Rawat

Orcid: 0000-0003-4052-6798

Affiliations:
  • University of Central Florida, Center for Research in Computer Vision, Orlando, FL, USA
  • National University of Singapore, Singapore (PhD)


According to our database1, Yogesh S. Rawat authored at least 69 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Asynchronous Perception Machine For Efficient Test-Time-Training.
CoRR, 2024

GeoMeter: Probing Depth and Height Perception of Large Visual-Language Models.
CoRR, 2024

AirSketch: Generative Motion to Sketch.
CoRR, 2024

Foundation Models for Video Understanding: A Survey.
CoRR, 2024

PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images.
CoRR, 2024

Navigating Hallucinations for Reasoning of Unintentional Activities.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Robustness Analysis on Foundational Segmentation Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Probing Conceptual Understanding of Large Visual-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Activity-Biometrics: Person Identification from Daily Activities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Semi-supervised Active Learning for Video Action Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Self-Supervised Learning for Videos: A Survey.
ACM Comput. Surv., 2023

EZ-CLIP: Efficient Zeroshot Video Action Recognition.
CoRR, 2023

Semi-supervised Active Learning for Video Action Detection.
CoRR, 2023

Robustness Analysis on Foundational Segmentation Models.
CoRR, 2023

Benchmarking self-supervised video representation learning.
CoRR, 2023

Probing Conceptual Understanding of Large Visual-Language Models.
CoRR, 2023

On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Revealing the unseen: Benchmarking video action recognition under occlusion.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PRAT: PRofiling Adversarial aTtacks.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Efficiently Robustify Pre-Trained Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Large-Scale Robustness Analysis of Video Action Recognition Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Hybrid Active Learning via Deep Clustering for Video Action Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Multi-modal Robustness Analysis Against Language and Visual Perturbations.
CoRR, 2022

Large-scale Robustness Analysis of Video Action Recognition Models.
CoRR, 2022

Video Action Detection: Analysing Limitations and Challenges.
CoRR, 2022

Pose-guided Generative Adversarial Net for Novel View Action Synthesis.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

GabriellaV2: Towards better generalization in surveillance videos for Action Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

Robustness Analysis of Video-Language Models Against Visual and Language Perturbations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Are all Frames Equal? Active Sparse Labeling for Video Action Detection.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

End-to-End Semi-Supervised Learning for Video Action Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SVGraph: Learning Semantic Graphs from Instructional Videos.
Proceedings of the Eighth IEEE International Conference on Multimedia Big Data, 2022

2021
Adversarial Learning for Personalized Tag Recommendation.
IEEE Trans. Multim., 2021

"Knights": First Place Submission for VIPriors21 Action Recognition Challenge at ICCV 2021.
CoRR, 2021

TinyAction Challenge: Recognizing Real-world Low-resolution Activities in Videos.
CoRR, 2021

We don't Need Thousand Proposals: Single Shot Actor-Action Detection in Videos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Reformulating Zero-shot Action Recognition for Multi-label Actions.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

NoisyActions2M: A Multimedia Dataset for Video Understanding from Noisy Labels.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Unsupervised Discriminative Embedding For Sub-Action Learning in Complex Activities.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Novel View Video Prediction using a Dual Representation.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Modeling Multi-Label Action Dependencies for Temporal Action Localization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

PLM: Partial Label Masking for Imbalanced Multi-Label Classification.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

LARNet: Latent Action Representation for Human Action Synthesis.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

SSA2D: Single Shot Actor-Action Detection in Videos (Student Abstract).
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Photography and Exploration of Tourist Locations Based on Optimal Foraging Theory.
IEEE Trans. Circuits Syst. Video Technol., 2020

View-invariant action recognition.
CoRR, 2020

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Surveillance Videos.
CoRR, 2020

UCF-System: Activity Detection in Untrimmed Videos.
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020

Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

TinyVIRAT: Low-resolution Video Action Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Multi-view Action Recognition Using Cross-View Video Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

A Recurrent Transformer Network for Novel View Action Synthesis.
Proceedings of the Computer Vision - ECCV 2020, 2020

Visual-Textual Capsule Routing for Text-Based Video Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
An Online System for Real-Time Activity Detection in Untrimmed Surveillance Videos.
Proceedings of the 2019 TREC Video Retrieval Evaluation, 2019

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

2018
A Spring-Electric Graph Model for Socialized Group Photography.
IEEE Trans. Multim., 2018

Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries.
CoRR, 2018

Time-Aware and View-Aware Video Rendering for Unsupervised Representation Learning.
CoRR, 2018

Action and Object Detection for TRECVID.
Proceedings of the 2018 TREC Video Retrieval Evaluation, 2018

VideoCapsuleNet: A Simplified Network for Action Detection.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial Network.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

2017
ClickSmart: A Context-Aware Viewpoint Recommendation System for Mobile Photography.
IEEE Trans. Circuits Syst. Video Technol., 2017

2016
ConTagNet: Exploiting User Context for Image Tag Recommendation.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015
Context-Aware Photography Learning for Smart Mobile Devices.
ACM Trans. Multim. Comput. Commun. Appl., 2015

Real-Time Assistance in Multimedia Capture Using Social Media.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

2014
Context-Based Photography Learning using Crowdsourced Images and Social Media.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Mode of teaching based segmentation and annotation of video lectures.
Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014


  Loading...