Shraman Pramanick

According to our database1, Shraman Pramanick authored at least 13 papers between 2021 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers.
CoRR, 2024

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Autonomous grasping of 3-D objects by a vision-actuated robot arm using Brain-Computer Interface.
Biomed. Signal Process. Control., July, 2023

VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment.
Trans. Mach. Learn. Res., 2023

EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UniVTG: Towards Unified Video-Language Temporal Grounding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Multimodal Learning using Optimal Transport for Sarcasm and Humor Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Where in the World Is This Image? Transformer-Based Geo-localization in the Wild.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
See, hear, read: Leveraging multimodality with guided attention for abstractive text summarization.
Knowl. Based Syst., 2021

Exercise? I thought you said 'Extra Fries': Leveraging Sentence Demarcations and Multi-hop Attention for Meme Affect Analysis.
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021

MOMENTA: A Multimodal Framework for Detecting Harmful Memes and Their Targets.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Detecting Harmful Memes and Their Targets.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021


  Loading...