Vasu Sharma

According to our database1, Vasu Sharma authored at least 31 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DINOv2: Learning Robust Visual Features without Supervision.
Trans. Mach. Learn. Res., 2024

The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks.
CoRR, 2024

An Introduction to Vision-Language Modeling.
CoRR, 2024

Text Quality-Based Pruning for Efficient Training of Language Models.
CoRR, 2024

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM.
CoRR, 2024

ε-ViLM : Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024

Demystifying CLIP Data.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer.
CoRR, 2023

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning.
CoRR, 2023

Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI.
CoRR, 2023

DINOv2: Learning Robust Visual Features without Supervision.
CoRR, 2023

Alexa Arena: A User-Centric Interactive Platform for Embodied AI.
CoRR, 2023

Alexa Arena: A User-Centric Interactive Platform for Embodied AI.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MAViL: Masked Audio-Video Learners.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Shimmy: Accelerating Inter-Container Communication for the IoT Edge.
Proceedings of the IEEE Global Communications Conference, 2023

Flap: Fast Language-Audio Pre-Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning.
CoRR, 2022

PISA: PoIncaré Saliency-Aware Interpolative Augmentation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Tweet Based Reach Aware Temporal Attention Network for NFT Valuation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2019
Induced Attention Invariance: Defending VQA Models against Adversarial Attacks.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

Multimodal Behavioral Markers Exploring Suicidal Intent in Social Media Videos.
Proceedings of the International Conference on Multimodal Interaction, 2019

Community Regularization of Visually-Grounded Dialog.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

2018
Mind Your Language: Learning Visually Grounded Dialog in a Multi-Agent Setting.
CoRR, 2018

Cyclegen: Cyclic consistency based product review generator from attributes.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

BioAMA: Towards an End to End BioMedical Question Answering System.
Proceedings of the BioNLP 2018 workshop, Melbourne, Australia, July 19, 2018, 2018

2017
Segmentation Guided Attention Networks for Visual Question Answering.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Automatic tagging and retrieval of E-Commerce products based on visual features.
Proceedings of the Student Research Workshop, 2016

2015
Analyzing Newspaper Crime Reports for Identification of Safe Transit Paths.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Image summarization using topic modelling.
Proceedings of the 2015 IEEE International Conference on Signal and Image Processing Applications, 2015

A Deep Neural Network based approach for vocal extraction from songs.
Proceedings of the 2015 IEEE International Conference on Signal and Image Processing Applications, 2015


  Loading...