Srikar Appalaraju

According to our database1, Srikar Appalaraju authored at least 24 papers between 2017 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RAVEN: Multitask Retrieval Augmented Vision-Language Learning.
CoRR, 2024

DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Multiple-Question Multiple-Answer Text-VQA.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Industry Track, 2024

DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding.
Proceedings of the Computer Vision - ECCV 2024, 2024

Enhancing Vision-Language Pre-Training with Rich Supervisions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

No Head Left Behind - Multi-Head Alignment Distillation for Transformers.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

DocFormerv2: Local Features for Document Understanding.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
SimCon Loss with Multiple Views for Text Supervised Semantic Segmentation.
CoRR, 2023

MixGen: A New Multi-Modal Data Augmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2023

A Multi-Modal Multilingual Benchmark for Document Image Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Towards Differential Relational Privacy and its use in Question Answering.
CoRR, 2022

SeeTek: Very Large-Scale Open-set Logo Recognition with Text-Aware Metric Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

YORO - Lightweight End to End Visual Grounding.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

LaTr: Layout-Aware Transformer for Scene-Text VQA.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Saliency Driven Perceptual Image Compression.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

DocFormer: End-to-End Transformer for Document Understanding.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Towards Good Practices in Self-supervised Representation Learning.
CoRR, 2020

Hierarchical Auto-Regressive Model for Image Compression Incorporating Object Saliency and a Deep Perceptual Loss.
CoRR, 2020

2019
Unbiased Evaluation of Deep Metric Learning Algorithms.
CoRR, 2019

Human Perceptual Evaluations for Image Compression.
CoRR, 2019

Deep Perceptual Compression.
CoRR, 2019

Scalable Logo Recognition Using Proxies.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

2017
Image similarity using Deep CNN and Curriculum Learning.
CoRR, 2017


  Loading...