Sumit Shekhar

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2024
ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models.
CoRR, 2024

Seeing the Unseen: Visual Metaphor Captioning for Videos.
CoRR, 2024

Unveiling the Invisible: Captioning Videos with Metaphors.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Audio Retrieval for Multimodal Design Documents: A New Dataset and Algorithms.
CoRR, 2023

SALAD : Source-free Active Label-Agnostic Domain Adaptation for Classification, Segmentation and Detection.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

A Neural CRF-based Hierarchical Approach for Linear Text Segmentation.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Open-World Factually Consistent Question Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

"Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

2022
DistillAdapt: Source-Free Active Visual Domain Adaptation.
CoRR, 2022

DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Talisman: Targeted Active Learning for Object Detection with Rare Classes and Slices Using Submodular Mutual Information.
Proceedings of the Computer Vision - ECCV 2022, 2022

OPAD: An Optimized Policy-based Active Learning Framework for Document Content Analysis.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2020
LEAF-QA: Locate, Encode & Attend for Figure Question Answering.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

ICPR 2020 - Competition on Harvesting Raw Tables from Infographics.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

STL-CQA: Structure-based Transformers with Localization and Encoding for Chart Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Are All the Frames Equally Important?
Proceedings of the Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems, 2020

2019
ARComposer: Authoring Augmented Reality Experiences through Text.
Proceedings of the Adjunct Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology, 2019

ICDAR 2019 Competition on Harvesting Raw Tables from Infographics (CHART-Infographics).
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

2018
VoCoG: An Intelligent, Non-Intrusive Assistant for Voice-based Collaborative Group-Viewing.
CoRR, 2018

Show and Recall @ MediaEval 2018 ViMemNet: Predicting Video Memorability.
Proceedings of the Working Notes Proceedings of the MediaEval 2018 Workshop, 2018

2017
Synthesis-based Robust Low Resolution Face Recognition.
CoRR, 2017

Show and Recall: Learning What Makes Videos Memorable.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

2016
Experience Individualization on Online TV Platforms through Persona-based Account Decomposition.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015
Coupled Projections for Adaptation of Dictionaries.
IEEE Trans. Image Process., 2015

Similarity Learning for Product Recommendation and Scoring Using Multi-channel Data.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

Domain adaptive sparse representation-based classification.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Class consistent multi-modal fusion with binary features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Sparse Methods for Robust and Efficient Visual Recognition.
PhD thesis, 2014

Joint Sparse Representation for Robust Multimodal Biometrics Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2014

Unsupervised domain adaptation using parallel transport on Grassmann manifold.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Parametric Dictionaries and Feature Augmentation for Continuous Domain Adaptation.
Proceedings of the 2014 Indian Conference on Computer Vision, 2014

Analysis sparse coding models for image-based classification.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Video-based face recognition via joint sparse representation.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

Generalized Domain-Adaptive Dictionaries.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Joint Sparsity-Based Robust Multimodal Biometrics Recognition.
Proceedings of the Computer Vision - ECCV 2012. Workshops and Demonstrations, 2012

2011
Personal Identification Using Multibiometrics Rank-Level Fusion.
IEEE Trans. Syst. Man Cybern. Part C, 2011

Synthesis-based recognition of low resolution faces.
Proceedings of the 2011 IEEE International Joint Conference on Biometrics, 2011

2010
Palmprint recognition using rank level fusion.
Proceedings of the International Conference on Image Processing, 2010


  Loading...