Sonal Kumar

Orcid: 0009-0006-5516-3060

According to our database1, Sonal Kumar authored at least 31 papers between 2009 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DatUS: Data-Driven Unsupervised Semantic Segmentation With Pretrained Self-Supervised Vision Transformer.
IEEE Trans. Cogn. Dev. Syst., October, 2024

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds.
CoRR, 2024

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities.
CoRR, 2024

LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition.
CoRR, 2024

ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions.
CoRR, 2024

VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap.
CoRR, 2024

Do Vision-Language Models Understand Compound Nouns?
CoRR, 2024

DatUS^2: Data-driven Unsupervised Semantic Segmentation with Pre-trained Self-supervised Vision Transformer.
CoRR, 2024

Do Vision-Language Models Understand Compound Nouns?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

A Closer Look at the Limitations of Instruction Tuning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

IPCL: Iterative Pseudo-Supervised Contrastive Learning to Improve Self-Supervised Feature Representation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Recap: Retrieval-Augmented Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2024

EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

AV-RIR: Audio-Visual Room Impulse Response Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
ASPIRE: Language-Guided Augmentation for Robust Image Classification.
CoRR, 2023

BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NER.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DALE: Generative Data Augmentation for Low-Resource Legal NLP.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
BSSFFS: blockchain-based sybil-secured smart forest fire surveillance.
J. Ambient Intell. Humaniz. Comput., 2022

A novel multimodal dynamic fusion network for disfluency detection in spoken utterances.
CoRR, 2022

Span Classification with Structured Information for Disfluency Detection in Spoken Utterances.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Cisco at AAAI-CAD21 shared task: Predicting Emphasis in Presentation Slides using Contextualised Embeddings.
CoRR, 2021

Cisco at SemEval-2021 Task 5: What's Toxic?: Leveraging Transformers for Multiple Toxic Span Extraction from Online Comments.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

2020
Blockchain-Based Sybil-Secure Data Transmission (SSDT) IoT Framework for Smart City Applications.
Proceedings of the Evolution in Computational Intelligence, 2020

2009
Enabling web services for Classification of Satellite Image.
Proceedings of the 2009 International Conference on Semantic Web & Web Services, 2009


  Loading...