Diptesh Kanojia

Orcid: 0000-0001-8814-0080

According to our database1, Diptesh Kanojia authored at least 88 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
CreoleVal: Multilingual Multitask Benchmarks for Creoles.
Trans. Assoc. Comput. Linguistics, 2024

Sampling Strategies for Creation of a Benchmark for Dialectal Sentiment Classification.
CoRR, 2024

Are Large Language Models State-of-the-art Quality Estimators for Machine Translation of User-generated Content?
CoRR, 2024

Edit Distances and Their Applications to Downstream Tasks in Research and Commercial Contexts.
CoRR, 2024

Connecting Ideas in 'Lower-Resource' Scenarios: NLP for National Varieties, Creoles and Other Low-resource Scenarios.
CoRR, 2024

AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis.
CoRR, 2024

Unsupervised Audio-Visual Segmentation with Modality Alignment.
CoRR, 2024

Google Translate Error Analysis for Mental Healthcare Information: Evaluating Accuracy, Comprehensibility, and Implications for Multilingual Healthcare Communication.
CoRR, 2024

Airavata: Introducing Hindi Instruction-tuned LLM.
CoRR, 2024

Natural Language Processing for Dialects of a Language: A Survey.
CoRR, 2024

Findings of the Quality Estimation Shared Task at WMT 2024: Are LLMs Closing the Gap in QE?
Proceedings of the Ninth Conference on Machine Translation, 2024

A Multi-task Learning Framework for Evaluating Machine Translation of Emotion-loaded User-generated Content.
Proceedings of the Ninth Conference on Machine Translation, 2024

A Survey of Multimodal Sarcasm Detection.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

StableTalk: Advancing Audio-to-Talking Face Generation with Stable Diffusion and Vision Transformer.
Proceedings of the Pattern Recognition - 27th International Conference, 2024

Centrality-aware Product Retrieval and Ranking.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

What do Large Language Models Need for Machine Translation Evaluation?
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Together We Can: Multilingual Automatic Post-Editing for Low-Resource Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Evaluating Machine Translation for Emotion-loaded User Generated Content (TransEval4Emo-UGC).
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), 2024

Character-level Language Models for Abbreviation and Long-form Detection.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Product Retrieval and Ranking for Alphanumeric Queries.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

DiffSED: Sound Event Detection with Denoising Diffusion.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
APE-then-QE: Correcting then Filtering Pseudo Parallel Corpora for MT Training Data Creation.
CoRR, 2023

CreoleVal: Multilingual Multitask Benchmarks for Creoles.
CoRR, 2023

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection.
CoRR, 2023

Leveraging Foundation models for Unsupervised Audio-Visual Segmentation.
CoRR, 2023

Applications and Challenges of Sentiment Analysis in Real-life Scenarios.
CoRR, 2023

SurreyAI 2023 Submission for the Quality Estimation Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

Findings of the WMT 2023 Shared Task on Quality Estimation.
Proceedings of the Eighth Conference on Machine Translation, 2023

Findings of the WMT 2023 Shared Task on Automatic Post-Editing.
Proceedings of the Eighth Conference on Machine Translation, 2023

Modelling Political Aggression on Social Media Platforms.
Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, 2023

Predict and Use: Harnessing Predicted Gaze to Improve Multimodal Sarcasm Detection.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Quality Estimation-Assisted Automatic Post-Editing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Evaluation of Chinese-English Machine Translation of Emotion-Loaded Microblog Texts: A Human Annotated Dataset for the Quality Assessment of Emotion Translation.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

A Multi-task Learning Framework for Quality Estimation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

YANMTT: Yet Another Neural Machine Translation Toolkit.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2023

2022
An Ensemble Approach to Acronym Extraction using Transformers.
CoRR, 2022

Some Strategies to Capture Karaka-Yogyata with Special Reference to apadana.
CoRR, 2022

Strategies of Effective Digitization of Commentaries and Sub-commentaries: Towards the Construction of Textual History.
CoRR, 2022

Findings of the WMT 2022 Shared Task on Quality Estimation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Findings of the WMT 2022 Shared Task on Automatic Post-Editing.
Proceedings of the Seventh Conference on Machine Translation, 2022

SURREY-CTS-NLP at WASSA2022: An Experiment of Discourse and Sentiment Analysis for the Prediction of Empathy, Distress and Emotion.
Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

PLOD: An Abbreviation Detection Dataset for Scientific Documents.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

HiNER: A large Hindi Named Entity Recognition Dataset.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Harnessing Abstractive Summarization for Fact-Checked Claim Detection.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Investigations into Distributional Semantics for Cognate Detection and Phylogenetics.
PhD thesis, 2021

Pushing the Right Buttons: Adversarial Evaluation of Quality Estimation.
Proceedings of the Sixth Conference on Machine Translation, 2021

Automated Evidence Collection for Fake News Detection.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

"So You Think You're Funny?": Rating the Humour Quotient in Standup Comedy.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Cognition-aware Cognate Detection.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

FrameNet-assisted Noun Compound Interpretation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
"A Passage to India": Pre-trained Word Embeddings for Indian Languages.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Recommendation Chart of Domains for Cross-Domain Sentiment Analysis: Findings of A 20 Domain Study.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Challenge Dataset of Cognates and False Friend Pairs from Indian Languages.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Happy Are Those Who Grade without Seeing: A Multi-Task Learning Approach to Grade Essays Using Gaze Behaviour.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

A Survey on Using Gaze Behaviour for Natural Language Processing.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Cognitively Aided Zero-Shot Automatic Essay Grading.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

Harnessing Deep Cross-lingual Word Embeddings to Infer Accurate Phylogenetic Trees.
Proceedings of the CoDS-COMAD 2020: 7th ACM IKDD CoDS and 25th COMAD, 2020

Keep Your Dimensions on a Leash: True Cognate Detection using Siamese Deep Neural Networks.
Proceedings of the CoDS-COMAD 2020: 7th ACM IKDD CoDS and 25th COMAD, 2020

Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2019
Utilizing Wordnets for Cognate Detection among Indian Languages.
Proceedings of the 10th Global Wordnet Conference, 2019

Cognate Identification to improve Phylogenetic trees for Indian Languages.
Proceedings of the ACM India Joint International Conference on Data Science and Management of Data, 2019

2018
Eyes are the Windows to the Soul: Predicting the Rating of Text Quality Using Gaze Behaviour.
CoRR, 2018

New Vistas to study Bhartrhari: Cognitive NLP.
CoRR, 2018

Hindi Wordnet for Language Teaching: Experiences and Lessons Learnt.
Proceedings of the 9th Global Wordnet Conference, 2018

Semi-automatic WordNet Linking using Word Embeddings.
Proceedings of the 9th Global Wordnet Conference, 2018

pyiwn: A Python based API to access Indian Language WordNets.
Proceedings of the 9th Global Wordnet Conference, 2018

Synthesizing Audio for Hindi WordNet.
Proceedings of the 9th Global Wordnet Conference, 2018

Indian Language Wordnets and their Linkages with Princeton WordNet.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Eyes are the Windows to the Soul: Predicting the Rating of Text Quality Using Gaze Behaviour.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Is your Statement Purposeless? Predicting Computer Science Graduation Admission Acceptance based on Statement Of Purpose.
Proceedings of the 14th International Conference on Natural Language Processing, 2017

Scanpath Complexity: Modeling Reading Effort Using Gaze Information.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Sarcasm Suite: A Browser-Based Engine for Sarcasm Detection and Generation.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Civique: Using Social Media to Detect Urban Emergencies.
CoRR, 2016

Mapping it differently: A solution to the linking challenges.
Proceedings of the 8th Global WordNet Conference, 2016

A picture is worth a thousand words: Using OpenClipArt library for enriching IndoWordNet.
Proceedings of the 8th Global WordNet Conference, 2016

Sophisticated Lexical Databases - Simplified Usage: Mobile Applications and Browser Plugins For Wordnets.
Proceedings of the 8th Global WordNet Conference, 2016

That'll Do Fine!: A Coarse Lexical Resource for English-Hindi MT, Using Polylingual Topic Models.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

SlangNet: A WordNet like resource for English Slang.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Leveraging Cognitive Features for Sentiment Analysis.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Harnessing Cognitive Features for Sarcasm Detection.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Predicting Readers' Sarcasm Understandability by Modeling Gaze Behavior.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Using Multilingual Topic Models for Improved Alignment in English-Hindi MT.
Proceedings of the 12th International Conference on Natural Language Processing, 2015

TransChat: Cross-Lingual Instant Messaging for Indian Languages.
Proceedings of the 12th International Conference on Natural Language Processing, 2015

World WordNet Database Structure: An Efficient Schema for Storing Information of WordNets of the World.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Do not do processing, when you can look up: Towards a Discrimination Net for WSD.
Proceedings of the Seventh Global Wordnet Conference, 2014

PaCMan : Parallel Corpus Management Workbench.
Proceedings of the 11th International Conference on Natural Language Processing, 2014

2013
More than meets the eye: Study of Human Cognition in Sense Annotation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

2012
Discrimination-Net for Hindi.
Proceedings of the COLING 2012, 2012


  Loading...