John P. McCrae

Orcid: 0000-0002-7227-1331

Affiliations:
  • National University of Ireland Galway, Insight Centre for Data Analytics, Ireland
  • University of Bielefeld, CITEC, Germany


According to our database1, John P. McCrae authored at least 149 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Text Detoxification as Style Transfer in English and Hindi.
CoRR, 2024

English-to-Low-Resource Translation: A Multimodal Approach for Hindi, Malayalam, Bengali, and Hausa.
Proceedings of the Ninth Conference on Machine Translation, 2024

Findings of the SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages.
Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, 2024

Large Language Models for Few-Shot Automatic Term Extraction.
Proceedings of the Natural Language Processing and Information Systems, 2024

Multilingual Text Style Transfer: Datasets & Models for Indian Languages.
Proceedings of the 17th International Natural Language Generation Conference, 2024

BRECS: Enhanced Binary Representation of Word Embeddings via Cosine Similarity.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

MaCmS: Magahi Code-mixed Dataset for Sentiment Analysis.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Intent Classification by the Use of Automatically Generated Knowledge Graphs.
Inf., May, 2023

Detecting abusive comments at a fine-grained level in a low-resource language.
Nat. Lang. Process. J., 2023

Empowering recommender systems using automatically generated Knowledge Graphs and Reinforcement Learning.
CoRR, 2023

Some Considerations in the Construction of a Historical Language WordNet.
Proceedings of the 12th Global Wordnet Conference, 2023

Documenting the Open Multilingual Wordnet.
Proceedings of the 12th Global Wordnet Conference, 2023

Temporal Domain Adaptation for Historical Irish.
Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

Exploring Techniques to Detect and Mitigate Non-Inclusive Language Bias in Marketing Communications Using a Dictionary-Based Approach.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

PICKD: In-Situ Prompt Tuning for Knowledge-Grounded Dialogue Generation.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

The Cardamom Workbench for Historical and Under-Resourced Languages.
Proceedings of the 4th Conference on Language, Data and Knowledge, 2023

MG2P: An Empirical Study Of Multilingual Training for Manx G2P.
Proceedings of the 4th Conference on Language, Data and Knowledge, 2023


Weakly-supervised Deep Cognate Detection Framework for Low-Resourced Languages Using Morphological Knowledge of Closely-Related Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
When linguistics meets web technologies. Recent advances in modelling linguistic linked data.
Semantic Web, 2022

DravidianCodeMix: sentiment analysis and offensive language identification dataset for Dravidian languages in code-mixed text.
Lang. Resour. Evaluation, 2022

Toward an Integrative Approach for Making Sense Distinctions.
Frontiers Artif. Intell., 2022

TamilEmo: Finegrained Emotion Detection Dataset for Tamil.
CoRR, 2022

Overview of The Shared Task on Homophobia and Transphobia Detection in Social Media Comments.
Proceedings of the Second Workshop on Language Technology for Equality, 2022


Linghub2: Language Resource Discovery Tool for Language Technologies.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

MHE: Code-Mixed Corpora for Similar Language Identification.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Towards the Construction of a WordNet for Old English.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Semantic Aware Answer Sentence Selection Using Self-Learning Based Domain Adaptation.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Proceedings of the 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference.
Proceedings of the 8th Workshop on Linked Data in Linguistics within the 13th Language Resources and Evaluation Conference, 2022

KG-CRuSE: Recurrent Walks over Knowledge Graph for Explainable Conversation Reasoning using Semantic Embeddings.
Proceedings of the 4th Workshop on NLP for Conversational AI, 2022

2021
A Survey of Orthographic Information in Machine Translation.
SN Comput. Sci., 2021

Conversation Concepts: Understanding Topics and Building Taxonomies for Financial Services.
Inf., 2021

Dataset for Identification of Homophobia and Transophobia in Multilingual YouTube Comments.
CoRR, 2021

DravidianMultiModality: A Dataset for Multi-modal Sentiment Analysis in Tamil and Malayalam.
CoRR, 2021

The GlobalWordNet Formats: Updates for 2020.
Proceedings of the 11th Global Wordnet Conference, 2021

Towards a Linking between WordNet and Wikidata.
Proceedings of the 11th Global Wordnet Conference, 2021

Monolingual Word Sense Alignment as a Classification Problem.
Proceedings of the 11th Global Wordnet Conference, 2021

ULD-NUIG at Social Media Mining for Health Applications (#SMM4H) Shared Task 2021.
Proceedings of the Sixth Social Media Mining for Health Workshop and Shared Task, 2021

Automatic Construction of Knowledge Graphs from Text and Structured Data: A Preliminary Literature Review.
Proceedings of the 3rd Conference on Language, Data and Knowledge, 2021

Encoder-Attention-Based Automatic Term Recognition (EA-ATR).
Proceedings of the 3rd Conference on Language, Data and Knowledge, 2021

Meta-Learning for Offensive Language Detection in Code-Mixed Texts.
Proceedings of the FIRE 2021: Forum for Information Retrieval Evaluation, Virtual Event, India, December 13, 2021

Findings of Shared Task on Offensive Language Identification in Tamil and Malayalam.
Proceedings of the FIRE 2021: Forum for Information Retrieval Evaluation, Virtual Event, India, December 13, 2021

Findings of the Sentiment Analysis of Dravidian Languages in Code-Mixed Text.
Proceedings of the Working Notes of FIRE 2021, 2021

Overview of the HASOC-DravidianCodeMix Shared Task on Offensive Language Detection in Tamil and Malayalam.
Proceedings of the Working Notes of FIRE 2021, 2021

Cross-lingual Sentence Embedding using Multi-Task Learning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Few-shot and Zero-shot Approaches to Legal Text Classification: A Case Study in the Financial Sector.
Proceedings of the Natural Legal Language Processing Workshop 2021, 2021

2020
COST Action "European network for Web centred linguistic data science" (NexusLinguarum).
Proces. del Leng. Natural, 2020

NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020.
Proceedings of the Fifth Conference on Machine Translation, 2020

Bilingual Lexicon Induction across Orthographically-distinct Under-Resourced Dravidian Languages.
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 2020

Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

A Sentiment Analysis Dataset for Code-Mixed Malayalam-English.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

iLOD: InterPlanetary File System based Linked Open Data Cloud.
Proceedings of the 6th Workshop on Managing the Evolution and Preservation of the Data Web (MEPDaW) co-located with the 19th International Semantic Web Conference (ISWC 2020), 2020

ULD@NUIG at SemEval-2020 Task 9: Generative Morphemes with an Attention Model for Sentiment Analysis in Code-Mixed Text.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Figure Me Out: A Gold Standard Dataset for Metaphor Interpretation.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


English WordNet 2020: Improving and Extending a WordNet for English using an Open-Source Methodology.
Proceedings of the LREC 2020 Workshop on Multimodal Wordnets, 2020

NUIG at TIAD: Combining Unsupervised NLP and Graph Metrics for Translation Inference.
Proceedings of the 2020 Globalex Workshop on Linked Lexicography, 2020


On the Linguistic Linked Open Data Infrastructure.
Proceedings of the 1st International Workshop on Language Technology Platforms, 2020

Modelling Frequency and Attestations for OntoLex-Lemon.
Proceedings of the 2020 Globalex Workshop on Linked Lexicography, 2020

Some Issues with Building a Multilingual Wordnet.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


Overview of the track on Sentiment Analysis for Dravidian Languages in Code-Mixed Text.
Proceedings of the Working Notes of FIRE 2020, 2020

Overview of the track on HASOC-Offensive Language Identification-DravidianCodeMix.
Proceedings of the Working Notes of FIRE 2020, 2020

Contextual Modulation for Relation-Level Metaphor Identification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network.
Proceedings of the 7th IEEE International Conference on Data Science and Advanced Analytics, 2020

Suggest me a movie for tonight: Leveraging Knowledge Graphs for Conversational Recommendation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Unsupervised Deep Language and Dialect Identification for Short Texts.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

A Comparative Study of Different State-of-the-Art Hate Speech Detection Methods in Hindi-English Code-Mixed Data.
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 2020

Challenges of Word Sense Alignment: Portuguese Language Resources.
Proceedings of the 7th Workshop on Linked Data in Linguistics, 2020

Adaptation of Word-Level Benchmark Datasets for Relation-Level Metaphor Identification.
Proceedings of the Second Workshop on Figurative Language Processing, 2020

Linguistic Linked Data - Representation, Generation and Applications
Springer, ISBN: 978-3-030-30224-5, 2020

2019
Foreword to the Special Issue: "Towards the Multilingual Web of Data".
Inf., 2019

Polylingual Wordnet.
CoRR, 2019

English WordNet 2019 - An Open-Source WordNet for English.
Proceedings of the 10th Global Wordnet Conference, 2019

Identification of Adjective-Noun Neologisms using Pretrained Language Models.
Proceedings of the Joint Workshop on Multiword Expressions and WordNet, 2019

WordNet Gloss Translation for Under-resourced Languages using Multilingual Neural Machine Translation.
Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation, 2019

Crowd-Sourcing A High-Quality Dataset for Metaphor Identification in Tweets.
Proceedings of the 2nd Conference on Language, Data and Knowledge, 2019

TIAD 2019 shared task: Leveraging knowledge graphs with neural machine translation for automatic multilingual dictionary generation.
Proceedings of TIAD-2019 Shared Task, 2019

TIAD Shared Task 2019: orthonormal explicit topic analysis for translation inference across dictionaries.
Proceedings of TIAD-2019 Shared Task, 2019

Representing Arabic Lexicons in Lemon - a Preliminary Study.
Proceedings of the Poster Session of the 2nd Conference on Language, 2019

Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages.
Proceedings of the 2nd Conference on Language, Data and Knowledge, 2019

Inferring translation candidates for multilingual dictionary generation with multi-way neural machine translation.
Proceedings of TIAD-2019 Shared Task, 2019

Lexical Sense Alignment using Weighted Bipartite b-Matching.
Proceedings of the Poster Session of the 2nd Conference on Language, 2019

Taxonomy Extraction for Customer Service Knowledge Base Construction.
Proceedings of the Semantic Systems. The Power of AI and Knowledge Graphs, 2019

A Comparative Study of SVM and LSTM Deep Learning Algorithms for Stock Market Prediction.
Proceedings of the Proceedings for the 27th AIAI Irish Conference on Artificial Intelligence and Cognitive Science, 2019

2018
MixedEmotions: An Open-Source Toolbox for Multimodal Emotion Analysis.
IEEE Trans. Multim., 2018

A Comparison of Emotion Annotation Approaches for Text.
Inf., 2018

Temporal Analysis of Entity Relatedness and its Evolution using Wikipedia and DBpedia.
CoRR, 2018

ELEXIS - a European infrastructure fostering cooperation and information exchange among lexicographical research communities.
Proceedings of the 9th Global Wordnet Conference, 2018

Towards a Crowd-Sourced WordNet for Colloquial English.
Proceedings of the 9th Global Wordnet Conference, 2018

Mapping WordNet Instances to Wikipedia.
Proceedings of the 9th Global Wordnet Conference, 2018

Improving Wordnets for Under-Resourced Languages Using Machine Translation.
Proceedings of the 9th Global Wordnet Conference, 2018

Teanga: A Linked Data based platform for Natural Language Processing.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A Comparison Of Emotion Annotation Schemes And A New Annotated Data Set.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

A supervised approach to taxonomy extraction using word embeddings.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Automatic Enrichment of Terminological Resources: the IATE RDF Example.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

ELEXIS - Eine europäische Forschungsinfrastruktur für lexikographische Daten.
Proceedings of the 5. Tagung des Verbands Digital Humanities im deutschsprachigen Raum, 2018

Constructing an Annotated Corpus of Verbal MWEs for English.
Proceedings of the Joint Workshop on Linguistic Annotation, 2018

Phrase-Level Metaphor Identification Using Distributed Representations of Word Meaning.
Proceedings of the Workshop on Figurative Language Processing, 2018

2017
The Colloquial WordNet: Extending Princeton WordNet with Neologisms.
Proceedings of the Language, Data, and Knowledge - First International Conference, 2017

OnLiT: An Ontology for Linguistic Terminology.
Proceedings of the Language, Data, and Knowledge - First International Conference, 2017

An Evaluation Dataset for Linked Data Profiling.
Proceedings of the Language, Data, and Knowledge - First International Conference, 2017

2016
Domain adaptation for ontology localization.
J. Web Semant., 2016

Toward a truly multilingual GlobalWordnet Grid.
Proceedings of the 8th Global WordNet Conference, 2016

CILI: the Collaborative Interlingual Index.
Proceedings of the 8th Global WordNet Conference, 2016

Identifying Poorly-Defined Concepts in WordNet with Graph Metrics.
Proceedings of the Knowledge Graphs and Language Technology, 2016

LIXR: Quick, succinct conversion of XML to RDF.
Proceedings of the ISWC 2016 Posters & Demonstrations Track co-located with 15th International Semantic Web Conference (ISWC 2016), 2016

Yuzu: Publishing Any Data as Linked Data.
Proceedings of the ISWC 2016 Posters & Demonstrations Track co-located with 15th International Semantic Web Conference (ISWC 2016), 2016

NUIG-UNLP at SemEval-2016 Task 1: Soft Alignment and Deep Learning for Semantic Textual Similarity.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

The Open Linguistics Working Group: Developing the Linguistic Linked Open Data Cloud.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Expanding wordnets to new languages with multilingual sense disambiguation.
Proceedings of the COLING 2016, 2016

2015
Multilingual linked data.
Semantic Web, 2015

lemonUby - A large, interlinked, syntactically-rich lexical resource for ontologies.
Semantic Web, 2015

Linghub: a Linked Data based portal supporting the discovery of language resources.
Proceedings of the Joint Proceedings of the Posters and Demos Track of 11th International Conference on Semantic Systems - SEMANTiCS 2015 and 1st Workshop on Data Science: Methods, Technology and Applications (DSci15) 11th International Conference on Semantic Systems, 2015

One Ontology to Bind Them All: The META-SHARE OWL Ontology for the Interoperability of Linguistic Datasets on the Web.
Proceedings of the Semantic Web: ESWC 2015 Satellite Events - ESWC 2015 Satellite Events Portorož, Slovenia, May 31, 2015

LIME: The Metadata Module for OntoLex.
Proceedings of the Semantic Web. Latest Advances and New Domains, 2015

Linking Four Heterogeneous Language Resources as Linked Data.
Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications, 2015

Reconciling Heterogeneous Descriptions of Language Resources.
Proceedings of the 4th Workshop on Linked Data in Linguistics: Resources and Applications, 2015

2014
Ontology-Based Interpretation of Natural Language
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02154-1, 2014

Representing Swedish Lexical Resources in RDF with lemon.
Proceedings of the ISWC 2014 Posters & Demonstrations Track a track within the 13th International Semantic Web Conference, 2014

Bielefeld SC: Orthonormal Topic Modelling for Grammar Induction.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Representing Multilingual Data as Linked Data: the Case of BabelNet 2.0.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Towards Assured Data Quality and Validation by Data Certification.
Proceedings of the 1st Workshop on Linked Data Quality co-located with 10th International Conference on Semantic Systems, 2014

Language Resources and Linked Data: A Practical Perspective.
Proceedings of the Knowledge Engineering and Knowledge Management, 2014

Default Physical Measurements in SUMO.
Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon, 2014

Modelling the Semantics of Adjectives in the Ontology-Lexicon Interface.
Proceedings of the 4th Workshop on Cognitive Aspects of the Lexicon, 2014

Design Patterns for Engineering the Ontology-Lexicon Interface.
Proceedings of the Towards the Multilingual Semantic Web, 2014

2013
On the Role of Senses in the Ontology-Lexicon.
Proceedings of the New Trends of Research in Ontologies and Lexical Resources, 2013

Towards Open Data for Linguistics: Linguistic Linked Data.
Proceedings of the New Trends of Research in Ontologies and Lexical Resources, 2013

A lemon lexicon for DBpedia.
Proceedings of the NLP & DBpedia workshop co-located with the 12th International Semantic Web Conference (ISWC 2013), 2013

Orthonormal Explicit Topic Analysis for Cross-Lingual Document Matching.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Releasing multimodal data as Linguistic Linked Open Data: An experience report.
Proceedings of the 2nd Workshop on Linked Data in Linguistics, 2013

Linguistic Linked Open Data (LLOD). Introduction and Overview.
Proceedings of the 2nd Workshop on Linked Data in Linguistics, 2013

Mining translations from the web of open linked data.
Proceedings of the Joint Workshop on NLP&LOD and SWAIE: Semantic Web, 2013

2012
Challenges for the multilingual Web of Data.
J. Web Semant., 2012

Interchanging lexical resources on the Semantic Web.
Lang. Resour. Evaluation, 2012

Collaborative semantic editing of linked data lexica.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Using SPIN to Formalise XBRL Accounting Regulations on the Semantic Web.
Proceedings of the Semantic Web: ESWC 2012 Satellite Events, 2012

Integrating WordNet and Wiktionary with <i>lemon</i>.
Proceedings of the Linked Data in Linguistics, 2012

2011
LexInfo: A declarative model for the lexicon-ontology interface.
J. Web Semant., 2011

Combining statistical and semantic approaches to the translation of ontologies and taxonomies.
Proceedings of Fifth Workshop on Syntax, 2011

Linking Lexical Resources and Ontologies on the Semantic Web with Lemon.
Proceedings of the Semantic Web: Research and Applications, 2011

2010
CLOVA: An Architecture for Cross-Language Semantic Data Querying.
Proceedings of the 1st International Workshop on the Multilingual Semantic Web, 2010

An ontology-driven system for detecting global health events.
Proceedings of the COLING 2010, 2010

2009
Automatic extraction of logically consistent ontologies from text corpora.
PhD thesis, 2009

2008
Synonym set extraction from the biomedical literature by lexical pattern discovery.
BMC Bioinform., 2008


  Loading...