Ion Androutsopoulos

Orcid: 0009-0000-2969-0509

According to our database1, Ion Androutsopoulos authored at least 133 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Machine learning in bank merger prediction: A text-based approach.
Eur. J. Oper. Res., January, 2024

LAR-ECHR: A New Legal Argument Reasoning Task and Dataset for Cases of the European Court of Human Rights.
CoRR, 2024

Should I try multiple optimizers when fine-tuning pre-trained Transformers for NLP tasks? Should I tune their hyperparameters?
CoRR, 2024

Archimedes-AUEB at SemEval-2024 Task 5: LLM explains Civil Procedure.
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

Should I try multiple optimizers when fine-tuning a pre-trained Transformer for NLP tasks? Should I tune their hyperparameters?
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Still All Greeklish to Me: Greeklish to Greek Transliteration.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

AUEB NLP Group at ImageCLEFmedical Caption 2024.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Machine Learning for Ancient Languages: A Survey.
Comput. Linguistics, September, 2023

Cache me if you Can: an Online Cost-aware Teacher-Student framework to Reduce the Calls to Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

AUEB NLP Group at ImageCLEFmedical Caption 2023.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

2022
Deception detection in text and its relation to the cultural dimension of individualism/collectivism.
Nat. Lang. Eng., 2022

Restoring and attributing ancient texts using deep neural networks.
Nat., 2022

Diagnostic captioning: a survey.
Knowl. Inf. Syst., 2022

Toxicity detection sensitive to conversational context.
First Monday, 2022

Realistic Zero-Shot Cross-Lingual Transfer in Legal Topic Classification.
Proceedings of the SETN 2022: 12th Hellenic Conference on Artificial Intelligence, Corfu, Greece, September 7, 2022

AUEB NLP Group at ImageCLEFmed Caption 2022.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Data Augmentation for Biomedical Factoid Question Answering.
Proceedings of the 21st Workshop on Biomedical Language Processing, 2022

From the Detection of Toxic Spans in Online Discussions to the Analysis of Toxic-to-Civil Transfer.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

FiNER: Financial Numeric Entity Recognition for XBRL Tagging.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Processing Long Legal Documents with Pre-trained Transformers: Modding LegalBERT and Longformer.
Proceedings of the Natural Legal Language Processing Workshop, 2022

2021
Toxicity Detection can be Sensitive to the Conversational Context.
CoRR, 2021

EDGAR-CORPUS: Billions of Tokens Make The World Go Round.
CoRR, 2021

Neural Contract Element Extraction Revisited.
CoRR, 2021

SemEval-2021 Task 5: Toxic Spans Detection.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

Paragraph-level Rationale Extraction through Regularization: A case study on European Court of Human Rights Cases.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

AUEB NLP Group at ImageCLEFmed Caption Tasks 2021.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
LEGAL-BERT: The Muppets straight out of Law School.
CoRR, 2020

GREEK-BERT: The Greeks visiting Sesame Street.
Proceedings of the SETN 2020: 11th Hellenic Conference on Artificial Intelligence, 2020

Domain Adversarial Fine-Tuning as an Effective Regularizer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

LEGAL-BERT: "Preparing the Muppets for Court'".
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

AUEB-NLP at BioASQ 8: Biomedical Document and Snippet Retrieval.
Proceedings of the Working Notes of CLEF 2020, 2020

Medical Image Tagging by Deep Learning and Retrieval.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2020

AUEB NLP Group at ImageCLEFmed Caption 2020.
Proceedings of the Working Notes of CLEF 2020, 2020

BioMRC: A Dataset for Biomedical Machine Reading Comprehension.
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 2020

Toxicity Detection: Does Context Really Matter?
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
SumQE: a BERT-based Summary Quality Estimation Model.
CoRR, 2019

A Survey on Biomedical Image Captioning.
CoRR, 2019

Extreme Multi-Label Legal Text Classification: A case study in EU Legislation.
CoRR, 2019

SEQ<sup>3</sup>: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression.
CoRR, 2019

ConvAI at SemEval-2019 Task 6: Offensive Language Identification and Categorization with Perspective and BERT.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

AUEB at BioASQ 7: Document and Snippet Retrieval.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2019

SEQˆ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

SUM-QE: a BERT-based Summary Quality Estimation Model.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

AUEB NLP Group at ImageCLEFmed Caption 2019.
Proceedings of the Working Notes of CLEF 2019, 2019

Transfer Learning for Causal Sentence Detection.
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019

Embedding Biomedical Ontologies by Jointly Encoding Network Structure and Textual Node Descriptors.
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019

Large-Scale Multi-Label Text Classification on EU Legislation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Neural Legal Judgment Prediction in English.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Generating Texts with Integer Linear Programming.
CoRR, 2018

Extracting Linguistic Resources from the Web for Concept-to-Text Generation.
CoRR, 2018

AUEB at BioASQ 6: Document and Snippet Retrieval.
CoRR, 2018

Identifying Retweetable Tweets with a Personalized Global Classifier.
Proceedings of the 10th Hellenic Conference on Artificial Intelligence, 2018

Ontology Driven Extraction of Research Processes.
Proceedings of the Semantic Web - ISWC 2018, 2018

BioRead: A New Dataset for Biomedical Reading Comprehension.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Deep Relevance Ranking using Enhanced Document-Query Interactions.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Obligation and Prohibition Extraction Using Hierarchical RNNs.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
A Personalized Global Filter To Predict Retweets.
Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, 2017

A Deep Learning Approach to Contract Element Extraction.
Proceedings of the Legal Knowledge and Information Systems, 2017

Extracting contract elements.
Proceedings of the 16th edition of the International Conference on Artificial Intelligence and Law, 2017

Improved Abusive Comment Moderation with User Embeddings.
Proceedings of the 2017 Workshop: Natural Language Processing meets Journalism, 2017

Deeper Attention to Abusive User Content Moderation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Deep Learning for User Comment Moderation.
Proceedings of the First Workshop on Abusive Language Online, 2017

2016
AUEB-ABSA at SemEval-2016 Task 5: Ensembles of Classifiers and Embeddings for Aspect Based Sentiment Analysis.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016


aueb.twitter.sentiment at SemEval-2016 Task 4: A Weighted Ensemble of SVMs for Twitter Sentiment Analysis.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question Answering.
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016

2015
Evaluation measures for hierarchical classification: a unified view and novel approaches.
Data Min. Knowl. Discov., 2015

LSHTC: A Benchmark for Large-Scale Text Classification.
CoRR, 2015

Probabilistic Cascading for Large Scale Hierarchical Classification.
CoRR, 2015

An overview of the BIOASQ large-scale biomedical semantic indexing and question answering competition.
BMC Bioinform., 2015

SemEval-2015 Task 12: Aspect Based Sentiment Analysis.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Anger detection in call center dialogues.
Proceedings of the 6th IEEE International Conference on Cognitive Infocommunications, 2015

Biomedical Question-focused Multi-document Summarization: ILSP and AUEB at BioASQ3.
Proceedings of the Working Notes of CLEF 2015, 2015

2014
Web-scale classification: web classification in the big data era.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

SemEval-2014 Task 4: Aspect Based Sentiment Analysis.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Multi-Granular Aspect Aggregation in Aspect-Based Sentiment Analysis.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

The Effect of Dimensionality Reduction on Large Scale Hierarchical Classification.
Proceedings of the Information Access Evaluation. Multilinguality, Multimodality, and Interaction, 2014

2013
Generating Natural Language Descriptions from OWL Ontologies: the NaturalOWL System.
J. Artif. Intell. Res., 2013

Using Integer Linear Programming for Content Selection, Lexicalization, and Aggregation to Produce Compact Texts from OWL Ontologies.
Proceedings of the ENLG 2013, 2013

Using Integer Linear Programming in Concept-to-Text Generation to Produce More Compact Texts.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Word Sense Disambiguation as an Integer Linear Programming Problem.
Proceedings of the Artificial Intelligence: Theories and Applications, 2012

Extractive Multi-Document Summarization with Integer Linear Programming and Support Vector Regression.
Proceedings of the COLING 2012, 2012

BioASQ: A Challenge on Large-Scale Biomedical Semantic Indexing and Question Answering.
Proceedings of the Information Retrieval and Knowledge Discovery in Biomedical Text, 2012

2011
A Generate and Rank Approach to Sentence Paraphrasing.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

2010
A Survey of Paraphrasing and Textual Entailment Methods.
J. Artif. Intell. Res., 2010

An extractive supervised two-stage method for sentence compression.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

2009
DCC&U: An Extended Digital Curation Lifecycle Model.
Int. J. Digit. Curation, 2009

Finding Short Definitions of Terms on Web Pages.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Adaptive Natural Language Interaction.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

An Open-Source Natural Language Generator for OWL Ontologies and its Use in Protege and Second Life.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

2008
Teaching Nonzero Sum Games Using a Diagrammatic Determination of Equilibria.
INFORMS Trans. Educ., 2008

2007
Source authoring for multilingual generation of personalised object descriptions.
Nat. Lang. Eng., 2007

Named Entity Recognition in Greek Texts with an Ensemble of SVMs and Active Learning.
Int. J. Artif. Intell. Tools, 2007

Word Sense Disambiguation with Spreading Activation Networks Generated from Thesauri.
Proceedings of the IJCAI 2007, 2007

Generating Multilingual Descriptions from Linguistically Annotated OWL Ontologies: the NaturalOWL System.
Proceedings of the Eleventh European Workshop on Natural Language Generation, 2007

A Game-Theoretic Investigation of the Effect of Human Interactive Proofs on Spam E-mail.
Proceedings of the CEAS 2007, 2007

Learning Textual Entailment using SVMs and String Similarity Measures.
Proceedings of the ACL-PASCAL@ACL 2007 Workshop on Textual Entailment and Paraphrasing, 2007

2006
A Greek Named-Entity Recognizer That Uses Support Vector Machines and Active Learning.
Proceedings of the Advances in Artificial Intelligence, 4th Helenic Conference on AI, 2006

Spam Filtering with Naive Bayes - Which Naive Bayes?
Proceedings of the CEAS 2006, 2006

2005
A Practically Unsupervised Learning Method to Identify Single-Snippet Answers to Definition Questions on the Web.
Proceedings of the HLT/EMNLP 2005, 2005

Exploiting OWL Ontologies in the Multilingual Generation of Object Descriptions.
Proceedings of the Tenth European Workshop on Natural Language Generation, 2005

A Game Theoretic Model of Spam E-Mailing.
Proceedings of the CEAS 2005, 2005

2004
Learning to Identify Single-Snippet Answers to Definition Questions.
Proceedings of the COLING 2004, 2004

Filtron: A Learning-Based Anti-Spam Filter.
Proceedings of the CEAS 2004, 2004

2003
A Memory-Based Approach to Anti-Spam Filtering for Mailing Lists.
Inf. Retr., 2003

Speaking the Users' Languages.
IEEE Intell. Syst., 2003

Learning to Order Facts for Discourse Planning in Natural Language Generation.
Proceedings of the 9th European Workshop on Natural Language Generation, 2003

2002
Symbolic Authoring for Multilingual Natural Language Generation.
Proceedings of the Methods and Applications of Artificial Intelligence, 2002

Ellogon: A New Text Engineering Platform.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

2001
Generating Multilingual Personalized Descriptions of Museum Exhibits - The M-PIRO Project
CoRR, 2001

A Greek Morphological Lexicon and Its Exploitation by Natural Language Processing Applications.
Proceedings of the Advances in Informatics, 8th Panhellenic Conference on Informatics, 2001

Stacking Classifiers for Anti-Spam Filtering of E-Mail.
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2001

2000
Learning to Filter Spam E-Mail: A Comparison of a Naive Bayesian and a Memory-Based Approach
CoRR, 2000

Selectional Restrictions in HPSG
CoRR, 2000

An evaluation of Naive Bayesian anti-spam filtering
CoRR, 2000

An experimental comparison of naive bayesian and keyword-based anti-spam filtering with personal e-mail messages.
Proceedings of the SIGIR 2000: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2000

Learning Rules for Large-Vocabulary Word Sense Disambiguation: A Comparison of Various Classifiers.
Proceedings of the Natural Language Processing, 2000

Automatic Web Rating: Filtering Obscene Content on the Web.
Proceedings of the Research and Advanced Technology for Digital Libraries, 2000

Selectional Restrictions in HPS.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

1999
Temporal Meaning Representations in a Natural Language Front-End
CoRR, 1999

Resolving Part-of-Speech Ambiguity in the Greek Language Using Learning Techniques
CoRR, 1999

1998
Time, tense and aspect in natural language database interfaces.
Nat. Lang. Eng., 1998

1996
A principled framework for constructing natural language interfaces to temporal databases.
PhD thesis, 1996

A Framework for Natural Language Interfaces to Temporal Databases
CoRR, 1996

A Principled Framework for Constructing Natural Language Interfaces To Temporal Databases
CoRR, 1996

1995
Natural language interfaces to databases - an introduction.
Nat. Lang. Eng., 1995

Experience Using TSQL2 in a Natural Language Interface
Proceedings of the Recent Advances in Temporal Databases, 1995


  Loading...