Elena Tutubalina

Orcid: 0000-0001-7936-0284

According to our database1, Elena Tutubalina authored at least 89 papers between 2014 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:

On csauthors.net:


NEREL: a Russian information extraction dataset with rich annotation for nested entities, relations, and wikidata entity links.
Lang. Resour. Evaluation, June, 2024

Data and models for stance and premise detection in COVID-19 tweets: Insights from the Social Media Mining for Health (SMM4H) 2022 shared task.
J. Biomed. Informatics, January, 2024

The overview of the BioRED (Biomedical Relation Extraction Dataset) track at BioCreative VIII.
Database J. Biol. Databases Curation, January, 2024

Call for papers: Special issue on biomedical multimodal large language models - novel approaches and applications.
J. Biomed. Informatics, 2024

∇<sup>2</sup>DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials.
CoRR, 2024

Biomedical Entity Representation with Graph-Augmented Multi-Objective Transformer.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Gradual Optimization Learning for Conformational Energy Minimization.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Chemical Language Models Have Problems with Chemistry: A Case Study on Molecule Captioning Task.
Proceedings of the Second Tiny Papers Track at ICLR 2024, 2024

BioASQ at CLEF2024: The Twelfth Edition of the Large-Scale Biomedical Semantic Indexing and Question Answering Challenge.
Proceedings of the Advances in Information Retrieval, 2024

Biomedical Concept Normalization over Nested Entities with Partial UMLS Terminology in Russian.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Overview of BioASQ 2024: The Twelfth BioASQ Challenge on Large-Scale Biomedical Semantic Indexing and Question Answering.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2024

Overview of BioNNE Task on Biomedical Nested Named Entity Recognition at BioASQ 2024.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

MMCP Team at ImageCLEFmed MEDVQA-GI 2024 Task: Diffusion Models for Text-to-Image Generation of Colonoscopy Images.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

HSE NLP Team at MEDIQA-CORR 2024 Task: In-Prompt Ensemble with Entities and Knowledge Graph for Medical Error Correction.
Proceedings of the 6th Clinical Natural Language Processing Workshop, 2024

AIRI NLP Team at EHRSQL 2024 Shared Task: T5 and Logistic Regression to the Rescue.
Proceedings of the 6th Clinical Natural Language Processing Workshop, 2024

Cross-Lingual Transfer Learning in Drug-Related Information Extraction from User-Generated Texts.
Program. Comput. Softw., December, 2023

NEREL-BIO: a dataset of biomedical abstracts annotated with nested named entities.
Bioinform., April, 2023

nach0: Multimodal Natural and Chemical Languages Foundation Model.
CoRR, 2023

Gradual Optimization Learning for Conformational Energy Minimization.
CoRR, 2023

Vote'n'Rank: Revision of Benchmarking with Social Choice Theory.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Graph-Enriched Biomedical Entity Representation Transformer.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2023

Multimodal model with text and drug embeddings for adverse drug reaction classification.
J. Biomed. Informatics, 2022

Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian.
CoRR, 2022

RuNNE-2022 Shared Task: Recognizing Nested Named Entities.
CoRR, 2022

Cross-Domain Limitations of Neural Models on Biomedical Relation Classification.
IEEE Access, 2022

Entity Linking over Nested Named Entities for Russian.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Medical Crossing: a Cross-lingual Evaluation of Clinical Entity Linking.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

A Comprehensive Evaluation of Biomedical Entity-centric Search.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

PAUQ: Text-to-SQL in Russian.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Overview of the Seventh Social Media Mining for Health Applications (#SMM4H) Shared Tasks at COLING 2022.
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, 2022

SMM4H 2022 Task 2: Dataset for stance and premise detection in tweets about health mandates related to COVID-19.
Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, 2022

RuCCoN: Clinical Concept Normalization in Russian.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

DetIE: Multilingual Open Information Extraction Inspired by Object Detection.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter.
J. Am. Medical Informatics Assoc., 2021

Many Heads but One Brain: an Overview of Fusion Brain Challenge on AI Journey 2021.
CoRR, 2021

NEREL: A Russian Dataset with Nested Named Entities and Relations.
CoRR, 2021

The Russian Drug Reaction Corpus and neural models for drug reactions and effectiveness detection in user reviews.
Bioinform., 2021

Medical concept normalization in clinical trials with drug and disease representation learning.
Bioinform., 2021

KFU NLP Team at SMM4H 2021 Tasks: Cross-lingual and Cross-modal BERT-based Models for Adverse Drug Effects.
Proceedings of the Sixth Social Media Mining for Health Workshop and Shared Task, 2021

Overview of the Sixth Social Media Mining for Health Applications (#SMM4H) Shared Tasks at NAACL 2021.
Proceedings of the Sixth Social Media Mining for Health Workshop and Shared Task, 2021

NEREL: A Russian Dataset with Nested Named Entities, Relations and Events.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

Drug and Disease Interpretation Learning with Biomedical Entity Representation Transformer.
Proceedings of the Advances in Information Retrieval, 2021

Selection of Pseudo-Annotated Data for Adverse Drug Reaction Classification Across Drug Groups.
Proceedings of the Analysis of Images, Social Networks and Texts, 2021

Near-Zero-Shot Suggestion Mining with a Little Help from WordNet.
Proceedings of the Analysis of Images, Social Networks and Texts, 2021

Improving unsupervised neural aspect extraction for online discussions using out-of-domain classification.
J. Intell. Fuzzy Syst., 2020

Multiple features for clinical relation extraction: A machine learning approach.
J. Biomed. Informatics, 2020

So What's the Plan? Mining Strategic Planning Documents.
CoRR, 2020

RecVAE: A New Variational Autoencoder for Top-N Recommendations with Implicit Feedback.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

KFU NLP Team at SMM4H 2020 Tasks: Cross-lingual Transfer Learning with Pretrained Language Models for Drug Reactions.
Proceedings of the Fifth Social Media Mining for Health Applications Workshop & Shared Task, 2020

Overview of the Fifth Social Media Mining for Health Applications (#SMM4H) Shared Tasks at COLING 2020.
Proceedings of the Fifth Social Media Mining for Health Applications Workshop & Shared Task, 2020

On Biomedical Named Entity Recognition: Experiments in Interlingual Transfer for Clinical and Social Media Texts.
Proceedings of the Advances in Information Retrieval, 2020

Topic Modeling as a Method of Educational Text Structuring.
Proceedings of the 13th International Conference on Developments in eSystems Engineering, 2020

Fair Evaluation in Concept Normalization: a Large-scale Comparative Analysis for BERT-based Models.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Ad Lingua: Text Classification Improves Symbolism Prediction in Image Advertisements.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Generating Sport Summaries: A Case Study for Russian.
Proceedings of the Analysis of Images, Social Networks and Texts, 2020

RuREBus: A Case Study of Joint Named Entity Recognition and Relation Extraction from E-Government Domain.
Proceedings of the Analysis of Images, Social Networks and Texts, 2020

Entity-Level Classification of Adverse Drug Reaction: A Comparative Analysis of Neural Network Models.
Program. Comput. Softw., 2019

CommentsRadar: Dive into Unique Data on All Comments on the Web.
CoRR, 2019

KFU NLP Team at SMM4H 2019 Tasks: Want to Extract Adverse Drugs Reactions from Tweets? BERT to The Rescue.
Proceedings of the Fourth Social Media Mining for Health Application Workshop & Shared Task, 2019

A Comparative Study on Feature Selection in Relation Extraction from Electronic Health Records.
Proceedings of the Selected Papers of the XXI International Conference on Data Analytics and Management in Data Intensive Domains (DAMDID/RCDL 2019), 2019

Distant Supervision for Sentiment Attitude Extraction.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Comparative analysis of context representation models in the relation extraction task from biomedical texts.
Proceedings of the 6th International Young Scientists Conference on Information Technologies, 2019

AspeRa: Aspect-Based Rating Prediction Model.
Proceedings of the Advances in Information Retrieval, 2019

Biomedical Entities Impact on Rating Prediction for Psychiatric Drugs.
Proceedings of the Analysis of Images, Social Networks and Texts, 2019

Wear the Right Head: Comparing Strategies for Encoding Sentences for Aspect Extraction.
Proceedings of the Analysis of Images, Social Networks and Texts, 2019

Deep Neural Models for Medical Concept Normalization in User-Generated Texts.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Detecting Adverse Drug Reactions from Biomedical Texts with Neural Networks.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

AspeRa: Aspect-Based Rating Prediction Based on User Reviews.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

Entity-level Classification of Adverse Drug Reactions: a Comparison of Neural Network Models.
Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

Report on the 12th Russian Summer School in Information Retrieval (RuSSIR 2018).
SIGIR Forum, 2018

Exploring convolutional neural networks and topic models for user profiling from drug reviews.
Multim. Tools Appl., 2018

Medical concept normalization in social media posts with recurrent neural networks.
J. Biomed. Informatics, 2018

Sequence Learning with RNNs for Medical Concept Normalization in User-Generated Texts.
CoRR, 2018

Demographic Prediction Based on User Reviews about Medications.
Computación y Sistemas, 2017

An Encoder-Decoder Model for ICD-10 Coding of Death Certificates.
CoRR, 2017

Deep Learning for ICD Coding: Looking for Medical Concepts in Clinical Documents in English and in French.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2017

KFU at CLEF eHealth 2017 Task 1: ICD-10 Coding of English Death Certificates with Recurrent Neural Networks.
Proceedings of the Working Notes of CLEF 2017, 2017

Automated Detection of Adverse Drug Reactions from Social Media Posts with Machine Learning.
Proceedings of the Analysis of Images, Social Networks and Texts, 2017

Mining Complaints to Improve a Product: a Study about Problem Phrase Extraction from User Reviews.
Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, 2016

A Sentiment-Aware Topic Model for Extracting Failures from Product Reviews.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Automated Prediction of Demographic Information from Medical User Reviews.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2016

Identifying Product Failures from Reviews in Noisy Data by Distant Supervision.
Proceedings of the Knowledge Engineering and Semantic Web - 7th International Conference, 2016

Constructing Aspect-Based Sentiment Lexicons with Topic Modeling.
Proceedings of the Analysis of Images, Social Networks and Texts, 2016

Sentiment classification of reviews and twitter posts based on dictionaries.
Russ. Digit. Libr. J., 2015

Dependency-Based Problem Phrase Extraction from User Reviews of Products.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Clustering-based Approach to Multiword Expression Extraction and Ranking.
Proceedings of the 11th Workshop on Multiword Expressions, 2015

Inferring Sentiment-Based Priors in Topic Models.
Proceedings of the Advances in Artificial Intelligence and Its Applications, 2015

Target-Based Topic Model for Problem Phrase Extraction.
Proceedings of the Advances in Information Retrieval, 2015

Clause-Based Approach to Extracting Problem Phrases from User Reviews of Products.
Proceedings of the Analysis of Images, Social Networks and Texts, 2014
