Arzucan Özgür

Orcid: 0000-0001-8376-1056

Affiliations:
  • Bogaziçi University, Department of Computer Engineering, Istanbul, Turkey
  • University of Michigan, Department of EECS, Ann Arbor, MI, USA


According to our database1, Arzucan Özgür authored at least 102 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Dealing with Data Scarcity in Spoken Question Answering.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Evaluating the Quality of a Corpus Annotation Scheme Using Pretrained Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Nested Named Entity Recognition using Multilayer BERT-based Model.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024

Detecting Hate Speech in Turkish Print Media: A Corpus and A Hybrid Approach with Target-oriented Linguistic Knowledge.
Proceedings of the 7th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2024

Overview of the Hate Speech Detection in Turkish and Arabic Tweets (HSD-2Lang) Shared Task at CASE 2024.
Proceedings of the 7th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2024

2023
Improving the filtering of false positive single nucleotide variations by combining genomic features with quality metrics.
Bioinform., December, 2023

A Framework for Improving the Generalizability of Drug-Target Affinity Prediction Models.
J. Comput. Biol., November, 2023

A Computational Software for Training Robust Drug-Target Affinity Prediction Models: pydebiaseddta.
J. Comput. Biol., November, 2023

Editorial: Pattern recognition for healthcare analytics.
Frontiers Digit. Health, March, 2023

Uncertainty-Aware Personal Assistant for Making Personalized Privacy Decisions.
ACM Trans. Internet Techn., 2023

Can We Explain Privacy?
IEEE Internet Comput., 2023

Generative Language Models on Nucleotide Sequences of Human Genes.
CoRR, 2023

Evaluation of GPT and BERT-based models on identifying protein-protein interactions in biomedical text.
CoRR, 2023

SIU2023-NST - Hate Speech Detection Contest.
Proceedings of the 31st Signal Processing and Communications Applications Conference, 2023

A Dataset for Investigating the Impact of Context for Offensive Language Detection in Tweets.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

TULAP - An Accessible and Sustainable Platform for Turkish Natural Language Processing Resources.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. EACL 2023, 2023

Explain to Me: Towards Understanding Privacy Decisions.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
Cluster-based mention typing for named entity disambiguation.
Nat. Lang. Eng., 2022

Resources for Turkish dependency parsing: introducing the BOUN Treebank and the BoAT annotation tool.
Lang. Resour. Evaluation, 2022

Exploiting Pretrained Biochemical Language Models for Targeted Drug Design.
CoRR, 2022

A Self-aware Personal Assistant for Making Personalized Privacy Decisions.
CoRR, 2022

A Hybrid Deep Dependency Parsing Approach Enhanced With Rules and Morphology: A Case Study for Turkish.
IEEE Access, 2022

Improving Code-Switching Dependency Parsing with Semi-Supervised Auxiliary Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

A Framework for Automatic Generation of Spoken Question-Answering Data.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

A SHAP-based Active Learning Approach for Creating High-Quality Training Data.
Proceedings of the IEEE International Conference on Big Data, 2022

Enhancements to the BOUN Treebank Reflecting the Agglutinative Nature of Turkish.
Proceedings of the ALTNLP The International Conference and workshop on Agglutanative Language Technologies as a challenge of Natural Language Processing, 2022

A Dataset and BERT-based Models for Targeted Sentiment Analysis on Turkish Texts.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

2021
DebiasedDTA: Model Debiasing to Boost Drug-Target Affinity Prediction.
CoRR, 2021

Sentiment Analysis of Customer Comments in Banking using BERT-based Approaches.
Proceedings of the 29th Signal Processing and Communications Applications Conference, 2021

Twitter Dataset and Evaluation of Transformers for Turkish Sentiment Analysis.
Proceedings of the 29th Signal Processing and Communications Applications Conference, 2021

BOUN at SemEval-2021 Task 9: Text Augmentation Techniques for Fact Verification in Tabular Data.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

PIDNA at BioASQ MESINESP: Hybrid Semantic Indexing for Biomedical Articles in Spanish.
Proceedings of the Working Notes of CLEF 2021 - Conference and Labs of the Evaluation Forum, Bucharest, Romania, September 21st - to, 2021

2020
A Hybrid Approach to Dependency Parsing: Combining Rules and Morphology with Deep Learning.
CoRR, 2020

Exploring Chemical Space using Natural Language Processing Methodologies for Drug Discovery.
CoRR, 2020

The RELX Dataset and Matching the Multilingual Blanks for Cross-lingual Relation Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Vapur: A Search Engine to Find Related Protein - Compound Pairs in COVID-19 Literature.
Proceedings of the 1st Workshop on NLP for COVID-19@ EMNLP 2020, Online, December 2020, 2020


BOUN-REX at CLEF-2020 ChEMU Task 2: Evaluating Pretrained Transformers for Event Extraction.
Proceedings of the Working Notes of CLEF 2020, 2020

2019
WideDTA: prediction of drug-target binding affinity.
CoRR, 2019

Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels.
BMC Bioinform., 2019

Statistical representation models for mutation information within genomic data.
BMC Bioinform., 2019

Linking entities through an ontology using word embeddings and syntactic re-ranking.
BMC Bioinform., 2019

Overview of the BioCreative VI Precision Medicine Track: mining protein interactions and mutations for precision medicine.
Database J. Biol. Databases Curation, 2019

Identifying Image Related Sentences in News Articles.
Proceedings of the 27th Signal Processing and Communications Applications Conference, 2019

Turkish Tweet Classification with Transformer Encoder.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

BOUN-ISIK Participation: An Unsupervised Approach for the Named Entity Normalization and Relation Extraction of Bacteria Biotopes.
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019

Turkish Treebanking: Unifying and Constructing Efforts.
Proceedings of the 13th Linguistic Annotation Workshop, 2019

2018
Segmenting hashtags and analyzing their grammatical structure.
J. Assoc. Inf. Sci. Technol., 2018

A chemical language based approach for protein - ligand interaction prediction.
CoRR, 2018

Ontology-based literature mining and class effect analysis of adverse drug reactions associated with neuropathy-inducing drugs.
J. Biomed. Semant., 2018

DeepDTA: deep drug-target binding affinity prediction.
Bioinform., 2018

A novel methodology on distributed representations of proteins using their interacting ligands.
Bioinform., 2018

A Morphology-Based Representation Model for LSTM-Based Dependency Parsing of Agglutinative Languages.
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018

2017
Ontology-based literature mining of E. coli vaccine-associated gene interaction networks.
J. Biomed. Semant., 2017

BIOSSES: a semantic sentence similarity estimation system for the biomedical domain.
Bioinform., 2017

Automatic query generation using word embeddings for retrieving passages describing experimental methods.
Database J. Biol. Databases Curation, 2017

Extracting Adverse Drug Reactions using Deep Learning and Dictionary Based Approaches.
Proceedings of the 2017 Text Analysis Conference, 2017

Description of the BOUN System for the Trilingual Entity Detection and Linking Tasks at TAC KBP 2017.
Proceedings of the 2017 Text Analysis Conference, 2017

Political opinion/sentiment prediction via long short term memory recurrent neural networks on Twitter.
Proceedings of the 25th Signal Processing and Communications Applications Conference, 2017

Turkish tweet sentiment analysis with word embedding and machine learning.
Proceedings of the 25th Signal Processing and Communications Applications Conference, 2017

BUSEM at SemEval-2017 Task 4A Sentiment Analysis with Word Embedding and Long Short Term Memory RNN Approaches.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

2016
Automated Neuroanatomical Relation Extraction: A Linguistically Motivated Approach with a PVT Connectivity Graph Case Study.
Frontiers Neuroinformatics, 2016

A comparative study of SMILES-based compound similarity functions for drug-target interaction prediction.
BMC Bioinform., 2016

BioCreative V BioC track overview: collaborative biocurator assistant task for BioGRID.
Database J. Biol. Databases Curation, 2016

The Interaction Network Ontology-supported modeling and mining of complex interactions represented with multiple keywords in biomedical literature.
BioData Min., 2016

Sentence Similarity based on Dependency Tree Kernels for Multi-document Summarization.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Named Entity Recognition on Twitter for Turkish using Semi-supervised Learning with Word Embeddings.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Segmenting Hashtags using Automatically Created Training Data.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Towards Building a Political Protest Database to Explain Changes in the Welfare State.
Proceedings of the 10th SIGHUM Workshop on Language Technology for Cultural Heritage, 2016

Ignet: A Centrality and INO-based Web System for Analyzing and Visualizing Literature-mined Networks.
Proceedings of the Joint International Conference on Biological Ontology and BioCreative, 2016

Ontology-Based Categorization of Bacteria and Habitat Entities using Information Retrieval Techniques.
Proceedings of the 4th BioNLP Shared Task Workshop, BioNLP 2016, 2016

2015
Detection and categorization of bacteria habitats using shallow linguistic analysis.
BMC Bioinform., December, 2015

Development and application of an interaction network ontology for literature mining of vaccine-associated gene-gene interactions.
J. Biomed. Semant., 2015

GLASS: a comprehensive database for experimentally validated GPCR-ligand associations.
Bioinform., 2015

Extension of the Interaction Network Ontology for Literature Mining of Gene-gene Interaction Networks from Sentences with Multiple Interaction Keywords.
Proceedings of International Workshop on Biomedical Data Mining, 2015

Question Analysis for a Closed Domain Question Answering System.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2015

2014
Rule-based focus extraction in Turkish question answering systems.
Proceedings of the 2014 22nd Signal Processing and Communications Applications Conference (SIU), 2014

Self-training a Constituency Parser using n-gram Trees.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Improving Named Entity Recognition for Morphologically Rich Languages Using Word Embeddings.
Proceedings of the 13th International Conference on Machine Learning and Applications, 2014

A Graph-based Approach for Contextual Text Normalization.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Analyzing Stemming Approaches for Turkish Multi-Document Summarization.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Expanding machine translation training data with an out-of-domain corpus using language modeling based vocabulary saturation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

2013
N-gram Parsing for Jointly Training a Discriminative Constituency Parser.
Polibits, 2013

PHISTO: pathogen-host interaction search tool.
Bioinform., 2013

BOUNCE: Sentiment Classification in Twitter using Rich Feature Sets.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Word Polarity Detection Using a Multilingual Approach.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

Bacteria Biotope Detection, Ontology-based Normalization, and Relation Extraction using Syntactic Rules.
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013

2012
Identification of fever and vaccine-associated gene interaction networks using ontology-based literature mining.
J. Biomed. Semant., 2012

2011
U-Compare bio-event meta-service: compatible BioNLP event extraction services.
BMC Bioinform., 2011

Mining of vaccine-associated IFN-γ gene interaction networks using the Vaccine Ontology.
J. Biomed. Semant., 2011

2010
Text and Network Mining for Literature-Based Scientific Discovery in Biomedicine.
PhD thesis, 2010

Citation Summarization Through Keyphrase Extraction.
Proceedings of the COLING 2010, 2010

2009
Michigan molecular interactions r2: from interacting proteins to pathways.
Nucleic Acids Res., 2009

Detecting Speculations and their Scopes in Scientific Text.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Supervised Classification for Extracting Biomedical Events.
Proceedings of the BioNLP 2009 Workshop Companion Volume for Shared Task, BioNLP@HLT-NAACL 2009, 2009

2008
Identifying gene-disease associations using centrality on a literature mined gene-interaction network.
Proceedings of the Proceedings 16th International Conference on Intelligent Systems for Molecular Biology (ISMB), 2008

2007
Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing.
Proceedings of the EMNLP-CoNLL 2007, 2007

2006
Efficient indexing technique for XML-based electronic product catalogs.
Electron. Commer. Res. Appl., 2006

Classification of Skewed and Homogenous Document Corpora with Class-Based and Corpus-Based Keywords.
Proceedings of the KI 2006: Advances in Artificial Intelligence, 2006

2005
Text Categorization with Class-Based and Corpus-Based Keyword Selection.
Proceedings of the Computer and Information Sciences, 2005

2004
Social Network of Co-occurrence in News Articles.
Proceedings of the Computer and Information Sciences, 2004


  Loading...