Esin Durmus

Orcid: 0009-0009-7331-8160

According to our database1, Esin Durmus authored at least 45 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Benchmarking Large Language Models for News Summarization.
Trans. Assoc. Comput. Linguistics, 2024

Sabotage Evaluations for Frontier Models.
CoRR, 2024

How will advanced AI systems impact democracy?
CoRR, 2024

NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Towards Understanding Sycophancy in Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Collective Constitutional AI: Aligning a Language Model with Public Input.
Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024

2023
Holistic Evaluation of Language Models.
Trans. Mach. Learn. Res., 2023

Evaluating Human-Language Model Interaction.
Trans. Mach. Learn. Res., 2023

Evaluating and Mitigating Discrimination in Language Model Decisions.
CoRR, 2023

Specific versus General Principles for Constitutional AI.
CoRR, 2023

Towards Understanding Sycophancy in Language Models.
CoRR, 2023

Studying Large Language Model Generalization with Influence Functions.
CoRR, 2023

Measuring Faithfulness in Chain-of-Thought Reasoning.
CoRR, 2023

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning.
CoRR, 2023

Towards Measuring the Representation of Subjective Global Opinions in Language Models.
CoRR, 2023

Opportunities and Risks of LLMs for Scalable Deliberation with Polis.
CoRR, 2023

Whose Opinions Do Language Models Reflect?
Proceedings of the International Conference on Machine Learning, 2023

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale.
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023

When Do Pre-Training Biases Propagate to Downstream Tasks? A Case Study in Text Summarization.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Towards Reference-free Text Simplification Evaluation with a BERT Siamese Network Architecture.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Contrastive Error Attribution for Finetuned Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Tracing and Removing Data Errors in Natural Language Generation Datasets.
CoRR, 2022

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code.
CoRR, 2022

Language modeling via stochastic processes.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving Faithfulness by Augmenting Negative Summaries from Fake Documents.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Faithful or Extractive? On Mitigating the Faithfulness-Abstractiveness Trade-off in Abstractive Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Spurious Correlations in Reference-Free Evaluation of Text Generation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Towards Understanding Persuasion in Computational Argumentation.
PhD thesis, 2021

Towards Understanding Persuasion in Computational Argumentation.
CoRR, 2021

On the Opportunities and Risks of Foundation Models.
CoRR, 2021

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.
CoRR, 2021

Leveraging Topic Relatedness for Argument Persuasion.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization.
CoRR, 2020

Exploring the Role of Argument Structure in Online Debate Persuasion.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

WikiLingua: A New Benchmark Dataset for Multilingual Abstractive Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Modeling the Factors of User Success in Online Debate.
Proceedings of the World Wide Web Conference, 2019

The Role of Pragmatic and Discourse Context in Determining Argument Impact.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Persuasion of the Undecided: Language vs. the Listener.
Proceedings of the 6th Workshop on Argument Mining, ArgMining@ACL 2019, Florence, Italy, 2019

Determining Relative Argument Specificity and Stance for Complex Argumentative Structures.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A Corpus for Modeling User and Language Effects in Argumentation on Online Debating.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Exploring the Role of Prior Beliefs for Argument Persuasion.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Understanding the Effect of Gender and Stance in Opinion Expression in Debates on "Abortion".
Proceedings of the Second Workshop on Computational Modeling of People's Opinions, 2018

2016
Cornell Belief and Sentiment System at TAC 2016.
Proceedings of the 2016 Text Analysis Conference, 2016


  Loading...