Steffen Eger

Orcid: 0000-0003-4663-8336

According to our database1, Steffen Eger authored at least 116 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Evaluating Large Language Models for Structured Science Summarization in the Open Research Knowledge Graph.
Inf., June, 2024

Towards Explainable Evaluation Metrics for Machine Translation.
J. Mach. Learn. Res., 2024

How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs.
CoRR, 2024

LLM-based multi-agent poetry generation in non-cooperative environments.
CoRR, 2024

DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ.
CoRR, 2024

Syntactic Language Change in English and German: Metrics, Parsers, and Convergences.
CoRR, 2024

Is there really a Citation Age Bias in NLP?
CoRR, 2024

AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Evaluating Diversity in Automatic Poetry Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

BMX: Boosting Natural Language Generation Metrics with Explainability.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Dependencies over Times and Tools (DoTT).
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
MENLI: Robust Evaluation Metrics from Natural Language Inference.
Trans. Assoc. Comput. Linguistics, 2023

NLLG Quarterly arXiv Report 09/23: What are the most influential current AI Papers?
CoRR, 2023

NLLG Quarterly arXiv Report 06/23: What are the most influential current AI Papers?
CoRR, 2023

Cross-lingual Cross-temporal Summarization: Dataset, Models, Evaluation.
CoRR, 2023

Cross-Genre Argument Mining: Can Language Models Automatically Fill in Missing Discourse Markers?
CoRR, 2023

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP.
CoRR, 2023

ChatGPT: A Meta-Analysis after 2.5 Months.
CoRR, 2023

Semantically-Informed Regressive Encoder Score.
Proceedings of the Eighth Conference on Machine Translation, 2023

The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics.
Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems, 2023

Team NLLG submission for Eval4NLP 2023 Shared Task: Retrieval-Augmented In-Context Learning for NLG Evaluation.
Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems, 2023

Transformers Go for the LOLs: Generating (Humourous) Titles from Scientific Abstracts End-to-End.
Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems, 2023

EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation Metrics.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

UScore: An Effective Approach to Fully Unsupervised Evaluation Metrics for Machine Translation.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Trade-Offs Between Fairness and Privacy in Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Measuring Social Solidarity During Crisis: The Role of Design Choices.
J. Soc. Comput., 2022

BMX: Boosting Machine Translation Metrics with Explainability.
CoRR, 2022

FairGer: Using NLP to Measure Support for Women and Migrants in 155 Years of German Parliamentary Debates.
CoRR, 2022

Can we do that simpler? Simple, Efficient, High-Quality Evaluation Metrics for NLG.
CoRR, 2022

Towards Explainable Evaluation Metrics for Natural Language Generation.
CoRR, 2022

Detecting Stance in Scientific Papers: Did we get more Negative Recently?
CoRR, 2022

Findings of the WMT 2022 Shared Task on Quality Estimation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Reproducibility Issues for BERT-based Evaluation Metrics.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Layer or Representation Space: What Makes BERT-based Evaluation Metrics Robust?
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Constrained Density Matching and Modeling for Cross-lingual Alignment of Contextualized Representations.
Proceedings of the Asian Conference on Machine Learning, 2022

2021
Graph routing between capsules.
Neural Networks, 2021

TUDa at WMT21: Sentence-Level Direct Assessment with Adapters.
Proceedings of the Sixth Conference on Machine Translation, 2021

Inducing Language-Agnostic Multilingual Representations.
Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics, 2021

Diachronic Analysis of German Parliamentary Proceedings: Ideological Shifts through the Lens of Political Biases.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2021

TUDA-Reproducibility @ ReproGen: Replicability of Human Evaluation of Text-to-Text and Concept-to-Text Generation.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Better than Average: Paired Evaluation of NLP systems.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
DBPal: A Fully Pluggable NL2SQL Training Pipeline.
Proceedings of the 2020 International Conference on Management of Data, 2020

CMCE at SemEval-2020 Task 1: Clustering on Manifolds of Contextualized Embeddings to Detect Historical Meaning Shifts.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

PO-EMO: Conceptualization, Annotation, and Modeling of Aesthetic Emotions in German and English Poetry.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

How to Probe Sentence Embeddings in Low-Resource Languages: On Structural Design Choices for Probing Task Evaluation.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Probing Multilingual BERT for Genetic and Typological Signals.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Vec2Sent: Probing Sentence Embeddings with Natural Language Generation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Practitioner's view: A comparison and a survey of lemmatization and morphological tagging in German and Latin.
J. Lang. Model., 2019

DBPal: Weak Supervision for Learning a Natural Language Interface to Databases.
CoRR, 2019

Predicting Research Trends From Arxiv.
CoRR, 2019

Pitfalls in the Evaluation of Sentence Embeddings.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Does My Rebuttal Matter? Insights from a Major NLP Conference.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Towards Scalable and Reliable Capsule Networks for Challenging NLP Applications.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Semantic Change and Emerging Tropes In a Large Corpus of New High German Poetry.
Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

2018
Concatenated p-mean Word Embeddings as Universal Cross-Lingual Sentence Representations.
CoRR, 2018

ArgumenText: Searching for Arguments in Heterogeneous Sources.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Multi-Task Learning for Argumentation Mining in Low-Resource Settings.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

One Size Fits All? A simple LSTM for non-literal token and construction-level classification.
Proceedings of the Second Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, 2018

Is it Time to Swish? Comparing Deep Learning Activation Functions Across NLP tasks.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Cross-lingual Argumentation Mining: Machine Translation (and a bit of Projection) is All You Need!
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Killing Four Birds with Two Stones: Multi-Task Learning for Non-Literal Language Detection.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

PD3: Better Low-Resource Cross-Lingual Transfer By Combining Direct Transfer and Annotation Projection.
Proceedings of the 5th Workshop on Argument Mining, 2018

2017
The Combinatorics of Weighted Vector Compositions.
CoRR, 2017

EELECTION at SemEval-2017 Task 10: Ensemble of nEural Learners for kEyphrase ClassificaTION.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

How Many Stemmata with Root Degree k?
Proceedings of the 15th Meeting on the Mathematics of Language, 2017

What is the Essence of a Claim? Cross-Domain Claim Identification.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Neural End-to-End Learning for Computational Argumentation Mining.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
A Comparison of Four Character-Level String-to-String Translation Models for (OCR) Spelling Error Correction.
Prague Bull. Math. Linguistics, 2016

Opinion dynamics and wisdom under out-group discrimination.
Math. Soc. Sci., 2016

On the Number of Many-to-Many Alignments of Multiple Sequences.
J. Autom. Lang. Comb., 2016

Wikidition: Automatic lexiconization and linkification of text corpora.
it Inf. Technol., 2016

Language classification from bilingual word embedding graphs.
CoRR, 2016

Lemmatization and Morphological Tagging in German and Latin: A Comparison and a Survey of the State-of-the-art.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Still not there? Comparing Traditional Sequence-to-Sequence Models to Encoder-Decoder Neural Networks on Monotone String Translation Tasks.
Proceedings of the COLING 2016, 2016

Language classification from bilingual word embedding graphs.
Proceedings of the COLING 2016, 2016

On the Linearity of Semantic Change: Investigating Meaning Variation via Dynamic Graph Models.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Some Elementary Congruences for the Number of Weighted Integer Compositions.
J. Integer Seq., 2015

On the Number of Many-to-Many Alignments of N Sequences.
CoRR, 2015

Towards Semantic Language Classification: Inducing and Clustering Semantic Association Networks from Europarl.
Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics, 2015

Designing and Comparing G2P-Type Lemmatizers for a Morphology-Rich Language.
Proceedings of the Systems and Frameworks for Computational Morphology, 2015

Lexicon-assisted tagging and lemmatization in Latin: A comparison of six taggers and two lemmatization models.
Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, 2015

Deriving a Primal Form for the Quadratic Power Kernel.
Proceedings of the KI 2015: Advances in Artificial Intelligence, 2015

Improving G2p from wiktionary and other (web) resources.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Complex Decomposition of the Negative Distance Kernel.
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

Do we need bigram alignment models? On the effect of alignment quality on transduction accuracy in G2P.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Multiple Many-to-Many Sequence Alignment for Combining String-Valued Variables: A G2P Experiment.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Stirling's Approximation for Central Extended Binomial Coefficients.
Am. Math. Mon., 2014

A Proof of the Mann-Shanks Primality Criterion Conjecture for Extended Binomial Coefficients.
Integers, 2014

Corrections to the results derived in "A Unified Approach to Algorithms Generating Unrestricted and Restricted Integer Compositions and Integer Partitions"'; and a comparison of four restricted integer composition generation algorithms.
CoRR, 2014

2013
Sequence Segmentation by Enumeration: An Exploration.
Prague Bull. Math. Linguistics, 2013

A Contribution to the Theory of Word Length Distribution Based on a Stochastic Word Length Distribution Model.
J. Quant. Linguistics, 2013

Sequence alignment with arbitrary steps and further generalizations, with applications to alignments in linguistics.
Inf. Sci., 2013

(Failure of the) Wisdom of the crowds in an endogenous opinion dynamics model with multiply biased agents.
CoRR, 2013

Opinion dynamics under opposition.
CoRR, 2013

2012
The Combinatorics of String Alignments: Reconsidering the Problem.
J. Quant. Linguistics, 2012

Stirling's approximation for central polynomial coefficients
CoRR, 2012

Asymptotic normality of integer compositions inside a rectangle
CoRR, 2012

Lexical semantic typologies from bilingual corpora - A framework.
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012

S-restricted monotone alignments.
Proceedings of the 11th Conference on Natural Language Processing, 2012

S-Restricted Monotone Alignments: Algorithm, Search Space, and Applications.
Proceedings of the COLING 2012, 2012

2010
Investigating lexical competition - An Empirical Case Study of the German Spelling Reform of 1996/2004/2006.
J. Lang. Technol. Comput. Linguistics, 2010

An Ensemble of Classifiers Methodology for Stemming in Inflectional Languages: Using the Example of Latvian.
Proceedings of the Human Language Technologies - The Baltic Perspective, 2010


  Loading...