Sebastian Gehrmann

Orcid: 0000-0002-8257-9516

According to our database1, Sebastian Gehrmann authored at least 66 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs).
CoRR, 2024

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning.
CoRR, 2024

On the Role of Summary Content Units in Text Summarization Evaluation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Academics Can Contribute to Domain-Specialized Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024


2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

PaLM: Scaling Language Modeling with Pathways.
J. Mach. Learn. Res., 2023

Diagnosing AI Explanation Methods with Folk Concepts of Behavior.
J. Artif. Intell. Res., 2023

Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text.
J. Artif. Intell. Res., 2023

PaLM 2 Technical Report.
CoRR, 2023

BloombergGPT: A Large Language Model for Finance.
CoRR, 2023

TaTA: A Multilingual Table-to-Text Dataset for African Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Dialect-robust Evaluation of Generated Text.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Benchmarking Large Language Model Capabilities for Conditional Generation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization.
CoRR, 2022

Towards Computationally Verifiable Semantic Grounding for Language Models.
CoRR, 2022

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code.
CoRR, 2022

Intriguing Properties of Compression on Multilingual Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Framework for Automated Text Generation Benchmarking.
Proceedings of the Workshop on Scientific Document Understanding co-located with 36th AAAI Conference on Artificial Inteligence, 2022

2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2021

SynthBio: A Case Study in Human-AI Collaborative Curation of Text Datasets.
CoRR, 2021

Reusable Templates and Guides For Documenting Datasets and Models for Natural Language Processing and Generation: A Case Study of the HuggingFace and GEM Data and Model Cards.
CoRR, 2021

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics.
CoRR, 2021

SynthBio: A Case Study in Faster Curation of Text Datasets.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Automatic Construction of Evaluation Suites for Natural Language Generation Datasets.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

LMdiff: A Visual Diff Tool to Compare Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

Learning Compact Metrics for MT.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Visual Interaction with Deep Learning Models through Collaborative Semantic Inference.
IEEE Trans. Vis. Comput. Graph., 2020

Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics.
CoRR, 2020

Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias.
CoRR, 2020

Evaluating an automated mediator for joint narratives in a conflict situation.
Behav. Inf. Technol., 2020

Learning to Evaluate Translation Beyond English: BLEURT Submissions to the WMT Metrics 2020 Shared Task.
Proceedings of the Fifth Conference on Machine Translation, 2020

Investigating Gender Bias in Language Models Using Causal Mediation Analysis.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Corpus for Detecting High-Context Medical Conditions in Intensive Care Patient Notes Focusing on Frequently Readmitted Patients.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2020

ToTTo: A Controlled Table-To-Text Generation Dataset.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformer Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

Interpretability and Analysis in Neural NLP.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

2019
Seq2seq-Vis: A Visual Debugging Tool for Sequence-to-Sequence Models.
IEEE Trans. Vis. Comput. Graph., 2019

Memory-Augmented Recurrent Neural Networks Can Learn Generalized Dyck Languages.
CoRR, 2019

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models.
CoRR, 2019

Encoder-Agnostic Adaptation for Conditional Language Generation.
CoRR, 2019

LSTM Networks Can Perform Dynamic Counting.
CoRR, 2019

Improving Human Text Comprehension through Semi-Markov CRF-based Neural Section Title Generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Margin Call: an Accessible Web-based Text Viewer with Generated Paragraph Summaries in the Margin.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

Generating Abstractive Summaries with Finetuned Language Models.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

Interactive Visual Exploration of Latent Space (IVELS) for peptide auto-encoder model selection.
Proceedings of the Deep Generative Models for Highly Structured Data, 2019

Identifying documented medical non-adherence from clinical notes using natural language processing.
Proceedings of the AMIA 2019, 2019

GLTR: Statistical Detection and Visualization of Generated Text.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
LSTMVis: A Tool for Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks.
IEEE Trans. Vis. Comput. Graph., 2018

Behind the scenes: A medical natural language processing project.
Int. J. Medical Informatics, 2018

End-to-End Content and Plan Selection for Data-to-Text Generation.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

E2E NLG Challenge Submission: Towards Controllable Generation of Diverse Natural Language.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

Debugging Sequence-to-Sequence Models with Seq2Seq-Vis.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Bottom-Up Abstractive Summarization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Comparing Rule-Based and Deep Learning Models for Patient Phenotyping.
CoRR, 2017

2016
Visual Analysis of Hidden State Dynamics in Recurrent Neural Networks.
CoRR, 2016

2015
Deploying AI Methods to Support Collaborative Writing: a Preliminary Investigation.
Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, 2015

2004
A detailed 3D model of the human hand.
J. Vis., 2004

2003
3-D-Atlas des kardiovaskulären Systems des Menschen auf der Basis des Visible Human.
PhD thesis, 2003


  Loading...