We stand with Ukraine

We stand with Ukraine

Oyvind Tafjord

According to our database¹, Oyvind Tafjord authored at least 55 papers between 2016 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

0

5

10

15

1

8

4

2

3

3

1

2

1

1

7

2

4

4

6

4

2

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

2 OLMo 2 Furious.

[BibT_eX]

[DOI]

CoRR, January, 2025

2024

From Models to Microtheories: Distilling a Model's Topical Knowledge for Grounded Question Answering.

[BibT_eX]

[DOI]

,

Bhavana Dalvi Mishra

,

,

,

,

Alexander Sabol

,

,

Benjamin Van Durme

,

CoRR, 2024

Establishing Task Scaling Laws via Compute-Efficient Model Ladders.

[BibT_eX]

[DOI]

,

,

Alexander Wettig

,

,

,

Ananya Harsh Jha

,

,

,

Dirk Groeneveld

,

,

,

Hannaneh Hajishirzi

CoRR, 2024

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training.

[BibT_eX]

[DOI]

CoRR, 2024

SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions.

[BibT_eX]

[DOI]

Sarah Wiegreffe

,

,

Yonatan Belinkov

,

Hannaneh Hajishirzi

,

Ashish Sabharwal

CoRR, 2024

OLMES: A Standard for Language Model Evaluations.

[BibT_eX]

[DOI]

,

,

,

,

,

Hannaneh Hajishirzi

CoRR, 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Paloma: A Benchmark for Evaluating Language Model Fit.

[BibT_eX]

[DOI]

,

,

Valentin Hofmann

,

,

Ananya Harsh Jha

,

,

,

Evan Pete Walsh

,

,

,

Dirk Groeneveld

,

,

Hanna Hajishirzi

,

,

Kyle Richardson

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DiscoveryWorld: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents.

[BibT_eX]

[DOI]

Peter A. Jansen

,

Marc-Alexandre Côté

,

,

,

Bhavana Dalvi Mishra

,

Bodhisattwa Prasad Majumder

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic.

[BibT_eX]

[DOI]

,

,

,

,

,

Zhengping Jiang

,

Bhavana Dalvi Mishra

,

,

Peter A. Jansen

,

,

Benjamin Van Durme

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Digital Socrates: Evaluating LLMs through Explanation Critiques.

[BibT_eX]

[DOI]

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

OLMo: Accelerating the Science of Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Faithful Reasoning over Scientific Claims.

[BibT_eX]

[DOI]

Neset Özkan Tan

,

,

,

,

,

Michael Witbrock

Proceedings of the AAAI 2024 Spring Symposium Series, 2024

2023

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets.

[BibT_eX]

[DOI]

Dirk Groeneveld

,

,

,

,

,

,

,

,

Kyle Richardson

,

CoRR, 2023

BaRDa: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability.

[BibT_eX]

[DOI]

,

Bhavana Dalvi Mishra

,

CoRR, 2023

CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization.

[BibT_eX]

[DOI]

Bodhisattwa Prasad Majumder

,

Bhavana Dalvi Mishra

,

Peter A. Jansen

,

,

,

,

Chris Callison-Burch

,

CoRR, 2023

Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy.

[BibT_eX]

[DOI]

Sarah Wiegreffe

,

Matthew Finlayson

,

,

,

Ashish Sabharwal

CoRR, 2023

Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy.

[BibT_eX]

[DOI]

Sarah Wiegreffe

,

Matthew Finlayson

,

,

,

Ashish Sabharwal

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Language Models with Rationality.

[BibT_eX]

[DOI]

,

,

Ashish Sabharwal

,

Kyle Richardson

,

Hinrich Schütze

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022

Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2022

Towards Teachable Reasoning Systems.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning.

[BibT_eX]

[DOI]

,

Bhavana Dalvi Mishra

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Teachable Reasoning Systems: Using a Dynamic Memory of User Feedback for Continual System Improvement.

[BibT_eX]

[DOI]

Bhavana Dalvi Mishra

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LILA: A Unified Benchmark for Mathematical Reasoning.

[BibT_eX]

[DOI]

,

Matthew Finlayson

,

,

,

,

,

Tanmay Rajpurohit

,

,

Ashish Sabharwal

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

General-Purpose Question-Answering with Macaw.

[BibT_eX]

[DOI]

,

CoRR, 2021

Enriching a Model's Notion of Belief using a Persistent Memory.

[BibT_eX]

[DOI]

,

,

Hinrich Schütze

,

CoRR, 2021

Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge.

[BibT_eX]

[DOI]

Sumithra Bhakthavatsalam

,

Daniel Khashabi

,

,

Bhavana Dalvi Mishra

,

Kyle Richardson

,

Ashish Sabharwal

,

Carissa Schoenick

,

,

CoRR, 2021

BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief.

[BibT_eX]

[DOI]

,

,

Hinrich Schütze

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Explaining Answers with Entailment Trees.

[BibT_eX]

[DOI]

,

,

,

,

,

Leighanna Pipatanangkura

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding.

[BibT_eX]

[DOI]

,

,

,

,

Mrinmaya Sachan

,

Snigdha Chaturvedi

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language.

[BibT_eX]

[DOI]

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge.

[BibT_eX]

[DOI]

,

,

,

,

Jonathan Berant

CoRR, 2020

UnifiedQA: Crossing Format Boundaries With a Single QA System.

[BibT_eX]

[DOI]

Daniel Khashabi

,

,

Ashish Sabharwal

,

,

,

Hannaneh Hajishirzi

CoRR, 2020

From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project.

[BibT_eX]

[DOI]

,

,

,

Daniel Khashabi

,

Bhavana Dalvi Mishra

,

Kyle Richardson

,

Ashish Sabharwal

,

Carissa Schoenick

,

,

,

Sumithra Bhakthavatsalam

,

Dirk Groeneveld

,

Michal Guerquin

,

Michael Schmitz

AI Mag., 2020

Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge.

[BibT_eX]

[DOI]

,

,

,

,

Jonathan Berant

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Multi-class Hierarchical Question Classification for Multiple Choice Science Exams.

[BibT_eX]

[DOI]

,

Peter A. Jansen

,

,

,

,

Harish Tayyar Madabushi

,

,

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Transformers as Soft Reasoners over Language.

[BibT_eX]

[DOI]

,

,

Kyle Richardson

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

"You are grounded!": Latent Name Artifacts in Pre-trained Language Models.

[BibT_eX]

[DOI]

,

Rachel Rudinger

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

UnifiedQA: Crossing Format Boundaries With a Single QA System.

[BibT_eX]

[DOI]

Daniel Khashabi

,

,

,

Ashish Sabharwal

,

,

,

Hannaneh Hajishirzi

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

SUPP.AI: finding evidence for supplement-drug interactions.

[BibT_eX]

[DOI]

,

,

,

,

,

Carissa Schoenick

,

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019

Extracting evidence of supplement-drug interactions from literature.

[BibT_eX]

[DOI]

,

,

,

,

,

Carissa Schoenick

,

,

CoRR, 2019

QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Reasoning Over Paragraph Effects in Situations.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

QUAREL: A Dataset and Models for Answering Questions about Qualitative Relationships.

[BibT_eX]

[DOI]

,

,

,

,

Ashish Sabharwal

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Declarative Question Answering over Knowledge Bases Containing Natural Language Text with Answer Set Programming.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

AllenNLP: A Deep Semantic Natural Language Processing Platform.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Matthew E. Peters

,

Michael Schmitz

,

Luke Zettlemoyer

CoRR, 2018

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge.

[BibT_eX]

[DOI]

,

,

,

,

Ashish Sabharwal

,

Carissa Schoenick

,

CoRR, 2018

2017

Moving beyond the Turing Test with the Allen AI Science Challenge.

[BibT_eX]

[DOI]

Carissa Schoenick

,

,

,

Peter D. Turney

,

Commun. ACM, 2017

2016

Semantic Parsing to Probabilistic Programs for Situated Question Answering.

[BibT_eX]

[DOI]

Jayant Krishnamurthy

,

CoRR, 2016

Semantic Parsing to Probabilistic Programs for Situated Question Answering.

[BibT_eX]

[DOI]

Jayant Krishnamurthy

,

,

Aniruddha Kembhavi

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Combining Retrieval, Statistics, and Inference to Answer Elementary Science Questions.

[BibT_eX]

[DOI]

,

,

,

Ashish Sabharwal

,

,

Peter D. Turney

,

Daniel Khashabi

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Loading...