Oyvind Tafjord

According to our database1, Oyvind Tafjord authored at least 51 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs.
CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.
CoRR, 2024

Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions.
CoRR, 2024

OLMES: A Standard for Language Model Evaluations.
CoRR, 2024

DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024


Digital Socrates: Evaluating LLMs through Explanation Critiques.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


Faithful Reasoning over Scientific Claims.
Proceedings of the AAAI 2024 Spring Symposium Series, 2024

2023
Paloma: A Benchmark for Evaluating Language Model Fit.
CoRR, 2023

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets.
CoRR, 2023

BaRDa: A Belief and Reasoning Dataset that Separates Factual Accuracy and Reasoning Ability.
CoRR, 2023

CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization.
CoRR, 2023

Attentiveness to Answer Choices Doesn't Always Entail High QA Accuracy.
CoRR, 2023

Increasing Probability Mass on Answer Choices Does Not Always Improve Accuracy.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Language Models with Rationality.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering.
CoRR, 2022

Towards Teachable Reasoning Systems.
CoRR, 2022

Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Entailer: Answering Questions with Faithful and Truthful Chains of Reasoning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Towards Teachable Reasoning Systems: Using a Dynamic Memory of User Feedback for Continual System Improvement.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

LILA: A Unified Benchmark for Mathematical Reasoning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
General-Purpose Question-Answering with Macaw.
CoRR, 2021

Enriching a Model's Notion of Belief using a Persistent Memory.
CoRR, 2021

Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge.
CoRR, 2021

BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Explaining Answers with Entailment Trees.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

"Let Your Characters Tell Their Story": A Dataset for Character-Centric Narrative Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge.
CoRR, 2020

UnifiedQA: Crossing Format Boundaries With a Single QA System.
CoRR, 2020

From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project.
AI Mag., 2020

Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Multi-class Hierarchical Question Classification for Multiple Choice Science Exams.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Transformers as Soft Reasoners over Language.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

"You are grounded!": Latent Name Artifacts in Pre-trained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

UnifiedQA: Crossing Format Boundaries With a Single QA System.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

SUPP.AI: finding evidence for supplement-drug interactions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 2020

2019
Extracting evidence of supplement-drug interactions from literature.
CoRR, 2019

QuaRTz: An Open-Domain Dataset of Qualitative Relationship Questions.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Reasoning Over Paragraph Effects in Situations.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

QUAREL: A Dataset and Models for Answering Questions about Qualitative Relationships.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Declarative Question Answering over Knowledge Bases Containing Natural Language Text with Answer Set Programming.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
AllenNLP: A Deep Semantic Natural Language Processing Platform.
CoRR, 2018

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge.
CoRR, 2018

2017
Moving beyond the Turing Test with the Allen AI Science Challenge.
Commun. ACM, 2017

2016
Semantic Parsing to Probabilistic Programs for Situated Question Answering.
CoRR, 2016

Semantic Parsing to Probabilistic Programs for Situated Question Answering.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Combining Retrieval, Statistics, and Inference to Answer Elementary Science Questions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016


  Loading...