Tushar Khot

Affiliations:
  • AI2, Allen Institute for Artificial Intelligence, Seattle, US


According to our database1, Tushar Khot authored at least 92 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models.
CoRR, 2024

DISCOVERYWORLD: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents.
CoRR, 2024

Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

ADaPT: As-Needed Decomposition and Planning with Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

SynerGPT: In-Context Learning for Personalized Drug Synergy Prediction and Drug Design.
CoRR, 2023

Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance.
CoRR, 2023

Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback.
CoRR, 2023

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Specializing Smaller Language Models towards Multi-Step Reasoning.
Proceedings of the International Conference on Machine Learning, 2023

Decomposed Prompting: A Modular Approach for Solving Complex Tasks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Complexity-Based Prompting for Multi-step Reasoning.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
♫ MuSiQue: Multihop Questions via Single-hop Question Composition.
Trans. Assoc. Comput. Linguistics, 2022

Teaching Broad Reasoning Skills via Decomposition-Guided Contexts.
CoRR, 2022

Better Retrieval May Not Lead to Better Question Answering.
CoRR, 2022

Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Hey AI, Can You Solve Complex Tasks by Talking to Agents?
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies.
Trans. Assoc. Comput. Linguistics, 2021

Structure learning for relational logistic regression: an ensemble approach.
Data Min. Knowl. Discov., 2021

PROMPT WAYWARDNESS: The Curious Case of Discretized Interpretation of Continuous Prompts.
CoRR, 2021

Learning to Solve Complex Tasks by Talking to Agents.
CoRR, 2021

MuSiQue: Multi-hop Questions via Single-hop Question Composition.
CoRR, 2021

Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge.
CoRR, 2021

Temporal Reasoning on Implicit Events from Distant Supervision.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

GooAQ: Open Question Answering with Diverse Answer Types.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions?
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

ReadOnce Transformers: Reusable Representations of Text for Transformers.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
UnQovering Stereotyping Biases via Underspecified Questions.
CoRR, 2020

Measuring and Reducing Non-Multifact Reasoning in Multi-hop Question Answering.
CoRR, 2020

UnifiedQA: Crossing Format Boundaries With a Single QA System.
CoRR, 2020

Natural Perturbation for Robust Question Answering.
CoRR, 2020

From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project.
AI Mag., 2020

Is Multihop QA in DiRe Condition? Measuring and Reducing Disconnected Reasoning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

UNQOVERing Stereotypical Biases via Underspecified Questions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

UnifiedQA: Crossing Format Boundaries With a Single QA System.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

More Bang for Your Buck: Natural Perturbation for Robust Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

A Simple Yet Strong Pipeline for HotpotQA.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

IIRC: A Dataset of Incomplete Information Reading Comprehension Questions.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

QASC: A Dataset for Question Answering via Sentence Composition.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
On the Capabilities and Limitations of Reasoning for Natural Language Understanding.
CoRR, 2019

Repurposing Entailment for Multi-Hop Question Answering Tasks.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

What's Missing: A Knowledge Gap Guided Approach for Multi-hop Question Answering.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Exploiting Explicit Paths for Multi-hop Reading Comprehension.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Exploiting Explicit Paths for Multi-hop Reading Comprehension.
CoRR, 2018

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge.
CoRR, 2018

Can a Suit of Armor Conduct Electricity? A New Dataset for Open Book Question Answering.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Bridging Knowledge Gaps in Neural Entailment via Symbolic Models.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

AdvEntuRe: Adversarial Training for Textual Entailment with Knowledge-Guided Examples.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

SciTaiL: A Textual Entailment Dataset from Science Question Answering.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Question Answering as Global Reasoning Over Semantic Abstractions.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Markov logic networks for adverse drug event extraction from text.
Knowl. Inf. Syst., 2017

Relational Restricted Boltzmann Machines: A Probabilistic Logic Learning Approach.
Proceedings of the Inductive Logic Programming - 27th International Conference, 2017

Learning What is Essential in Questions.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Answering Complex Questions Using Open Information Extraction.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Scaling Lifted Probabilistic Inference and Learning Via Graph Databases.
Proceedings of the 2016 SIAM International Conference on Data Mining, 2016

Inductive Logic Programming Meets Relational Databases: Efficient Learning of Markov Logic Networks.
Proceedings of the Inductive Logic Programming - 26th International Conference, 2016

Question Answering via Integer Programming over Semi-Structured Knowledge.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Learning Continuous-Time Bayesian Networks in Relational Domains: A Non-Parametric Approach.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Combining Retrieval, Statistics, and Inference to Answer Elementary Science Questions.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Gradient-based boosting for statistical relational learning: the Markov logic network and missing data cases.
Mach. Learn., 2015

Markov Logic Networks for Natural Language Question Answering.
CoRR, 2015

Anomaly Detection in Text: The Value of Domain Knowledge.
Proceedings of the Twenty-Eighth International Florida Artificial Intelligence Research Society Conference, 2015

Exploring Markov Logic Networks for Question Answering.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Extracting Adverse Drug Events from Text Using Human Advice.
Proceedings of the Artificial Intelligence in Medicine, 2015

Learning Probabilistic Logic Models with Human Advice.
Proceedings of the 2015 AAAI Spring Symposia, 2015

Knowledge-Based Probabilistic Logic Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Boosted Statistical Relational Learners - From Benchmarks to Data-Driven Medicine
Springer Briefs in Computer Science, Springer, ISBN: 978-3-319-13644-8, 2014

Relational learning helps in three-way classification of Alzheimer patients from structural magnetic resonance images of the brain.
Int. J. Mach. Learn. Cybern., 2014

Statistical Relational Learning for Handwriting Recognition.
Proceedings of the Inductive Logic Programming - 24th International Conference, 2014

Effectively Creating Weakly Labeled Training Examples via Approximate Domain Knowledge.
Proceedings of the Inductive Logic Programming - 24th International Conference, 2014

Learning from Imbalanced Data in Relational Domains: A Soft Margin Approach.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

Classification from One Class of Examples for Relational Domains.
Proceedings of the Statistical Relational Artificial Intelligence, 2014

Relational One-Class Classification: A Non-Parametric Approach.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Bootstrapping Knowledge Base Acceleration.
Proceedings of The Twenty-Second Text REtrieval Conference, 2013

Accelerating Imitation Learning in Relational Domains via Transfer by Initialization.
Proceedings of the Inductive Logic Programming - 23rd International Conference, 2013

Using Commonsense Knowledge to Automatically Create (Noisy) Training Examples from Text.
Proceedings of the Statistical Relational Artificial Intelligence, 2013

2012
Gradient-based boosting for statistical relational learning: The relational dependency network case.
Mach. Learn., 2012

Accelarating Imitation Learning in Relational Domains via Transfer by Initialization.
Proceedings of the 2nd International Workshop on Statistical Relational AI (StaRAI-12), 2012

Learning Relational Structure for Temporal Relation Extraction.
Proceedings of the 2nd International Workshop on Statistical Relational AI (StaRAI-12), 2012

A Machine Learning Pipeline for Three-Way Classification of Alzheimer Patients from Structural Magnetic Resonance Images of the Brain.
Proceedings of the 11th International Conference on Machine Learning and Applications, 2012

2011
Learning Markov Logic Networks via Functional Gradient Boosting.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

2010
Exploiting Causal Independence in Markov Logic Networks: Combining Undirected and Directed Models.
Proceedings of the Statistical Relational Artificial Intelligence, 2010

2009
Some new directions in graph-based semi-supervised learning.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009


  Loading...