Doug Downey

Orcid: 0000-0002-4737-8444

Affiliations:
  • Northwestern University, Evanston, IL, USA


According to our database1, Doug Downey authored at least 102 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The Semantic Reader Project.
Commun. ACM, October, 2024

SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature.
CoRR, 2024

MARG: Multi-Agent Review Generation for Scientific Papers.
CoRR, 2024

CARE: Extracting Experimental Findings From Clinical Literature.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

TOPICAL: TOPIC Pages AutomagicaLly.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 2024

ARIES: A Corpus of Scientific Paper Edits Made in Response to Peer Reviews.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SciMON: Scientific Inspiration Machines Optimized for Novelty.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
LIMEADE: From AI Explanations to Advice Taking.
ACM Trans. Interact. Intell. Syst., December, 2023

A Computational Inflection for Scientific Discovery.
Commun. ACM, August, 2023

Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery.
CoRR, 2023

Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks.
CoRR, 2023

The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces.
CoRR, 2023

The Semantic Scholar Open Data Platform.
CoRR, 2023

SciRepEval: A Multi-Format Benchmark for Scientific Document Representations.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

S2abEL: A Dataset for Entity Linking from Scientific Tables.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Embedding Recycling for Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Penguins Don't Fly: Reasoning about Generics through Instantiations and Exceptions.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Relatedly: Scaffolding Literature Reviews with Existing Related Work Sections.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups.
Trans. Assoc. Comput. Linguistics, 2022

ABNIRML: Analyzing the Behavior of Neural IR Models.
Trans. Assoc. Comput. Linguistics, 2022

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation.
CoRR, 2022

Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search.
CoRR, 2022

Infrastructure for Rapid Open Knowledge Network Development.
AI Mag., 2022

FeedLens: Polymorphic Lenses for Personalizing Exploratory Search over Knowledge Graphs.
Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Few-Shot Self-Rationalization with Natural Language Prompts.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

S2AMP: a high-coverage dataset of scholarly mentorship inferred from publications.
Proceedings of the JCDL '22: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20, 2022

ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts.
Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration.
Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

Exploring the Role of Local and Global Explanations in Recommender Systems.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction.
Proceedings of the 4th Conference on Automated Knowledge Base Construction, 2022

2021
Incorporating Visual Layout Structures for Scientific Text Classification.
CoRR, 2021

Simplified Data Wrangling with ir_datasets.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

CODE: Compiler-based Neuron-aware Ensemble training.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

S2AND: A Benchmark and Evaluation System for Author Name Disambiguation.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2021

"It doesn't look good for a date": Transforming Critiques into Preferences for Conversational Recommendation Systems.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Who's on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

2020
Explanation-Based Tuning of Opaque Machine Learners with Application to Paper Recommendation.
CoRR, 2020

Practical Methods for Semi-automated Peer Grading in a Classroom Setting.
Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization, 2020

High-Precision Extraction of Emerging Concepts from Scientific Literature.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Abductive Commonsense Reasoning.
Proceedings of the 8th International Conference on Learning Representations, 2020

G-DAug: Generative Data Augmentation for Commonsense Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Stolen Probability: A Structural Weakness of Neural Language Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

SPECTER: Document-level Representation Learning using Citation-informed Transformers.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Just Add Functions: A Neural-Symbolic Language Model.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Multi-sense Definition Modeling using Word Sense Decompositions.
CoRR, 2019

CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense.
CoRR, 2019

A Semantic Cover Approach for Topic Modeling.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

Using Large Corpus N-gram Statistics to Improve Recurrent Neural Language Models.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A new evaluation framework for topic modeling algorithms based on synthetic corpora.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Construction of the Literature Graph in Semantic Scholar.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Estimating Marginal Probabilities of n-grams for Recurrent Neural Language Models.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Extracting Commonsense Properties from Embeddings with Limited Human Guidance.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Sampling Informative Training Data for RNN Language Models.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018

OTyper: A Neural Architecture for Open Named Entity Typing.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Controlling Global Statistics in Recurrent Neural Network Text Generation.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
VecShare: A Framework for Sharing Word Representation Vectors.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

PAG2ADMG: A Novel Methodology to Enumerate Causal Graph Structures.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Definition Modeling: Learning to Define Word Embeddings in Natural Language.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Beating the Artificial Chaos: Fighting OSN Spam Using Its Own Templates.
IEEE/ACM Trans. Netw., 2016

Learning Hierarchically Decomposable Concepts with Active Over-Labeling.
Proceedings of the IEEE 16th International Conference on Data Mining, 2016

2015
TabEL: Entity Linking in Web Tables.
Proceedings of the Semantic Web - ISWC 2015, 2015

Efficient Methods for Incorporating Knowledge into Topic Models.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Efficient Methods for Inferring Large Sparse Topic Hierarchies.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Learning Representations for Weakly Supervised Natural Language Processing Tasks.
Comput. Linguistics, 2014

WebSAIL wikifier at ERD 2014.
Proceedings of the ERD'14, 2014

Analyzing the content emphasis of web search engines.
Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Adding High-Precision Links to Wikipedia.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Spam ain't as diverse as it seems: throttling OSN spam with templates underneath.
Proceedings of the 30th Annual Computer Security Applications Conference, 2014

2013
WebSAIL Wikifier: English Entity Linking at TAC 2013.
Proceedings of the Sixth Text Analysis Conference, 2013

Overcoming the Memory Bottleneck in Distributed Training of Latent Variable Models of Text.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Methods for exploring and mining tables on Wikipedia.
Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics, 2013

Using natural language to integrate, evaluate, and optimize extracted knowledge bases.
Proceedings of the 2013 workshop on Automated knowledge base construction, 2013

A probabilistic graphical model for brand reputation assessment in social networks.
Proceedings of the Advances in Social Networks Analysis and Mining 2013, 2013

Scaling Semi-supervised Naive Bayes with Feature Marginals.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Sentiment identification by incorporating syntax, semantics and context information.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Explanatory semantic relatedness and explicit spatialization for exploratory search.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

2011
Language Models as Representations for Weakly Supervised NLP Tasks.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Local and Global Algorithms for Disambiguation to Wikipedia.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Analysis of a probabilistic model of redundancy in unsupervised information extraction.
Artif. Intell., 2010

Improved Extraction Assessment through Better Language Models.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

2008
Look Ma, No Hands: Analyzing the Monotonic Feature Abstraction for Text Classification.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

It's a Contradiction - no, it's not: A Case Study using Functional Relations.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Understanding the relationship between searchers' queries and information goals.
Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007
Heads and tails: studies of web search with common and rare queries.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Models of Searching and Browsing: Languages, Studies, and Application.
Proceedings of the IJCAI 2007, 2007

Locating Complex Named Entities in Web Text.
Proceedings of the IJCAI 2007, 2007

Sparse Information Extraction: Unsupervised Language Models to the Rescue.
Proceedings of the ACL 2007, 2007

2005
Unsupervised named-entity extraction from the Web: An experimental study.
Artif. Intell., 2005

KnowItNow: Fast, Scalable Information Extraction from the Web.
Proceedings of the HLT/EMNLP 2005, 2005

A Probabilistic Model of Redundancy in Information Extraction.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

2004
Web-scale information extraction in knowitall: (preliminary results).
Proceedings of the 13th international conference on World Wide Web, 2004

Methods for Domain-Independent Information Extraction from the Web: An Experimental Comparison.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004


  Loading...