Doug Downey

Kyle Lo

Daniel S. Weld

ACM Trans. Interact. Intell. Syst., December, 2023

A Computational Inflection for Scientific Discovery.

[BibT_eX]

[DOI]

Commun. ACM, August, 2023

Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery.

[BibT_eX]

[DOI]

CoRR, 2023

Beyond Summarization: Designing AI Support for Real-World Expository Writing Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces.

[BibT_eX]

[DOI]

CoRR, 2023

The Semantic Scholar Open Data Platform.

[BibT_eX]

[DOI]

CoRR, 2023

SciRepEval: A Multi-Format Benchmark for Scientific Document Representations.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

S2abEL: A Dataset for Entity Linking from Scientific Tables.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

PaperMage: A Unified Toolkit for Processing, Representing, and Manipulating Visually-Rich Scientific Documents.

[BibT_eX]

[DOI]

Yoganand Chandrasekhar

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Embedding Recycling for Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Penguins Don't Fly: Reasoning about Generics through Instantiations and Exceptions.

[BibT_eX]

[DOI]

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Relatedly: Scaffolding Literature Reviews with Existing Related Work Sections.

[BibT_eX]

[DOI]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

CiteSee: Augmenting Citations in Scientific Papers with Persistent and Personalized Historical Context.

[BibT_eX]

[DOI]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Are Layout-Infused Language Models Robust to Layout Distribution Shifts? A Case Study with Scientific Documents.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2022

ABNIRML: Analyzing the Behavior of Neural IR Models.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2022

I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation.

[BibT_eX]

[DOI]

CoRR, 2022

Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search.

[BibT_eX]

[DOI]

CoRR, 2022

Infrastructure for Rapid Open Knowledge Network Development.

[BibT_eX]

[DOI]

AI Mag., 2022

FeedLens: Polymorphic Lenses for Personalizing Exploratory Search over Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Few-Shot Self-Rationalization with Natural Language Prompts.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

S2AMP: a high-coverage dataset of scholarly mentorship inferred from publications.

[BibT_eX]

[DOI]

Proceedings of the JCDL '22: The ACM/IEEE Joint Conference on Digital Libraries in 2022, Cologne, Germany, June 20, 2022

ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts.

[BibT_eX]

[DOI]

Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Learning to Perform Complex Tasks through Compositional Fine-Tuning of Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Building a Shared Conceptual Model of Complex, Heterogeneous Data Systems: A Demonstration.

[BibT_eX]

[DOI]

Proceedings of the 12th Conference on Innovative Data Systems Research, 2022

Exploring the Role of Local and Global Explanations in Recommender Systems.

[BibT_eX]

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks.

[BibT_eX]

[DOI]

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction.

[BibT_eX]

[DOI]

Tara Safavi

Tom Hope

Proceedings of the 4th Conference on Automated Knowledge Base Construction, 2022

2021

Incorporating Visual Layout Structures for Scientific Text Classification.

[BibT_eX]

[DOI]

CoRR, 2021

Simplified Data Wrangling with ir_datasets.

[BibT_eX]

[DOI]

Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

CODE: Compiler-based Neuron-aware Ensemble training.

[BibT_eX]

[DOI]

Ettore M. G. Trainiti

Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

S2AND: A Benchmark and Evaluation System for Author Name Disambiguation.

[BibT_eX]

[DOI]

Shivashankar Subramanian

Daniel King

Sergey Feldman

Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2021

"It doesn't look good for a date": Transforming Critiques into Preferences for Conversational Recommendation Systems.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Who's on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains.

[BibT_eX]

[DOI]

Benjamin Charles Germain Lee

Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts.

[BibT_eX]

[DOI]

Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

2020

Explanation-Based Tuning of Opaque Machine Learners with Application to Paper Recommendation.

[BibT_eX]

[DOI]

Kyle Lo

Daniel S. Weld

CoRR, 2020

Practical Methods for Semi-automated Peer Grading in a Classroom Setting.

[BibT_eX]

[DOI]

Zheng Yuan

Proceedings of the 28th ACM Conference on User Modeling, Adaptation and Personalization, 2020

High-Precision Extraction of Emerging Concepts from Scientific Literature.

[BibT_eX]

[DOI]

Daniel King

Daniel S. Weld

Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Abductive Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

G-DAug: Generative Data Augmentation for Commonsense Reasoning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Stolen Probability: A Structural Weakness of Neural Language Models.

[BibT_eX]

[DOI]

Gregory Kimmel

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

SPECTER: Document-level Representation Learning using Citation-informed Transformers.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Just Add Functions: A Neural-Symbolic Language Model.

[BibT_eX]

[DOI]

Rajagopal Venkatesaramani

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Multi-sense Definition Modeling using Word Sense Decompositions.

[BibT_eX]

[DOI]

CoRR, 2019

CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense.

[BibT_eX]

[DOI]

CoRR, 2019

A Semantic Cover Approach for Topic Modeling.

[BibT_eX]

[DOI]

Bradley A. Malin

Yevgeniy Vorobeychik

Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

Using Large Corpus N-gram Statistics to Improve Recurrent Neural Language Models.

[BibT_eX]

[DOI]

Yiben Yang

Ji-Ping Wang

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

A new evaluation framework for topic modeling algorithms based on synthetic corpora.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018

Construction of the Literature Graph in Semantic Scholar.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Estimating Marginal Probabilities of n-grams for Recurrent Neural Language Models.

[BibT_eX]

[DOI]

Lidong Bing

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Extracting Commonsense Properties from Embeddings with Limited Human Guidance.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Sampling Informative Training Data for RNN Language Models.

[BibT_eX]

[DOI]

Jared Fernandez

Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018

OTyper: A Neural Architecture for Open Named Entity Typing.

[BibT_eX]

[DOI]

Zheng Yuan

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

Controlling Global Statistics in Recurrent Neural Network Text Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

VecShare: A Framework for Sharing Word Representation Vectors.

[BibT_eX]

[DOI]

Jared Fernandez

Zhaocheng Yu

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

PAG2ADMG: A Novel Methodology to Enumerate Causal Graph Structures.

[BibT_eX]

[DOI]

Nishant Subramani

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Definition Modeling: Learning to Define Word Embeddings in Natural Language.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Beating the Artificial Chaos: Fighting OSN Spam Using Its Own Templates.

[BibT_eX]

[DOI]

IEEE/ACM Trans. Netw., 2016

Learning Hierarchically Decomposable Concepts with Active Over-Labeling.

[BibT_eX]

[DOI]

Yuji Mo

Stephen D. Scott

Chandra Sekhar Bhagavatula

Proceedings of the IEEE 16th International Conference on Data Mining, 2016

2015

TabEL: Entity Linking in Web Tables.

[BibT_eX]

[DOI]

Proceedings of the Semantic Web - ISWC 2015, 2015

Efficient Methods for Incorporating Knowledge into Topic Models.

[BibT_eX]

[DOI]

Yi Yang

Jordan L. Boyd-Graber

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Efficient Methods for Inferring Large Sparse Topic Hierarchies.

[BibT_eX]

[DOI]

Chandra Bhagavatula

Yi Yang

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014

Learning Representations for Weakly Supervised Natural Language Processing Tasks.

[BibT_eX]

[DOI]

Comput. Linguistics, 2014

WebSAIL wikifier at ERD 2014.

[BibT_eX]

[DOI]

Chandra Bhagavatula

Proceedings of the ERD'14, 2014

Analyzing the content emphasis of web search engines.

[BibT_eX]

[DOI]

Mohammed A. Alam

Proceedings of the 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2014

Adding High-Precision Links to Wikipedia.

[BibT_eX]

[DOI]

Chandra Bhagavatula

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Spam ain't as diverse as it seems: throttling OSN spam with templates underneath.

[BibT_eX]

[DOI]

Proceedings of the 30th Annual Computer Security Applications Conference, 2014

2013

WebSAIL Wikifier: English Entity Linking at TAC 2013.

[BibT_eX]

[DOI]

Proceedings of the Sixth Text Analysis Conference, 2013

Overcoming the Memory Bottleneck in Distributed Training of Latent Variable Models of Text.

[BibT_eX]

[DOI]

Yi Yang

Alexander Yates

Chandra Sekhar Bhagavatula

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Methods for exploring and mining tables on Wikipedia.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics, 2013

Using natural language to integrate, evaluate, and optimize extracted knowledge bases.

[BibT_eX]

[DOI]

Chandra Sekhar Bhagavatula

Alexander Yates

Proceedings of the 2013 workshop on Automated knowledge base construction, 2013

A probabilistic graphical model for brand reputation assessment in social networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Social Networks Analysis and Mining 2013, 2013

Scaling Semi-supervised Naive Bayes with Feature Marginals.

[BibT_eX]

[DOI]

Michael Lucas

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012

Sentiment identification by incorporating syntax, semantics and context information.

[BibT_eX]

[DOI]

Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Explanatory semantic relatedness and explicit spatialization for exploratory search.

[BibT_eX]

[DOI]

Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

2011

Language Models as Representations for Weakly Supervised NLP Tasks.

[BibT_eX]

[DOI]

Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 2011

Local and Global Algorithms for Disambiguation to Wikipedia.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010

Analysis of a probabilistic model of redundancy in unsupervised information extraction.

[BibT_eX]

[DOI]

Stephen Soderland

Artif. Intell., 2010

Improved Extraction Assessment through Better Language Models.

[BibT_eX]

[DOI]

Arun Ahuja

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

2008

Look Ma, No Hands: Analyzing the Monotonic Feature Abstraction for Text Classification.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 21, 2008

It's a Contradiction - no, it's not: A Case Study using Functional Relations.

[BibT_eX]

[DOI]

Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Understanding the relationship between searchers' queries and information goals.

[BibT_eX]

[DOI]

Proceedings of the 17th ACM Conference on Information and Knowledge Management, 2008

2007

Heads and tails: studies of web search with common and rare queries.

[BibT_eX]

[DOI]

Susan T. Dumais

Eric Horvitz

Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Models of Searching and Browsing: Languages, Studies, and Application.

[BibT_eX]

[DOI]

Susan T. Dumais

Eric Horvitz

Proceedings of the IJCAI 2007, 2007

Locating Complex Named Entities in Web Text.

[BibT_eX]

[DOI]

Matthew Broadhead

Proceedings of the IJCAI 2007, 2007

Sparse Information Extraction: Unsupervised Language Models to the Rescue.

[BibT_eX]

[DOI]

Stefan Schoenmackers

Proceedings of the ACL 2007, 2007

2005

Unsupervised named-entity extraction from the Web: An experimental study.

[BibT_eX]

[DOI]

Artif. Intell., 2005

KnowItNow: Fast, Scalable Information Extraction from the Web.

[BibT_eX]

[DOI]

Proceedings of the HLT/EMNLP 2005, 2005

A Probabilistic Model of Redundancy in Information Extraction.

[BibT_eX]

[DOI]