Noah A. Smith

Orcid: 0000-0002-2310-6380

Affiliations:
  • University of Washington, Seattle, WA, USA
  • Allen Institute for AI, Seattle, WA, USA
  • Carnegie Mellon University, Pittsburgh, USA


According to our database1, Noah A. Smith authored at least 400 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Morphosyntactic probing of multilingual BERT models.
Nat. Lang. Eng., 2024

Using proprietary language models in academic research requires explicit justification.
Nat. Comput. Sci., 2024

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback.
CoRR, 2024

Raising the Stakes: Performance Pressure Improves AI-Assisted Decision Making.
CoRR, 2024

ComPO: Community Preferences for Language Model Personalization.
CoRR, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models.
CoRR, 2024

OLMoE: Open Mixture-of-Experts Language Models.
CoRR, 2024

Toward a More Complete OMR Solution.
CoRR, 2024

CPS-TaskForge: Generating Collaborative Problem Solving Environments for Diverse Communication Tasks.
CoRR, 2024

Does Liking Yellow Imply Driving a School Bus? Semantic Leakage in Language Models.
CoRR, 2024

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?
CoRR, 2024

The Art of Saying No: Contextual Noncompliance in Language Models.
CoRR, 2024

MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization.
CoRR, 2024

MUSE: Machine Unlearning Six-Way Evaluation for Language Models.
CoRR, 2024

Decoding-Time Language Model Alignment with Multiple Objectives.
CoRR, 2024

Evaluating Copyright Takedown Methods for Language Models.
CoRR, 2024

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models.
CoRR, 2024

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback.
CoRR, 2024

What Can Natural Language Processing Do for Peer Review?
CoRR, 2024

Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically.
CoRR, 2024

A Taxonomy of Ambiguity Types for NLP.
CoRR, 2024

RewardBench: Evaluating Reward Models for Language Modeling.
CoRR, 2024

Encode Once and Decode in Parallel: Efficient Transformer Decoding.
CoRR, 2024

Third-Party Language Model Performance Prediction from Instruction.
CoRR, 2024

OLMo: Accelerating the Science of Language Models.
CoRR, 2024

Tuning Language Models by Proxy.
CoRR, 2024

How Language Model Hallucinations Can Snowball.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

In-Context Pretraining: Language Modeling Beyond Document Boundaries.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

What's In My Big Data?
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Evaluating n-Gram Novelty of Language Models Using Rusty-DAWG.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

BLINK: Multimodal Large Language Models Can See but Not Perceive.
Proceedings of the Computer Vision - ECCV 2024, 2024

A Call for Clarity in Beam Search: How It Works and When It Stops.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Estimating the Causal Effect of Early ArXiving on Paper Acceptance.
Proceedings of the Causal Learning and Reasoning, 2024

Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

Set the Clock: Temporal Alignment of Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024


Time is Encoded in the Weights of Finetuned Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024


2023
Transparency Helps Reveal When Language Models Learn Meaning.
Trans. Assoc. Comput. Linguistics, 2023

Paloma: A Benchmark for Evaluating Language Model Fit.
CoRR, 2023

Language Models: A Guide for the Perplexed.
CoRR, 2023

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2.
CoRR, 2023

ACID: Abstractive, Content-Based IDs for Document Retrieval with Language Models.
CoRR, 2023

What's In My Big Data?
CoRR, 2023

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore.
CoRR, 2023

Efficiency Pentathlon: A Standardized Arena for Efficiency Evaluation.
CoRR, 2023

DIALGEN: Collaborative Human-LM Generated Dialogues for Improved Understanding of Human-Human Conversations.
CoRR, 2023

Scaling Expert Language Models with Unsupervised Domain Discovery.
CoRR, 2023

LEXPLAIN: Improving Model Explanations via Lexicon Supervision.
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

RealTime QA: What's the Answer Right Now?
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Selective Annotation Makes Language Models Better Few-Shot Learners.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Binding Language Models in Symbolic Languages.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Measuring and Narrowing the Compositionality Gap in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

We're Afraid Language Models Aren't Modeling Ambiguity.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Demystifying Prompts in Language Models via Perplexity Estimation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Elaboration-Generating Commonsense Question Answering at Scale.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Self-Instruct: Aligning Language Models with Self-Generated Instructions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

One Embedder, Any Task: Instruction-Finetuned Text Embeddings.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Stubborn Lexical Bias in Data and Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Reproducibility in NLP: What Have We Learned from the Checklist?
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

NarrowBERT: Accelerating Masked Language Model Pretraining and Inference.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Data-Efficient Finetuning Using Cross-Task Nearest Neighbors.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Risks and NLP Design: A Case Study on Procedural Document QA.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Saturated Transformers are Constant-Depth Threshold Circuits.
Trans. Assoc. Comput. Linguistics, 2022

Self-Instruct: Aligning Language Model with Self Generated Instructions.
CoRR, 2022

PromptCap: Prompt-Guided Task-Aware Image Captioning.
CoRR, 2022

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models.
CoRR, 2022

Benchmarking Generalization via In-Context Instructions on 1, 600+ Language Tasks.
CoRR, 2022

Beam Decoding with Controlled Patience.
CoRR, 2022

Computational Lens on Cognition: Study Of Autobiographical Versus Imagined Stories With Large-Scale Language Models.
CoRR, 2022

Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Time Waits for No One! Analysis and Challenges of Temporal Misalignment.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Transparent Human Evaluation for Image Captioning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

DEMix Layers: Disentangling Domains for Modular Language Modeling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Domain Mismatch Doesn't Always Prevent Cross-lingual Transfer Learning.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

The Engage Corpus: A Social Media Dataset for Text-Based Recommender Systems.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Measuring the Carbon Intensity of AI in Cloud Instances.
Proceedings of the FAccT '22: 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, June 21, 2022

UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Modeling Context With Linear Attention for Scalable Document-Level Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Unsupervised Learning of Hierarchical Conversation Structure.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

WANLI: Worker and AI Collaboration for Natural Language Inference Dataset Creation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Twist Decoding: Diverse Generators Guide Each Other.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

In-Context Learning for Few-Shot Dialogue State Tracking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

How Much Does Attention Actually Attend? Questioning the Importance of Attention in Pretrained Transformers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

ABC: Attention with Bounded-memory Control.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine Text.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Generating Scientific Definitions with Controllable Complexity.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Infusing Finetuning with Semantic Dependencies.
Trans. Assoc. Comput. Linguistics, 2021

Provable Limitations of Acquiring Meaning from Ungrounded Form: What Will Future Language Models Understand?
Trans. Assoc. Comput. Linguistics, 2021

Scarecrow: A Framework for Scrutinizing Machine Text.
CoRR, 2021

On the Power of Saturated Transformers: A View from Circuit Complexity.
CoRR, 2021

Specializing Multilingual Language Models: An Empirical Study.
CoRR, 2021

On-the-Fly Controlled Text Generation with Experts and Anti-Experts.
CoRR, 2021

Go Forth and Prosper: Language Modeling with Ancient Textual History.
CoRR, 2021

Probing Across Time: What Does RoBERTa Know and When?
CoRR, 2021

GENIE: A Leaderboard for Human-in-the-Loop Evaluation of Text Generation.
CoRR, 2021

A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Choose Your Own Adventure: Paired Suggestions in Collaborative Writing for Evaluating Story Generation Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Random Feature Attention.
Proceedings of the 9th International Conference on Learning Representations, 2021

Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation.
Proceedings of the 9th International Conference on Learning Representations, 2021

Measuring Association Between Labels and Free-Text Rationales.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Sentence Bottleneck Autoencoders from Transformer Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Probing Across Time: What Does RoBERTa Know and When?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Finetuning Pretrained Transformers into RNNs.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Expected Validation Performance and Estimation of a Random Variable's Maximum.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Competency Problems: On Finding and Removing Artifacts in Language Data.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Challenges in Automated Debiasing for Toxic Language Detection.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study.
Proceedings of the 3rd Conference on Automated Knowledge Base Construction, 2021

Shortformer: Better Language Modeling using Shorter Inputs.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Explaining Relationships Between Scientific Documents.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Promoting Graph Awareness in Linearized Graph-to-Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings.
Trans. Assoc. Comput. Linguistics, 2020

On Consequentialism and Fairness.
Frontiers Artif. Intell., 2020

Parameter Norm Growth During Training of Transformers.
CoRR, 2020

Parsing with Multilingual BERT, a Small Corpus, and a Small Treebank.
CoRR, 2020

Deep Encoder, Shallow Decoder: Reevaluating the Speed-Quality Tradeoff in Machine Translation.
CoRR, 2020

Evaluating NLP Models via Contrast Sets.
CoRR, 2020

Multi-View Learning for Vision-and-Language Navigation.
CoRR, 2020

Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping.
CoRR, 2020

Citation Text Generation.
CoRR, 2020

Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction.
Comput. Linguistics, 2020

Contextual word representations: putting words into computers.
Commun. ACM, 2020

Green AI.
Commun. ACM, 2020

Multilevel Text Alignment with Cross-Document Attention.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Thinking Like a Skeptic: Defeasible Inference in Natural Language.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Grounded Compositional Outputs for Adaptive Language Modeling.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Plug and Play Autoencoders for Conditional Text Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Multilingual Amazon Reviews Corpus.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Parsing with Multilingual BERT, a Small Treebank, and a Small Corpus.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Writing Strategies for Science Communication: Data and Computational Analysis.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020


Explain like I am a Scientist: The Linguistic Barriers of Entry to r/science.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

The Right Tool for the Job: Matching Model and Instance Complexities.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Social Bias Frames: Reasoning about Social and Power Implications of Language.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Improving Transformer Models by Reordering their Sublayers.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Mixture of h - 1 Heads is Better than h Heads.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Formal Hierarchy of RNN Architectures.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Exploring the Effect of Author and Reader Identity in Online Story Writing: the STORIESINTHEWILD Corpus.
Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events, 2020

2019
Analyzing Privacy Policies at Scale: From Crowdsourcing to Automated Annotations.
ACM Trans. Web, 2019

Measuring Online Debaters' Persuasive Skill from Text over Time.
Trans. Assoc. Comput. Linguistics, 2019

Situating Sentence Embedders with Nearest Neighbor Overlap.
CoRR, 2019

Improving Natural Language Inference with a Pretrained Parser.
CoRR, 2019

Shallow Syntax in Deep Water.
CoRR, 2019

Contextual Word Representations: A Contextual Introduction.
CoRR, 2019

To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks.
Proceedings of the 4th Workshop on Representation Learning for NLP, 2019

Polyglot Contextual Representations Improve Crosslingual Transfer.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Linguistic Knowledge and Transferability of Contextual Representations.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Deep Weighted Averaging Classifiers.
Proceedings of the Conference on Fairness, Accountability, and Transparency, 2019

Knowledge Enhanced Contextual Word Representations.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

PaLM: A Hybrid Parser and Language Model.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Robust Navigation with Language Pretraining and Stochastic Sampling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Topics to Avoid: Demoting Latent Confounds in Text Classification.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

RNN Architecture Learning with Sparse Regularization.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Show Your Work: Improved Reporting of Experimental Results.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Quoref: A Reading Comprehension Dataset with Questions Requiring Coreferential Reasoning.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Low-Resource Parsing with Crosslingual Contextualized Representations.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Evaluating Gender Bias in Machine Translation.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Is Attention Interpretable?
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

The Risk of Racial Bias in Hate Speech Detection.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Variational Pretraining for Semi-supervised Text Classification.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Sentence Mover's Similarity: Automatic Evaluation for Multi-Sentence Texts.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Framing Effects: Choice of Slogans Used to Advertise Online Experiments Can Boost Recruitment and Lead to Sample Biases.
Proc. ACM Hum. Comput. Interact., 2018

You May Not Need Attention.
CoRR, 2018

Semantic Matching Against a Corpus: New Applications and Methods.
CoRR, 2018

SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines.
CoRR, 2018

"You are no Jack Kennedy": On Media Selection of Highlights from Presidential Debates.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

LSTMs Exploit Linguistic Attributes of Data.
Proceedings of The Third Workshop on Representation Learning for NLP, 2018

Learning Joint Semantic Parsers from Disjoint Data.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Parsing Tweets into Universal Dependencies.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Annotation Artifacts in Natural Language Inference Data.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Sounding Board: A User-Centric and Content-Driven Social Chatbot.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Neural Text Generation in Stories Using Entity Representations as Context.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

The Importance of Calibration for Estimating Proportions from Annotations.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Creative Writing with a Machine in the Loop: Case Studies on Slogans and Stories.
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018

Natural Language Processing for Analyzing Disaster Recovery Trends Expressed in Large Text Corpora.
Proceedings of the 2018 IEEE Global Humanitarian Technology Conference, 2018

Neural Cross-lingual Named Entity Recognition with Minimal Resources.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Syntactic Scaffolds for Semantic Structures.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Rational Recurrences.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Bridging CNNs, RNNs, and Weighted Finite-State Machines.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Backpropagating through Structured Argmax using a SPIGOT.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Neural Models for Documents with Metadata.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Event2Mind: Commonsense Inference on Events, Intents, and Reactions.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Polyglot Semantic Role Labeling.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
End-to-End Neural Segmental Models for Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2017

Frame-Semantic Parsing with Softmax-Margin Segmental RNNs and a Syntactic Scaffold.
CoRR, 2017

Multi-task Learning with CTC and Segmental CRF for Speech Recognition.
CoRR, 2017

Random Search for Hyperparameters using Determinantal Point Processes.
CoRR, 2017

A Neural Framework for Generalized Topic Models.
CoRR, 2017

Greedy Transition-Based Dependency Parsing with Stack LSTMs.
Comput. Linguistics, 2017

Multitask Learning with CTC and Segmental CRF for Speech Recognition.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Dynamic Entity Representations in Neural Language Models.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

What Do Recurrent Neural Network Grammars Learn About Syntax?
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Story Cloze Task: UW NLP System.
Proceedings of the 2nd Workshop on Linking Models of Lexical, 2017

The Effect of Different Writing Tasks on Linguistic Style: A Case Study of the ROC Story Cloze Task.
Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), 2017

Friendships, Rivalries, and Trysts: Characterizing Relations between Ideas in Texts.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Deep Multitask Learning for Semantic Dependency Parsing.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Neural Discourse Structure for Text Categorization.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Many Languages, One Parser.
Trans. Assoc. Comput. Linguistics, 2016

Segmental Recurrent Neural Networks.
Proceedings of the 4th International Conference on Learning Representations, 2016

Character Sequence Models for ColorfulWords.
CoRR, 2016

Massively Multilingual Word Embeddings.
CoRR, 2016

One Parser, Many Languages.
CoRR, 2016

Crowdsourcing Annotations for Websites' Privacy Policies: Can It Really Work?
Proceedings of the 25th International Conference on World Wide Web, 2016

UW-CSE at SemEval-2016 Task 10: Detecting Multiword Expressions and Supersenses using Double-Chained Conditional Random Fields.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

CMU at SemEval-2016 Task 8: Graph-based AMR Parsing with Infinite Ramp Loss.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Generation from Abstract Meaning Representation using Tree Transducers.
Proceedings of the NAACL HLT 2016, 2016

Recurrent Neural Network Grammars.
Proceedings of the NAACL HLT 2016, 2016

Segmental Recurrent Neural Networks for End-to-End Speech Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Friends with Motives: Using Text to Infer Influence on SCOTUS.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Semi-Supervised Learning of Sequence Models with Method of Moments.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Distilling an Ensemble of Greedy Dependency Parsers into One MST Parser.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Character Sequence Models for Colorful Words.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Analyzing Framing through the Casts of Characters in the News.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Training with Exploration Improves a Greedy Stack LSTM Parser.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Greedy, Joint Syntactic-Semantic Parsing with Stack LSTMs.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

Hierarchical Character-Word Models for Language Identification.
Proceedings of The Fourth International Workshop on Natural Language Processing for Social Media, 2016

A Neural Model for Language Identification in Code-Switched Tweets.
Proceedings of the Second Workshop on Computational Approaches to Code Switching@EMNLP 2016, 2016

2015
AD<sup>3</sup>: alternating directions dual decomposition for MAP inference in graphical models.
J. Mach. Learn. Res., 2015

Bayesian Optimization of Text Representations.
CoRR, 2015

Annotating Character Relationships in Literary Texts.
CoRR, 2015

Modeling User Arguments, Interactions, and Attributes for Stance Prediction in Online Debate Forums.
Proceedings of the 2015 SIAM International Conference on Data Mining, Vancouver, BC, Canada, April 30, 2015

A Corpus and Model Integrating Multiword Expressions and Supersenses.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Transforming Dependencies into Phrase Structures.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Retrofitting Word Vectors to Semantic Lexicons.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Toward Abstractive Summarization Using Semantic Representations.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Contextualized Sarcasm Detection on Twitter.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

Learning Word Representations with Hierarchical Sparse Coding.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Bayesian Optimization of Text Representations.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Extractive Summarization by Maximizing Semantic Volume.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

A Utility Model of Authors in the Scientific Community.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Open Extraction of Fine-Grained Political Statements.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Improved Transition-based Parsing by Modeling Characters instead of Words with LSTMs.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

A Supertag-Context Model for Weakly-Supervised CCG Parser Learning.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Frame-Semantic Role Labeling with Heterogeneous Annotations.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Sparse Overcomplete Word Vector Representations.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Transition-Based Dependency Parsing with Stack Long Short-Term Memory.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

The Media Frames Corpus: Annotations of Frames Across Issues.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

The Utility of Text: The Case of Amicus Briefs and the Supreme Court.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Weakly-Supervised Grammar-Informed Bayesian CCG Parser Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Dynamic Language Models for Streaming Text.
Trans. Assoc. Comput. Linguistics, 2014

Discriminative Lexical Semantic Segmentation with Gaps: Running the MWE Gamut.
Trans. Assoc. Comput. Linguistics, 2014

Unsupervised Discovery of Biographical Structure from Text.
Trans. Assoc. Comput. Linguistics, 2014

Narrative framing of consumer sentiment in online restaurant reviews.
First Monday, 2014

An Empirical Comparison of Parsing Methods for Stanford Dependencies.
CoRR, 2014

Phrase Dependency Machine Translation with Quasi-Synchronous Tree-to-Tree Features.
Comput. Linguistics, 2014

Frame-Semantic Parsing.
Comput. Linguistics, 2014

CMU: Arc-Factored, Discriminative Semantic Dependency Parsing.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Conditional Random Field Autoencoders for Unsupervised Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, 2014

Comprehensive Annotation of Multiword Expressions in a Social Web Corpus.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Making the Most of Bag of Words: Sentence Regularization with Alternating Direction Method of Multipliers.
Proceedings of the 31th International Conference on Machine Learning, 2014

Identifying Relevant Text Fragments to Help Crowdsource Privacy Policy Annotations.
Proceedings of the Seconf AAAI Conference on Human Computation and Crowdsourcing, 2014

A Dependency Parser for Tweets.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Weakly-Supervised Bayesian Learning of a CCG Supertagger.
Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014

A Step Towards Usable Privacy Policy: Automatic Alignment of Privacy Statements.
Proceedings of the COLING 2014, 2014

Linguistic Structured Sparsity in Text Categorization.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Overview of the 2014 NLP Unshared Task in PoliInformatics.
Proceedings of the Workshop on Language Technologies and Computational Social Science@ACL 2014, 2014

Unsupervised Alignment of Privacy Policies using Hidden Markov Models.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Simplified Dependency Annotations with GFL-Web.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

A Discriminative Graph-Based Parser for the Abstract Meaning Representation.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

A Bayesian Mixed Effects Model of Literary Character.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Distributed Representations of Geographically Situated Language.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Linguistic structure prediction with the sparseptron.
XRDS, 2013

New Alignment Methods for Discriminative Book Summarization
CoRR, 2013

A Sparse and Adaptive Prior for Time-Dependent Model Parameters.
CoRR, 2013

Predicting the NFL using Twitter.
Proceedings of the 2nd Workshop on Machine Learning and Data Mining for Sports Analytics co-located with 2013 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2013), 2013

Supersense Tagging for Arabic: the MT-in-the-Middle Attack.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

A Simple, Fast, and Effective Reparameterization of IBM Model 2.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Knowledge-Rich Morphological Priors for Bayesian Language Models.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

A Penny for Your Tweets: Campaign Contributions and Capitol Hill Microblogs.
Proceedings of the Seventh International Conference on Weblogs and Social Media, 2013

Measuring Ideological Proportions in Political Speeches.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Learning Topics and Positions from Debatepedia.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Translating into Morphologically Rich Languages with Synthetic Phrases.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Inferring Social Rank in an Old Assyrian Trade Network.
Proceedings of the 8th Annual International Conference of the Alliance of Digital Humanities Organizations, 2013

A Framework for (Under)specifying Dependency Syntax without Overloading Annotators.
Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, 2013

Learning to Extract International Relations from Political Context.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Turning on the Turbo: Fast Third-Order Non-Projective Turbo Parsers.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Learning Latent Personas of Film Characters.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
pycdec: A Python Interface to cdec.
Prague Bull. Math. Linguistics, 2012

Censorship and deletion practices in Chinese social media.
First Monday, 2012

Alternating Directions Dual Decomposition
CoRR, 2012

Mapping the geographical diffusion of new words
CoRR, 2012

Adversarial Evaluation for Models of Natural Language
CoRR, 2012

Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning.
Comput. Linguistics, 2012

An Exact Dual Decomposition Algorithm for Shallow Semantic Parsing with Constraints.
Proceedings of the First Joint Conference on Lexical and Computational Semantics, 2012

Textual Predictors of Bill Survival in Congressional Committees.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Structured Sparsity in Natural Language Processing: Models, Algorithms and Applications.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Concavity and Initialization for Unsupervised Dependency Parsing.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Structured Ramp Loss Minimization for Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Graph-Based Lexicon Expansion with Sparsity-Inducing Penalties.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Word Salad: Relating Food Prices and Descriptions.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Recall-Oriented Learning of Named Entities in Arabic Wikipedia.
Proceedings of the EACL 2012, 2012

Transliteration by Sequence Labeling with Lattice Encodings and Reranking.
Proceedings of the 4th Named Entity Workshop, 2012

A Probabilistic Model for Canonicalizing Named Entity Mentions.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Discovering Factions in the Computational Linguistics Community.
Proceedings of the Special Workshop on Rediscovering 50 Years of Discoveries@ACL 2012, 2012

Coarse Lexical Semantic Annotation with Supersenses: An Arabic Case Study.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Linguistic Structure Prediction
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02143-5, 2011

Products of weighted logic programs.
Theory Pract. Log. Program., 2011

Online Learning of Structured Predictors with Multiple Kernels.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

Generative Models of Monolingual and Bilingual Gappy Patterns.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

The CMU-ARK German-English Translation System.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Author Age Prediction from Text using Linear Regression.
Proceedings of the 5th ACL Workshop on Language Technology for Cultural Heritage, 2011

An Augmented Lagrangian Approach to Constrained MAP Inference.
Proceedings of the 28th International Conference on Machine Learning, 2011

Predicting a Scientific Community's Response to an Article.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Structured Sparsity in Structured Prediction.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Dual Decomposition with Many Overlapping Components.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Quasi-Synchronous Phrase Dependency Grammars for Machine Translation.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Structured Databases of Named Entities from Bayesian Nonparametrics.
Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, 2011

Unsupervised Structure Prediction with Non-Parallel Multilingual Guidance.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Unsupervised Bilingual POS Tagging with Markov Random Fields.
Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, 2011

Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Discovering Sociolinguistic Associations with Structured Sparsity.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Unsupervised Word Alignment with Arbitrary Features.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Semi-Supervised Frame-Semantic Parsing for Unknown Predicates.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010
Covariance in Unsupervised Learning of Probabilistic Grammars.
J. Mach. Learn. Res., 2010

SEMAFOR: Frame Argument Resolution with Log-Linear Models.
Proceedings of the 5th International Workshop on Semantic Evaluation, 2010

Empirical Risk Minimization with Approximations of Probabilistic Grammars.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Shedding (a Thousand Points of) Light on Biased Language.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Movie Reviews and Revenues: An Experiment in Text Regression.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Rating Computer-Generated Questions with Mechanical Turk.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

Tree Edit Models for Recognizing Textual Entailments, Paraphrases, and Answers to Questions.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Good Question! Statistical Ranking for Question Generation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Probabilistic Frame-Semantic Parsing.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Variational Inference for Adaptor Grammars.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

What's Worthy of Comment? Content and Comment Volume in Political Blogs.
Proceedings of the Fourth International Conference on Weblogs and Social Media, 2010

From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series.
Proceedings of the Fourth International Conference on Weblogs and Social Media, 2010

Turbo Parsers: Dependency Parsing by Approximate Variational Inference.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

A Latent Variable Model for Geographic Lexical Variation.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Distributed Asynchronous Online Learning for Natural Language Processing.
Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010

Nonparametric Word Segmentation for Machine Translation.
Proceedings of the COLING 2010, 2010

Viterbi Training for PCFGs: Hardness Results and Competitiveness of Uniform Initialization.
Proceedings of the ACL 2010, 2010

Favor Short Dependencies: Parsing with Soft and Hard Constraints on Dependency Length.
Proceedings of the Trends in Parsing Technology, 2010

2009
Nonextensive Information Theoretic Kernels on Measures.
J. Mach. Learn. Res., 2009

Predicting Response to Political Blog Posts with Topic Models.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Preference Grammars: Softening Syntactic Constraints to Improve Statistical Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Predicting Risk from Financial Reports with Regression.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Shared Logistic Normal Distributions for Soft Parameter Tying in Unsupervised Grammar Induction.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

From Episodes to Sagas: Understanding the News by Identifying Temporally Related Story Sequences.
Proceedings of the Third International Conference on Weblogs and Social Media, 2009

Tutorial summary: Structured prediction for natural language processing.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Polyhedral outer approximations with application to natural language parsing.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

Feature-Rich Translation by Quasi-Synchronous Lattice Parsing.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Cube Summing, Approximate Inference with Non-Local Features, and Dynamic Programming without Semirings.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Concise Integer Linear Programming Formulations for Dependency Parsing.
Proceedings of the ACL 2009, 2009

Paraphrase Identification as Probabilistic Quasi-Synchronous Recognition.
Proceedings of the ACL 2009, 2009

Variational Inference for Grammar Induction with Prior Knowledge.
Proceedings of the ACL 2009, 2009

Leveraging Structural Relations for Fluent Compressions at Multiple Compression Rates.
Proceedings of the ACL 2009, 2009

2008
<i>Computational Approaches to Morphology and Syntax</i> Brian Roark and Richard Sproat (Oregon Health and Science University and University of Illinois at Urbana-Champaign) Oxford: Oxford University Press (Oxford surveys in syntax and morphology, edited by Robert D. Van Valin Jr, volume 4), 2007, xx+316 pp; hardbound, ISBN 978-0-19-927477-2.
Comput. Linguistics, 2008

Rich Source-Side Context for Statistical Machine Translation.
Proceedings of the Third Workshop on Statistical Machine Translation, 2008

Logistic Normal Priors for Unsupervised Probabilistic Grammar Induction.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Relative keyboard input system.
Proceedings of the 13th International Conference on Intelligent User Interfaces, 2008

Nonextensive entropic kernels.
Proceedings of the Machine Learning, 2008

Dynamic Programming Algorithms as Products of Weighted Logic Programs.
Proceedings of the Logic Programming, 24th International Conference, 2008

Stacking Dependency Parsers.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Wider Pipelines: N-Best Alignments and Parses in MT Training.
Proceedings of the 8th Conference of the Association for Machine Translation in the Americas: Research Papers, 2008

2007
Weighted and Probabilistic Context-Free Grammars Are Equally Expressive.
Comput. Linguistics, 2007

What is the Jeopardy Model? A Quasi-Synchronous Grammar for QA.
Proceedings of the EMNLP-CoNLL 2007, 2007

Probabilistic Models of Nonprojective Dependency Trees.
Proceedings of the EMNLP-CoNLL 2007, 2007

Joint Morphological and Syntactic Disambiguation.
Proceedings of the EMNLP-CoNLL 2007, 2007

Computationally Efficient M-Estimation of Log-Linear Structure Models.
Proceedings of the ACL 2007, 2007

2006
Vine Parsing and Minimum Risk Reranking for Speed and Precision.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

Annealing Structural Bias in Multilingual Weighted Grammar Induction.
Proceedings of the ACL 2006, 2006

2005
Context-Based Morphological Disambiguation with Random Fields.
Proceedings of the HLT/EMNLP 2005, 2005

Compiling Comp Ling: Weighted Dynamic Programming and the Dyna Language.
Proceedings of the HLT/EMNLP 2005, 2005

Parsing with Soft and Hard Constraints on Dependency Length.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

Contrastive Estimation: Training Log-Linear Models on Unlabeled Data.
Proceedings of the ACL 2005, 2005

2004
Bilingual Parsing with Factored Estimation: Using English to Parse Korean.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

Annealing Techniques For Unsupervised Statistical Language Learning.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004

Dyna: A Language for Weighted Dynamic Programming.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26, 2004, 2004

2003
The Web as a Parallel Corpus.
Comput. Linguistics, 2003

2002
From Words to Corpora: Recognizing Translation.
Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, 2002

2000
Cairo: An Alignment Visualization Tool.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000


  Loading...