Shay B. Cohen

Orcid: 0000-0003-4753-8353

Affiliations:
  • University of Edinburgh
  • Department of Computer Science, Columbia University


According to our database1, Shay B. Cohen authored at least 141 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
On the Trade-off between Redundancy and Cohesiveness in Extractive Summarization.
J. Artif. Intell. Res., 2024

CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning.
CoRR, 2024

PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning.
CoRR, 2024

What can Large Language Models Capture about Code Functional Equivalence?
CoRR, 2024

einspace: Searching for Neural Architectures from Fundamental Operations.
CoRR, 2024

Spectral Editing of Activations for Large Language Model Alignment.
CoRR, 2024

'Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory.
CoRR, 2024

CivilSum: A Dataset for Abstractive Summarization of Indian Court Decisions.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Are Large Language Model Temporally Grounded?
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

LeanReasoner: Boosting Complex Logical Reasoning with Lean.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Evaluating Automatic Metrics with Incremental Machine Translation Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Modeling News Interactions and Influence for Financial Market Prediction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Causal Explanations for Sequential Decision-Making in Multi-Agent Systems.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Large Language Models Relearn Removed Concepts.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Rant or rave: variation over time in the language of online reviews.
Lang. Resour. Evaluation, September, 2023

Erasure of Unaligned Attributes from Neural Representations.
Trans. Assoc. Comput. Linguistics, 2023

Are Large Language Models Temporally Grounded?
CoRR, 2023

Neuron to Graph: Interpreting Language Model Neurons at Scale.
CoRR, 2023

Knowledge Base Question Answering for Space Debris Queries.
CoRR, 2023

Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making.
CoRR, 2023

PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Detecting and Mitigating Hallucinations in Multilingual Summarisation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

AMR Parsing is Far from Solved: GrAPES, the Granular AMR Parsing Evaluation Suite.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Joint Matrix Factorization Analysis of Multilingual Representations.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Gold Doesn't Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

BERT Is Not The Count: Learning to Match Mathematical Statements with Proofs.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

DISCOSQA: A Knowledge Base Question Answering System for Space Debris based on Program Induction.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

The Larger they are, the Harder they Fail: Language Models do not Recognize Identifier Swaps in Python.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Nonparametric Learning of Two-Layer ReLU Residual Units.
Trans. Mach. Learn. Res., 2022

A Human-Centric Method for Generating Causal Explanations in Natural Language for Autonomous Vehicle Motion Planning.
CoRR, 2022

Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents by Sampling Summary Views.
CoRR, 2022

On the Trade-off between Redundancy and Local Coherence in Summarization.
CoRR, 2022

Abstractive Summarization Guided by Latent Hierarchical Document Structure.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Sentence-Incremental Neural Coreference Resolution.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Factorizing Content and Budget Decisions in Abstractive Summarization of Long Documents.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Understanding Domain Learning in Language Models Through Subpopulation Analysis.
Proceedings of the Fifth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2022

Co-training an Unsupervised Constituency Parser with Weak Supervision.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Bottom-up unranked tree-to-graph transducers for translation into semantic graphs.
Theor. Comput. Sci., 2021

Unsupervised Extractive Summarization by Human Memory Simulation.
CoRR, 2021

Learning to Match Mathematical Statements with Proofs.
CoRR, 2021

Narration Generation for Cartoon Videos.
CoRR, 2021

Universal Discourse Representation Structure Parsing.
Comput. Linguistics, 2021

Text Generation from Discourse Representation Structures.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

A Root of a Problem: Optimizing Single-Root Dependency Parsing.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

A Differentiable Relaxation of Graph Segmentation and Alignment for AMR Parsing.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Open-Domain Contextual Link Prediction and its Complementarity with Entailment Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

A Closer Look into the Robustness of Neural Dependency Parsers Using Better Adversarial Examples.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Reducing Quantity Hallucinations in Abstractive Summarization.
CoRR, 2020

Learning Two-Layer Residual Networks with Nonparametric Function Estimation by Convex Programming.
CoRR, 2020

Obfuscation for Privacy-preserving Syntactic Parsing.
Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, 2020

Tensors over Semirings for Latent-Variable Weighted Logic Programs.
Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, 2020

English-to-Chinese Transliteration with Phonetic Auxiliary Task.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Learning Latent Forests for Medical Relation Extraction.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Compositional languages emerge in a neural iterated learning model.
Proceedings of the 8th International Conference on Learning Representations, 2020

Reducing the Frequency of Hallucinated Quantities in Abstractive Summaries.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Lightweight, Dynamic Graph Convolutional Networks for AMR-to-Text Generation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-Step Inference for Reasoning Over Paragraphs.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

The Role of Reentrancies in Abstract Meaning Representation Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Machine Reading of Historical Events.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Learning Dialog Policies from Weak Demonstrations.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Bayesian Analysis in Natural Language Processing, Second Edition
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02170-1, 2019

Unlexicalized Transition-based Discontinuous Constituency Parsing.
Trans. Assoc. Comput. Linguistics, 2019

What is this Article about? Extreme Summarization with Topic-aware Convolutional Neural Networks.
J. Artif. Intell. Res., 2019

Jointly Extracting and Compressing Documents with Summary State Representations.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Structural Neural Encoders for AMR-to-text Generation.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Discontinuous Constituency Parsing with a Stack-Free Transition System and a Dynamic Oracle.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Partners in Crime: Multi-view Sequential Inference for Movie Understanding.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Semantic Role Labeling with Iterative Structure Refinement.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Experimenting with Power Divergences for Language Modeling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Wide-Coverage Neural A* Parsing for Minimalist Grammars.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Discourse Representation Parsing for Sentences and Documents.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Duality of Link Prediction and Entailment Graph Induction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Learning Typed Entailment Graphs with Global Soft Constraints.
Trans. Assoc. Comput. Linguistics, 2018

Whodunnit? Crime Drama as a Case for Natural Language Understanding.
Trans. Assoc. Comput. Linguistics, 2018

Bayesian Analysis in Natural Language Processing Shay Cohen (University of Edinburgh)Morgan & Claypool (Synthesis Lectures on Human Language Technologies, edited by Graeme Hirst, volume 35), 2016, xxvii+246 pp; paperback, ISBN 9781627058735, $85.00; ebook, ISBN 9781627054218, $68.00; doi: 10.2200/S00719ED1V01Y201605HLT035.
Comput. Linguistics, 2018

Ranking Sentences for Extractive Summarization with Reinforcement Learning.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Abstract Meaning Representation for Paraphrase Detection.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Cross-Lingual Abstract Meaning Representation Parsing.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Multilingual Clustering of Streaming News.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Privacy-preserving Neural Representations of Text.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Local String Transduction as Sequence Labeling.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Document Modeling with External Attention for Sentence Extraction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Discourse Representation Structure Parsing.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Stock Movement Prediction from Tweets and Historical Prices.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Canonical Correlation Inference for Mapping Abstract Scenes to Text.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Neural Extractive Summarization with Side Information.
CoRR, 2017

Latent-Variable PCFGs: Background and Applications.
Proceedings of the 15th Meeting on the Mathematics of Language, 2017

Split and Rephrase.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

An Incremental Parser for Abstract Meaning Representation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017


2016
Bayesian Analysis in Natural Language Processing
Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02161-9, 2016

Encoding Prior Knowledge with Eigenword Embeddings.
Trans. Assoc. Comput. Linguistics, 2016

Parsing Linear Context-Free Rewriting Systems with Fast Matrix Multiplication.
Comput. Linguistics, 2016

Paraphrase Generation from Latent-Variable PCFGs for Semantic Parsing.
Proceedings of the INLG 2016, 2016

Semi-Supervised Learning of Sequence Models with Method of Moments.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Low-Rank Approximation of Weighted Tree Automata.
Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, 2016

Optimizing Spectral Learning for Parsing.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Weighted Tree Automata Approximation by Singular Value Truncation.
CoRR, 2015

Lexical Event Ordering with an Edge-Factored Model.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Coactive Learning for Interactive Machine Translation.
Proceedings of the 4th Workshop on Machine Learning for Interactive Systems, 2015

Diversity in Spectral Learning for Natural Language Parsing.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Conversation Trees: A Grammar Model for Topic Structure in Forums.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

A Coactive Learning View of Online Structured Prediction in Statistical Machine Translation.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

2014
Online Adaptor Grammars with Hybrid Inference.
Trans. Assoc. Comput. Linguistics, 2014

Spectral learning of latent-variable PCFGs: algorithms and sample complexity.
J. Mach. Learn. Res., 2014

The Visualization of Change in Word Meaning over Time using Temporal Word Embeddings.
CoRR, 2014

Latent-Variable Synchronous CFGs for Hierarchical Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Spectral Unsupervised Parsing with Additive Tree Metrics.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

A Provably Correct Learning Algorithm for Latent-Variable PCFGs.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Lexical Inference over Multi-Word Predicates: A Distributional Approach.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Experiments with Spectral Learning of Latent-Variable PCFGs.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Approximate PCFG Parsing Using Tensor Decomposition.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Spectral Learning Algorithms for Natural Language Processing.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Spectral Learning of Refinement HMMs.
Proceedings of the Seventeenth Conference on Computational Natural Language Learning, 2013

The effect of non-tightness on Bayesian estimation of PCFGs.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Elimination of Spurious Ambiguity in Transition-Based Dependency Parsing
CoRR, 2012

Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning.
Comput. Linguistics, 2012

Tensor Decomposition for Fast Parsing with Latent-Variable PCFGs.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Spectral Learning of Latent-Variable PCFGs.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Products of weighted logic programs.
Theory Pract. Log. Program., 2011

Exact Inference for Generative Probabilistic Non-Projective Dependency Parsing.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Unsupervised Structure Prediction with Non-Parallel Multilingual Guidance.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Unsupervised Bilingual POS Tagging with Markov Random Fields.
Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, 2011

2010
Covariance in Unsupervised Learning of Probabilistic Grammars.
J. Mach. Learn. Res., 2010

Empirical Risk Minimization with Approximations of Probabilistic Grammars.
Proceedings of the Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6-9 December 2010, 2010

Variational Inference for Adaptor Grammars.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Viterbi Training for PCFGs: Hardness Results and Competitiveness of Uniform Initialization.
Proceedings of the ACL 2010, 2010

2009
Shared Logistic Normal Distributions for Soft Parameter Tying in Unsupervised Grammar Induction.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Variational Inference for Grammar Induction with Prior Knowledge.
Proceedings of the ACL 2009, 2009

2008
Logistic Normal Priors for Unsupervised Probabilistic Grammar Induction.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Dynamic Programming Algorithms as Products of Weighted Logic Programs.
Proceedings of the Logic Programming, 24th International Conference, 2008

2007
Feature Selection via Coalitional Game Theory.
Neural Comput., 2007

Joint Morphological and Syntactic Disambiguation.
Proceedings of the EMNLP-CoNLL 2007, 2007

2005
Feature Selection Based on the Shapley Value.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005


  Loading...