Jacob Eisenstein

Affiliations:
  • Google Research, Seattle, WA, USA
  • Georgia Institute of Technology, School of Interactive Computing, Atlanta, GA, USA


According to our database1, Jacob Eisenstein authored at least 145 papers between 1999 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning.
CoRR, 2024

Predicting the Target Word of Game-playing Conversations using a Low-Rank Dialect Adapter for Decoder Models.
CoRR, 2024

Robust Preference Optimization through Reward Model Distillation.
CoRR, 2024

Theoretical guarantees on the best-of-n alignment policy.
CoRR, 2024

Transforming and Combining Rewards for Aligning Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking.
CoRR, 2023

MD3: The Multi-Dialect Dataset of Dialogues.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Selectively Answering Ambiguous Questions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Dialect-robust Evaluation of Generated Text.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond.
Trans. Assoc. Comput. Linguistics, 2022

Time-Aware Language Models as Temporal Knowledge Bases.
Trans. Assoc. Comput. Linguistics, 2022

Underspecification Presents Challenges for Credibility in Modern Machine Learning.
J. Mach. Learn. Res., 2022

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models.
CoRR, 2022

Pre-trained Sentence Embeddings for Implicit Discourse Relation Classification.
CoRR, 2022

Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model.
CoRR, 2022

Uninformative Input Features and Counterfactual Invariance: Two Perspectives on Spurious Correlations in Natural Language.
CoRR, 2022

Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

The MultiBERTs: BERT Reproductions for Robustness Analysis.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Predicting Long-Term Citations from Short-Term Linguistic Influence.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Sparse, Dense, and Attentional Representations for Text Retrieval.
Trans. Assoc. Comput. Linguistics, 2021

Follow the leader: Documents on the leading edge of semantic change get more citations.
J. Assoc. Inf. Sci. Technol., 2021

Learning to Look Inside: Augmenting Token-Based Encoders with Character-Level Information.
CoRR, 2021

Revisiting the Primacy of English in Zero-shot Cross-lingual Transfer.
CoRR, 2021

Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests.
CoRR, 2021

Abolitionist Networks: Modeling Language Change in Nineteenth-Century Activist Newspapers.
CoRR, 2021

Tuiteamos o pongamos un tuit? Investigating the Social Constraints of Loanword Integration in Spanish Social Media.
CoRR, 2021

Counterfactual Invariance to Spurious Correlations in Text Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Learning to Recognize Dialect Features.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

2020
How We Do Things With Words: Analyzing Text as Social and Cultural Data.
Frontiers Artif. Intell., 2020

Characterizing Collective Attention via Descriptor Context: A Case Study of Public Discussions of Crisis Events.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020

Will it Unblend?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

AdvAug: Robust Adversarial Augmentation for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Characterizing Collective Attention via Descriptor Context in Public Discussions of Crisis Events.
CoRR, 2019

Unsupervised Domain Adaptation of Contextualized Embeddings: A Case Study in Early Modern English.
CoRR, 2019

Measuring and Modeling Language Change.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Correcting Whitespace Errors in Digitized Historical Texts.
Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, 2019

Detecting Social Influence in Event Cascades by Comparing Discriminative Rankers.
Proceedings of the 2019 ACM SIGKDD Workshop on Causal Discovery, 2019

Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Character Eyes: Seeing Language through Character-Level Taggers.
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019

Clinical Concept Extraction for Document-Level Coding.
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019

Training on Synthetic Noise Improves Robustness to Natural Noise in Machine Translation.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

The Referential Reader: A Recurrent Entity Network for Anaphora Resolution.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Mind Your POV: Convergence of Articles and Editors Towards Wikipedia's Neutrality Norm.
Proc. ACM Hum. Comput. Interact., 2018

The Internet's Hidden Rules: An Empirical Study of Reddit Norm Violations at Micro, Meso, and Macro Scales.
Proc. ACM Hum. Comput. Interact., 2018

Stylistic Variation in Social Media Part-of-Speech Tagging.
CoRR, 2018

Discriminative Modeling of Social Influence for Prediction and Explanation in Event Cascades.
CoRR, 2018

Interactional Stancetaking in Online Forums.
Comput. Linguistics, 2018

Si O No, Que Penses? Catalonian Independence and Linguistic Identity on Social Media.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Explainable Prediction of Medical Codes from Clinical Text.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Making "fetch" happen: The influence of social and linguistic context on nonstandard word growth and decline.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Predicting Semantic Relations using Global Graph Properties.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

2017
Overcoming Language Variation in Sentiment Analysis with Social Attention.
Trans. Assoc. Comput. Linguistics, 2017

You Can't Stay Here: The Efficacy of Reddit's 2015 Ban Examined Through Hate Speech.
Proc. ACM Hum. Comput. Interact., 2017

Making "fetch" happen: The influence of social and linguistic context on the success of lexical innovations.
CoRR, 2017

A Kernel Independence Test for Geographical Language Variation.
Comput. Linguistics, 2017

Mimicking Word Embeddings using Subword RNNs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

#Anorexia, #anarexia, #anarexyia: Characterizing online community practices with orthographic variation.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017

A Multidimensional Lexicon for Interpersonal Stancetaking.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Unsupervised Learning for Lexicon-Based Classification.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
More emojis, less : ) The competition for paralinguistic function in microblog writing.
First Monday, 2016

A Latent Variable Recurrent Neural Network for Discourse Relation Language Models.
CoRR, 2016

Nonparametric Bayesian Storyline Detection from Microtexts.
CoRR, 2016

Shallow Discourse Parsing Using Distributed Argument Representations and Bayesian Optimization.
CoRR, 2016

The Social Dynamics of Language Change in Online Networks.
Proceedings of the Social Informatics - 8th International Conference, 2016

Part-of-Speech Tagging for Historical English.
Proceedings of the NAACL HLT 2016, 2016

A Latent Variable Recurrent Neural Network for Discourse-Driven Language Models.
Proceedings of the NAACL HLT 2016, 2016

Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

A Joint Model of Rhetorical Discourse Structure and Summarization.
Proceedings of the Workshop on Structured Prediction for NLP@EMNLP 2016, 2016

Morphological Priors for Probabilistic Neural Word Embeddings.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
One Vector is Not Enough: Entity-Augmented Distributed Semantics for Discourse Relations.
Trans. Assoc. Comput. Linguistics, 2015

Exploratory Thematic Analysis for Digitized Archival Collections.
Digit. Scholarsh. Humanit., 2015

Identifying visual attributes for object recognition from text and taxonomy.
Comput. Vis. Image Underst., 2015

Putting Things in Context: Community-specific Embedding Projections for Sentiment Analysis.
CoRR, 2015

Unsupervised Domain Adaptation with Feature Embeddings.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Emoticons vs. Emojis on Twitter: A Causal Inference Approach.
CoRR, 2015

Entity-Augmented Distributional Semantics for Discourse Relations.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Document Context Language Models.
CoRR, 2015

Unsupervised Multi-Domain Adaptation with Feature Embeddings.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

"You're Mr. Lebowski, I'm the Dude": Inducing Address Term Formality in Signed Social Networks.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Psychological Effects of Urban Crime Gleaned from Social Media.
Proceedings of the Ninth International Conference on Web and Social Media, 2015

Confounds and Consequences in Geotagged Twitter Data.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Closing the Gap: Domain Adaptation from Explicit to Implicit Discourse Relations.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Better Document-level Sentiment Analysis from RST Discourse Parsing.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
One Vector is Not Enough: Entity-Augmented Distributional Semantics for Discourse Relations.
CoRR, 2014

Unsupervised Induction of Signed Social Networks from Content and Structure.
CoRR, 2014

Exploratory Thematic Analysis for Historical Newspaper Archives.
Proceedings of the 9th Annual International Conference of the Alliance of Digital Humanities Organizations, 2014

Fast Easy Unsupervised Domain Adaptation with Marginalized Structured Dropout.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Modeling Factuality Judgments in Social Media Text.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

POS induction with distributional and morphological information using a distance-dependent Chinese restaurant process.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Representation Learning for Text-level Discourse Parsing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Mining Themes and Interests in the Asperger's and Autism Community.
Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2014

2013
Automated text mining for requirements analysis of policy documents.
Proceedings of the 21st IEEE International Requirements Engineering Conference, 2013

Discourse Connectors for Latent Subjectivity in Sentiment Analysis.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

What to do about bad language on the internet.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

A Log-Linear Model for Unsupervised Text Normalization.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Discriminative Improvements to Distributional Sentence Similarity.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012
Mapping the geographical diffusion of new words
CoRR, 2012

Gender in Twitter: Styles, stances, and social networks
CoRR, 2012

Document hierarchies from text and links.
Proceedings of the 21st World Wide Web Conference 2012, 2012

TopicViz: interactive topic exploration in document collections.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2012

Bootstrapping a Unified Model of Lexical and Phonetic Acquisition.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011

TopicScape: Semantic Navigation of Document Collections
CoRR, 2011

Unified analysis of streaming news.
Proceedings of the 20th International Conference on World Wide Web, 2011

Sparse Additive Generative Models of Text.
Proceedings of the 28th International Conference on Machine Learning, 2011

Structured Databases of Named Entities from Bayesian Nonparametrics.
Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, 2011

Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Discovering Sociolinguistic Associations with Structured Sparsity.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
A Latent Variable Model for Geographic Lexical Variation.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

2009
Multilingual Part-of-Speech Tagging: Two Unsupervised Approaches.
J. Artif. Intell. Res., 2009

Learning Document-Level Semantic Properties from Free-Text Annotations.
J. Artif. Intell. Res., 2009

Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: a Bayesian Non-Parametric Approach.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Hierarchical Text Segmentation from Multi-Scale Lexical Cohesion.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Reading to Learn: Constructing Features from Semantic Abstracts.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

2008
Structured models of gesture for discourse processing.
PhD thesis, 2008

Gesture Salience as a Hidden Variable for Coreference Resolution and Keyframe Extraction.
J. Artif. Intell. Res., 2008

Unsupervised Multilingual Learning for POS Tagging.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Bayesian Unsupervised Topic Segmentation.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Gestural Cohesion for Topic Segmentation.
Proceedings of the ACL 2008, 2008

Discourse Topic and Gestural Form.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Conditional Modality Fusion for Coreference Resolution.
Proceedings of the ACL 2007, 2007

Turning Lectures into Comic Books Using Linguistically Salient Gestures.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007

2006
Natural gesture in descriptive monologues.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2006

Gesture Improves Coreference Resolution.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Semantic Back-Pointers from Gesture.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Gesture Features for Coreference Resolution.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Interacting with communication appliances: an evaluation of two computer vision-based selection techniques.
Proceedings of the 2006 Conference on Human Factors in Computing Systems, 2006

2004
A Salience-Based Approach to Gesture-Speech Alignment.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Visual and linguistic information in gesture classification.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Gestural cues for speech understanding.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Building the Design Studio of the Future.
Proceedings of the Making Pen-Based Interaction Intelligent and Natural, 2004

2003
Device Independence and Extensibility in Gesture Recognition.
Proceedings of the IEEE Virtual Reality Conference 2003 (VR 2003), 2003

2002
Model-Based Approaches to Reengineering Web Pages.
Proceedings of the Task Models and Diagrams for User Interface Design: Proceedings of the First International Workshop on Task Models and Diagrams for User Interface Design, 2002

XIML: a common representation for interaction data .
Proceedings of the 7th International Conference on Intelligent User Interfaces, 2002

A GUI editor that generates tutoring agents.
Proceedings of the 7th International Conference on Intelligent User Interfaces, 2002

Agents and GUIs from task models.
Proceedings of the 7th International Conference on Intelligent User Interfaces, 2002

2001
Applying model-based techniques to the development of UIs for mobile computers.
Proceedings of the 6th International Conference on Intelligent User Interfaces, 2001

Modeling preference for adaptive user-interfaces.
Proceedings of the Universal Access In HCI: Towards an Information Society for All, 2001

Alternative Representations and Abstractions for Moving Sensors Databases.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001

2000
Adapting to mobile contexts with user-interface modeling.
Proceedings of the 3rd IEEE Workshop on Mobile Computing Systems and Applications (WMCSA 2000), 2000

Adaptation in automated user-interface design.
Proceedings of the 5th International Conference on Intelligent User Interfaces, 2000

1999
Towards a general computational framework for model-based interface development systems.
Knowl. Based Syst., 1999

Individual and/versus social creativity (panel session).
Proceedings of the 3rd Conference on Creativity & Cognition, 1999

Learning Design Guidelines by Theory Refinement.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999


  Loading...