2024
InfAlign: Inference-aware language model alignment.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
ALTA: Compiler-Based Analysis of Transformers.
CoRR, 2024
Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning.
CoRR, 2024
Predicting the Target Word of Game-playing Conversations using a Low-Rank Dialect Adapter for Decoder Models.
CoRR, 2024
Robust Preference Optimization through Reward Model Distillation.
CoRR, 2024
Theoretical guarantees on the best-of-n alignment policy.
CoRR, 2024
Transforming and Combining Rewards for Aligning Large Language Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
2023
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
MD3: The Multi-Dialect Dataset of Dialogues.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Selectively Answering Ambiguous Questions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Dialect-robust Evaluation of Generated Text.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond.
,
,
,
,
,
,
,
,
,
,
,
,
Trans. Assoc. Comput. Linguistics, 2022
Time-Aware Language Models as Temporal Knowledge Bases.
Trans. Assoc. Comput. Linguistics, 2022
Underspecification Presents Challenges for Credibility in Modern Machine Learning.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
J. Mach. Learn. Res., 2022
Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Pre-trained Sentence Embeddings for Implicit Discourse Relation Classification.
CoRR, 2022
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model.
CoRR, 2022
Uninformative Input Features and Counterfactual Invariance: Two Perspectives on Spurious Correlations in Natural Language.
CoRR, 2022
Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
The MultiBERTs: BERT Reproductions for Robustness Analysis.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Tenth International Conference on Learning Representations, 2022
Predicting Long-Term Citations from Short-Term Linguistic Influence.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
2021
Sparse, Dense, and Attentional Representations for Text Retrieval.
Trans. Assoc. Comput. Linguistics, 2021
Follow the leader: Documents on the leading edge of semantic change get more citations.
J. Assoc. Inf. Sci. Technol., 2021
Learning to Look Inside: Augmenting Token-Based Encoders with Character-Level Information.
CoRR, 2021
Revisiting the Primacy of English in Zero-shot Cross-lingual Transfer.
CoRR, 2021
Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests.
CoRR, 2021
Abolitionist Networks: Modeling Language Change in Nineteenth-Century Activist Newspapers.
CoRR, 2021
Tuiteamos o pongamos un tuit? Investigating the Social Constraints of Loanword Integration in Spanish Social Media.
CoRR, 2021
Counterfactual Invariance to Spurious Correlations in Text Classification.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Learning to Recognize Dialect Features.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
2020
How We Do Things With Words: Analyzing Text as Social and Cultural Data.
Frontiers Artif. Intell., 2020
Characterizing Collective Attention via Descriptor Context: A Case Study of Public Discussions of Crisis Events.
Proceedings of the Fourteenth International AAAI Conference on Web and Social Media, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
AdvAug: Robust Adversarial Augmentation for Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Characterizing Collective Attention via Descriptor Context in Public Discussions of Crisis Events.
CoRR, 2019
Unsupervised Domain Adaptation of Contextualized Embeddings: A Case Study in Early Modern English.
CoRR, 2019
Measuring and Modeling Language Change.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Correcting Whitespace Errors in Digitized Historical Texts.
Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, 2019
Detecting Social Influence in Event Cascades by Comparing Discriminative Rankers.
Proceedings of the 2019 ACM SIGKDD Workshop on Causal Discovery, 2019
Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Character Eyes: Seeing Language through Character-Level Taggers.
Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2019
Clinical Concept Extraction for Document-Level Coding.
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019
Training on Synthetic Noise Improves Robustness to Natural Noise in Machine Translation.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019
The Referential Reader: A Recurrent Entity Network for Anaphora Resolution.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
Mind Your POV: Convergence of Articles and Editors Towards Wikipedia's Neutrality Norm.
Proc. ACM Hum. Comput. Interact., 2018
The Internet's Hidden Rules: An Empirical Study of Reddit Norm Violations at Micro, Meso, and Macro Scales.
Proc. ACM Hum. Comput. Interact., 2018
Stylistic Variation in Social Media Part-of-Speech Tagging.
CoRR, 2018
Discriminative Modeling of Social Influence for Prediction and Explanation in Event Cascades.
CoRR, 2018
Interactional Stancetaking in Online Forums.
Comput. Linguistics, 2018
Si O No, Que Penses? Catalonian Independence and Linguistic Identity on Social Media.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Explainable Prediction of Medical Codes from Clinical Text.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018
Making "fetch" happen: The influence of social and linguistic context on nonstandard word growth and decline.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Predicting Semantic Relations using Global Graph Properties.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
2017
Overcoming Language Variation in Sentiment Analysis with Social Attention.
Trans. Assoc. Comput. Linguistics, 2017
You Can't Stay Here: The Efficacy of Reddit's 2015 Ban Examined Through Hate Speech.
Proc. ACM Hum. Comput. Interact., 2017
Making "fetch" happen: The influence of social and linguistic context on the success of lexical innovations.
CoRR, 2017
A Kernel Independence Test for Geographical Language Variation.
Comput. Linguistics, 2017
Mimicking Word Embeddings using Subword RNNs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017
#Anorexia, #anarexia, #anarexyia: Characterizing online community practices with orthographic variation.
Proceedings of the 2017 IEEE International Conference on Big Data (IEEE BigData 2017), 2017
A Multidimensional Lexicon for Interpersonal Stancetaking.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
Unsupervised Learning for Lexicon-Based Classification.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
More emojis, less : ) The competition for paralinguistic function in microblog writing.
First Monday, 2016
A Latent Variable Recurrent Neural Network for Discourse Relation Language Models.
CoRR, 2016
Nonparametric Bayesian Storyline Detection from Microtexts.
CoRR, 2016
Shallow Discourse Parsing Using Distributed Argument Representations and Bayesian Optimization.
CoRR, 2016
The Social Dynamics of Language Change in Online Networks.
Proceedings of the Social Informatics - 8th International Conference, 2016
Part-of-Speech Tagging for Historical English.
Proceedings of the NAACL HLT 2016, 2016
A Latent Variable Recurrent Neural Network for Discourse-Driven Language Models.
Proceedings of the NAACL HLT 2016, 2016
Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
A Joint Model of Rhetorical Discourse Structure and Summarization.
Proceedings of the Workshop on Structured Prediction for NLP@EMNLP 2016, 2016
Morphological Priors for Probabilistic Neural Word Embeddings.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
2015
One Vector is Not Enough: Entity-Augmented Distributed Semantics for Discourse Relations.
Trans. Assoc. Comput. Linguistics, 2015
Exploratory Thematic Analysis for Digitized Archival Collections.
Digit. Scholarsh. Humanit., 2015
Identifying visual attributes for object recognition from text and taxonomy.
Comput. Vis. Image Underst., 2015
Putting Things in Context: Community-specific Embedding Projections for Sentiment Analysis.
CoRR, 2015
Unsupervised Domain Adaptation with Feature Embeddings.
Proceedings of the 3rd International Conference on Learning Representations, 2015
Emoticons vs. Emojis on Twitter: A Causal Inference Approach.
CoRR, 2015
Entity-Augmented Distributional Semantics for Discourse Relations.
Proceedings of the 3rd International Conference on Learning Representations, 2015
Document Context Language Models.
CoRR, 2015
Unsupervised Multi-Domain Adaptation with Feature Embeddings.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
"You're Mr. Lebowski, I'm the Dude": Inducing Address Term Formality in Signed Social Networks.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
Psychological Effects of Urban Crime Gleaned from Social Media.
Proceedings of the Ninth International Conference on Web and Social Media, 2015
Confounds and Consequences in Geotagged Twitter Data.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Closing the Gap: Domain Adaptation from Explicit to Implicit Discourse Relations.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Better Document-level Sentiment Analysis from RST Discourse Parsing.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
2014
One Vector is Not Enough: Entity-Augmented Distributional Semantics for Discourse Relations.
CoRR, 2014
Unsupervised Induction of Signed Social Networks from Content and Structure.
CoRR, 2014
Exploratory Thematic Analysis for Historical Newspaper Archives.
Proceedings of the 9th Annual International Conference of the Alliance of Digital Humanities Organizations, 2014
Fast Easy Unsupervised Domain Adaptation with Marginalized Structured Dropout.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Modeling Factuality Judgments in Social Media Text.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
POS induction with distributional and morphological information using a distance-dependent Chinese restaurant process.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Representation Learning for Text-level Discourse Parsing.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Mining Themes and Interests in the Asperger's and Autism Community.
Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2014
2013
Automated text mining for requirements analysis of policy documents.
Proceedings of the 21st IEEE International Requirements Engineering Conference, 2013
Discourse Connectors for Latent Subjectivity in Sentiment Analysis.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013
What to do about bad language on the internet.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013
A Log-Linear Model for Unsupervised Text Normalization.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
Discriminative Improvements to Distributional Sentence Similarity.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013
2012
Mapping the geographical diffusion of new words
CoRR, 2012
Gender in Twitter: Styles, stances, and social networks
CoRR, 2012
Document hierarchies from text and links.
Proceedings of the 21st World Wide Web Conference 2012, 2012
TopicViz: interactive topic exploration in document collections.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2012
Bootstrapping a Unified Model of Lexical and Phonetic Acquisition.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012
2011
Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text.
Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011
TopicScape: Semantic Navigation of Document Collections
CoRR, 2011
Unified analysis of streaming news.
Proceedings of the 20th International Conference on World Wide Web, 2011
Sparse Additive Generative Models of Text.
Proceedings of the 28th International Conference on Machine Learning, 2011
Structured Databases of Named Entities from Bayesian Nonparametrics.
Proceedings of the First workshop on Unsupervised Learning in NLP@EMNLP 2011, 2011
Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011
Discovering Sociolinguistic Associations with Structured Sparsity.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011
2010
A Latent Variable Model for Geographic Lexical Variation.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010
2009
Multilingual Part-of-Speech Tagging: Two Unsupervised Approaches.
J. Artif. Intell. Res., 2009
Learning Document-Level Semantic Properties from Free-Text Annotations.
J. Artif. Intell. Res., 2009
Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: a Bayesian Non-Parametric Approach.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009
Hierarchical Text Segmentation from Multi-Scale Lexical Cohesion.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009
Reading to Learn: Constructing Features from Semantic Abstracts.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009
2008
Structured models of gesture for discourse processing.
PhD thesis, 2008
Gesture Salience as a Hidden Variable for Coreference Resolution and Keyframe Extraction.
J. Artif. Intell. Res., 2008
Unsupervised Multilingual Learning for POS Tagging.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008
Bayesian Unsupervised Topic Segmentation.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008
Gestural Cohesion for Topic Segmentation.
Proceedings of the ACL 2008, 2008
Discourse Topic and Gestural Form.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008
2007
Conditional Modality Fusion for Coreference Resolution.
Proceedings of the ACL 2007, 2007
Turning Lectures into Comic Books Using Linguistically Salient Gestures.
Proceedings of the Twenty-Second AAAI Conference on Artificial Intelligence, 2007
2006
Natural gesture in descriptive monologues.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2006
Gesture Improves Coreference Resolution.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006
Semantic Back-Pointers from Gesture.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006
Gesture Features for Coreference Resolution.
Proceedings of the Machine Learning for Multimodal Interaction, 2006
Interacting with communication appliances: an evaluation of two computer vision-based selection techniques.
Proceedings of the 2006 Conference on Human Factors in Computing Systems, 2006
2004
A Salience-Based Approach to Gesture-Speech Alignment.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004
Visual and linguistic information in gesture classification.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004
Gestural cues for speech understanding.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004
Building the Design Studio of the Future.
Proceedings of the Making Pen-Based Interaction Intelligent and Natural, 2004
2003
Device Independence and Extensibility in Gesture Recognition.
Proceedings of the IEEE Virtual Reality Conference 2003 (VR 2003), 2003
2002
Model-Based Approaches to Reengineering Web Pages.
Proceedings of the Task Models and Diagrams for User Interface Design: Proceedings of the First International Workshop on Task Models and Diagrams for User Interface Design, 2002
XIML: a common representation for interaction data .
Proceedings of the 7th International Conference on Intelligent User Interfaces, 2002
A GUI editor that generates tutoring agents.
Proceedings of the 7th International Conference on Intelligent User Interfaces, 2002
Agents and GUIs from task models.
Proceedings of the 7th International Conference on Intelligent User Interfaces, 2002
2001
Applying model-based techniques to the development of UIs for mobile computers.
Proceedings of the 6th International Conference on Intelligent User Interfaces, 2001
Modeling preference for adaptive user-interfaces.
Proceedings of the Universal Access In HCI: Towards an Information Society for All, 2001
Alternative Representations and Abstractions for Moving Sensors Databases.
Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, 2001
2000
Adapting to mobile contexts with user-interface modeling.
Proceedings of the 3rd IEEE Workshop on Mobile Computing Systems and Applications (WMCSA 2000), 2000
Adaptation in automated user-interface design.
Proceedings of the 5th International Conference on Intelligent User Interfaces, 2000
1999
Towards a general computational framework for model-based interface development systems.
Knowl. Based Syst., 1999
Individual and/versus social creativity (panel session).
Proceedings of the 3rd Conference on Creativity & Cognition, 1999
Learning Design Guidelines by Theory Refinement.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999