Yulia Tsvetkov
Orcid: 0000-0002-4634-7128Affiliations:
- University of Washington, Paul G. Allen School of Computer Science and Engineering, USA
- Carnegie Mellon University, Pittsburgh, PA, USA
According to our database1,
Yulia Tsvetkov
authored at least 171 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
CoRR, 2024
CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs.
CoRR, 2024
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization.
CoRR, 2024
Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically.
CoRR, 2024
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge.
CoRR, 2024
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.
CoRR, 2024
Proceedings of the ACM on Web Conference 2024, 2024
LatticeGen: Hiding Generated Text in a Lattice for Privacy-Aware Large Language Model Generation on Cloud.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024
P³Sum: Preserving Author's Perspective in News Summarization with Diffusion Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Beyond the Stethoscope: Operationalizing Interactive Clinical Reasoning in Large Language Models via Proactive Information Seeking.
Proceedings of the 12th IEEE International Conference on Healthcare Informatics, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Mental Health Stigma across Diverse Genders in Generative Large Language Models - Abstract (abstract).
Proceedings of Machine Learning for Cognitive and Mental Health Workshop (ML4CMH 2024) Co-located with the Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2024), 2024
2023
What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization.
CoRR, 2023
Knowledge Crosswords: Geometric Reasoning over Structured Knowledge with Large Language Models.
CoRR, 2023
LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud.
CoRR, 2023
CoRR, 2023
CoRR, 2023
CooK: Empowering General-Purpose Language Models with Modular and Collaborative Knowledge.
CoRR, 2023
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Toward Human Readable Prompt Tuning: Kubrick's The Shining is a good movie, and a good prompt too?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023
Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
ORCA: Interpreting Prompted Language Models via Locating Supporting Data Evidence in the Ocean of Pretraining Data.
CoRR, 2022
CoRR, 2022
VoynaSlov: A Data Set of Russian Social Media Activity during the 2022 Ukraine-Russia War.
CoRR, 2022
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Challenges and Opportunities in Information Manipulation Detection: An Examination of Wartime Russian Media.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Speaker Information Can Guide Models to Better Inductive Biases: A Case Study On Predicting Code-Switching.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
2021
CoRR, 2021
An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
Proceedings of the Fifteenth International AAAI Conference on Web and Social Media, 2021
Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models.
Proceedings of the 9th International Conference on Learning Representations, 2021
DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues.
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Influence Tuning: Demoting Spurious Correlations via Instance Attribution and Instance-Driven Updates.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
Cross-Cultural Similarity Features for Cross-Lingual Transfer Learning of Pragmatically Motivated Tasks.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021
2020
Frontiers Artif. Intell., 2020
Ranking Transfer Languages with Pragmatically-Motivated Features for Multilingual Sentiment Analysis.
CoRR, 2020
StructSum: Incorporating Latent and Explicit Sentence Dependencies for Single Document Summarization.
CoRR, 2020
Where New Words Are Born: Distributional Semantic Analysis of Neologisms and Their Semantic Neighborhoods.
CoRR, 2020
Proceedings of the Social Informatics - 12th International Conference, 2020
LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for Multi-Granular Propaganda Span Identification.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020
Stress and burnout in open source: toward finding, understanding, and mitigating unhealthy interactions.
Proceedings of the ICSE-NIER 2020: 42nd International Conference on Software Engineering, New Ideas and Emerging Results, Seoul, South Korea, 27 June, 2020
Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History.
Proceedings of the 8th International Conference on Learning Representations, 2020
Proceedings of the "I Can't Believe It's Not Better!" at NeurIPS Workshops, 2020
On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020
A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards.
Proceedings of the Fourth Workshop on Neural Generation and Translation, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the Eighth International Workshop on Natural Language Processing for Social Media, 2020
2019
CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology.
CoRR, 2019
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019
Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue, 2019
Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
Contextual Affective Analysis: A Case Study of People Portrayals in Online #MeToo Stories.
Proceedings of the Thirteenth International Conference on Web and Social Media, 2019
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs.
Proceedings of the 7th International Conference on Learning Representations, 2019
Learning to Generate Word- and Phrase-Embeddings for Efficient Phrase-Based Neural Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
Finding Microaggressions in the Wild: A Case for Locating Elusive Phenomena in Social Media Posts.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019
A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019
2018
Trans. Assoc. Comput. Linguistics, 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018
Framing and Agenda-Setting in Russian News: a Computational Analysis of Intricate Political Strategies.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018
2017
Proceedings of the Social Informatics, 2017
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017
2016
Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation Learning.
CoRR, 2016
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016
Polyglot Neural Language Models: A Case Study in Cross-Lingual Phonetic Representation Learning.
Proceedings of the NAACL HLT 2016, 2016
Proceedings of the NAACL HLT 2016, 2016
Learning the Curriculum with Bayesian Optimization for Task-Specific Word Representation Learning.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016
2015
Proceedings of the NetWordS Final Conference on Word Knowledge and Word Usage: Representations and Processes in the Mental Lexicon, Pisa, Italy, March 30, 2015
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Not All Contexts Are Created Equal: Better Word Representations with Variable Attention.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015
2014
Identification of Multiword Expressions by Combining Multiple Linguistic Information Sources.
Comput. Linguistics, 2014
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014
Proceedings of the COLING 2014, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
2013
Generating English Determiners in Phrase-Based Translation with Synthetic Translation Options.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, 2013
2012
Nat. Lang. Eng., 2012
2011
Identification of Multi-word Expressions by Combining Multiple Linguistic Information Sources.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011
2010
Proceedings of the International Conference on Language Resources and Evaluation, 2010