Sherry Tongshuang Wu

Orcid: 0000-0003-1630-0588

Affiliations:
  • Carnegie Mellon University, PA, USA


According to our database1, Sherry Tongshuang Wu authored at least 76 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design.
Trans. Assoc. Comput. Linguistics, 2024

Large Language Models Enable Few-Shot Clustering.
Trans. Assoc. Comput. Linguistics, 2024

A large-scale audit of dataset licensing and attribution in AI.
Nat. Mac. Intell., 2024

HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation.
CoRR, 2024

Residual Kolmogorov-Arnold Network for Enhanced Deep Learning.
CoRR, 2024

What Is Wrong with My Model? Identifying Systematic Problems with Semantic Data Slicing.
CoRR, 2024

What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs.
CoRR, 2024

SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning.
CoRR, 2024

WebCanvas: Benchmarking Web Agents in Online Environments.
CoRR, 2024

Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness.
CoRR, 2024

Evaluating Mathematical Reasoning Beyond Accuracy.
CoRR, 2024

Large Language Models Help Humans Verify Truthfulness - Except When They Are Convincingly Wrong.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Synthetic Multimodal Question Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Selenite: Scaffolding Online Sensemaking with Comprehensive Overviews Elicited from Large Language Models.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

Trust and Reliance in Evolving Human-AI Workflows (TREW).
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2024

Generating Situated Reflection Triggers About Alternative Solution Paths: A Case Study of Generative AI for Computer-Supported Collaborative Learning.
Proceedings of the Artificial Intelligence in Education - 25th International Conference, 2024

How to Teach Programming in the AI Era? Using LLMs as a Teachable Agent for Debugging.
Proceedings of the Artificial Intelligence in Education - 25th International Conference, 2024

Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Better Synthetic Data by Retrieving and Transforming Existing Datasets.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Towards Natural Language-Based Visualization Authoring.
IEEE Trans. Vis. Comput. Graph., 2023

Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation.
Trans. Assoc. Comput. Linguistics, 2023

Do LLMs exhibit human-like response biases? A case study in survey design.
CoRR, 2023

Measuring Adversarial Datasets.
CoRR, 2023

The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI.
CoRR, 2023

HypoCompass: Large-Language-Model-based Tutor for Hypothesis Construction in Debugging for Novices.
CoRR, 2023

From Nuisance to News Sense: Augmenting the News with Cross-Document Evidence and Context.
CoRR, 2023

Selenite: Scaffolding Decision Making with Comprehensive Overviews Elicited from Large Language Models.
CoRR, 2023

"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge Graphs.
CoRR, 2023

LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs.
CoRR, 2023

Seeing Seeds Beyond Weeds: Green Teaming Generative AI for Beneficial Uses.
CoRR, 2023

Tool Learning with Foundation Models.
CoRR, 2023

Parachute: Evaluating Interactive Human-LM Co-writing Systems.
CoRR, 2023

Synergi: A Mixed-Initiative System for Scholarly Synthesis and Sensemaking.
Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, 2023

ScatterShot: Interactive In-context Example Curation for Text Transformation.
Proceedings of the 28th International Conference on Intelligent User Interfaces, 2023

BiasX: "Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Prompt2Model: Generating Deployable Models from Natural Language Instructions.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

NewsSense: Reference-free Verification via Cross-document Comparison.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

ConvXAI : Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing.
Proceedings of the Computer Supported Cooperative Work and Social Computing, 2023

LLMs and the Infrastructure of CSCW.
Proceedings of the Computer Supported Cooperative Work and Social Computing, 2023

Workshop on Trust and Reliance in AI-Human Teams (TRAIT).
Proceedings of the Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

Is AI the better programming partner? Human-Human Pair Programming vs. Human-AI pAIr Programming.
Proceedings of the Workshop on Empowering Education with LLMs, 2023

DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Capabilities for Better ML Engineering.
Proceedings of the Workshop on Artificial Intelligence Safety 2023 (SafeAI 2023) co-located with the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023), 2023

2022
Interactive AI Model Debugging and Correction
PhD thesis, 2022

DeHumor: Visual Analytics for Decomposing Humor.
IEEE Trans. Vis. Comput. Graph., 2022

Decisions that Explain Themselves: A User-Centric Deep Reinforcement Learning Explanation System.
CoRR, 2022

StoryBuddy: A Human-AI Collaborative Chatbot for Parent-Child Interactive Storytelling with Flexible Parental Involvement.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

AI Chains: Transparent and Controllable Human-AI Interaction by Chaining Large Language Model Prompts.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

PromptChainer: Chaining Large Language Model Prompts through Visual Programming.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Pretty Princess vs. Successful Leader: Gender Roles in Greeting Card Messages.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story Books.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Fantastic Questions and Where to Find Them: FairytaleQA - An Authentic Dataset for Narrative Comprehension.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Are Shortest Rationales the Best Explanations for Human Understanding?
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Tailor: Generating and Perturbing Text with Semantic Controls.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2021

It is AI's Turn to Ask Human a Question: Question and Answer Pair Generation for Children Storybooks in FairytaleQA Dataset.
CoRR, 2021

Polyjuice: Automated, General-purpose Counterfactual Generation.
CoRR, 2021

Beyond Accuracy: Behavioral Testing of NLP Models with Checklist (Extended Abstract).
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Principles and Interactive Tools for Evaluating and Improving the Behavior of Natural Language Processing models.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance.
Proceedings of the CHI '21: CHI Conference on Human Factors in Computing Systems, 2021

Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Tempura: Query Analysis with Structural Templates.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

No Explainability without Accountability: An Empirical Study of Explanations and Feedback in Interactive ML.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

Interactive Attention Model Explorer for Natural Language Processing Tasks with Unbalanced Data Sizes.
Proceedings of the 2020 IEEE Pacific Visualization Symposium, 2020

Beyond Accuracy: Behavioral Testing of NLP Models with CheckList.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Local Decision Pitfalls in Interactive Machine Learning: An Investigation into Feature Selection in Sentiment Analysis.
ACM Trans. Comput. Hum. Interact., 2019

Interactive Context-Aware Anomaly Detection Guided by User Feedback.
IEEE Trans. Hum. Mach. Syst., 2019

Errudite: Scalable, Reproducible, and Testable Error Analysis.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Technology-Enabled Disinformation: Summary, Lessons, and Recommendations.
CoRR, 2018

2017
NameClarifier: A Visual Analytics System for Author Name Disambiguation.
IEEE Trans. Vis. Comput. Graph., 2017

2016
PieceStack: Toward Better Understanding of Stacked Graphs.
IEEE Trans. Vis. Comput. Graph., 2016

NetworkSeer: Visual analysis for social network in MOOCs.
Proceedings of the 2016 IEEE Pacific Visualization Symposium, 2016

STAC: Enhancing stacked graphs for time series analysis.
Proceedings of the 2016 IEEE Pacific Visualization Symposium, 2016

2015
mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support.
Database J. Biol. Databases Curation, 2015


  Loading...