Joyce Y. Chai

Orcid: 0000-0002-9658-2230

Affiliations:
  • University of Michigan, Ann Arbor, MI, USA
  • Michigan State University, East Lansing, USA (former)


According to our database1, Joyce Y. Chai authored at least 148 papers between 1995 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities.
CoRR, 2024

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning.
CoRR, 2024

Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models.
CoRR, 2024

Multi-Object Hallucination in Vision-Language Models.
CoRR, 2024

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions.
CoRR, 2024

3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination.
CoRR, 2024

LinkGPT: Teaching Large Language Models To Predict Missing Links.
CoRR, 2024

DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences.
CoRR, 2024

Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations.
CoRR, 2024

GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot Learning.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Groundhog Grounding Large Language Models to Holistic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inversion-Free Image Editing with Language-Guided Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Partition-Based Active Learning for Graph Neural Networks.
Trans. Mach. Learn. Res., 2023

Inversion-Free Image Editing with Natural Language.
CoRR, 2023

Efficient In-Context Learning in Vision-Language Models for Egocentric Videos.
CoRR, 2023

Natural Language Instructions for Intuitive Human Interaction with Robotic Assistants in Field Construction Work.
CoRR, 2023

BAD: BiAs Detection for Large Language Models in the context of candidate screening.
CoRR, 2023

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Pragmatic Communication with Embodied Agents.
Proceedings of the 28th International Conference on Intelligent User Interfaces, 2023

Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MetaReVision: Meta-Learning with Retrieval for Visually Grounded Compositional Concept Acquisition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

NLP Reproducibility For All: Understanding Experiences of Beginners.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

In-Context Analogical Reasoning with Pre-Trained Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Spoken language interaction with robots: Recommendations for future research.
Comput. Speech Lang., 2022

Prompting Large Pre-trained Vision-Language Models For Compositional Concept Learning.
CoRR, 2022

DANLI: Deliberative Agent for Following Natural Language Instructions.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Learning to Mediate Disparities Towards Pragmatic Communication.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models.
CoRR, 2021

Zero-Shot Compositional Concept Learning.
CoRR, 2021

Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Beyond the Tip of the Iceberg: Assessing Coherence of Text Classifiers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Experience Grounds Language.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2019
X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust.
CoRR, 2019

Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps.
CoRR, 2019

Commonsense Reasoning for Natural Language Understanding: A Survey of Benchmarks, Resources, and Approaches.
CoRR, 2019

Natural Language Interaction with Explainable AI Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Explainable AI as Collaborative Task Solving.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Language to Action: Towards Interactive Task Learning with Physical Agents.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Commonsense Justification for Action Explanation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Language to Action: Towards Interactive Task Learning with Physical Agents.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

What Action Causes This? Towards Naive Physical Action-Effect Prediction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Detecting clinically related content in online patient posts.
J. Biomed. Informatics, 2017

Interactive Learning of State Representation through Natural Language Instruction and Explanation.
CoRR, 2017

Interactive Learning of Grounded Verb Semantics towards Human-Robot Communication.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Collaborative Language Grounding Toward Situated Human-Robot Dialogue.
AI Mag., 2016

Grounded Semantic Role Labeling.
Proceedings of the NAACL HLT 2016, 2016

Jointly Learning Grounded Task Structures from Language Instruction and Visual Demonstration.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Program robots manufacturing tasks by natural language instructions.
Proceedings of the IEEE International Conference on Automation Science and Engineering, 2016

Incremental Acquisition of Verb Hypothesis Space towards Physical World Interaction.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Physical Causality of Action Verbs in Grounded Language Understanding.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Task Learning through Visual Demonstration and Situated Dialogue.
Proceedings of the Symbiotic Cognitive Systems, 2016

What's Hot in Human Language Technology: Highlights from NAACL HLT 2015.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Exception Handling for Natural Language Control of Robots.
Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction, 2015

Embodied Collaborative Referring Expression Generation in Situated Human-Robot Interaction.
Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction, 2015

Question Types in Online Health Communities.
Proceedings of the AMIA 2015, 2015

Learning to Mediate Perceptual Differences in Situated Human-Robot Dialogue.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Back to the Blocks World: Learning New Actions through Situated Human-Robot Dialogue.
Proceedings of the SIGDIAL 2014 Conference, 2014

Teaching Robots New Actions through Natural Language Instructions.
Proceedings of the 23rd IEEE International Symposium on Robot and Human Interactive Communication, 2014

Perceptive feedback for natural language control of robotic operations.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Collaborative effort towards common ground in situated human-robot dialogue.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2014

Probabilistic Labeling for Efficient Referential Grounding based on Collaborative Discourse.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Collaborative Models for Referring Expression Generation in Situated Dialogue.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Introduction to the special section on eye gaze and conversation.
ACM Trans. Interact. Intell. Syst., 2013

Modeling Collaborative Referring for Situated Referential Grounding.
Proceedings of the SIGDIAL 2013 Conference, 2013

Towards Situated Dialogue: Revisiting Referring Expression Generation.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Shared Gaze in Situated Referential Grounding: An Empirical Study.
Proceedings of the Eye Gaze in Intelligent User Interfaces, 2013

2012
Introduction to the special issue on eye gaze in intelligent human-machine interaction.
ACM Trans. Interact. Intell. Syst., 2012

Semantic Role Labeling of Implicit Arguments for Nominal Predicates.
Comput. Linguistics, 2012

Towards Mediating Shared Perceptual Basis in Situated Dialogue.
Proceedings of the SIGDIAL 2012 Conference, 2012

Autonomous Self-Assessment of Autocorrections: Exploring Text Message Dialogues.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Towards online adaptation and personalization of key-target resizing for mobile devices.
Proceedings of the 17th International Conference on Intelligent User Interfaces, 2012

Integrating word acquisition and referential grounding towards physical world interaction.
Proceedings of the International Conference on Multimodal Interaction, 2012

2011
Beyond Normalization: Pragmatics of Word Form in Text Messages.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

A Joint Model of Implicit Arguments for Nominal Predicates.
Proceedings of the ACL 2011 Workshop on Relational Models of Semantics, 2011

2010
Context-based Word Acquisition for Situated Dialogue in a Virtual World.
J. Artif. Intell. Res., 2010

Hand Gestures in Disambiguating Types of You Expressions in Multiparty Meetings.
Proceedings of the SIGDIAL 2010 Conference, 2010

Workshop: eye gaze in intelligent human machine interaction.
Proceedings of the 15th International Conference on Intelligent User Interfaces, 2010

Towards Conversation Entailment: An Empirical Investigation.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Fusing Eye Gaze with Speech Recognition Hypotheses to Resolve Exophoric References in Situated Dialogue.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates.
Proceedings of the ACL 2010, 2010

Ambiguities in Spatial Language Understanding in Situated Human Robot Dialogue.
Proceedings of the Dialog with Robots, 2010

2009
What do We Know about Conversation Participants: Experiments on Conversation Entailment.
Proceedings of the SIGDIAL 2009 Conference, 2009

The Role of Interactivity in Human-Machine Conversation for Automatic Word Acquisition.
Proceedings of the SIGDIAL 2009 Conference, 2009

The Role of Implicit Argumentation in Nominal SRL.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Between linguistic attention and gaze fixations inmultimodal conversational interfaces.
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009

Communicative gestures in coreference identification in multiparty meetings.
Proceedings of the 11th International Conference on Multimodal Interfaces, 2009

2008
Beyond attention: the role of deictic gesture in intention recognition in multimodal conversational interfaces.
Proceedings of the 13th International Conference on Intelligent User Interfaces, 2008

What's in a gaze?: the role of eye-gaze in reference resolution in multimodal conversational interfaces.
Proceedings of the 13th International Conference on Intelligent User Interfaces, 2008

Incorporating Temporal and Semantic Information with Eye Gaze for Automatic Word Acquisition in Multimodal Conversational Systems.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

2007
An empirical investigation of user term feedback in text-based targeted image search.
ACM Trans. Inf. Syst., 2007

Discourse processing for context question answering based on linguistic knowledge.
Knowl. Based Syst., 2007

Michigan State University at the 2007 TREC ciQA Task.
Proceedings of The Sixteenth Text REtrieval Conference, 2007

An Exploration of Eye Gaze in Spoken Language Processing for Multimodal Conversational Interfaces.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Automated Vocabulary Acquisition and Interpretation in Multimodal Conversational Systems.
Proceedings of the ACL 2007, 2007

Eye Gaze for Attention Prediction in Multimodal Human-Machine Conversation.
Proceedings of the Interaction Challenges for Intelligent Assistants, 2007

2006
A statistical framework for query translation disambiguation.
ACM Trans. Asian Lang. Inf. Process., 2006

Cognitive Principles in Robust Multimodal Interpretation.
J. Artif. Intell. Res., 2006

Automated performance assessment in interactive QA.
Proceedings of the SIGIR 2006: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2006

Towards intelligent QA interfaces: discourse processing for context questions.
Proceedings of the 11th International Conference on Intelligent User Interfaces, 2006

Salience modeling based on non-verbal modalities for spoken language understanding.
Proceedings of the 8th International Conference on Multimodal Interfaces, 2006

Towards Conversational QA: Automatic Identification of Problematic Situations and User Intent.
Proceedings of the ACL 2006, 2006

2005
User term feedback in interactive text-based image retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

A maximum coherence model for dictionary-based cross-language information retrieval.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

Study of cross lingual information retrieval using on-line translation systems.
Proceedings of the SIGIR 2005: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2005

A Salience Driven Approach to Robust Input Interpretation in Multimodal Conversational Systems.
Proceedings of the HLT/EMNLP 2005, 2005

Linguistic theories in efficient multimodal reference resolution: an empirical investigation.
Proceedings of the 10th International Conference on Intelligent User Interfaces, 2005

Learn to weight terms in information retrieval using category information.
Proceedings of the Machine Learning, 2005

2004
An automatic weighting scheme for collaborative filtering.
Proceedings of the SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004

Performance Evaluation and Error Analysis for Multimodal Reference Resolution in a Conversation System.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Effective automatic image annotation via a coherent language model and active learning.
Proceedings of the 12th ACM International Conference on Multimedia, 2004

A probabilistic approach to reference resolution in multimodal user interfaces.
Proceedings of the 9th International Conference on Intelligent User Interfaces, 2004

MSU at ImageCLEF: Cross Language and Interactive Image Retrieval.
Proceedings of the Multilingual Information Access for Text, 2004

Regularizing translation models for better automatic image annotation.
Proceedings of the 2004 ACM CIKM International Conference on Information and Knowledge Management, 2004

Optimization in Multimodal Interpretation.
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004

2002
Natural Language Assistant: A Dialog System for Online Product Recommendation.
AI Mag., 2002

Operations for context-based multimodal interpretation in conversational systems.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Context-Based Multimodal Input Understanding in Conversational Systems.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Semantics-based Representation for Multimodal Interpretation in Conversational Systems.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

2001
The Role of a Natural Language Conversational Interface in Online Sales: A Case Study.
Int. J. Speech Technol., 2001

Conversational Sales Assistant for Online Shopping.
Proceedings of the First International Conference on Human Language Technology Research, 2001

A Conversational Interface for Online Shopping.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Natural Language Sales Assistant - A Web-Based Dialog System for Online Sales.
Proceedings of the Thirteenth Innovative Applications of Artificial Intelligence Conference, 2001

2000
Enabling technologies: natural language dialogue for personalized interaction.
Commun. ACM, 2000

Dynamic User Level and Utility Measurement for Adaptive Dialog in a Help-Desk System.
Proceedings of the SIGDIAL 2000 Workshop, 2000

Comparative Evaluation of a Natural Language Dialog Based System and a Menu Driven System for Information Access: a Case Study.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Evaluation of a Generic Lexical Semantic Resource in Information Extraction.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

A multi-modal dialog system for business transactions.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Two Dimensional Generalization in Information Extraction.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

The Use of Word Sense Disambiguation in an Information Extraction System.
Proceedings of the Sixteenth National Conference on Artificial Intelligence and Eleventh Conference on Innovative Applications of Artificial Intelligence, 1999

1997
A WordNet Based Rule Generalization Engine for Meaning Extraction System.
Proceedings of the Foundations of Intelligent Systems, 10th International Symposium, 1997

A Trainable Message Understanding System.
Proceedings of the 1997 Meeting of the ACL Special Interest Group in Natural Language Learning: Computational Natural Language Learning, 1997

Duke's Trainable Information and Meaning Extraction System (Duke TIMES).
Proceedings of the 5th Applied Natural Language Processing Conference, 1997

Corpus Based Statistical Generalization Tree in Rule Optimization.
Proceedings of the Fifth Workshop on Very Large Corpora, 1997

The Role of WordNet in The Creation of a Trainable Message Understanding System.
Proceedings of the Fourteenth National Conference on Artificial Intelligence and Ninth Innovative Applications of Artificial Intelligence Conference, 1997

1995
A trainable system for the extraction of meaning from text.
Proceedings of the 1995 Conference of the Centre for Advanced Studies on Collaborative Research, 1995


  Loading...