Alice Oh

Orcid: 0000-0002-7884-3038

Affiliations:
  • Korea Advanced Institute of Science and Technology (KAIST), Computer Science Department, Daejeon, South Korea
  • Massachusetts Institute of Technology (MIT), Cambridge, MA, USA (PhD 2008)


According to our database1, Alice Oh authored at least 122 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
KoBBQ: Korean Bias Benchmark for Question Answering.
Trans. Assoc. Comput. Linguistics, 2024

Survey of Cultural Awareness in Language Models: Text and Beyond.
CoRR, 2024

LLM-Driven Learning Analytics Dashboard for Teachers in EFL Writing Education.
CoRR, 2024

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines.
CoRR, 2024

Machine Learning Approach to Brain Tumor Detection and Classification.
CoRR, 2024

Uncovering Factor Level Preferences to Improve Human-Model Alignment.
CoRR, 2024

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages.
CoRR, 2024

Designing Prompt Analytics Dashboards to Analyze Student-ChatGPT Interactions in EFL Writing.
CoRR, 2024

BEnQA: A Question Answering and Reasoning Benchmark for Bengali and English.
CoRR, 2024

Multi-FAct: Assessing Multilingual LLMs' Multi-Regional Knowledge using FActScore.
CoRR, 2024

DREsS: Dataset for Rubric-based Essay Scoring on EFL Writing.
CoRR, 2024

Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to Analysis.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Translating Subgraphs to Nodes Makes Simple GNNs Strong and Efficient for Subgraph Representation Learning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Learning from Teaching Assistants to Formulate Subgoals for Programming Tasks: Exploring the Potential for AI Teaching Assistants.
Proceedings of the Joint Proceedings of the Human-Centric eXplainable AI in Education and the Leveraging Large Language Models for Next Generation Educational Technologies Workshops (HEXED-L3MNGET 2024) co-located with 17th International Conference on Educational Data Mining (EDM 2024), 2024

CHOP: Integrating ChatGPT into EFL Oral Presentation Practice.
Proceedings of the Joint Proceedings of the Human-Centric eXplainable AI in Education and the Leveraging Large Language Models for Next Generation Educational Technologies Workshops (HEXED-L3MNGET 2024) co-located with 17th International Conference on Educational Data Mining (EDM 2024), 2024

The Generative AI Paradox in Evaluation: "What It Can Solve, It May Not Evaluate".
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

BEnQA: A Question Answering Benchmark for Bengali and English.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Cultural Differences in Students' Privacy Concerns in Learning Analytics across Germany, South Korea, Spain, Sweden, and the United States.
CoRR, 2023

Peer Reviews of Peer Reviews: A Randomized Controlled Trial and Other Experiments.
CoRR, 2023

FABRIC: Automated Scoring and Feedback Generation for Essays.
CoRR, 2023

Learning from Teaching Assistants to Program with Subgoals: Exploring the Potential for AI Teaching Assistants.
CoRR, 2023

CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset.
CoRR, 2023

EliRank: A Code Editing History Based Ranking Model for Early Detection of Students in Need.
Proceedings of the Tenth ACM Conference on Learning @ Scale, 2023

RECIPE: How to Integrate ChatGPT into EFL Writing Education.
Proceedings of the Tenth ACM Conference on Learning @ Scale, 2023

The Role of Gender in Students' Privacy Concerns about Learning Analytics: Evidence from five countries.
Proceedings of the LAK 2023: 13th International Learning Analytics and Knowledge Conference, 2023

Time-Aware Representation Learning for Time-Sensitive Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Rethinking Annotation: Can Language Learners Contribute?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Ranking-Enhanced Unsupervised Sentence Representation Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Translating Hanja historical documents to understandable Korean and English.
CoRR, 2022

Efficient Representation Learning of Subgraphs by Subgraph-To-Node Translation.
CoRR, 2022

HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

CS1QA: A Dataset for Assisting Code-based Question Answering in an Introductory Programming Course.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Translating Hanja Historical Documents to Contemporary Korean and English.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

IDK-MRC: Unanswerable Questions for Indonesian Machine Reading Comprehension.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

KOLD: Korean Offensive Language Dataset.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Virtual Knowledge Graph Construction for Zero-Shot Domain-Specific Document Retrieval.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Models and Benchmarks for Representation Learning of Partially Observed Subgraphs.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

Two-Step Question Retrieval for Open-Domain QA.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Cocode: Providing Social Presence with Co-learner Screen Sharing in Online Programming Classes.
Proc. ACM Hum. Comput. Interact., 2021

KLUE: Korean Language Understanding Evaluation.
CoRR, 2021

Pythonpad: Server-free Python Hands-on Exercise for Online Programming Classes.
Proceedings of the SIGCSE '21: The 52nd ACM Technical Symposium on Computer Science Education, 2021


Emergent Communication under Varying Sizes and Connectivities.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

How to Find Your Friendly Neighborhood: Graph Attention Design with Self-Supervision.
Proceedings of the 9th International Conference on Learning Representations, 2021

Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Dimensional Emotion Detection from Categorical Emotion.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Learning Bill Similarity with Annotated and Augmented Corpora of Bills.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Knowledge-Enhanced Evidence Retrieval for Counterargument Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Mitigating Language-Dependent Ethnic Bias in BERT.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Weakly Supervised Pre-Training for Multi-Hop Retriever.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Denoising Recurrent Neural Networks for Classifying Crash-Related Events.
IEEE Trans. Intell. Transp. Syst., 2020

K-EmoCon, a multimodal sensor dataset for continuous emotion recognition in naturalistic conversations.
CoRR, 2020

Detecting Contract Cheaters in Online Programming Classes with Keystroke Dynamics.
Proceedings of the L@S'20: Seventh ACM Conference on Learning @ Scale, 2020

Context-Aware Answer Extraction in Question Answering.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Suicidal Risk Detection for Military Personnel.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Cocode: Co-learner Screen Sharing for Social Translucence in Online Programming Courses.
Proceedings of the Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems, 2020

Speaker Sensitive Response Evaluation Model.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Toward Dimensional Emotion Detection from Categorical Emotion Annotations.
CoRR, 2019

Homogeneity-Based Transmissive Process to Model True and False News in Social Networks.
Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, 2019

Conversation Model Fine-Tuning for Classifying Client Utterances in Counseling Dialogues.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

automaTA: Human-Machine Interaction for Answering Context-Specific Questions.
Proceedings of the Sixth ACM Conference on Learning @ Scale, 2019

Variational Hierarchical User-based Conversation Model.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Additive Compositionality of Word Vectors.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

2018
Leveraging the Crowd to Detect and Reduce the Spread of Fake News and Misinformation.
Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018

Non-Linear Editing of Text-Based Screencasts.
Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology, 2018

Elicast: embedding interactive exercises in instructional programming screencasts.
Proceedings of the Fifth Annual ACM Conference on Learning at Scale, 2018

Hierarchical Dirichlet Gaussian Marked Hawkes Process for Narrative Reconstruction in Continuous Time Domain.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Conversational Decision Making Model for Predicting King's Decision in the Annals of the Joseon Dynasty.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Subword-level Word Vector Representations for Korean.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Joint Modeling of Topics, Citations, and Topical Authority in Academic Corpora.
Trans. Assoc. Comput. Linguistics, 2017

Hierarchical Dirichlet scaling process.
Mach. Learn., 2017

Non-Linear Editor for Text-Based Screencast.
Proceedings of the Adjunct Publication of the 30th Annual ACM Symposium on User Interface Software and Technology, 2017

Rotated Word Vector Representations and their Interpretability.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Eliph: Effective Visualization of Code History for Peer Assessment in Programming Education.
Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, 2017

Analysis of the Effect of Competition on Player Immersion and Engagement in a Mobile Game.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

An adaptive vocabulary learning application through modeling learner's linguistic proficiency and interests.
Proceedings of the 2017 IEEE International Conference on Big Data and Smart Computing, 2017

2016
Elivate: A Real-Time Assistant for Students and Lecturers as Part of an Online CS Education Platform.
Proceedings of the Third ACM Conference on Learning @ Scale, 2016

Elice: An online CS Education Platform to Understand How Students Learn Programming.
Proceedings of the Third ACM Conference on Learning @ Scale, 2016

How to Compete Online for News Audience: Modeling Words that Attract Clicks.
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

Topical Interest and Degree of Involvement of Bilingual Editors in Wikipedia.
Proceedings of the Wiki, 2016

The Proficiency-Congruency Dilemma: Virtual Team Design and Performance in Multiplayer Online Games.
Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 2016

2015
Understanding Editing Behaviors in Multilingual Wikipedia.
CoRR, 2015

Five Centuries of Monarchy in Korea: Mining the Text of the Annals of the Joseon Dynasty.
Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, 2015

Towards Understanding Relational Orientation: Attachment Theory and Facebook Activities.
Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing, 2015

How WEIRD is HCI?: Extending HCI Principles to other Countries and Cultures.
Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems, 2015

Social Media Dynamics of Global Co-presence During the 2014 FIFA World Cup.
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

2014
Effective ranking and search techniques for Web resources considering semantic relationships.
Inf. Process. Manag., 2014

A computational analysis of agenda setting.
Proceedings of the 23rd International World Wide Web Conference, 2014

Sociolinguistic analysis of Twitter in multilingual societies.
Proceedings of the 25th ACM Conference on Hypertext and Social Media, 2014

Self-disclosure topic model for classifying and analyzing Twitter conversations.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013
Context-Dependent Conceptualization.
Proceedings of the IJCAI 2013, 2013

A Hierarchical Aspect-Sentiment Model for Online Reviews.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
"Discovering emotion influence patterns in online social network conversations" by Suin Kim, JinYeong Bak, and Alice Oh, with Ching-man Au Yeung as coordinator.
SIGWEB Newsl., 2012

Bayesian Group Nonnegative Matrix Factorization for EEG Analysis
CoRR, 2012

Summarizing Reviews with Variable-length Syntactic Patterns and Topic Models
CoRR, 2012

Variable Selection for Latent Dirichlet Allocation
CoRR, 2012

Do You Feel What I Feel? Social Aspects of Emotions in Twitter Conversations.
Proceedings of the Sixth International Conference on Weblogs and Social Media, 2012

Dirichlet Process with Mixed Random Measures: A Nonparametric Topic Model for Labeled Data.
Proceedings of the 29th International Conference on Machine Learning, 2012

Modeling topic hierarchies with the recursive chinese restaurant process.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Self-Disclosure and Relationship Strength in Twitter Conversations.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Aspect and sentiment unification model for online review analysis.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Analyzing social media in escalating crisis situations.
Proceedings of the 2011 IEEE International Conference on Intelligence and Security Informatics, 2011

Accounting for data dependencies within a hierarchical dirichlet process mixture model.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Topic Chains for Understanding a News Corpus.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2011

2010
Learning Influence Propagation of Personal Blogs with Content and Network Analyses.
Proceedings of the 2010 IEEE Second International Conference on Social Computing, 2010

Users' needs for social tagging and sharing on mobile contacts.
Proceedings of the 12th Conference on Human-Computer Interaction with Mobile Devices and Services, 2010

iLight: information flashlight on objects using handheld projector.
Proceedings of the 28th International Conference on Human Factors in Computing Systems, 2010

2009
User Evaluation of a System for Classifying and Displaying Political Viewpoints of Weblogs.
Proceedings of the Third International Conference on Weblogs and Social Media, 2009

Temporal Issue Trend Identifications in Blogs.
Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, 2009

2008
Generating multiple summaries based on computational model of perspective.
PhD thesis, 2008

Generating Baseball Summaries from Multiple Perspectives by Reordering Content.
Proceedings of the INLG 2008, 2008

2002
Stochastic natural language generation for spoken dialog systems.
Comput. Speech Lang., 2002

Face-Responsive Interfaces: From Direct Manipulation to Perceptive Presence.
Proceedings of the UbiComp 2002: Ubiquitous Computing, 4th International Conference, Göteborg, Sweden, September 29, 2002

Evaluating look-to-talk: a gaze-aware interface in a collaborative environment.
Proceedings of the Extended abstracts of the 2002 Conference on Human Factors in Computing Systems, 2002

2000
Task and domain specific modelling in the Carnegie Mellon communicator system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Creating natural dialogs in the carnegie mellon communicator system.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999


  Loading...