Kee-Eung Kim

According to our database1, Kee-Eung Kim authored at least 101 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models.
CoRR, 2024

SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization.
CoRR, 2024

Diversification of Adaptive Policy for Effective Offline Reinforcement Learning.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

A Submodular Optimization Approach to Accountable Loan Approval.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Stitching Sub-trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Trustworthy residual vehicle value prediction for auto finance.
AI Mag., December, 2023

Adapting Text-based Dialogue State Tracker for Spoken Dialogues.
CoRR, 2023

Regularized Behavior Cloning for Blocking the Leakage of Past Action Information.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Information-Theoretic State Space Model for Multi-View Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation.
CoRR, 2022

Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning to Embed Multi-Modal Contexts for Situated Conversational Agents.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Data Augmentation for Learning to Play in Text-Based Games.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

PAC-Net: A Model Pruning Approach to Inductive Transfer Learning.
Proceedings of the International Conference on Machine Learning, 2022

DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations.
Proceedings of the Tenth International Conference on Learning Representations, 2022

GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Corrigendum to 'Extensions to Hybrid Code Networks for FAIR Dialog Data' Computer Speech & Language volume 53 (2019) Pages 80-91.
Comput. Speech Lang., 2021

Augment & Valuate : A Data Enhancement Pipeline for Data-Centric AI.
CoRR, 2021

Multi-View Representation Learning via Total Correlation Objective.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation.
Proceedings of the 38th International Conference on Machine Learning, 2021

Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic.
Proceedings of the 9th International Conference on Learning Representations, 2021

Monte-Carlo Planning and Learning with Language Action Value Estimates.
Proceedings of the 9th International Conference on Learning Representations, 2021

Representation Balancing Offline Model-based Reinforcement Learning.
Proceedings of the 9th International Conference on Learning Representations, 2021

Personalized Treatment Using Biologics: An Analysis Using Counterfactual Regression Based on Deep Learning.
Proceedings of the 42nd International Conference on Information Systems, 2021

Dual Correction Strategy for Ranking Distillation in Top-N Recommender System.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Layered Behavior Modeling via Combining Descriptive and Prescriptive Approaches: A Case Study of Infantry Company Engagement.
IEEE Trans. Syst. Man Cybern. Syst., 2020

Foreword: special issue for the journal track of the 11th Asian Conference on Machine Learning (ACML 2019).
Mach. Learn., 2020

Foreword: special issue for the journal track of the 12th Asian conference on machine learning (ACML 2020).
Mach. Learn., 2020

Variational Interaction Information Maximization for Cross-domain Disentanglement.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforcement Learning for Control with Multiple Frequencies.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Batch Reinforcement Learning with Hyperparameter Gradients.
Proceedings of the 37th International Conference on Machine Learning, 2020

Variational Inference for Sequential Data with Future Likelihood Estimates.
Proceedings of the 37th International Conference on Machine Learning, 2020

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Residual Neural Processes.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Bayesian optimistic Kullback-Leibler exploration.
Mach. Learn., 2019

A Machine Learning-Based Approach for the Prediction of Acute Coronary Syndrome Requiring Revascularization.
J. Medical Syst., 2019

Extensions to hybrid code networks for FAIR dialog dataset.
Comput. Speech Lang., 2019

Policy Optimization Through Approximated Importance Sampling.
CoRR, 2019

PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Trust Region Sequential Variational Inference.
Proceedings of The 11th Asian Conference on Machine Learning, 2019

2018
Cross-Language Neural Dialog State Tracker for Large Ontologies Using Hierarchical Attention.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Monte-Carlo Tree Search for Constrained POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Bayesian Approach to Generative Adversarial Imitation Learning.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

OP-CAS: Collision Avoidance with Overtaking Maneuvers.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Imitation Learning via Kernel Mean Embedding.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Foreword: special issue for the journal track of the 8th Asian conference on machine learning (ACML 2016).
Mach. Learn., 2017

Hybrid modeling and simulation of tactical maneuvers in computer generated force.
Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics, 2017

Generative Local Metric Learning for Kernel Regression.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Constrained Bayesian Reinforcement Learning via Approximate Linear Programming.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Hierarchically-partitioned Gaussian Process Approximation.
Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016
Dialog History Construction with Long-Short Term Memory for Robust Generative Dialog State Tracking.
Dialogue Discourse, 2016

Neural dialog state tracker for large ontologies by attention mechanism.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Bayesian Reinforcement Learning with Behavioral Feedback.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Multi-view Automatic Lip-Reading Using Neural Network.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Hierarchical Bayesian Inverse Reinforcement Learning.
IEEE Trans. Cybern., 2015

Information-Theoretic Bounded Rationality.
CoRR, 2015

Reactive bandits with attitude.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Tighter Value Function Bounds for Bayesian Reinforcement Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Reward Shaping for Model-Based Bayesian Reinforcement Learning.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
Optimizing Generative Dialog State Tracker via Cascading Gradient Descent.
Proceedings of the SIGDIAL 2014 Conference, 2014

2013
Engineering Statistical Dialog State Trackers: A Case Study on DSTC.
Proceedings of the SIGDIAL 2013 Conference, 2013

Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning.
Proceedings of the IJCAI 2013, 2013

2012
Exploiting symmetries for single- and multi-agent Partially Observable Stochastic Domains.
Artif. Intell., 2012

Cost-Sensitive Exploration in Bayesian Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011
Robust Performance Evaluation of POMDP-Based Dialogue Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2011

Inverse Reinforcement Learning in Partially Observable Environments.
J. Mach. Learn. Res., 2011

A Geometric Traversal Algorithm for Reward-Uncertain MDPs.
Proceedings of the UAI 2011, 2011

MAP Inference for Bayesian Inverse Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Point-Based Value Iteration for Constrained POMDPs.
Proceedings of the IJCAI 2011, 2011

Closing the Gap: Improved Bounds on Optimal POMDP Solutions.
Proceedings of the 21st International Conference on Automated Planning and Scheduling, 2011

A POMDP-Based Optimal Control of P300-Based Brain-Computer Interfaces.
Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010
Point-Based Bounded Policy Iteration for Decentralized POMDPs.
Proceedings of the PRICAI 2010: Trends in Artificial Intelligence, 2010

A POMDP approach to P300-based brain-computer interfaces.
Proceedings of the 15th International Conference on Intelligent User Interfaces, 2010

2008
Effects of user modeling on POMDP-based dialogue systems.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Symbolic Heuristic Search Value Iteration for Factored POMDPs.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

Exploiting Symmetries in POMDPs for Point-Based Algorithms.
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007
Place Recognition Using Multiple Wearable Cameras.
Proceedings of the Ubiquitous Computing Systems, 4th International Symposium, 2007

2006
Hand Grip Pattern Recognition for Mobile User Interfaces.
Proceedings of the Proceedings, 2006

2005
Variable bandwidth allocation scheme for energy efficient wireless sensor network.
Proceedings of IEEE International Conference on Communications, 2005

2003
Solving factored MDPs using non-homogeneous partitions.
Artif. Intell., 2003

2002
Solving Factored MDPs with Large Action Space Using Algebraic Decision Diagrams.
Proceedings of the PRICAI 2002: Trends in Artificial Intelligence, 2002

2001
Representations and Algorithms for Large Stochastic Planning Problems.
PhD thesis, 2001

Solving Factored MDPs via Non-Homogeneous Partitioning.
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

2000
Learning to Cooperate via Policy Search.
Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

Approximate Solutions to Factored Markov Decision Processes via Greedy Search in the Space of Finite State Controllers.
Proceedings of the Fifth International Conference on Artificial Intelligence Planning Systems, 2000

1999
Learning Finite-State Controllers for Partially Observable Environments.
Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Solving POMDPs by Searching the Space of Finite Policies.
Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

1998
Solving Stochastic Planning Problems with Large State and Action Spaces.
Proceedings of the Fourth International Conference on Artificial Intelligence Planning Systems, 1998

Solving Very Large Weakly Coupled Markov Decision Processes.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998


  Loading...