Kee-Eung Kim

Jun Zhu

Mach. Learn., 2020

Foreword: special issue for the journal track of the 12th Asian conference on machine learning (ACML 2020).

[BibT_eX]

[DOI]

Vineeth N. Balasubramanian

Mach. Learn., 2020

Variational Interaction Information Maximization for Cross-domain Disentanglement.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Reinforcement Learning for Control with Multiple Frequencies.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Batch Reinforcement Learning with Hyperparameter Gradients.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Variational Inference for Sequential Data with Future Likelihood Estimates.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Residual Neural Processes.

[BibT_eX]

[DOI]

Seunghoon Hong

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues.

[BibT_eX]

[DOI]

Youngsoo Jang

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Bayesian optimistic Kullback-Leibler exploration.

[BibT_eX]

[DOI]

Mach. Learn., 2019

A Machine Learning-Based Approach for the Prediction of Acute Coronary Syndrome Requiring Revascularization.

[BibT_eX]

[DOI]

J. Medical Syst., 2019

Extensions to hybrid code networks for FAIR dialog dataset.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2019

Policy Optimization Through Approximated Importance Sampling.

[BibT_eX]

[DOI]

CoRR, 2019

PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Trust Region Sequential Variational Inference.

[BibT_eX]

[DOI]

Proceedings of The 11th Asian Conference on Machine Learning, 2019

2018

Cross-Language Neural Dialog State Tracker for Large Ontologies Using Hierarchical Attention.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Monte-Carlo Tree Search for Constrained POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Bayesian Approach to Generative Adversarial Imitation Learning.

[BibT_eX]

[DOI]

Wonseok Jeon

Seokin Seo

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

OP-CAS: Collision Avoidance with Overtaking Maneuvers.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Imitation Learning via Kernel Mean Embedding.

[BibT_eX]

[DOI]

Hyun Soo Park

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Foreword: special issue for the journal track of the 8th Asian conference on machine learning (ACML 2016).

[BibT_eX]

[DOI]

Mach. Learn., 2017

Hybrid modeling and simulation of tactical maneuvers in computer generated force.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics, 2017

Generative Local Metric Learning for Kernel Regression.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Constrained Bayesian Reinforcement Learning via Approximate Linear Programming.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Hierarchically-partitioned Gaussian Process Approximation.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017

2016

Dialog History Construction with Long-Short Term Memory for Robust Generative Dialog State Tracking.

[BibT_eX]

[DOI]

Dialogue Discourse, 2016

Neural dialog state tracker for large ontologies by attention mechanism.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Bayesian Reinforcement Learning with Behavioral Feedback.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Multi-view Automatic Lip-Reading Using Neural Network.

[BibT_eX]

[DOI]

Daehyun Lee

Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015

Hierarchical Bayesian Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2015

Information-Theoretic Bounded Rationality.

[BibT_eX]

[DOI]

CoRR, 2015

Reactive bandits with attitude.

[BibT_eX]

[DOI]

Pedro A. Ortega

Daniel D. Lee

Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

Approximate Linear Programming for Constrained Partially Observable Markov Decision Processes.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Tighter Value Function Bounds for Bayesian Reinforcement Learning.

[BibT_eX]

[DOI]

Kanghoon Lee

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

Reward Shaping for Model-Based Bayesian Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014

Optimizing Generative Dialog State Tracker via Cascading Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2014 Conference, 2014

2013

Engineering Statistical Dialog State Trackers: A Case Study on DSTC.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2013 Conference, 2013

Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2013, 2013

2012

Exploiting symmetries for single- and multi-agent Partially Observable Stochastic Domains.

[BibT_eX]

[DOI]

Byung Kon Kang

Artif. Intell., 2012

Cost-Sensitive Exploration in Bayesian Reinforcement Learning.

[BibT_eX]

[DOI]

Dongho Kim

Pascal Poupart

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011

Robust Performance Evaluation of POMDP-Based Dialogue Systems.

[BibT_eX]

[DOI]

Dongho Kim

Jin H. Kim

IEEE ACM Trans. Audio Speech Lang. Process., 2011

Inverse Reinforcement Learning in Partially Observable Environments.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2011

A Geometric Traversal Algorithm for Reward-Uncertain MDPs.

[BibT_eX]

[DOI]

Eunsoo Oh

Proceedings of the UAI 2011, 2011

MAP Inference for Bayesian Inverse Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

Point-Based Value Iteration for Constrained POMDPs.

[BibT_eX]

[DOI]

Proceedings of the IJCAI 2011, 2011

Closing the Gap: Improved Bounds on Optimal POMDP Solutions.

[BibT_eX]

[DOI]

Pascal Poupart

Dongho Kim

Proceedings of the 21st International Conference on Automated Planning and Scheduling, 2011

A POMDP-Based Optimal Control of P300-Based Brain-Computer Interfaces.

[BibT_eX]

[DOI]

Jaeyoung Park

Yoon-Kyu Song

Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

2010

Point-Based Bounded Policy Iteration for Decentralized POMDPs.

[BibT_eX]

[DOI]

Youngwook Kim

Proceedings of the PRICAI 2010: Trends in Artificial Intelligence, 2010

A POMDP approach to P300-based brain-computer interfaces.

[BibT_eX]

[DOI]

Jaeyoung Park

Sungho Jo

Proceedings of the 15th International Conference on Intelligent User Interfaces, 2010

2008

Effects of user modeling on POMDP-based dialogue systems.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Symbolic Heuristic Search Value Iteration for Factored POMDPs.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

Exploiting Symmetries in POMDPs for Point-Based Algorithms.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence, 2008

2007

Place Recognition Using Multiple Wearable Cameras.

[BibT_eX]

[DOI]

Proceedings of the Ubiquitous Computing Systems, 4th International Symposium, 2007

2006

Hand Grip Pattern Recognition for Mobile User Interfaces.

[BibT_eX]

[DOI]

Proceedings of the Proceedings, 2006

2005

Variable bandwidth allocation scheme for energy efficient wireless sensor network.

[BibT_eX]

[DOI]

SeongHwan Cho

Proceedings of IEEE International Conference on Communications, 2005

2003

Solving factored MDPs using non-homogeneous partitions.

[BibT_eX]

[DOI]

Artif. Intell., 2003

2002

Solving Factored MDPs with Large Action Space Using Algebraic Decision Diagrams.

[BibT_eX]

[DOI]

Proceedings of the PRICAI 2002: Trends in Artificial Intelligence, 2002

2001

Representations and Algorithms for Large Stochastic Planning Problems.

[BibT_eX]

[DOI]

PhD thesis, 2001

Solving Factored MDPs via Non-Homogeneous Partitioning.

[BibT_eX]

Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence, 2001

2000

Learning to Cooperate via Policy Search.

[BibT_eX]

[DOI]

Leonid Peshkin

Proceedings of the UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence, Stanford University, Stanford, California, USA, June 30, 2000

Approximate Solutions to Factored Markov Decision Processes via Greedy Search in the Space of Finite State Controllers.

[BibT_eX]

[DOI]

Proceedings of the Fifth International Conference on Artificial Intelligence Planning Systems, 2000

1999

Learning Finite-State Controllers for Partially Observable Environments.

[BibT_eX]

[DOI]

Leonid Peshkin

Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

Solving POMDPs by Searching the Space of Finite Policies.

[BibT_eX]

[DOI]

Anthony R. Cassandra

Proceedings of the UAI '99: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden, July 30, 1999

1998

Solving Stochastic Planning Problems with Large State and Action Spaces.

[BibT_eX]

[DOI]

Robert Givan

Proceedings of the Fourth International Conference on Artificial Intelligence Planning Systems, 1998

Solving Very Large Weakly Coupled Markov Decision Processes.

[BibT_eX]

[DOI]