Hyeong Soo Chang

Orcid: 0000-0003-3298-0018

According to our database1, Hyeong Soo Chang authored at least 57 papers between 2000 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
On Supervised Online Rolling-Horizon Control for Infinite-Horizon Discounted Markov Decision Processes.
IEEE Trans. Autom. Control., February, 2024

2021
An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems.
Autom., 2021

2020
An asymptotically optimal strategy for constrained multi-armed bandit problems.
Math. Methods Oper. Res., 2020

An Index-based Deterministic Asymptotically Optimal Algorithm for Constrained Multi-armed Bandit Problems.
CoRR, 2020

2017
Value set iteration for two-person zero-sum Markov games.
Autom., 2017

2016
Sleeping experts and bandits approach to constrained Markov decision processes.
Autom., 2016

2015
Random search for constrained Markov decision processes with multi-policy improvement.
Autom., 2015

2014
Value set iteration for Markov decision processes.
Autom., 2014

An exact iterative search algorithm for constrained Markov decision processes.
Autom., 2014

2013
Policy set iteration for Markov decision processes.
Autom., 2013

On functional equations for Kth best policies in Markov decision processes.
Autom., 2013

2012
A Stochastic Approximation Framework for a Class of Randomized Optimization Algorithms.
IEEE Trans. Autom. Control., 2012

A policy iteration heuristic for constrained discounted controlled Markov Chains.
Optim. Lett., 2012

On the Probability of Correct Selection in Ordinal Comparison over Dynamic Networks.
J. Optim. Theory Appl., 2012

Approximate stochastic annealing for online control of infinite horizon Markov decision processes.
Autom., 2012

2011
Dynamic sample budget allocation in model-based optimization.
J. Glob. Optim., 2011

2010
Adaptive Adversarial Multi-Armed Bandit Approach to Two-Person Zero-Sum Markov Games.
IEEE Trans. Autom. Control., 2010

An Approximate Stochastic Annealing algorithm for finite horizon Markov decision processes.
Proceedings of the 49th IEEE Conference on Decision and Control, 2010

2009
Decentralized Learning in Finite Markov Chains: Revisited.
IEEE Trans. Autom. Control., 2009

2008
Converging Coevolutionary Algorithm for Two-Person Zero-Sum Discounted Markov Games With Perfect Information.
IEEE Trans. Autom. Control., 2008

Finite-Step Approximation Error Bounds for Solving Average-Reward-Controlled Markov Set-Chains.
IEEE Trans. Autom. Control., 2008

A population-based cross-entropy method with dynamic sample allocation.
Proceedings of the 47th IEEE Conference on Decision and Control, 2008

2007
Recursive Learning Automata Approach to Markov Decision Processes.
IEEE Trans. Autom. Control., 2007

An Asymptotically Efficient Simulation-Based Algorithm for Finite Horizon Stochastic Dynamic Programming.
IEEE Trans. Autom. Control., 2007

Solving Controlled Markov Set-Chains With Discounting via Multipolicy Improvement.
IEEE Trans. Autom. Control., 2007

A policy improvement method for constrained average Markov decision processes.
Oper. Res. Lett., 2007

A survey of some simulation-based algorithms for Markov decision processes.
Commun. Inf. Syst., 2007

Adversarial multi-armed bandit approach to two-person zero-sum Markov games.
Proceedings of the 46th IEEE Conference on Decision and Control, 2007

2006
A Policy Improvement Method in Constrained Stochastic Dynamic Programming.
IEEE Trans. Autom. Control., 2006

Perfect information two-person zero-sum markov games with imprecise transition probabilities.
Math. Methods Oper. Res., 2006

Converging Marriage in Honey-Bees Optimization and Application to Stochastic Dynamic Programming.
J. Glob. Optim., 2006

Advances in Dynamic Games, Applications to Economics, Finance, Optimization, and Stochastic Control: A.S. Nowak, K. Szajowski (Eds.); Birkhauser, Boston, 2005, ISBN 0-8176-4362-1.
Autom., 2006

Adversarial Multi-Armed Bandit Approach to Stochastic Optimization.
Proceedings of the 45th IEEE Conference on Decision and Control, 2006

Reinforcement learning with supervision by combining multiple learnings and expert advices.
Proceedings of the American Control Conference, 2006

2005
Evolutionary policy iteration for solving Markov decision processes.
IEEE Trans. Autom. Control., 2005

Error bounds for finite step approximations for solving infinite horizon controlled Markov set-chains.
IEEE Trans. Autom. Control., 2005

An Adaptive Sampling Algorithm for Solving Markov Decision Processes.
Oper. Res., 2005

An ant system based multicasting in mobile ad hoc network.
Proceedings of the IEEE Congress on Evolutionary Computation, 2005

Recursive Learning Automata for Control of Partially Observable Markov Decision Processes.
Proceedings of the 44th IEEE IEEE Conference on Decision and Control and 8th European Control Conference Control, 2005

On Solving Controlled Markov Set-Chains via Multi-Policy Improvement.
Proceedings of the 44th IEEE IEEE Conference on Decision and Control and 8th European Control Conference Control, 2005

2004
Multi-policy iteration with a distributed voting.
Math. Methods Oper. Res., 2004

Parallel Rollout for Online Solution of Partially Observable Markov Decision Processes.
Discret. Event Dyn. Syst., 2004

An adaptation of particle swarm optimization for Markov decision processes.
Proceedings of the IEEE International Conference on Systems, 2004

An ant system based exploration-exploitation for reinforcement learning.
Proceedings of the IEEE International Conference on Systems, 2004

Localization for a class of two-team zero-sum Markov games.
Proceedings of the 43rd IEEE Conference on Decision and Control, 2004

An ant system approach to Markov decision processes.
Proceedings of the 2004 American Control Conference, 2004

2003
Two-person zero-sum Markov games: receding horizon approach.
IEEE Trans. Autom. Control., 2003

Multitime scale Markov decision processes.
IEEE Trans. Autom. Control., 2003

Parallelizing Parallel Rollout Algorithm for Solving Markov Decision Processes.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2003

On the use of flow migration for handling short-term overloads.
Proceedings of the Global Telecommunications Conference, 2003

An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems.
Proceedings of the 42nd IEEE Conference on Decision and Control, 2003

A distributed algorithm for solving a class of multi-agent Markov decision problems.
Proceedings of the 42nd IEEE Conference on Decision and Control, 2003

2002
Scheduling Multiclass Packet Streams to Minimize Weighted Loss.
Queueing Syst. Theory Appl., 2002

Receding horizon approach to Markov games for infinite horizon discounted cost.
Proceedings of the 41st IEEE Conference on Decision and Control, 2002

A model for multi-time scaled sequential decision making processes.
Proceedings of the 41st IEEE Conference on Decision and Control, 2002

2000
A framework for simulation-based network control via hindsight optimization.
Proceedings of the 39th IEEE Conference on Decision and Control, 2000

On-line Scheduling via Sampling.
Proceedings of the Fifth International Conference on Artificial Intelligence Planning Systems, 2000


  Loading...