Kazuteru Miyazaki

Proceedings of the PRIMA 2018: Principles and Practice of Multi-Agent Systems - 21st International Conference, Tokyo, Japan, October 29, 2018

Proposal and Evaluation of an Indirect Reward Assignment Method for Reinforcement Learning by Profit Sharing Method.

[BibT_eX]

[DOI]

Naoki Kodama

Proceedings of the Intelligent Systems and Applications, 2018

A Proposal for Reducing the Number of Trial-and-Error Searches for Deep Q-Networks Combined with Exploitation-Oriented Learning.

[BibT_eX]

[DOI]

Naoki Kodama

Taku Harada

Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018

2017

Proposal of PSwithEFP and its Evaluation in Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Koudai Furukawa

J. Adv. Comput. Intell. Intell. Informatics, 2017

Exploitation-Oriented Learning with Deep Learning - Introducing Profit Sharing to a Deep Q-Network -.

[BibT_eX]

[DOI]

J. Adv. Comput. Intell. Intell. Informatics, 2017

Proposal of a Deep Q-network with Profit Sharing.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual International Conference on Biologically Inspired Cognitive Architectures, 2017

2016

Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multi-agent Learning.

[BibT_eX]

[DOI]

Koudai Furukawa

Proceedings of the IEEE International Conference on Agents, 2016

Proposal of an Action Selection Strategy with Expected Failure Probability and Its Evaluation in Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

Koudai Furukawa

Proceedings of the Multi-Agent Systems and Agreement Technologies, 2016

A Study of an Indirect Reward on Multi-agent Environments.

[BibT_eX]

[DOI]

Proceedings of the 7th Annual International Conference on Biologically Inspired Cognitive Architectures, 2016

2014

The Necessity of a Secondary System in Machine Consciousness.

[BibT_eX]

[DOI]

Jun'ichi Takeno

Proceedings of the 5th Annual International Conference on Biologically Inspired Cognitive Architectures, 2014

2013

Proposal of an Exploitation-oriented Learning Method on Multiple Rewards and Penalties Environments and the Design Guideline.

[BibT_eX]

[DOI]

J. Comput., 2013

2012

Proposal of the Continuous-Valued Penalty Avoiding Rational Policy Making Algorithm.

[BibT_eX]

[DOI]

J. Adv. Comput. Intell. Intell. Informatics, 2012

Introduction of Fixed Mode States into Online Reinforcement Learning with Penalties and Rewards and its Application to Biped Robot Waist Trajectory Generation.

[BibT_eX]

[DOI]

Seiya Kuroda

J. Adv. Comput. Intell. Intell. Informatics, 2012

Proposal of an Active Course Classification Support system with Exploitation-oriented Learning extended by positive and negative examples.

[BibT_eX]

[DOI]

Masaaki Ida

Proceedings of the 6th International Conference on Soft Computing and Intelligent Systems (SCIS), 2012

Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm in Real World Environment.

[BibT_eX]

[DOI]

Masaki Itou

Proceedings of the Intelligent Information and Database Systems - 4th Asian Conference, 2012

2011

Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning.

[BibT_eX]

[DOI]

Masaaki Ida

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot.

[BibT_eX]

[DOI]

Seiya Kuroda

Proceedings of the Recent Advances in Reinforcement Learning - 9th European Workshop, 2011

2010

The Penalty Avoiding Rational Policy Making Algorithm in Continuous Action Spaces.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Data Engineering and Automated Learning, 2010

2009

A New Improved Penalty Avoiding Rational Policy Making Algorithm for Keepaway with Continuous State Spaces.

[BibT_eX]

[DOI]

Takuji Watanabe

J. Adv. Comput. Intell. Intell. Informatics, 2009

Exploitation-Oriented Learning PS-r#.

[BibT_eX]

[DOI]

J. Adv. Comput. Intell. Intell. Informatics, 2009

2008

Proposal of Exploitation-Oriented Learning PS-r#.

[BibT_eX]

[DOI]

Proceedings of the Intelligent Data Engineering and Automated Learning, 2008

2007

Reinforcement Learning for Penalty Avoidance in Continuous State Spaces.

[BibT_eX]

[DOI]

J. Adv. Comput. Intell. Intell. Informatics, 2007

2006

Multi User Learning Agent on the Distribution of MDPs.

[BibT_eX]

[DOI]

Daisuke Katagami

Katsumi Nitta

Proceedings of the 15th IEEE International Symposium on Robot and Human Interactive Communication, 2006

2004

Development of a reinforcement learning system to play Othello.

[BibT_eX]

[DOI]

Sougo Tsuboi

Artif. Life Robotics, 2004

2001

Rationality of Reward Sharing in Multi-agent Reinforcement Learning.

[BibT_eX]

[DOI]

New Gener. Comput., 2001

2000

Reinforcement learning for penalty avoiding policy making.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2000

1999

Multi-agent Reinforcement Learning for Crane Control Problem: Designing Rewards for Conflict Resolution.

[BibT_eX]

[DOI]

Sachiyo Arai

Proceedings of the Fourth International Symposium on Autonomous Decentralized Systems, 1999

1997

k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning.

[BibT_eX]

[DOI]

Masayuki Yamamura

Artif. Intell., 1997

Reinforcement Learning in POMDPs with Function Approximation.

[BibT_eX]

Hajime Kimura