Networked reinforcement learning

Makito Oku; Kazuyuki Aihara

Networked reinforcement learning

Makito Oku^*, Kazuyuki Aihara

^*Corresponding author for this work

Research Center for Pre-Disease Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Recently, many models of reinforcement learning with hierarchical or modular structures have been proposed. They decompose a task into simpler sub-tasks and solve them with multiple agents. In these models, however, topological relations of agents are severely restricted. By relaxing the restrictions, we propose networked reinforcement learning where each agent in a network acts in parallel as if the other agents are parts of the environment. Although convergence to an optimal policy is no longer assured, we show by numerical simulations that our model performs well at least in some simple situations.

Original language	English
Title of host publication	Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08
Pages	469-472
Number of pages	4
State	Published - 2008
Event	13th International Symposium on Artificial Life and Robotics, AROB 13th'08 - Oita, Japan Duration: 2008/01/31 → 2008/02/02

Publication series

Name	Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08

Conference

Conference	13th International Symposium on Artificial Life and Robotics, AROB 13th'08
Country/Territory	Japan
City	Oita
Period	2008/01/31 → 2008/02/02

Keywords

Hierarchical reinforcement learning
Modular reinforcement learning
Partially observable markov decision process

ASJC Scopus subject areas

Artificial Intelligence
Computer Vision and Pattern Recognition
Human-Computer Interaction

Access to Document

https://alife-robotics.co.jp/members2008/icarob/papers/OS11/OS11-1.pdf

Cite this

@inproceedings{bda93e4a681e45a6a71fa96da4543579,

title = "Networked reinforcement learning",

abstract = "Recently, many models of reinforcement learning with hierarchical or modular structures have been proposed. They decompose a task into simpler sub-tasks and solve them with multiple agents. In these models, however, topological relations of agents are severely restricted. By relaxing the restrictions, we propose networked reinforcement learning where each agent in a network acts in parallel as if the other agents are parts of the environment. Although convergence to an optimal policy is no longer assured, we show by numerical simulations that our model performs well at least in some simple situations.",

keywords = "Hierarchical reinforcement learning, Modular reinforcement learning, Partially observable markov decision process",

author = "Makito Oku and Kazuyuki Aihara",

year = "2008",

language = "英語",

isbn = "9784990288020",

series = "Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08",

pages = "469--472",

booktitle = "Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08",

note = "13th International Symposium on Artificial Life and Robotics, AROB 13th'08 ; Conference date: 31-01-2008 Through 02-02-2008",

}

Oku, M & Aihara, K 2008, Networked reinforcement learning. in Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08. Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08, pp. 469-472, 13th International Symposium on Artificial Life and Robotics, AROB 13th'08, Oita, Japan, 2008/01/31. <https://alife-robotics.co.jp/members2008/icarob/papers/OS11/OS11-1.pdf>

Networked reinforcement learning. / Oku, Makito; Aihara, Kazuyuki.
Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08. 2008. p. 469-472 (Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Networked reinforcement learning

AU - Oku, Makito

AU - Aihara, Kazuyuki

PY - 2008

Y1 - 2008

N2 - Recently, many models of reinforcement learning with hierarchical or modular structures have been proposed. They decompose a task into simpler sub-tasks and solve them with multiple agents. In these models, however, topological relations of agents are severely restricted. By relaxing the restrictions, we propose networked reinforcement learning where each agent in a network acts in parallel as if the other agents are parts of the environment. Although convergence to an optimal policy is no longer assured, we show by numerical simulations that our model performs well at least in some simple situations.

AB - Recently, many models of reinforcement learning with hierarchical or modular structures have been proposed. They decompose a task into simpler sub-tasks and solve them with multiple agents. In these models, however, topological relations of agents are severely restricted. By relaxing the restrictions, we propose networked reinforcement learning where each agent in a network acts in parallel as if the other agents are parts of the environment. Although convergence to an optimal policy is no longer assured, we show by numerical simulations that our model performs well at least in some simple situations.

KW - Hierarchical reinforcement learning

KW - Modular reinforcement learning

KW - Partially observable markov decision process

UR - http://www.scopus.com/inward/record.url?scp=78449232010&partnerID=8YFLogxK

M3 - 会議への寄与

AN - SCOPUS:78449232010

SN - 9784990288020

T3 - Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08

SP - 469

EP - 472

BT - Proceedings of the 13th International Symposium on Artificial Life and Robotics, AROB 13th'08

T2 - 13th International Symposium on Artificial Life and Robotics, AROB 13th'08

Y2 - 31 January 2008 through 2 February 2008

ER -

Networked reinforcement learning

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this