Please submit manuscripts in either of the following two submission systems

    ScholarOne Manuscripts

  • ScholarOne
  • 勤云稿件系统

  • 登录

Search by Issue

  • 2024 Vol.31
  • 2023 Vol.30
  • 2022 Vol.29
  • 2021 Vol.28
  • 2020 Vol.27
  • 2019 Vol.26
  • 2018 Vol.25
  • 2017 Vol.24
  • 2016 vol.23
  • 2015 vol.22
  • 2014 vol.21
  • 2013 vol.20
  • 2012 vol.19
  • 2011 vol.18
  • 2010 vol.17
  • 2009 vol.16
  • No.1
  • No.2

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码
微信公众号二维码
Related citation:Li Jun,PanQiShu.A multiagent reinforcement learning approach based on different states[J].Journal of Harbin Institute Of Technology(New Series),2010,17(3):419-423.DOI:10.11916/j.issn.1005-9113.2010.03.024.
【Print】   【HTML】   【PDF download】   View/Add Comment  Download reader   Close
←Previous|Next→ Back Issue    Advanced Search
This paper has been: browsed 1132times   downloaded 657times 本文二维码信息
码上扫一扫!
Shared by: Wechat More
A multiagent reinforcement learning approach based on different states
Author NameAffiliation
Li Jun School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China 
PanQiShu School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China 
Abstract:
In this paper we describe a new reinforcement learning approach based on different states. When the multiagent is in coordination state,we take all coordinative agents as players and choose the learning approach based on game theory. When the multiagent is in indedependent state,we make each agent use the independent learning. We demonstrate that the proposed method on the pursuit-evasion problem can solve the dimension problems induced by both the state and the action space scale exponentially with the number of agents and no convergence problems,and we compare it with other related multiagent learning methods. Simulation experiment results show the feasibility of the algorithm.
Key words:  MAS  reinforcement learning  Q-learning  pursuit-evasion problem
DOI:10.11916/j.issn.1005-9113.2010.03.024
Clc Number:TP18
Fund:

LINKS