Please submit manuscripts in either of the following two submission systems

    ScholarOne Manuscripts

  • ScholarOne
  • 勤云稿件系统

  • 登录

Search by Issue

  • 2024 Vol.31
  • 2023 Vol.30
  • 2022 Vol.29
  • 2021 Vol.28
  • 2020 Vol.27
  • 2019 Vol.26
  • 2018 Vol.25
  • 2017 Vol.24
  • 2016 vol.23
  • 2015 vol.22
  • 2014 vol.21
  • 2013 vol.20
  • 2012 vol.19
  • 2011 vol.18
  • 2010 vol.17
  • 2009 vol.16
  • No.1
  • No.2

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码
微信公众号二维码
Related citation:Jun Li,Wen-Long Song,Yu-Rong He.Research of Multiagent Coordination and Cooperation Algorithm[J].Journal of Harbin Institute Of Technology(New Series),2013,20(3):109-112.DOI:10.11916/j.issn.1005-9113.2013.03.018.
【Print】   【HTML】   【PDF download】   View/Add Comment  Download reader   Close
←Previous|Next→ Back Issue    Advanced Search
This paper has been: browsed 1673times   downloaded 987times 本文二维码信息
码上扫一扫!
Shared by: Wechat More
Research of Multiagent Coordination and Cooperation Algorithm
Author NameAffiliation
Jun Li College Mechanical and Electrical, Northeast Forestry University, Harbin 150040, China 
Wen-Long Song College Mechanical and Electrical, Northeast Forestry University, Harbin 150040, China 
Yu-Rong He School of Energy Science and Engineering, Harbin Institute of Technology, Harbin 150001, China 
Abstract:
To solve the problem of conflict and deadlock with agents in multiagent system, an algorithm of multiagent coordination and cooperation was proposed. Taking agent in multiagent system as a player, the pursuit problem Markov model was built. The solution was introduced to get the optimal Nash equilibrium by multiagent reinforcement learning. The method of probability and statistics and Bayes formula was used to estimate the policy knowledge of other players. Relative mean deviation method was used to evaluate the confidence degree in order to increase the convergence speed. The simulation results on pursuit problem showed the feasibility and validity of the given algorithm.
Key words:  multiagent system  Markov games  Nash equilibrium  reinforcement learning
DOI:10.11916/j.issn.1005-9113.2013.03.018
Clc Number:TP391.9
Fund:

LINKS