Research of Multiagent Coordination and Cooperation Algorithm

Jun Li; Wen-Long Song; Yu-Rong He

Please submit manuscripts in either of the following two submission systems

ScholarOne Manuscripts

ScholarOne

勤云稿件系统

Search by Issue

Search by Keywords

News & AnnouncementMORE

【03-29】2015 Outstanding Reviewers
【03-27】2014 Outstanding Reviewers
【02-18】2013 Outstanding Reviewers
【12-29】The First Outstanding Reviewers
【05-04】Copyright Transfer Agreement
【04-04】To authors

Supervised by Ministry of Industry and Information Technology of The People's Republic of China Sponsored by Harbin Institute of Technology Editor-in-chief Yu Zhou ISSNISSN 1005-9113 CNCN 23-1378/T

期刊网站二维码

微信公众号二维码

Related citation:

Jun Li,Wen-Long Song,Yu-Rong He.Research of Multiagent Coordination and Cooperation Algorithm[J].Journal of Harbin Institute Of Technology(New Series),2013,20(3):109-112.DOI:10.11916/j.issn.1005-9113.2013.03.018.

【Print】【HTML】【PDF download】【View/Add Comment】【Download reader】【 Close 】

←Previous|Next→

Back Issue Advanced Search

This paper has been: browsed 1776times downloaded 1078times	码上扫一扫！
Shared by: Wechat More Font:larger+\|default\|smaller-
Research of Multiagent Coordination and Cooperation Algorithm

Author Name	Affiliation
Jun Li	College Mechanical and Electrical, Northeast Forestry University, Harbin 150040, China
Wen-Long Song	College Mechanical and Electrical, Northeast Forestry University, Harbin 150040, China
Yu-Rong He	School of Energy Science and Engineering, Harbin Institute of Technology, Harbin 150001, China

Abstract:

To solve the problem of conflict and deadlock with agents in multiagent system, an algorithm of multiagent coordination and cooperation was proposed. Taking agent in multiagent system as a player, the pursuit problem Markov model was built. The solution was introduced to get the optimal Nash equilibrium by multiagent reinforcement learning. The method of probability and statistics and Bayes formula was used to estimate the policy knowledge of other players. Relative mean deviation method was used to evaluate the confidence degree in order to increase the convergence speed. The simulation results on pursuit problem showed the feasibility and validity of the given algorithm.

Key words: multiagent system Markov games Nash equilibrium reinforcement learning

DOI：10.11916/j.issn.1005-9113.2013.03.018

Clc Number:TP391.9

Fund:

Search by Issue

Search by Keywords

News & AnnouncementMORE

LINKS