引用本文: | 王庆文,史浩山,戚茜.Ad Hoc网络Q学习稳定蚁群路由算法[J].哈尔滨工业大学学报,2012,44(7):120.DOI:10.11918/j.issn.0367-6234.2012.07.023 |
| WANG Qing-wen,SHI Hao-shan,QI Qian.A stable ant colony routing algorithm based on Q-learning for Ad Hoc Networks[J].Journal of Harbin Institute of Technology,2012,44(7):120.DOI:10.11918/j.issn.0367-6234.2012.07.023 |
|
摘要: |
针对Ad Hoc网络路由协议存在的对动态拓扑适应性差和链路不稳定问题,提出了一种Q学习稳定蚁群路由算法(SACRQ),该算法综合了蚁群优化和Q学习算法的思想,将信息素映射为Q学习算法的Q值,增强节点对动态环境的学习能力.在路由选择方面,使用自适应伪随机比率选择下一跳节点,避免算法陷入局部最优或是停滞;提出了新的链路稳定度来衡量链路的鲁棒性,结合鲁棒性和信息素强度两种因素选择下一跳链路.该算法增加了链路的鲁棒性,对Ad Hoc网络动态拓扑适应性强.仿真结果表明,SACRQ的路由发现数量、平均端对端延迟、冲突数量和每次路由发现吞吐量4种指标均优于ARA和AODV. |
关键词: Ad Hoc网络 Q学习 蚁群 路由算法 鲁棒性 |
DOI:10.11918/j.issn.0367-6234.2012.07.023 |
分类号:TP393 |
基金项目:教育部博士点基金资助项目(20050699037);
国家自然科学基金资助项目 (60472074). |
|
A stable ant colony routing algorithm based on Q-learning for Ad Hoc Networks |
WANG Qing-wen1,2, SHI Hao-shan2, QI Qian3
|
1.Dept. of Aerospace Engineering, The Second Artillery Engineering University, 710025 Xi'an,China;2.School of Electronic Engineering, Northwestern Polytechnical University, 710129 Xi'an, China;3.School of Marine, Northwestern Polytechnical University, 710072 Xi'an, China
|
Abstract: |
To solve the problem of poor flexibility and frequent route breakage caused by dynamic topology in Ad Hoc network routing protocols, a stable ant colony routing algorithm based on Q-learning (SACRQ) is proposed, which synthesizes the Ant Colony Optimization and the Q-learning algorithm. The pheromone level is equal to the Q value to enhance the learning ability of nodes. To avoid local peak, SARCQ applies an adaptive pseudo random proportional action choice rule to select the next hop. A new robustness of the links metric is presented to calculate the probability of the route selection together with the pheromone level. The algorithm enhances the stability of the links and demonstrates high flexibility to the dynamic topology of the network. Simulation results show that SACRQ achieves better performance in terms of the number of the route discovery, the average end-to-end delay, the number of collisions and the average throughput per route discovery, which is respectively compared with the ARA and AODV. |
Key words: Ad Hoc networks Q-learning ant colony optimization routing algorithm robustness |