欢迎访问《哈尔滨工业大学学报》编辑部网站！

期刊检索

关键词检索

新闻公告MORE

【03-25】投稿请提供保密审查证明
【05-04】论文版权转让协议
【07-05】出版伦理声明
【04-04】告作者书
【07-11】审稿人的职责
【11-26】领军！《哈尔滨工业大学学报》入选“中国科技期刊卓越行动计划”领军期刊
【10-17】《哈工大学报》入选“第5届中国精品科技期刊”
【12-30】《哈工大学报》入选“世界学术影响力Q2期刊”
【01-03】《哈工大学报》入选“2018中国国际影响力优秀学术期刊”
【11-01】哈工大学报荣获2016、2018、2020年度“中国高校百佳科技期刊奖”
【03-24】哈工大学报10篇论文入选中国精品科技期刊顶尖学术论文
【12-18】哈工大学报2023优秀审稿专家
【12-24】哈工大学报2022优秀审稿专家
【12-21】哈工大学报2021优秀审稿专家
【12-10】哈工大学报2020优秀审稿专家
【12-13】哈工大学报2019优秀审稿专家

主管单位 中华人民共和国
工业和信息化部 主办单位 哈尔滨工业大学主编李隆球 国际刊号ISSN 0367-6234 国内刊号CN 23-1235/T

期刊网站二维码

微信公众号二维码

引用本文:	李钰龙,梁新武.融合注意力机制和多任务学习的机器人抓取检测算法[J].哈尔滨工业大学学报,2023,55(12):9.DOI:10.11918/202212037
	LI Yulong,LIANG Xinwu.Robotic grasp detection algorithm integrating attention mechanism and multi-task learning[J].Journal of Harbin Institute of Technology,2023,55(12):9.DOI:10.11918/202212037

【打印本页】【HTML】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

过刊浏览高级检索

本文已被：浏览 893次下载 1112次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
融合注意力机制和多任务学习的机器人抓取检测算法
李钰龙,梁新武
(上海交通大学航空航天学院,上海 200240)

摘要:

抓取主要分为抓取检测、轨迹规划和执行环节,准确的抓取检测是完成抓取任务的关键。为进行更准确的抓取检测,提高机器人抓取性能表现,本研究以关键点检测算法为基础,提出了一种融合注意力和多任务学习的抓取检测算法。首先,针对任务特点,在特征提取环节引入CA（coordinate attention）注意力模块,显式的学习通道和空间特征,充分利用特征信息。其次,在损失函数环节加入多任务权重学习算法,学习抓取中心坐标、抓手开合宽度及旋转角度信息的最优权重。最后,在Cornell数据集以及更大规模的Jacquard数据集上进行试验。研究结果表明,所提方法相比滑动窗口和锚框类型等经典方法在检测速率上有明显提升,且与单纯的关键点检测方法相比有更高的准确率,所提模型在两个数据集上分别取得98.8%和95.7%的准确率。检测示例体现出所提模型对于非常规物体也有良好的抓取结果,不同Jaccard系数条件下的抓取结果显示模型在精准抓取方面有优秀性能,而对于权重学习算法的不同初始值试验则表明所提模型具有良好的鲁棒性。此外,通过消融实验分析了不同模块对于模型性能表现的影响程度。

关键词: 抓取检测关键点估计注意力机制可学习权重深度学习

DOI：10.11918/202212037

分类号:TP241

文献标识码:A

基金项目:国家自然科学基金(62173230)；上海市科技计划资助项目(22511101400)

Robotic grasp detection algorithm integrating attention mechanism and multi-task learning

LI Yulong,LIANG Xinwu

（School of Aeronautics and Astronautics, Shanghai Jiao Tong University, Shanghai 200240, China）

Abstract:

Grasping is mainly divided into grasping detection, trajectory planning, and execution. Accurate grasping detection is the key to completing grasping tasks. In order to achieve more accurate grasping detection and improve the performance of robot grasping, this paper proposes a grasping detection algorithm that integrates attention and multi-task learning based on key point detection algorithm. Firstly, a coordinate attention (CA) attention module is introduced in the feature extraction process to explicitly learn channel and spatial features and make full use of feature information. Secondly, a multi-task weight learning algorithm is added to the loss function to learn the optimal weights of the grasp center coordinates, gripper opening width, and rotation angle information. Finally, experiments are conducted on the Cornell dataset and the larger-scale Jacquard dataset. The results show that the proposed method has a significant improvement in detection speed compared to classical methods such as sliding windows and anchor box types, and has higher accuracy compared to simple key point detection methods. The proposed model achieves accuracy rates of 98.8% and 95.7% on the two datasets, respectively. Grasping examples show that the proposed model also has good grasping results for unconventional objects, and the model has excellent performance in accurate grasping under different Jaccard coefficient conditions. Moreover, the experiments with different initial values of the weight learning algorithm show that the proposed model has good robustness. In addition, the impact of different modules on the performance of the model is analyzed through ablation experiments.

Key words: grasp detection key point estimation attention module learnable weights deep learnin

期刊检索

关键词检索

新闻公告MORE

友情链接LINKS