PYNQ框架的高精度异构无预选框检测模型实现

张瑞琰; 姜秀杰; 安军社; 崔天舒

期刊检索

关键词检索

新闻公告MORE

【03-25】投稿请提供保密审查证明
【05-04】论文版权转让协议
【07-05】出版伦理声明
【04-04】告作者书
【07-11】审稿人的职责
【11-26】《哈尔滨工业大学学报》入选中国科技期刊卓越行动计划领军期刊
【10-17】《哈工大学报》入选“第5届中国精品科技期刊”
【12-30】《哈工大学报》入选“世界学术影响力Q2期刊”
【01-03】《哈工大学报》入选“2018中国国际影响力优秀学术期刊”
【11-01】哈工大学报荣获2016、2018、2020年度“中国高校百佳科技期刊奖”
【03-24】哈工大学报10篇论文入选中国精品科技期刊顶尖学术论文
【12-05】哈工大学报2024优秀审稿专家
【12-18】哈工大学报2023优秀审稿专家
【12-24】哈工大学报2022优秀审稿专家
【12-21】哈工大学报2021优秀审稿专家
【12-10】哈工大学报2020优秀审稿专家

主管单位 中华人民共和国
工业和信息化部 主办单位 哈尔滨工业大学主编李隆球 国际刊号ISSN 0367-6234 国内刊号CN 23-1235/T

期刊网站二维码

微信公众号二维码

引用本文:	张瑞琰,姜秀杰,安军社,崔天舒.PYNQ框架的高精度异构无预选框检测模型实现[J].哈尔滨工业大学学报,2022,54(5):24.DOI:10.11918/202111015
	ZHANG Ruiyan,JIANG Xiujie,AN Junshe,CUI Tianshu.Realization of high-precision heterogeneous anchor-free detection model based on PYNQ framework[J].Journal of Harbin Institute of Technology,2022,54(5):24.DOI:10.11918/202111015

【打印本页】【HTML】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

过刊浏览高级检索

本文已被：浏览 737次下载 744次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
PYNQ框架的高精度异构无预选框检测模型实现
张瑞琰^1,2,姜秀杰¹,安军社¹,崔天舒¹
(1.复杂航天系统电子信息技术重点实验室(中国科学院国家空间科学中心),北京 100190; 2.中国科学院大学,北京 100049)

摘要:

由于深度卷积网络的参数量及计算量过大,多尺度目标检测网络难以快速高精度地部署在许多资源及功耗受限的平台上。为解决此问题,本文基于Python productivity for ZYNQ(PYNQ)框架实现了无预选框检测模型CTiny的IP核设计及异构系统架构部署。首先,提出在卷积核中分段量化整体缩放系数的方式,使得预训练的高精度算法低损地部署于可编程门阵列(field programmable gate array,FPGA)上；其次,基于PYNQ框架实现了CTiny模型的系统搭建,包含ResNet主干网络、反卷积网络和分支检测网络；最后,将图片预处理及后处理等耗时计算从串行的ARM端移入并行的FPGA中,进一步缩减了总处理时长。实验结果表明:在PYNQ-Z2开发板上部署CTiny模型后,本文所提量化方式在公开光学遥感数据集NWPU VHR-10的平均检测精度达到81.60%,相较于截断量化提升了14.27%,实现了部署精简无预选框检测网络的精度低损耗的需求,且后处理的处理时长由ARM端的9.228 s缩减为了FPGA端的0.008 s,提高了检测模型的速度。

关键词: 目标检测 Python productivity for ZYNQ 光学遥感图像无预选框整体缩放系数

DOI：10.11918/202111015

分类号:TP391.4

文献标识码:A

基金项目:中国科学院复杂航天系统电子信息技术重点实验室自主部署基金(Y42613A32S)

Realization of high-precision heterogeneous anchor-free detection model based on PYNQ framework

ZHANG Ruiyan^1,2,JIANG Xiujie¹,AN Junshe¹,CUI Tianshu¹

(1.Key Laboratory of Electronics and Information Technology for Space Systems(National Space Science Center, Chinese Academy of Sciences), Beijing 100190, China; 2.University of Chinese Academy of Sciences, Beijing 100049, China)

Abstract:

Due to the large number of parameters and large amount of calculation of deep convolutional networks, it is difficult to quickly and accurately deploy multi-scale target detection networks on many platforms with limited resources and power consumption. To solve this problem, based on the Python productivity for ZYNQ (PYNQ) framework, this paper realizes the IP core design and heterogeneous system architecture deployment of CTiny model, which is an anchor-free object detection model. First, a method of segmental quantization of the overall scaling factors in the convolution kernel was proposed, so that the pre-trained high-precision algorithm could be deployed on the field programmable gate array (FPGA) with low loss. Then, the system of the CTiny model was constructed based on the PYNQ framework, including ResNet backbone network, deconvolution network, and branch detection network. Finally, the time-consuming calculation such as picture preprocessing and post-processing was moved from serial ARM to parallel FPGA, further reducing the total processing time. Experimental results show that after deploying the CTiny model on the PYNQ-Z2 development board, the proposed quantization method achieved a mean average precision of 81.60% in the public optical remote sensing dataset NWPU VHR-10, which increased by 14.27% than truncated quantization. It has realized the requirement of deploying a tiny anchor-free object detection network with low loss. In addition, the processing time of post-processing was reduced from 9.228 s on the ARM side to 0.008 s on the FPGA side, which improved the speed of the detection model.

Key words: object detection Python productivity for ZYNQ optical remote sensing image anchor-free overall scaling factor

期刊检索

关键词检索

新闻公告MORE

友情链接LINKS