期刊检索

  • 2024年第56卷
  • 2023年第55卷
  • 2022年第54卷
  • 2021年第53卷
  • 2020年第52卷
  • 2019年第51卷
  • 2018年第50卷
  • 2017年第49卷
  • 2016年第48卷
  • 2015年第47卷
  • 2014年第46卷
  • 2013年第45卷
  • 2012年第44卷
  • 2011年第43卷
  • 2010年第42卷
  • 第1期
  • 第2期

主管单位 中华人民共和国
工业和信息化部
主办单位 哈尔滨工业大学 主编 李隆球 国际刊号ISSN 0367-6234 国内刊号CN 23-1235/T

期刊网站二维码
微信公众号二维码
引用本文:张瑞琰,姜秀杰,安军社,崔天舒.PYNQ框架的高精度异构无预选框检测模型实现[J].哈尔滨工业大学学报,2022,54(5):24.DOI:10.11918/202111015
ZHANG Ruiyan,JIANG Xiujie,AN Junshe,CUI Tianshu.Realization of high-precision heterogeneous anchor-free detection model based on PYNQ framework[J].Journal of Harbin Institute of Technology,2022,54(5):24.DOI:10.11918/202111015
【打印本页】   【HTML】   【下载PDF全文】   查看/发表评论  下载PDF阅读器  关闭
过刊浏览    高级检索
本文已被:浏览 634次   下载 627 本文二维码信息
码上扫一扫!
分享到: 微信 更多
PYNQ框架的高精度异构无预选框检测模型实现
张瑞琰1,2,姜秀杰1,安军社1,崔天舒1
(1.复杂航天系统电子信息技术重点实验室(中国科学院国家空间科学中心),北京 100190; 2.中国科学院大学,北京 100049)
摘要:
由于深度卷积网络的参数量及计算量过大,多尺度目标检测网络难以快速高精度地部署在许多资源及功耗受限的平台上。为解决此问题,本文基于Python productivity for ZYNQ(PYNQ)框架实现了无预选框检测模型CTiny的IP核设计及异构系统架构部署。首先,提出在卷积核中分段量化整体缩放系数的方式,使得预训练的高精度算法低损地部署于可编程门阵列(field programmable gate array,FPGA)上;其次,基于PYNQ框架实现了CTiny模型的系统搭建,包含ResNet主干网络、反卷积网络和分支检测网络;最后,将图片预处理及后处理等耗时计算从串行的ARM端移入并行的FPGA中,进一步缩减了总处理时长。实验结果表明:在PYNQ-Z2开发板上部署CTiny模型后,本文所提量化方式在公开光学遥感数据集NWPU VHR-10的平均检测精度达到81.60%,相较于截断量化提升了14.27%,实现了部署精简无预选框检测网络的精度低损耗的需求,且后处理的处理时长由ARM端的9.228 s缩减为了FPGA端的0.008 s,提高了检测模型的速度。
关键词:  目标检测  Python productivity for ZYNQ  光学遥感图像  无预选框  整体缩放系数
DOI:10.11918/202111015
分类号:TP391.4
文献标识码:A
基金项目:中国科学院复杂航天系统电子信息技术重点实验室自主部署基金(Y42613A32S)
Realization of high-precision heterogeneous anchor-free detection model based on PYNQ framework
ZHANG Ruiyan1,2,JIANG Xiujie1,AN Junshe1,CUI Tianshu1
(1.Key Laboratory of Electronics and Information Technology for Space Systems(National Space Science Center, Chinese Academy of Sciences), Beijing 100190, China; 2.University of Chinese Academy of Sciences, Beijing 100049, China)
Abstract:
Due to the large number of parameters and large amount of calculation of deep convolutional networks, it is difficult to quickly and accurately deploy multi-scale target detection networks on many platforms with limited resources and power consumption. To solve this problem, based on the Python productivity for ZYNQ (PYNQ) framework, this paper realizes the IP core design and heterogeneous system architecture deployment of CTiny model, which is an anchor-free object detection model. First, a method of segmental quantization of the overall scaling factors in the convolution kernel was proposed, so that the pre-trained high-precision algorithm could be deployed on the field programmable gate array (FPGA) with low loss. Then, the system of the CTiny model was constructed based on the PYNQ framework, including ResNet backbone network, deconvolution network, and branch detection network. Finally, the time-consuming calculation such as picture preprocessing and post-processing was moved from serial ARM to parallel FPGA, further reducing the total processing time. Experimental results show that after deploying the CTiny model on the PYNQ-Z2 development board, the proposed quantization method achieved a mean average precision of 81.60% in the public optical remote sensing dataset NWPU VHR-10, which increased by 14.27% than truncated quantization. It has realized the requirement of deploying a tiny anchor-free object detection network with low loss. In addition, the processing time of post-processing was reduced from 9.228 s on the ARM side to 0.008 s on the FPGA side, which improved the speed of the detection model.
Key words:  object detection  Python productivity for ZYNQ  optical remote sensing image  anchor-free  overall scaling factor

友情链接LINKS