测绘通报 ›› 2021, Vol. 0 ›› Issue (9): 21-27.doi: 10.13474/j.cnki.11-2246.2021.0267

• 学术研究 • 上一篇    下一篇

一种优化的Faster R-CNN小目标检测方法

程瑞1, 高建1, 邢强2, 孙中昶2   

  1. 1. 南京邮电大学, 江苏 南京 210003;
    2. 中国科学院空天信息创新研究院, 北京 100094
  • 收稿日期:2020-09-02 修回日期:2021-03-19 出版日期:2021-09-25 发布日期:2021-10-11
  • 通讯作者: 邢强。E-mail:xingqiang@aircas.ac.cn
  • 作者简介:程瑞(1994-),男,硕士生,主要从事遥感数字图像处理。E-mail:1218012332@njupt.edu.cn
  • 基金资助:
    中国科学院战略性先导科技专项(XDA19090121;XDA19030104)

An optimized Faster R-CNN small target detection method

CHENG Rui1, GAO Jian1, XING Qiang2, SUN Zhongchang2   

  1. 1. College of telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;
    2. Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
  • Received:2020-09-02 Revised:2021-03-19 Online:2021-09-25 Published:2021-10-11

摘要: 图像目标检测是计算机视觉与数字图像处理的一个热门方向,其主要任务是找出图像中感兴趣的物体并确定物体的位置与类别。目前基于深度学习模型是主流的目标检测算法,利用其解决诸多学科问题成为一种趋势。本文采用区域卷积神经网络(Faster R-CNN)深度学习算法和相关图像处理算法,以ResNet50、ResNet101为骨干网络,采用特征金字塔网络开展新冠疫情期间武汉市车辆变化监测,以此分析疫情下的武汉市内部活动强度。结果显示:本文车辆目标检测方法的精确率为0.96,召回率为0.92,平均精度为0.85。疫情前(2019年11月17日)、中(2020年02月22日)车辆变化情况为:武汉汇聚中心分别为263、32辆,汪家嘴立交桥分别为89、44辆,新兴工业园分别为554、347辆,经开未来城分别为188、57辆。可知,疫情导致武汉市人口出行减少、车辆活动明显降低。

关键词: 目标检测, Faster R-CNN算法, 深度学习, 图像处理, 新冠病毒

Abstract: Image object detection is a popular direction in computer vision and digital image processing. Its main task is to find out the object of interest in the image and determine the location and category of the object. The current mainstream object detection algorithms are mainly based on deep learning models, and it has become a trend to solve many disciplinary problems. This article uses a method based on the combination of the regional convolutional neural network (Faster R-CNN) deep learning algorithm and related image processing algorithms,using ResNet50 and ResNet101 as the backbone network and using feature pyramid networks to monitor the changes of vehicles in Wuhan during the new crown epidemic to analyze the intensity of internal activities in Wuhan during the epidemic. The results show that the accuracy rate of the image detection method in this paper is 0.96, the recall rate is 0.915, and the average accuracy is 0.853 8.The vehicle number changes before the epidemic (November 17, 2019) and during the epidemic (February 22, 2020) is as follows:Wuhan Convergence Center (263 and 32 vehicles), Wangjiazui Overpass (89 and 44 vehicles), Xinxing Industrial Park (554 and 347 vehicles), Jingkai Future City (188 and 57 vehicles). The epidemic has led to a decrease in population travel and vehicle activities in Wuhan.

Key words: target detection, Faster R-CNN algorithm, deep learning, image processing, novel coronavirus

中图分类号: