Detection of small target in aerial photography based on deep learning

Hua LIANG; Yu-long SONG; Feng QIAN; Ce SONG

doi:10.3788/YJYXS20183309.0793

您当前的位置：

首页 >

文章列表页 >

Detection of small target in aerial photography based on deep learning

Image Processing | 更新时间：2020-07-06

- Detection of small target in aerial photography based on deep learning
- Chinese Journal of Liquid Crystals and Displays Vol. 33, Issue 9, Pages: 793-800(2018)
- 作者机构：
  
  1.中国科学院长春光学精密机械与物理研究所, 吉林长春 130033
  2.中国科学院大学, 北京 100049
- 作者简介：
- 基金信息：
- DOI：10.3788/YJYXS20183309.0793
  CLC： TP391
- Received：02 April 2018，
  
  Accepted：08 June 2018，
  
  Published：05 September 2018
- 稿件说明：
移动端阅览
Hua LIANG, Yu-long SONG, Feng QIAN, et al. Detection of small target in aerial photography based on deep learning[J]. Chinese journal of liquid crystals and displays, 2018, 33(9): 793-800.
DOI：

Hua LIANG, Yu-long SONG, Feng QIAN, et al. Detection of small target in aerial photography based on deep learning[J]. Chinese journal of liquid crystals and displays, 2018, 33(9): 793-800. DOI： 10.3788/YJYXS20183309.0793.

摘要

针对航拍图像中对地小目标识别率低、定位效果差的问题，提出了一种基于深度学习的目标检测算法。该算法利用VGG16网络作为微调网络，并添加部分深层网络，通过提取目标浅层特征与深层特征进行联合训练，克服检测过程中定位与识别相互矛盾的问题。提出把奇异值分解技术应用于卷积特征压缩处理，降低模型的计算与存储需求，并且采用多尺度训练方法以适应航空目标尺度的变化。实验结果表明，在通用数据集PASCAL上可以实现0.76 mAP，检测速度达16 fps，在专用航空目标数据集UCAS-AOD上可以实现0.63 mAP，检测速度达18 fps。基本满足对小目标检测精确度的要求，并且检测速度可以接近实时检测效果。

Abstract

In order to solve the problem of low recognition rate and poor positioning in aerial images

a target detection method based on deep learning is proposed. This algorithm uses VGG16 network as a fine tuning network and adds some deep network in it. Joint training is carried out by extracting the features of the shallow layers and the deep features of the target to overcome the contradiction between location and recognition in the process of detection. The singular value decomposition technology is used to compress the convolution features to reduce the computing and storage requirements of the model

and Multi scale training method is adopted to adapt to the change of aerial target scale. The experimental results show that 0.76 mAP can be implemented on the general data set PASCAL

and the detection speed is 16 fps. The 0.63 mAP can be achieved on the special aviation target data set UCAS-AOD

and the detection speed is 18 fps. It can satisfy the requirements for small target detection accuracy

and the detection speed can be close to the real-time detection effect.

关键词

Keywords

references

LOWED G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(2):91-110.

齐冰洁, 刘金国, 张博研, 等.高分辨率遥感图像SIFT和SURF算法匹配性能研究[J].中国光学, 2017, 10(3):331-339.

QI B J, LIU J G, ZHANG B Y, et al . Research on matching performance of SIFT and SURF algorithms for high resolution remote sensing image[J]. Chinese Optics , 2017, 10(3):331-339. (in Chinese)

王梅, 屠大维, 周许超. SIFT特征匹配和差分相乘融合的运动目标检测[J].光学精密工程, 2011, 19(4):892-899.

WANG M, TU D W, ZHOU X C. Moving object detection by combining SIFT and differential multiplication[J]. Optics and Precision Engineering , 2011, 19(4):892-899. (in Chinese)

耿庆田, 赵浩宇, 于繁华, 等.基于改进HOG特征提取的车型识别算法[J].中国光学, 2018, 11(2):174-181.

GENG Q T, ZHAO H Y, YU F H, et al . Vehicle type recognition algorithm based on improved HOG feature[J]. Chinese Optics , 2018, 11(2):174-181. (in Chinese)

GIRSHICK R, DONAHUE J, DARRELL T, et al . Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, OH, USA: IEEE, 2014: 580-587.

FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, et al . Object detection with discriminatively trained part-based models[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2010, 32(9):1627-1645.

Felzenszwalb P, Mcallester D, Ramanan D. A discriminatively trained, multiscale, deformable part mode[J]. IEEE Conference on Computer Visionand Pattern Recognition [J].2008, 8: 1-8. https://blog.csdn.net/xp215774576/article/details/41981415

刘峰, 沈同圣, 马新星, 等.基于多波段深度神经网络的舰船目标识别[J].光学精密工程, 2017, 25(11):2939-2946.

LIU F, SHEN T S, MA X X, et al . Ship recognition based on multi-band deep neural network[J]. Optics and Precision Engineering , 2017, 25(11):2939-2946. (in Chinese)

李宇, 刘雪莹, 张洪群, 等.基于卷积神经网络的光学遥感图像检索[J].光学精密工程, 2018, 26(1):200-207.

LI Y, LIU X Y, ZHANG H Q, et al . Optical remote sensing image retrieval based on convolutional neural networks[J]. Optics and Precision Engineering , 2018, 26(1):200-207. (in Chinese)

LIN T Y, DOLLAR P, GIRSHICK R, et al . Feature pyramid networks for object detection[C]// Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition . Honolulu, Hawaii, USA: IEEE, 2017: 936-944.

LIU W, ANGUELOV D, ERHAN D, et al . SSD: Single shot MultiBox detector[C]// Proceedings of the 14th European Conference . Amsterdam, The Netherlands: Springer, 2016: 21-37.

HE K M, ZHANG X Y, REN S Q, et al . Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2015, 37(9):1904-1916.

REN S Q, HE K M, GIRSHICK R, et al . Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017, 39(6):1137-1149.

REDMON J, DIVVALA S, GIRSHICK R, et al . You only look once: Unified, real-time object detection[C]// Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition . Las Vegas, NV, USA: IEEE, 2016: 779-788.

BODLA N, SINGH B, CHELLAPPA R, et al . Soft-NMS -Improving object detection with one line of code[C]// Proceedings of 2017 IEEE International Conference on Computer Vision . Venice, Italy: IEEE, 2017: 5562-5570.

SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[J]. arXiv: 1409.1556, 2014. http://cn.arxiv.org/abs/1409.1556

LECUN Y, BOTTOU L, BENGIO Y, et al . Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE , 1998, 86(11):2278-2324.

KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[C]// Proceedings of the 25th International Conference on Neural Information Processing Systems . Lake Tahoe, Nevada: Curran Associates Inc., 2012: 1097-1105.

ZEILER M D, FERGUS R. Visualizing and understanding convolutional networks[C]// Proceedings of the 13th European Conference . Zurich, Switzerland: Springer, 2014: 818-833.

Views

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Design of heterogeneous FPGA hardware accelerator based on CNN

Development of AOI inspection of Mura defects on TFT-LCD surface

Lightweight image super-resolution combining residual learning and layer attention

Remote sensing image change detection based on CNN-Transformer structure

Wavefront correction method based on P-U-net for pyramid wavefront detector

Related Author

JI Haolin

XU Wei

PIAO Yongjie

WU Xiaobin

GAO Tan

CHEN Zekang

SHEN Yi

ZHAI Chenyang

Related Institution

Key Laboratory of Space-based Dynamic & Rapid Optical Imaging Technology， Chinese Academy of Sciences

Changchun Institute of Optics， Fine Mechanics and Physics， Chinese Academy of Sciences

College of Engineering， Shantou University

Guangdong Provincial Key Laboratory of Automotive Display and Touch Technologies

School of Electronic Information and Artificial Intelligence， Shaanxi University of Science & Technology， Xi′an

AI问答

Address：No.3888 Dong Nanhu Road, Changchun, Jilin, China 130033 Postal code：130033
Tel：0431-86176059 Email：yjxs@ciomp.ac.cn
Technical support is provided by Beijing Founder electronics co., LTD 吉ICP备11002662号-17 京公网安备11010802024621
It is recommended to read the content of this site in Chrome&IE9+. Please switch to extreme mode in browser 360.
Cookies We use cookies to help provide and enhance our service and tailor content. By continuing, you agree to the use of cookies.

⁰