Hua LIANG, Yu-long SONG, Feng QIAN, et al. Detection of small target in aerial photography based on deep learning[J]. Chinese journal of liquid crystals and displays, 2018, 33(9): 793-800.
DOI:
Hua LIANG, Yu-long SONG, Feng QIAN, et al. Detection of small target in aerial photography based on deep learning[J]. Chinese journal of liquid crystals and displays, 2018, 33(9): 793-800. DOI: 10.3788/YJYXS20183309.0793.
Detection of small target in aerial photography based on deep learning
In order to solve the problem of low recognition rate and poor positioning in aerial images
a target detection method based on deep learning is proposed. This algorithm uses VGG16 network as a fine tuning network and adds some deep network in it. Joint training is carried out by extracting the features of the shallow layers and the deep features of the target to overcome the contradiction between location and recognition in the process of detection. The singular value decomposition technology is used to compress the convolution features to reduce the computing and storage requirements of the model
and Multi scale training method is adopted to adapt to the change of aerial target scale. The experimental results show that 0.76 mAP can be implemented on the general data set PASCAL
and the detection speed is 16 fps. The 0.63 mAP can be achieved on the special aviation target data set UCAS-AOD
and the detection speed is 18 fps. It can satisfy the requirements for small target detection accuracy
and the detection speed can be close to the real-time detection effect.
关键词
Keywords
references
LOWED G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(2):91-110.
QI B J, LIU J G, ZHANG B Y, et al . Research on matching performance of SIFT and SURF algorithms for high resolution remote sensing image[J]. Chinese Optics , 2017, 10(3):331-339. (in Chinese)
WANG M, TU D W, ZHOU X C. Moving object detection by combining SIFT and differential multiplication[J]. Optics and Precision Engineering , 2011, 19(4):892-899. (in Chinese)
GENG Q T, ZHAO H Y, YU F H, et al . Vehicle type recognition algorithm based on improved HOG feature[J]. Chinese Optics , 2018, 11(2):174-181. (in Chinese)
GIRSHICK R, DONAHUE J, DARRELL T, et al . Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition. Columbus, OH, USA: IEEE, 2014: 580-587.
FELZENSZWALB P F, GIRSHICK R B, MCALLESTER D, et al . Object detection with discriminatively trained part-based models[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2010, 32(9):1627-1645.
Felzenszwalb P, Mcallester D, Ramanan D. A discriminatively trained, multiscale, deformable part mode[J]. IEEE Conference on Computer Visionand Pattern Recognition [J].2008, 8: 1-8. https://blog.csdn.net/xp215774576/article/details/41981415
LIU F, SHEN T S, MA X X, et al . Ship recognition based on multi-band deep neural network[J]. Optics and Precision Engineering , 2017, 25(11):2939-2946. (in Chinese)
LI Y, LIU X Y, ZHANG H Q, et al . Optical remote sensing image retrieval based on convolutional neural networks[J]. Optics and Precision Engineering , 2018, 26(1):200-207. (in Chinese)
LIN T Y, DOLLAR P, GIRSHICK R, et al . Feature pyramid networks for object detection[C]// Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition . Honolulu, Hawaii, USA: IEEE, 2017: 936-944.
LIU W, ANGUELOV D, ERHAN D, et al . SSD: Single shot MultiBox detector[C]// Proceedings of the 14th European Conference . Amsterdam, The Netherlands: Springer, 2016: 21-37.
HE K M, ZHANG X Y, REN S Q, et al . Spatial pyramid pooling in deep convolutional networks for visual recognition[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2015, 37(9):1904-1916.
REN S Q, HE K M, GIRSHICK R, et al . Faster R-CNN:Towards real-time object detection with region proposal networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence , 2017, 39(6):1137-1149.
REDMON J, DIVVALA S, GIRSHICK R, et al . You only look once: Unified, real-time object detection[C]// Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition . Las Vegas, NV, USA: IEEE, 2016: 779-788.
BODLA N, SINGH B, CHELLAPPA R, et al . Soft-NMS -Improving object detection with one line of code[C]// Proceedings of 2017 IEEE International Conference on Computer Vision . Venice, Italy: IEEE, 2017: 5562-5570.
SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition[J]. arXiv: 1409.1556, 2014. http://cn.arxiv.org/abs/1409.1556
LECUN Y, BOTTOU L, BENGIO Y, et al . Gradient-based learning applied to document recognition[J]. Proceedings of the IEEE , 1998, 86(11):2278-2324.
KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks[C]// Proceedings of the 25th International Conference on Neural Information Processing Systems . Lake Tahoe, Nevada: Curran Associates Inc., 2012: 1097-1105.
ZEILER M D, FERGUS R. Visualizing and understanding convolutional networks[C]// Proceedings of the 13th European Conference . Zurich, Switzerland: Springer, 2014: 818-833.