• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Cai, Fenghuang (Cai, Fenghuang.) [1] (Scholars:蔡逢煌) | Zhang, Jiaxiang (Zhang, Jiaxiang.) [2] | Huang, Jie (Huang, Jie.) [3] (Scholars:黄捷)

Indexed by:

EI Scopus PKU CSCD

Abstract:

To address the challenges of significant changes in the field of view and complex spatiotemporal information in unmanned aerial vehicle aerial image target detection, a model for small object detection in aerial photography based on low dimensional image feature fusion is presented grounded on the YOLOv5(you only look once version 5) architecture. Coordinate attention is introduced to improve the inverted residuals of MobileNetV3, thereby increasing the spatial dimension information of images while reducing parameters of the model. The YOLOv5 feature pyramid network structure is improved to incorporate feature images from shallow networks. The ability of the model to represent low-dimensional effective information of images is enhanced, and consequently the detection accuracy of the proposed model for small objects is improved. To reduce the impact of complex background in the image, the parameter-free average attention module is introduced to focus on both spatial attention and channel attention. VariFocal Loss is adopted to reduce the weight proportion of negative samples in the training process. Experiments on VisDrone dataset demonstrate the effectiveness of the proposed model. The detection accuracy is effectively improved while the model complexity is significantly reduced. © 2024 Science Press. All rights reserved.

Keyword:

Aerial photography Antennas Complex networks Image enhancement Object detection Object recognition

Community:

  • [ 1 ] [Cai, Fenghuang]College of Electrical Engineering and Automation, Fuzhou University, Fuzhou; 350108, China
  • [ 2 ] [Zhang, Jiaxiang]College of Electrical Engineering and Automation, Fuzhou University, Fuzhou; 350108, China
  • [ 3 ] [Huang, Jie]College of Electrical Engineering and Automation, Fuzhou University, Fuzhou; 350108, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Pattern Recognition and Artificial Intelligence

ISSN: 1003-6059

CN: 34-1089/TP

Year: 2024

Issue: 2

Volume: 37

Page: 162-171

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 6

Online/Total:1299/9718754
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1