特征协同与细粒度感知的遥感图像小目标检测

肖振久,张杰浩,林渤翰. 特征协同与细粒度感知的遥感图像小目标检测[J]. 光电工程,2024,51(6): 240066. doi: 10.12086/oee.2024.240066
引用本文: 肖振久,张杰浩,林渤翰. 特征协同与细粒度感知的遥感图像小目标检测[J]. 光电工程,2024,51(6): 240066. doi: 10.12086/oee.2024.240066
Xiao Z J, Zhang J H, Lin B H. Feature coordination and fine-grained perception of small targets in remote sensing images[J]. Opto-Electron Eng, 2024, 51(6): 240066. doi: 10.12086/oee.2024.240066
Citation: Xiao Z J, Zhang J H, Lin B H. Feature coordination and fine-grained perception of small targets in remote sensing images[J]. Opto-Electron Eng, 2024, 51(6): 240066. doi: 10.12086/oee.2024.240066

特征协同与细粒度感知的遥感图像小目标检测

  • 基金项目:
    辽宁省高等学校基本科研项目(LJKMZ20220699);辽宁工程技术大学学科创新团队项目(LNTU20TD-23)
详细信息
    作者简介:
    *通讯作者: 张杰浩,zjhao0409@163.com
  • 中图分类号: TP391.4

Feature coordination and fine-grained perception of small targets in remote sensing images

  • Fund Project: Project supported by Basic Scientific Research Project of Liaoning Provincial Universities (LJKMZ20220699), and Subject Innovation Team Project of Liaoning Technical University (LNTU20TD-23)
More Information
  • 针对遥感图像中小目标多、排列密集导致的漏检问题,提出一种特征协同与细粒度感知的遥感图像小目标检测算法。首先,构造精细特征协同策略,通过智能调整卷积核参数,优化了特征间的交互和整合过程;通过精确控制信息流,实现从粗糙到精细的渐进式特征精化。在此基础上,本文设计一个细粒度感知模块,将感知注意力与移动反向卷积结合形成一个增强型检测头,显著增强网络对于极小尺寸物体的感知能力。最后,为了提升模型训练的效率,采用MPDIoU和NWD作为回归损失函数,解决位置偏差,加快模型收敛。在DOTA1.0数据集和DOTA1.5数据集上的实验结果表明,改进后算法相比于基准方法,平均精度分别提高7.4%和6.1%,相较于其他算法具有明显优势,显著改善遥感图像中小目标的漏检情况。

  • Overview: With the rapid development of remote sensing image technology, remote sensing image target detection is widely used in many important fields, including military target location and identification, natural environment protection, disaster detection, and urban planning and construction. The task of remote sensing image target detection is to accurately identify and locate the specific target in the image, and speculate its type and position. Different from targets in natural scenes, targets in remote sensing images have the characteristics of large scenes, small targets, multi-scale, complex backgrounds, overlapping occlusion, etc., so it is a challenging task to detect specific objects accurately. At present, great breakthroughs have been made in remote sensing image target detection algorithms, but the effect of small target detection is still not ideal. Small target detection faces two major difficulties: Little feature information of the target, scarce positive samples, and unbalanced classification; The target location is difficult, the background is complex, and contains a lot of redundant information, which causes serious interference to the detection. This makes it challenging to extract the edge features from aerial images and distinguish the object from the background. Therefore, the research on object detection and application in remote sensing images has important theoretical and practical significance. Addressing the challenge of missed detection caused by many small targets and dense arrangement in remote sensing images, this study introduces a small target detection algorithm for remote sensing applications, leveraging a combination of feature synergy and micro-perception strategies. Initially, we propose a refined feature synergistic fusion strategy that optimizes the interaction and integration of features across different scales by intelligently adjusting the parameters of convolution kernels. This strategy facilitates progressive refinement of features from coarse to fine granularity. Building upon this foundation, a micro-perception unit is developed in this paper, incorporating perceptual attention mechanisms with moving inverse convolution to form an advanced detection head. This innovative approach substantially boosts the network's capability to detect very small objects. Furthermore, to augment the training efficiency of the model, we employ MPDIoU and NWD as regression loss functions, mitigating positional bias issues and expediting model convergence. Experimental evaluations on the DOTA1.0 dataset and DOTA1.5 dataset reveal that our algorithm substantially improves mean Average Precision (mAP) by 7.4% and 6.1% over the baseline method, which has obvious advantages over other algorithms. The results underscore the algorithm's efficacy in significantly reducing the incidence of missed detections of small targets within remote sensing imagery.

  • 加载中
  • 图 1  模型总体结构

    Figure 1.  Overall model structure

    图 2  KernelWarehouse的结构图

    Figure 2.  Structure of KernelWarehouse

    图 3  特征协同模块

    Figure 3.  Feature synergy module

    图 4  细粒度感知检测头

    Figure 4.  Fine-grained aware detect head

    图 5  感知注意力机制

    Figure 5.  Aware attention mechanism

    图 6  YOLOv8 算法训练结果图

    Figure 6.  Training results of the YOLOv8 algorithm

    图 7  所提算法训练结果图

    Figure 7.  Training results of the proposed algorithm

    图 8  检测效果对比图

    Figure 8.  Comparison of detection results

    表 1  所提算法在DOTA1.0数据集的消融实验

    Table 1.  Ablation experiments of the proposed algorithm in the DOTA1.0 dataset

    NumberABCDPrecision/%Recall/%FPSmAP@0.5 /%mAP@0.5:.0.95 /%
    1××××79.068.743474.350.9
    2×××82.170.238476.552.5
    3×××84.469.037075.251.6
    4×××80.169.747675.551.4
    5×××80.570.445475.851.8
    6××83.372.356678.053.9
    7××84.969.928576.452.8
    8×84.873.840279.855.6
    9×82.371.741677.053.0
    1084.375.645481.758.0
    下载: 导出CSV

    表 2  不同算法在DOTA1.0数据集上的实验结果

    Table 2.  Experimental results of different algorithms on DOTA1.0 dataset; unit: %

    CategoryYOLOv5YOLOv7CornerNetR-FCNYOLO-BiFPNYOLO-PWCAYOLO-DCTIOurs
    SV75.176.510.149.881.777.686.888.4
    LV86.786.750.245.185.485.790.990.3
    PL93.692.364.781.191.892.691.993.0
    ST74.370.957.967.477.972.785.777.5
    SH89.689.131.349.389.187.581.191.7
    HA87.783.580.545.288.284.188.589.8
    GTF71.155.524.958.969.364.273.675.4
    SBF62.158.422.741.866.864.570.670.7
    TC94.094.985.568.993.494.093.896.0
    SP82.879.518.553.364.164.283.085.0
    BD76.171.138.258.974.078.777.378.8
    RA64.347.144.551.459.462.063.965.2
    BC78.372.262.552.176.275.582.182.5
    BR59.245.326.231.656.451.857.157.7
    HC84.481.712.133.983.076.385.287.2
    mAP@0.578.673.642.052.680.278.880.681.7
    下载: 导出CSV

    表 3  不同算法在DOTA1.5数据集上的实验结果

    Table 3.  Experimental results of different algorithms on DOTA1.5 dataset; unit: %

    CategoryYOLOv5YOLOv7CornerNetR-FCNYOLO-BiFPNYOLO-PWCAYOLO-DCTIOurs
    SV57.866.550.359.770.571.275.277.3
    LV71.482.159.658.977.886.388.589.6
    PL80.588.476.577.384.190.780.190.1
    ST77.880.968.170.576.273.175.777.6
    SH76.785.360.764.877.186.487.389.4
    HA82.681.777.875.186.980.689.190.5
    GTF73.780.664.160.377.575.374.976.1
    SBF63.268.458.361.673.466.875.175.2
    TC85.583.280.779.887.689.583.791.0
    SP76.178.672.973.460.570.180.481.3
    BD79.378.268.670.680.183.784.884.9
    RA73.475.470.266.574.668.770.176.9
    BC78.381.173.474.880.482.680.783.1
    BR60.362.563.266.367.169.862.670.2
    HC68.865.662.760.178.977.482.480.3
    CC62.767.864.566.973.170.374.275.6
    mAP@0.576.375.672.070.878.777.978.180.4
    下载: 导出CSV
  • [1]

    马梁, 苟于涛, 雷涛, 等. 基于多尺度特征融合的遥感图像小目标检测[J]. 光电工程, 2022, 49(4): 210363. doi: 10.12086/oee.2022.210363

    Ma L, Gou Y T, Lei T, et al. Small object detection based on multi-scale feature fusion using remote sensing images[J]. Opto-Electron Eng, 2022, 49(4): 210363. doi: 10.12086/oee.2022.210363

    [2]

    陈旭, 彭冬亮, 谷雨. 基于改进YOLOv5s的无人机图像实时目标检测[J]. 光电工程, 2022, 49(3): 210372. doi: 10.12086/oee.2022.210372

    Chen X, Peng D L, Gu Y. Real-time object detection for UAV images based on improved YOLOv5s[J]. Opto-Electron Eng, 2022, 49(3): 210372. doi: 10.12086/oee.2022.210372

    [3]

    王友伟, 郭颖, 邵香迎. 基于改进级联算法的遥感图像目标检测[J]. 光学学报, 2022, 42(24): 2428004. doi: 10.3788/AOS202242.2428004

    Wang Y W, Guo Y, Shao X Y. Target detection in remote sensing images based on improved cascade algorithm[J]. Acta Opt Sin, 2022, 42(24): 2428004. doi: 10.3788/AOS202242.2428004

    [4]

    王家宝, 程塨, 谢星星, 等. 多元信息监督的遥感图像有向目标检测[J]. 遥感学报, 2023, 27(12): 2726−2735. doi: 10.11834/jrs.20211564

    Wang J B, Cheng G, Xie X X, et al. Multi-information supervision in optical remote sensing images[J]. Natl Remote Sens Bull, 2023, 27(12): 2726−2735. doi: 10.11834/jrs.20211564

    [5]

    张德银, 赵志恒, 谢逸戈, 等. 基于改进YOLOv8的遥感图像飞机目标检测研究[J]. 自动化应用, 2024, 65(2): 193−195,198. doi: 10.19769/j.zdhy.2024.02.060

    Zhang D Y, Zhao Z H, Xie Y G, et al. Research on aircraft target detection in remote sensing images based on improved YOLOv8[J]. Autom Appl, 2024, 65(2): 193−195,198. doi: 10.19769/j.zdhy.2024.02.060

    [6]

    Zhang Y L, Jin H Y. Detector consistency research on remote sensing object detection[J]. Remote Sens, 2023, 15(17): 4130. doi: 10.3390/rs15174130

    [7]

    Lyu Z, Jin H F, Zhen T, et al. Small object recognition algorithm of grain pests based on SSD feature fusion[J]. IEEE Access, 2021, 9: 43202−43213. doi: 10.1109/ACCESS.2021.3066510

    [8]

    Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[J]. IEEE Trans Pattern Anal Mach Intell, 2020, 42(2): 318−327. doi: 10.1109/TPAMI.2018.2858826

    [9]

    Redmon J, Divvala S, Girshick R, et al. You only look once: unified, real-time object detection[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016: 779–788. https://doi.org/10.1109/CVPR.2016.91.

    [10]

    Redmon J, Farhadi A. YOLO9000: better, faster, stronger[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017: 6517–6525. https://doi.org/10.1109/CVPR.2017.690.

    [11]

    Redmon J, Farhadi A. YOLOv3: an incremental improvement[Z]. arXiv: 1804.02767, 2018. https://doi.org/10.48550/arXiv.1804.02767.

    [12]

    Bochkovskiy A, Wang C Y, Liao H Y M. YOLOv4: optimal speed and accuracy of object detection[Z]. arXiv: 2004.10934, 2020. https://doi.org/10.48550/arXiv.2004.10934.

    [13]

    Ge Z, Liu S T, Wang F, et al. YOLOX: exceeding YOLO series in 2021[Z]. arXiv: 2107.08430, 2021. https://doi.org/10.48550/arXiv.2107.08430.

    [14]

    Girshick R. Fast R-CNN[C]//Proceedings of 2015 IEEE International Conference on Computer Vision, 2015: 1440–1448. https://doi.org/10.1109/ICCV.2015.169.

    [15]

    Dai J F, Li Y, He K M, et al. R-FCN: object detection via region-based fully convolutional networks[C]//Proceedings of the 30th International Conference on Neural Information Processing Systems, 2016: 29.

    [16]

    Li Y T, Fan Q S, Huang H S, et al. A modified YOLOv8 detection network for UAV aerial image recognition[J]. Drones, 2023, 7(5): 304. doi: 10.3390/drones7050304

    [17]

    Zhu F Z, Wang Y Y, Cui J Y, et al. Target detection for remote sensing based on the enhanced YOLOv4 with improved BiFPN[J]. Egypt J Remote Sens Space Sci, 2023, 26(2): 351−360. doi: 10.1016/j.ejrs.2023.04.003

    [18]

    Zhai X X, Huang Z H, Li T, et al. YOLO-Drone: an optimized YOLOv8 network for tiny UAV object detection[J]. Electronics, 2023, 12(17): 3664. doi: 10.3390/electronics12173664

    [19]

    Zhou F Y, Deng H G, Xu Q G, et al. CNTR-YOLO: improved YOLOv5 based on ConvNext and transformer for aircraft detection in remote sensing images[J]. Electronics, 2023, 12(12): 2671. doi: 10.3390/electronics12122671

    [20]

    Zhu B Y, Lv Q B, Yang Y B, et al. Gradient structure information-guided attention generative adversarial networks for remote sensing image generation[J]. Remote Sens, 2023, 15(11): 2827. doi: 10.3390/rs15112827

    [21]

    Xiao J S, Guo H W, Yao Y T, et al. Multi-scale object detection with the pixel attention mechanism in a complex background[J]. Remote Sens, 2022, 14(16): 3969. doi: 10.3390/rs14163969

    [22]

    Wu J J, Su L M, Lin Z W, et al. Object detection of flexible objects with arbitrary orientation based on rotation-adaptive YOLOv5[J]. Sensors, 2023, 23(10): 4925. doi: 10.3390/s23104925

    [23]

    Yang X, Yan J C, Liao W L, et al. SCRDet++: detecting small, cluttered and rotated objects via instance-level feature denoising and rotation loss smoothing[J]. IEEE Trans Pattern Anal Mach Intell, 2023, 45(2): 2384−2399. doi: 10.1109/TPAMI.2022.3166956

    [24]

    Zhang H Y, Liu J. Direction estimation of aerial image object based on neural network[J]. Remote Sens, 2022, 14(15): 3523. doi: 10.3390/rs14153523

    [25]

    Yu H T, Tian Y J, Ye Q X, et al. Spatial transform decoupling for oriented object detection[C]//Proceedings of the 38th AAAI Conference on Artificial Intelligence, 2024: 6782–6790. https://doi.org/10.1609/aaai.v38i7.28502.

    [26]

    Li C, Yao A B. KernelWarehouse: towards parameter-efficient dynamic convolution[Z]. arXiv: 2308.08361, 2023. https://doi.org/10.48550/arXiv.2308.08361.

    [27]

    Ma S L, Xu Y. MPDIoU: a loss for efficient and accurate bounding box regression[Z]. arXiv: 2307.07662, 2023. https://doi.org/10.48550/arXiv.2307.07662.

    [28]

    Wang J W, Xu C, Yang W, et al. A normalized Gaussian Wasserstein distance for tiny object detection[Z]. arXiv: 2110.13389, 2021. https://doi.org/10.48550/arXiv.2110.13389.

    [29]

    Sandler M, Howard A, Zhu M L, et al. MobileNetV2: inverted residuals and linear bottlenecks[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018: 4510–4520.

    [30]

    Konstantinov A V, Utkin L V, Lukashin A A, et al. Neural attention forests: transformer-based forest improvement[C]//Proceedings of the Seventh International Scientific Conference “Intelligent Information Technologies for Industry”, 2023. https://doi.org/10.1007/978-3-031-43789-2_14.

    [31]

    Tan M X, Le Q V. EfficientNet: rethinking model scaling for convolutional neural networks[C]//Proceedings of the 36th International Conference on Machine Learning, 2019: 6105–6114.

    [32]

    Law H, Deng J. CornerNet: detecting objects as paired keypoints[C]//Proceedings of the 15th European Conference on Computer Vision (ECCV), 2018: 734–750. https://doi.org/10.1007/978-3-030-01264-9_45.

    [33]

    于傲泽, 魏维伟, 王平, 等. 基于分块复合注意力的无人机小目标检测算法[J]. 航空学报, 2023: 1–12. https://doi.org/10.7527/S1000-6893.2023.29148.

    Yu A Z, Wei W W, Wang P, et al. Small target detection algorithm for UAV based on patch-wise co-attention[J]. Acta Aeronaut Astronaut Sin, 2023: 1–12. https://doi.org/10.7527/S1000-6893.2023.29148.

    [34]

    Min L T, Fan Z M, Lv Q Y, et al. YOLO-DCTI: small object detection in remote sensing base on contextual transformer enhancement[J]. Remote Sens, 2023, 15(16): 3970. doi: 10.3390/rs15163970

  • 加载中

(9)

(3)

计量
  • 文章访问数: 
  • PDF下载数: 
  • 施引文献:  0
出版历程
收稿日期:  2024-03-20
修回日期:  2024-04-25
录用日期:  2024-04-26
刊出日期:  2024-06-25

目录

/

返回文章
返回