SSD-YOLO:一种基于新型抗噪与方向敏感注意力融合的合成孔径雷达船舶检测方法

SSD-YOLO: SAR ship detection via a novel anti-noise and direction-sensitive attention fusion approach

  • 摘要:
    目的 为解决合成孔径雷达(SAR)图像中的强噪声和目标尺度差异问题,提出一种改进型YOLOv8n模型(SSD-YOLO),旨在提升复杂海洋场景下的船舶检测性能。
    方法 该模型以YOLOv8n为基线,融合三大创新模块:SAR_SPPF模块利用Ghost卷积和双径池化高效抗噪;C2f_SimAM模块嵌入无参数SimAM注意力机制,强化目标响应;C2f_DSConv模块则采用方向敏感的深度可分离卷积,精细捕捉舰船纹理与方向信息。
    结果 在SSDD数据集上的实验结果表明,模型精确率97.0%,召回率96.4%,mAP@0.5达99.0%,mAP@0.5:0.95达74.3%。该模型参数量约3 M,FLOPs为7.7 G,保持了轻量化。在更具挑战性的HRSID数据集上的泛化实验表明,模型的mAP@0.5达到94.9%,进一步证明了其鲁棒性与泛化能力。消融实验也证明了各模块的有效性。
    结论 本研究通过融合轻量级抗噪、目标强化及方向敏感特征捕捉策略,为SAR舰船检测提供了一种高效、鲁棒的解决方案。

     

    Abstract:
    Objective Synthetic aperture radar (SAR) has become an indispensable tool for marine surveillance due to its all-weather, day-and-night imaging capabilities. However, automatic ship detection in SAR imagery remains highly challenging, primarily because of strong coherent speckle noise, drastic variations in target scale, and complex background interference in inshore scenarios. Existing lightweight models often struggle to maintain high detection accuracy and robustness under these varying conditions. To address these challenges, this paper proposes an improved lightweight detection model—SSD-YOLO—build upon the YOLOv8n architecture, designed to improve detection performance in complex marine scenarios.
    Method The SSD-YOLO model constructs a hierarchical "Reinforce-Purify-Fuse" synergistic framework by integrating three theoretically grounded innovative modules. First, to address the challenge of weak target responses in heavy noise, a parameter-free target response enhancement module (C2f_SimAM) is incorporated into the backbone network. Leveraging an energy-function-based SimAM attention mechanism inspired by neuroscience, this module adaptively calculates 3D attention weights, effectively enhancing the linear separability of target neurons while suppressing background clutter without introducing any additional model parameters. Second, a lightweight anti-noise feature aggregation module (SAR_SPPF) is deployed at the end of the backbone. Unlike traditional spatial pyramid pooling, this module employs Ghost Convolutions to improve computational efficiency and introduces a novel dual-path pooling strategy. Theoretical derivations based on the Gamma distribution properties of SAR speckle noise show that combining small-kernel average pooling (for smoothing background variance) with large-kernel max pooling (for retaining strong scattering peaks) maximizes the signal-to-noise ratio. Third, a direction-sensitive multi-scale fusion module (C2f_DSConv) is integrated into the neck network. This module replaces standard convolutions with direction-sensitive depthwise separable convolutions. By applying kernels oriented at 0°, 45°, and 90°, it effectively captures the anisotropic texture and geometric structures of ships, thereby significantly improving the model’s ability to detect densely distributed and multi-scale targets.
    Results Extensive experiments were conducted on the SSDD and HRSID datasets. On the SSDD dataset, the model achieved 97.0% precision, 96.4% recall, 99.0% mAP@0.5, and 74.3% mAP@0.5:0.95. Despite these substantial performance gains, the model remains highly lightweight, maintaining approximately 3.01M parameters and 7.9G FLOPs. Generalization experiments on the more challenging, high-resolution HRSID dataset demonstrated superior robustness, yielding an mAP@0.5 of 94.9%. Quantitative analysis of the feature map SNR further validated the internal rationale behind the proposed synergistic framework.
    Conclusion This study presents an efficient and robust solution for SAR ship detection by integrating lightweight anti-noise processing, target response enhancement, and direction-sensitive feature capture strategies.

     

/

返回文章
返回