刘金华, 周之平, 李克伟. 基于小波分解和注意力引导的伪装目标检测算法[J]. 南昌航空大学学报(自然科学版), 2025, 39(5): 31-41. DOI: 10.3969/j.issn.2096-8566.2025.05.004
引用本文: 刘金华, 周之平, 李克伟. 基于小波分解和注意力引导的伪装目标检测算法[J]. 南昌航空大学学报(自然科学版), 2025, 39(5): 31-41. DOI: 10.3969/j.issn.2096-8566.2025.05.004
Jinhua LIU, Zhiping ZHOU, Kewei LI. Camouflage Object Detection Based on Wavelet Decomposition and Attention Guidance[J]. Journal of nanchang hangkong university(Natural science edition), 2025, 39(5): 31-41. DOI: 10.3969/j.issn.2096-8566.2025.05.004
Citation: Jinhua LIU, Zhiping ZHOU, Kewei LI. Camouflage Object Detection Based on Wavelet Decomposition and Attention Guidance[J]. Journal of nanchang hangkong university(Natural science edition), 2025, 39(5): 31-41. DOI: 10.3969/j.issn.2096-8566.2025.05.004

基于小波分解和注意力引导的伪装目标检测算法

Camouflage Object Detection Based on Wavelet Decomposition and Attention Guidance

  • 摘要: 在极端复杂的场景下,现有伪装目标检测方法仍面临严重挑战,具体表现为不同尺度伪装目标的定位精度不足以及预测分割图的边界模糊问题。为了解决上述不足,本文提出一种融合小波分解与注意力引导的编码−解码检测网络,旨在实现伪装目标的精准识别与高精度分割。具体来说,在编码阶段,使用Swin-Transformer主干进行多层次编码并利用感受野模块进行特征优化处理。随后,提出频率分解和聚合模块对多层特征进行频域增强以辅助模型定位目标和增强空间细节。接着,通过级联的特征融合模块渐进融合局部信息和全局信息。最后,利用注意力引导边缘增强解码器对语义特征和边缘特征进行互补融合,以生成更准确的伪装目标预测图。实验结果表明,与21种基于深度学习的现有算法相比,本文提出的方法在3个公开的伪装目标检测数据集上具有明显的优势。

     

    Abstract: In extremely complex scenarios, existing camouflaged object detection methods still face significant challenges, which are specifically manifested in insufficient localization accuracy of camouflage objects at varying scales and blurred boundary problems in predicted segmentation maps. To address the above shortcomings, this paper innovatively proposes an encoder-decoder detection network integrating wavelet decomposition and attention guidance, aiming to achieve accurate recognition and high-precision segmentation of camouflage objects. Specifically, in the encoding stage, the Swin-Transformer backbone is used for multi-level encoding, and the receptive field module is utilized for feature optimization processing. Subsequently, a frequency decomposition and aggregation module is proposed to perform frequency-domain enhancement on multi-level features, so as to assist the model in locating targets and enhancing spatial details. Then, a cascaded feature fusion module is adopted to progressively fuse local and global information. Finally, an attention-guided edge-enhanced decoder is used to conduct complementary fusion of semantic features and edge features, thereby generating more accurate predicted maps of camouflage objects. Extensive experiment results demonstrate that compared with 21 existing deep learning-based algorithms, the proposed approach has obvious advantages on three public camouflage object detection datasets.

     

/

返回文章
返回