一种基于多尺度扩张注意力网络的肝脏CT图像分割算法

董华; 李彬彬; 方菁; 邱全林; 陈志华

doi:10.3969/j.issn.2096-8566.2026.01.006

一种基于多尺度扩张注意力网络的肝脏CT图像分割算法

A Multi-Scale Dilated Attention Algorithm for Liver CT Image Segmentation

摘要

摘要: 鉴于当前肝脏CT图像分割算法面临边界模糊不清以及小目标区域欠分割的挑战，本文提出一种基于U-Net改进的MSDA-Net（Multi-scale Dilated Attention Network）肝脏分割算法。首先，为实现多尺度特征的精准捕捉，在编码和解码环节将传统卷积替换为金字塔卷积；其次，在跳跃连接中引入多尺度注意力聚合模块，以此实现对多尺度特征的聚合；然后，在瓶颈层位置采用稠密连接的注意力扩张空间金字塔池化模块，构建更为密集的特征金字塔结构并扩大感受野范围，从而充分挖掘并利用丰富的上下文语义信息；最后，在公开数据集LiTS17上验证本文算法，结果显示该算法的Dice系数、IoU系数、召回率、精确度以及VOE系数依次为92.77%、89.19%、93.47%、93.46%和0.1079。相较于当前主流的肝脏分割算法，本文所提出的MSDA-Net算法在肝脏局部结构的精细分割任务中，能够准确捕捉肝脏边缘的细微特征，呈现出更为卓越的性能优势。

Abstract: To address the current challenges in liver CT image segmentation, such as blurred boundaries and the under-segmentation of small target regions, this paper proposes MSDA-Net (Multi-scale Dilated Attention Network), an improved liver segmentation algorithm based on the U-Net model. First, pyramid convolution replaces traditional convolution in both the encoder and decoder, enabling the precise capture of multi-scale features. Second, a multi-scale attention aggregation module is introduced into the skip connections to effectively aggregate features at different scales. Furthermore, a densely connected attentive dilated spatial pyramid pooling module is employed at the bottleneck layer to construct a denser feature pyramid structure and expand the receptive field, thereby fully mining and leveraging contextual semantic information. Finally, the proposed algorithm is validated on the public LiTS17 dataset. The results show that the Dice coefficient, IoU coefficient, recall, precision, and VOE coefficient are 92.77%, 89.19%, 93.47%, 93.46%, and 0.1079, respectively. Compared with current mainstream liver segmentation algorithms, the proposed MSDA-Net algorithm demonstrates improved performance in the fine segmentation of local liver structures by accurately capturing subtle features along liver edges.

HTML全文

参考文献(29)

施引文献

资源附件(0)