测绘通报 ›› 2026, Vol. 0 ›› Issue (2): 137-143.doi: 10.13474/j.cnki.11-2246.2026.0222

• 技术交流 • 上一篇    下一篇

基于改进U-Net3+模型的无人机正射影像语义分割

姜磊1, 梁聪1, 赵旭1, 王鹏1, 闫文凯1, 杨宏鼎2, 吴继忠2   

  1. 1. 中铁七局集团第三工程有限公司, 陕西 西安 710043;
    2. 南京工业大学测绘科学与技术学院, 江苏 南京 211816
  • 收稿日期:2025-07-08 发布日期:2026-03-12
  • 通讯作者: 吴继忠。E-mail:jzwu@njtech.edu.cn
  • 作者简介:姜磊(1982—),男,高级工程师,主要从事城市轨道交通测量的工作。E-mail:330352755@qq.com
  • 基金资助:
    自然资源部国土卫星遥感应用重点实验室开放基金科研项目(KLSMNR-K202307);中铁七局集团第三工程有限公司科技研究开发课题(GX2208)

Semantic segmentation of UAV orthophoto images based on improved U-Net3+ model

JIANG Lei1, LIANG Cong1, ZHAO Xu1, WANG Peng1, YAN Wenkai1, YANG Hongding2, WU Jizhong2   

  1. 1. The Third Engineering Co., Ltd., China Railway Seventh Group, Xi'an 710043, China;
    2. School of Geomatics Science and Technology, Nanjing Tech University, Nanjing 211816, China
  • Received:2025-07-08 Published:2026-03-12

摘要: 为解决U-Net3+模型在无人机正射影像语义分割时特征抽象层次不足与跨尺度特征冗余的问题,本文提出了一种改进的U-Net3+模型。改进模型引入基于残差网络架构的深度卷积神经网络ResNet50作为特征提取主干网络,同时引入卷积注意力模块作为轻量级注意力机制。试验结果表明:改进U-Net3+模型的总体准确率、平均交并比、F1分数比原始U-Net3+分别高出8.3%、2.6%和1.9%,且优于FCN、U-Net、U-Net++和DeepLab系列主流语义分割模型,改进U-Net3+模型在典型场景下表现出更强的特征区分能力和更高的准确性;仅引入ResNet50或CBAM无法达到最佳效果,ResNet50与CBAM的协同作用可显著增强模型在复杂场景下的识别能力。改进U-Net3+模型的分割精度有明显改善,为无人机正射影像语义分割提供了有效的技术解决方案。

关键词: 无人机正射影像, 语义分割, U-Net3+, ResNet50, 卷积注意力模块

Abstract: To address the limitations of insufficient feature abstraction and cross-scale feature redundancy in semantic segmentation of unmanned aerial vehicle (UAV)orthophoto images using the U-Net3+ model,this study proposes an improved U-Net3+ architecture.The improvement incorporates ResNet50,a deep convolutional neural network based on residual network,as the backbone for feature extraction.Simultaneously,the convolutional block attention module (CBAM)is integrated as a lightweight attention mechanism.Experimental results demonstrate that the proposed U-Net3+ model delivers significant improvements in segmentation performance,achieving an 8.3% increase in overall accuracy,2.6% in mean intersection over union,and 1.9% in F1-score compared to the original U-Net3+ model.The proposed model consistently outperforms established benchmarks,including FCN,U-Net,U-Net++,and the DeepLab series,across all evaluation metrics,demonstrating superior feature discrimination and segmentation accuracy in representative scene types.Moreover,the integration of either ResNet50 or CBAM alone results in moderate gains,their combined implementation leads to a notable synergistic effect,yielding the most effective results in segmentation tasks.The improved U-Net3+ model has significantly improved the segmentation accuracy,providing an effective technical solution for semantic segmentation of UAV orthophoto maps.

Key words: UAV orthophoto images, semantic segmentation, U-Net3+, ResNet50, convolutional block attention module

中图分类号: