基于改进U-Net3+模型的无人机正射影像语义分割

doi:10.13474/j.cnki.11-2246.2026.0222

测绘通报 ›› 2026, Vol. 0 ›› Issue (2): 137-143.doi: 10.13474/j.cnki.11-2246.2026.0222

基于改进U-Net3+模型的无人机正射影像语义分割

姜磊¹, 梁聪¹, 赵旭¹, 王鹏¹, 闫文凯¹, 杨宏鼎², 吴继忠²

1. 中铁七局集团第三工程有限公司, 陕西西安 710043;
2. 南京工业大学测绘科学与技术学院, 江苏南京 211816

收稿日期:2025-07-08 发布日期:2026-03-12
通讯作者: 吴继忠。E-mail:jzwu@njtech.edu.cn
作者简介:姜磊(1982—),男,高级工程师,主要从事城市轨道交通测量的工作。E-mail:330352755@qq.com
基金资助:
自然资源部国土卫星遥感应用重点实验室开放基金科研项目(KLSMNR-K202307);中铁七局集团第三工程有限公司科技研究开发课题(GX2208)

Semantic segmentation of UAV orthophoto images based on improved U-Net3+ model

JIANG Lei¹, LIANG Cong¹, ZHAO Xu¹, WANG Peng¹, YAN Wenkai¹, YANG Hongding², WU Jizhong²

1. The Third Engineering Co., Ltd., China Railway Seventh Group, Xi'an 710043, China;
2. School of Geomatics Science and Technology, Nanjing Tech University, Nanjing 211816, China

Received:2025-07-08 Published:2026-03-12

摘要/Abstract

摘要： 为解决U-Net3+模型在无人机正射影像语义分割时特征抽象层次不足与跨尺度特征冗余的问题,本文提出了一种改进的U-Net3+模型。改进模型引入基于残差网络架构的深度卷积神经网络ResNet50作为特征提取主干网络,同时引入卷积注意力模块作为轻量级注意力机制。试验结果表明:改进U-Net3+模型的总体准确率、平均交并比、F1分数比原始U-Net3+分别高出8.3%、2.6%和1.9%,且优于FCN、U-Net、U-Net++和DeepLab系列主流语义分割模型,改进U-Net3+模型在典型场景下表现出更强的特征区分能力和更高的准确性;仅引入ResNet50或CBAM无法达到最佳效果,ResNet50与CBAM的协同作用可显著增强模型在复杂场景下的识别能力。改进U-Net3+模型的分割精度有明显改善,为无人机正射影像语义分割提供了有效的技术解决方案。

关键词: 无人机正射影像, 语义分割, U-Net3+, ResNet50, 卷积注意力模块

Abstract: To address the limitations of insufficient feature abstraction and cross-scale feature redundancy in semantic segmentation of unmanned aerial vehicle (UAV)orthophoto images using the U-Net3+ model,this study proposes an improved U-Net3+ architecture.The improvement incorporates ResNet50,a deep convolutional neural network based on residual network,as the backbone for feature extraction.Simultaneously,the convolutional block attention module (CBAM)is integrated as a lightweight attention mechanism.Experimental results demonstrate that the proposed U-Net3+ model delivers significant improvements in segmentation performance,achieving an 8.3% increase in overall accuracy,2.6% in mean intersection over union,and 1.9% in F1-score compared to the original U-Net3+ model.The proposed model consistently outperforms established benchmarks,including FCN,U-Net,U-Net++,and the DeepLab series,across all evaluation metrics,demonstrating superior feature discrimination and segmentation accuracy in representative scene types.Moreover,the integration of either ResNet50 or CBAM alone results in moderate gains,their combined implementation leads to a notable synergistic effect,yielding the most effective results in segmentation tasks.The improved U-Net3+ model has significantly improved the segmentation accuracy,providing an effective technical solution for semantic segmentation of UAV orthophoto maps.

Key words: UAV orthophoto images, semantic segmentation, U-Net3+, ResNet50, convolutional block attention module

中图分类号:

P237

姜磊, 梁聪, 赵旭, 王鹏, 闫文凯, 杨宏鼎, 吴继忠. 基于改进U-Net3+模型的无人机正射影像语义分割[J]. 测绘通报, 2026, 0(2): 137-143.

JIANG Lei, LIANG Cong, ZHAO Xu, WANG Peng, YAN Wenkai, YANG Hongding, WU Jizhong. Semantic segmentation of UAV orthophoto images based on improved U-Net3+ model[J]. Bulletin of Surveying and Mapping, 2026, 0(2): 137-143.

/ 推荐

参考文献

[1] 胡功明,杨春成,徐立,等.改进U-Net的遥感图像语义分割方法[J].测绘学报,2023,52(6):980-989.
[2] ZHU Xiaoxiang,TUIA D,MOU Lichao,et al.Deep learning in remote sensing:a comprehensive review and list of resources[J].IEEE Geoscience and Remote Sensing Magazine,2017,5(4):8-36.
[3] MA Lei,LIU Yu,ZHANG Xueliang,et al.Deep learning in remote sensing applications:a meta-analysis and review[J].ISPRS Journal of Photogrammetry and Remote Sensing,2019,152:166-177.
[4] LI Er,FEMIANI J,XU Shibiao,et al.Robust rooftop extraction from visible band images using higher order CRF[J].IEEE Transactions on Geoscience and Remote Sensing,2015,53(8):4483-4495.
[5] CHEN Yang,FAN Rongshuang,YANG Xiucheng,et al.Extraction of urban water bodies from high-resolution remote-sensing imagery using deep learning[J].Water,2018,10(5):585.
[6] ZHANG Zhengxin,LIU Qingjie,WANG Yunhong.Road extraction by deep residual U-Net[J].IEEE Geoscience and Remote Sensing Letters,2018,15(5):749-753.
[7] XU Yongyang,XIE Zhong,FENG Yaxing,et al.Road extraction from high-resolution remote sensing imagery using deep learning[J].Remote Sensing,2018,10(9):1461.
[8] 王斌,陈占龙,吴亮,等.兼顾连通性的U-Net网络高分辨率遥感影像道路提取[J].遥感学报,2020,24(12):1488-1499.
[9] 王卓,闫浩文,禄小敏,等.一种改进U-Net的高分辨率遥感影像道路提取方法[J].遥感技术与应用,2020,35(4):741-748.
[10] 袁洲,郭海涛,卢俊,等.融合UNet++网络和注意力机制的高分辨率遥感影像变化检测算法[J].测绘科学技术学报,2021,38(2):155-159.
[11] 窦世卿,郑贺刚,徐勇,等.基于U-Net3+的高分遥感影像建筑物提取[J].测绘通报,2022(6):40-44.
[12] 金智文,王宁,肖坚星,等.基于改进U-Net的高分辨率正射影像图田间可行驶道路提取方法[J].农业机械学报,2025,56(2):155-163.
[13] HE Kaiming,ZHANG Xiangyu,REN Shaoqing,et al.Deep residual learning for image recognition[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Las Vegas:IEEE,2016:770-778.
[14] WOO S,PARK J,LEE J Y,et al.CBAM:convolutional block attention module[C]//Proceedings of 2018 Computer Vision-ECCV 2018.Cham:Springer,2018:3-19.
[15] LIAN Xugang,LI Yu,WANG Xiaobing,et al.Research on identification and location of mining landslide in mining area based on improved YOLO algorithm[J].Drones,2024,8(4):150.
[16] 马于博,张爱国,王浩宇,等.基于U-ConvHDNet模型的戈壁砾幕层提取[J].测绘通报,2025(4):68-74.

基于改进U-Net3+模型的无人机正射影像语义分割

Semantic segmentation of UAV orthophoto images based on improved U-Net3+ model

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	高书涵, 周超, 荣梦琪, 刘养东, 刘辉, 沈浩, 贾然, 刘传彬, 张洋, 刘嵘, 申抒含. 面向高压输电线路的三维点云语义分割[J]. 测绘通报, 2026, 0(2): 156-160.
[2]	刘丹莹, 夏既胜. 基于改进PSPNet网络的林窗信息提取方法[J]. 测绘通报, 2026, 0(1): 151-155,171.
[3]	罗惠恒, 杨磊, 文凡, 胡金艳, 王海羽, 李卓建, 高琛. 面向航空影像跨域语义分割的测试时自适应方法[J]. 测绘通报, 2026, 0(1): 100-107.
[4]	邓小龙, 黄志海, 郭波. 改进分割大模型的桥梁缆索损伤语义分割方法[J]. 测绘通报, 2025, 0(9): 112-117.
[5]	王靖凯, 葛星彤, 李兆博, 丁翔, 彭玲. 基于改进HRNet的高速公路路域内光伏板信息提取[J]. 测绘通报, 2025, 0(5): 74-78,99.
[6]	李建, 王健, 王雷, 李敏, 杨立克, 赵艺龙. 双重注意力机制的电力走廊点云语义分割[J]. 测绘通报, 2025, 0(4): 127-133.
[7]	马于博, 张爱国, 王浩宇, 刘帅琪, 靳镜宇, 沈占锋, 李均力. 基于U-ConvHDNet模型的戈壁砾幕层提取[J]. 测绘通报, 2025, 0(4): 68-74.
[8]	符强, 钟振, 纪元法, 任风华. 基于动态场景的实时语义SLAM算法[J]. 测绘通报, 2025, 0(4): 27-33.
[9]	张开洲, 马瑞峰, 贾鑫. 基于语义分割网络和特征匹配的复杂山区高速公路路面线性要素智能提取方法[J]. 测绘通报, 2025, 0(4): 164-169.
[10]	苟长龙, 庞敏, 杨扬. 改进的U-Net卷积网络在遥感影像地物分类中的应用[J]. 测绘通报, 2025, 0(3): 150-155.
[11]	赵兴旺, 赵妍, 刘超, 刘春阳. 基于改进的DeepLabV3+网络的Sentinel-1影像水体提取[J]. 测绘通报, 2025, 0(3): 66-70.
[12]	王超, 付强, 崔志芳, 唐甜. 融合多模态数据的河道遥感地物识别[J]. 测绘通报, 2025, 0(12): 178-183.
[13]	陈忠超, 孙俊英, 谭登澳, 李娟, 成其换, 杨铭珂. 基于视觉基础模型优化的U-Net++遥感语义分割方法[J]. 测绘通报, 2025, 0(12): 121-125,162.
[14]	赵效祖, 苟长龙, 杨扬. 基于CBAM增强的轻量级遥感影像语义分割方法[J]. 测绘通报, 2025, 0(10): 36-42.
[15]	张伟, 张朝龙, 王本林, 蔡安宁. 基于深度学习的多尺度无人机遥感图像道路提取[J]. 测绘通报, 2024, 0(6): 77-81.