Bulletin of Surveying and Mapping ›› 2025, Vol. 0 ›› Issue (12): 121-125,162.doi: 10.13474/j.cnki.11-2246.2025.1221

Previous Articles    

U-Net++ remote sensing semantic segmentation method optimized based on visual foundation models

CHEN Zhongchao, SUN Junying, TAN Deng'ao, LI Juan, CHENG Qihuan, YANG Mingke   

  1. The Second Surveying and Mapping Institude of Guizhou Province, Guiyang 550001, China
  • Received:2025-08-13 Published:2025-12-31

Abstract: To break through the bottlenecks of high dependence on samples and multi-scale feature fusion in previous remote sensing semantic segmentation,this paper proposes a remote sensing semantic segmentation method VF-UNet++,which integrates the visual foundation model SAM with the U-Net++ architecture.This method innovatively constructs a dual-stream feature collaborative modeling framework and designs a dual-stream feature interactive fusion mechanism to achieve cross-scale feature fusion.Meanwhile,aiming at the characteristics of remote sensing scenarios,it proposes an adaptation strategy of parameter freezing and domain knowledge injection,which effectively improves the generalization ability of the model under the condition of limited samples.Experiments based on the Inria Aerial Image Labeling Dataset demonstrate that VF-UNet++ outperforms the comparison models in metrics such as recall,F1-score,and mIoU.This method effectively tackles the challenges in transferring and adapting visual foundation models to the remote sensing domain,offering a reference paradigm for intelligent remote sensing interpretation under low-sample conditions.Additionally,it overcomes the limitation of inadequate fusion between global semantic features and local detailed features,achieving a dual enhancement in segmentation accuracy and model robustness within complex remote sensing scenarios.

Key words: remote sensing semantic segmentation, dual-stream feature collaborative modeling, domain adaptation optimization, channel attention mechanism

CLC Number: