测绘通报 ›› 2026, Vol. 0 ›› Issue (3): 57-61.doi: 10.13474/j.cnki.11-2246.2026.0310

• 学术研究 • 上一篇    下一篇

CSG-Net:一种融合域适应与视觉基础模型SAM的遥感影像建筑物足迹提取方法

王椰1, 张新长1, 姜明2, 阮永俭1   

  1. 1. 广州大学地理科学与遥感学院, 广东 广州 510006;
    2. 中山大学地理科学与规划学院, 广东 广州 510006
  • 收稿日期:2025-10-09 出版日期:2026-03-25 发布日期:2026-04-08
  • 通讯作者: 张新长。E-mail:zhangxc@gzhu.edu.cn
  • 作者简介:王椰(2001—),男,硕士生,主要研究方向为智慧城市与城市遥感。E-mail:2112301066@e.gzhu.edu.cn
  • 基金资助:
    国家自然科学基金(42371406;42401518;42571533);广东省自然科学基金(2025A1515011066)

CSG-Net: a method for building footprint extraction from remote sensing images by integrating domain adaptation and the visual foundation model SAM

WANG Ye1, ZHANG Xinchang1, JIANG Ming2, RUAN Yongjian1   

  1. 1. School of Geography and Remote Sensing, Guangzhou University, Guangzhou 510006, China;
    2. School of Geography and Planing, Sun at-sen University, Guangzhou University, Guangzhou 510006, China
  • Received:2025-10-09 Online:2026-03-25 Published:2026-04-08

摘要: 针对深度学习建筑物足迹提取模型在跨平台与跨分辨率应用中因域间分布不一致导致的泛化能力显著下降问题,本文提出了一种跨尺度几何精炼网络(CSG-Net),构建了一个“概率-几何”串联的伪标签精炼框架,旨在提升模型在无标签目标域中的适应性与提取精度。首先,通过计算模型双预测分支的Jensen-Shannon散度(JSD),实现对伪标签的不确定性度量与概率加权,以软性方式抑制不可靠区域的噪声;然后,引入基于segment anything model(SAM)分割结果的几何先验,通过重叠率分析对初始伪标签的边界进行硬性几何修正,从而生成高质量的训练目标。在跨尺度建筑物提取任务上的试验表明,CSG-Net的交并比(IoU)达到73.05%,显著优于Baseline(52.49%)及其他先进域适应方法,验证了本文框架在提升跨域稳健性和提取精度方面的有效性。

关键词: 遥感影像, 建筑物足迹, 语义分割, 域适应, segment anything model(SAM)

Abstract: To address the substantial decline in generalization performance of deep learning models for building footprint extraction across different platforms and spatial resolutions caused by inter-domain distribution inconsistency,this study introduces a cross-scale geometric-refined network (CSG-Net) aimed at enhancing both adaptability and extraction accuracy in unlabeled target domains.The proposed approach establishes a cascaded probabilistic-geometric pseudo-label refinement framework.Firstly,pseudo-label uncertainty is quantified by computing the Jensen-Shannon divergence (JSD) between the dual prediction branches of the model,and a probabilistic weighting scheme is applied to suppress noise in unreliable regions.Subsequently,geometric priors derived from the segmentation results of the segment anything model (SAM) are incorporated to perform explicit geometry-constrained refinement of pseudo-label boundaries based on overlap ratio analysis,thereby generating high-quality training targets.Extensive experiments on challenging cross-scale building extraction tasks demonstrate that CSG-Net achieves an intersection over union (IoU) of 73.05%,markedly surpassing the Baseline (52.49%) and outperforming other state-of-the-art domain adaptation approaches.These findings confirm the effectiveness of the proposed framework in improving cross-domain robustness and extraction accuracy.

Key words: remote sensing imagery, building footprint, semantic segmentation, domain adaptation, segment anything model (SAM)

中图分类号: