Bulletin of Surveying and Mapping ›› 2025, Vol. 0 ›› Issue (4): 27-33.doi: 10.13474/j.cnki.11-2246.2025.0405

Previous Articles    

Real-time semantic SLAM algorithm based on dynamic scenes

FU Qiang1,2, ZHONG Zhen1,2, JI Yuanfa1,2,3,4, REN Fenghua1,2   

  1. 1. Guangxi Key Laboratory of Precision Navigation Technology and Application, Guilin University of Electronic Technology, Guilin 541004, China;
    2. School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, China;
    3. Joint Laboratory for International Cooperation on Spatio-Temporal Information and Intelligent Location Services, Guilin 541004, China;
    4. Guangxi Institute of Industry and Research Spatio-Temporal Information Technology Co., Ltd., Nanning 530023, China
  • Received:2024-08-01 Published:2025-04-28

Abstract: Aiming at the problems of traditional visual SLAM (simultaneous localization and mapping) in indoor dynamic environment with low localization accuracy, poor robustness and poor real-time performance after combining with deep learning as well as the inability to construct dense maps, this paper proposes an improved algorithm based on ORB-SLAM3. First, a lightweight SegFormer semantic segmentation network is used to identify dynamic objects present in the image, and then a mask image adaptive expansion method is added to automatically adjust the mask expansion range according to the total number of feature points to more effectively remove potential dynamic object feature points; second, the bag-of-words model is improved to enhance the loading and matching speed of the algorithm; and lastly, a dense map building thread is added to construct a map for removing dynamic features according to the mask information and keyframes to construct a dense point cloud map after removing dynamic features. The experimental results show that the algorithm in this paper can effectively remove dynamic object feature points in highly dynamic scenes, and improve the localization accuracy and robustness of the system,and the average processing speed is 20.3FPS, which basically meets the requirements of real-time operation.

Key words: VSLAM, ORB-SLAM3, semantic segmentation, dense mapping

CLC Number: