Bulletin of Surveying and Mapping ›› 2021, Vol. 0 ›› Issue (10): 108-113.doi: 10.13474/j.cnki.11-2246.2021.315

Previous Articles     Next Articles

Evaluation of the credibility of multi-source address elements: a case study of road feature

SUN Licai1,2,3,4, CHEN Yisong5, XIONG Jie5, LUO An2, WANG Yong1,2   

  1. 1. Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China;
    2. Chinese Academy of Surveying & Mapping, Beijing 100036, China;
    3. National-Local Joint Engineering Research Center of Technologies and Applications for National Geographic State Monitoring, Lanzhou 730070, China;
    4. Gansu Provincial Engineering Laboratory for National Geographic State Monitoring, Lanzhou 730070, China;
    5. China Telecom Corporation Limited Sichuan Branch, Chengdu 610015, China
  • Received:2021-01-18 Online:2021-10-25 Published:2021-11-13

Abstract: With the development of spontaneous geographic information and Chinese address element segmentation technology, the quality of address elements needs to be evaluated. Aiming at the problem that the quality of address elements produced by Chinese address text segmentation is difficult to effectively evaluate, this paper proposes a method for evaluating the credibility of address elements supported by multi-source data and network retrieval. Firstly, the Chinese word segmentation tool is used to segment the address elements and part-of-speech tagging. By analyzing the word frequency and part-of-speech combination mode, the credibility of the naming structure of the address elements is calculated. Then, based on large-scale address samples, road data, and POI data, excavate the data support of multi-source data to address elements, and calculate the data support. Then use the search engine to retrieve the address elements quickly, analyze the search results and quantity, and calculate the network credibility of the address elements. Finally, a comprehensive credibility calculation model for address elements is proposed to realize the comprehensive credibility calculation of address elements. Experimental results show that the model and method can not only efficiently and quickly calculate the credibility of address elements in Chinese address texts, but also effectively discover the remoteness and falsehood of address elements, which provides a reference for the automatic detection and standardization of address elements.

Key words: multi-source data, credibility evaluation, Chinese word segmentation, address element, information normalize

CLC Number: