测绘学报 ›› 2016, Vol. 45 ›› Issue (5): 623-630.doi: 10.11947/j.AGCS.2016.20150618

• 地图学与地理信息 • 上一篇    

顾及位置关系的网络POI地址信息标准化处理方法

王勇1,2, 刘纪平2, 郭庆胜1, 罗安2   

  1. 1. 武汉大学资源与环境科学学院, 湖北 武汉 430079;
    2. 中国测绘科学研究院, 北京 100830
  • 收稿日期:2015-12-08 修回日期:2016-03-22 出版日期:2016-05-20 发布日期:2016-05-30
  • 作者简介:王勇(1976-),男,副研究员,研究方向为网络地理信息获取与挖掘。E-mail: wangyong@casm.ac.cn
  • 基金资助:
    国家863计划(2012AA12A402;2013AA12A403);国家自然科学基金(41471384);国家测绘地理信息局公益科研专项(201512021;201512032)

The Standardization Method of Address Information for POIs from Internet Based on Positional Relation

WANG Yong1,2, LIU Jiping2, GUO Qingsheng1, LUO An2   

  1. 1. School of Resource and Environmental Sciences, Wuhan University, Wuhan 430079, China;
    2. Chinese Academy of Surveying and Mapping, Beijing 100830, ChinaAbstract
  • Received:2015-12-08 Revised:2016-03-22 Online:2016-05-20 Published:2016-05-30
  • Supported by:
    The National High-tech Research and Development Program of China (863 Program) (Nos.2012AA12A402;2013AA12A403);The National Natural Science Foundation of China (No.41471384);Research Projects of Public Welfare for Surveying and Mapping Industry(Nos. 201512021;201512032)

摘要: 针对互联网POI(兴趣点)地址信息中广泛存在的地址要素不完整、文字表达不一致等不规范现象,提出一种顾及位置关系的网络POI地址信息标准化处理方法,首先对POI信息进行切分提取并逐层匹配地址树模型;然后基于4种位置关系从标准POI库中选出相应集合,作为丰富和修正非标准POI地址要素的候选;最后通过最小粒度地址要素的回溯,实现POI地址信息的快速标准化处理。试验表明该方法可以获得较高的准确率,尤其适用于在互联网数据环境中的POI地址信息标准化。

关键词: 网络POI, 地址树, 位置关系, 地址标准化

Abstract: As points of interest (POI)on the internet, exists widely incomplete addresses and inconsistent literal expressions, a fast standardization processing method of network POIs address information based on spatial constraints was proposed. Based on the model of the extensible address expression, first of all, address information of POI was segmented and extracted. Address elements are updated by means of matching with the address tree layer by layer. Then, by defining four types of positional relations, corresponding set are selected from standard POI library as candidate for enrichment and amendment of non-standard address. At last, the fast standardized processing of POI address information was achieved with the help of backtracking address elements with minimum granularity. Experiments in this paper proved that the standardization processing of an address can be realized by means of this method with higher accuracy in order to build the address database.

Key words: POIs from internet, addresses tree, positional relation, standalization of address

中图分类号: