地图学与地理信息

地址树模型的中文地址提取方法

  • 亢孟军 ,
  • 杜清运 ,
  • 王明军
展开
  • 武汉大学资源与环境科学学院, 湖北 武汉 430079
亢孟军(1983-), 男, 讲师, 主要研究方向为电子地图、地理编码. E-mail: mengjunk@gmail.com

收稿日期: 2014-01-08

  修回日期: 2014-10-05

  网络出版日期: 2015-01-22

基金资助

国家自然科学基金(41201403)

A New Method of Chinese Address Extraction Based on Address Tree Model

  • KANG Mengjun ,
  • DU Qingyun ,
  • WANG Mingjun
Expand
  • School of Resources and Environmental Science, Wuhan University, Wuhan 430079, China

Received date: 2014-01-08

  Revised date: 2014-10-05

  Online published: 2015-01-22

Supported by

The National Natural Science Foundation of China(No.41201403)

摘要

地址是一种对个体地域空间位置信息的编码方法.在我国,由于城市快速发展,地址规划相对落后,非标准地址大量存在.本文在分析标准地址模型空间约束关系类型的基础上,提出了一种基于地址树模型的中文地址提取方法.该模型以拓扑关系作为空间约束关系是否一致的判断标准,可以从非标准地址中提取标准地址,并剔除非标准和错误地址元素.试验证明,该方法有较高的地址匹配率.

本文引用格式

亢孟军 , 杜清运 , 王明军 . 地址树模型的中文地址提取方法[J]. 测绘学报, 2015 , 44(1) : 99 -107 . DOI: 10.11947/j.AGCS.2015.20130205

Abstract

Address is a spatial location encoding method of individual geographical area. In China, address planning is relatively backward due to the rapid development of the city, resulting in the presence of large number of non-standard address. The space constrain relationship of standard address model is analyzed in this paper and a new method of standard address extraction based on the tree model is proposed, which regards topological relationship as consistent criteria of space constraints. With this method, standard address can be extracted and errors can be excluded from non-standard address. Results indicate that higher math rate can be obtained with this method.

参考文献

[1] ZHANG Xueying, ZHU Shaonan, ZHANG Chunju. Annotation of Geographical Named Entities in Chinese Text[J]. Acta Geodaetica et Cartographic Sinica, 2012,41(1):115-120.(张雪英, 朱少楠, 张春菊. 中文文本的地理命名实体标注 [J]. 测绘学报, 2012, 41(1): 115-120.)
[2] PALKOWSKY B,METACARTA I. A New Application Information Discovery:Geography Really Does Matter [C]//Proceedings of the SPE Annuual Technical Conference and Exhibition. Dallas:[s.n.],2005.
[3] ROONGPIBOONSOPIT D, KARIMI H A. Comparative Evaluation and Analysis of Online Geocoding Services [J]. International Geographical Information Science, 2010, 24(7): 1081-1100.
[4] ZHANG Xueying, LÜ Guonian, LI Boqiu,et al. Rule-based Approach to Semantic Resolution of Chinese Addresses[J]. Geoinfomation Science,2010,12(1):9-17.(张雪英, 闾国年, 李伯秋,等. 基于规则的中文地址要素解析方法 [J]. 地球信息科学学报, 2010, 12(1): 9-17.)
[5] ZANDBERGEN P A. A Comparison of Address Point, Parcel and Street Geocoding Techniques [J]. Computers, Environment and Urban Systems, 2008, 32(3): 214-232.
[6] GOLDBERG D W, WILSON J P, KNOBLOCK C A. From Text to Geographic Coordinates: the Current State of Geocoding [J]. URISA Journal, 2007, 19(1): 33-46.
[7] RUSHTON G, ARMSTRONG M P, GITTLER J, et al. Geocoding in Cancer Research: A Review [J]. American Journal of Preventive Medicine, 2006, 30(2): S16-S24.
[8] ZHU Jianwei, WANG Zemin. The Principle of Geocodifying and Its Solution on Localization[J]. Beijing Surveying and Mapping,2004(2):24-27.(朱建伟, 王泽民. 地理编码原理及其本地化解决方案 [J]. 北京测绘, 2004(2):24-27.)
[9] WANG Xiuming. Address Automatic Matching of Geographic Information System[J]. Journal of Minxi Vocational and Technical College,2007, 9(2): 75-77.(王秀明. 地理信息系统地址自动匹配 [J]. 闽西职业技术学院学报, 2007, 9(2): 75-77.)
[10] HU Qing, XU Jianhua, WANG Zhihai. Study on the Method of Address Automatically Matching in GIS Database[J]. Geomatics and Spatial Information Technology, 2008, 31(6): 50-52.(胡青, 徐建华, 王志海. GIS 数据库中地址自动匹配方法研究 [J]. 测绘与空间地理信息, 2008, 31(6): 50-52.)
[11] SUN Yafu, CHEN Wenbin. Address Matching Technology Based on Word Segmentation[C]//Proceedings of China Association of Geographic Information Systems Fourth Congress of the 11th Annual Meeting. Beijing:[s.n.], 2007: 114-125.(孙亚夫, 陈文斌. 基于分词的地址匹配技术 [C]//中国地理信息系统协会第四次会员代表大会暨第十一届年会论文集.北京:[s.n.], 2007: 114-125.)
[12] HUANG Song. Research on Chinese Address Coding Technology[D].Beijing:Beijing University,2005.(黄颂. 中文地址编码技术的研究 [D].北京:北京大学,2005.)
[13] CHU Yaping, YIN Junke, SUN Donghu. The Toponymy Basis Tutorial[M]. Beijing:Sinomap Press,1994.(褚亚平, 尹钧科, 孙冬虎. 地名学基础教程 [M]. 北京:中国地图出版社, 1994.)
[14] ZHANG Xueying, ZHANG Chuju, LÜ Guonian. Design and Analysis of a Classification Scheme of Geographical Named Entities[J]. Geoinformation Science, 2010, 12(2): 220-227.(张雪英, 张春菊, 闾国年. 地理命名实体分类体系的设计与应用分析 [J]. 地球信息科学, 2010, 12(2): 220-227.)
[15] CHEN Jianjun, ZHOU Chenhu, WANG Jinggui. Advances in the Study of the Geo-ontology[J]. Earth Science Frontiers, 2006, 13(3): 81-90.(陈建军, 周成虎, 王敬贵. 地理本体的研究进展与分析 [J]. 地学前缘, 2006, 13(3): 81-90.
[16] CHU Yaping. The City Names Commercialization of Geographic Names Legalization[J]. Chinese Toponym, 1996(1): 4-6.(褚亚平. 城市地名商品化与地名管理法制化 [J]. 中国地名, 1996(1): 4-6.)
[17] CHU Yaping. Urban Planning and Development Can not Ignore the Toponym Planning[J]. Beijing Planning Review, 2004(6): 112-113.(褚亚平. 城市规划发展不能忽略地名规划 [J]. 北京规划建设, 2004(6): 112-113.)
[18] QIN Xuexiu. Three Forms of Placename Data and Their Demand[J]. Bulletin of Surveying and Mapping, 2011(10): 68-69.(秦学秀. 地名数据的3种形式及其质量要求 [J]. 测绘通报, 2011(10): 68-69.)
[19] ZHANG Li. Analysis of Chinese Signposts Language Usage[J]. Lanzhou Academic Journal,2007(3): 206-208.(张黎. 我国地名标志语言文字使用状况分析 [J]. 兰州学刊, 2007(3): 206-208.)
[20] ESRI. ArcGIS Resource [EB/OL].[2013-07-12].http://help.arcgis.com/zh-cn/arcgisdesk-top.
[21] ZHAO Guozhou. To Talk about the Doorplate Reform[J]. Research and Exploration, 1998, 2(1):34-36.(赵国洲. 谈谈门牌改革 [J]. 决策探索, 1998, 2(1):34-36.)
[22] GUO Xiaolin. Discussion on the Management of Doorplate in City[J]. Shandong Economic Strategy Research, 2008(3): 61-62.(郭晓琳. 略论城市建设中的楼门牌设置与管理 [J]. 山东经济战略研究, 2008(3): 61-62.)
[23] LI Qimin. The Social Function of the City Doorplate[J]. Construction Science and Technology, 2002(2): 46-47.(李启明."城市门牌"的社会功能 [J]. 建设科技, 2002(2): 46-47.)
[24] LI Yongheng. The Integration Process of Macau Geography Information Data: Taking Street Door Number Data as an Example[J]. Geomatics World, 2013, (1):87-91.(李永恒. 澳门地理数据的整合进程——以街道门牌数据为例 [J]. 地理信息世界, 2013, (1):87-91.)
[25] HILL L, FREW J, ZHENG Q. Geographic Names: The Implementation of a Gazetteer in a Georeferenced Digital Library [J/OL].[2013-07-13].http://dblp.uni-trier.de/db/journals/dlib.
[26] HILL L L. Core Elements of Digital Gazetteers: Placenames, Categories, and Footprints [M]. Berlin:Springer, 2000: 280-290.
[27] EGENHOFER M J, HERRING J. A Mathematical Framework for the Definition of Topological Relationships [C]//Proceedings of the Fourth International Symposium on Spatial Data Handling, Zurich:[s.n.], 1990: 803-813.
[28] EGENHOFER M J, HERRING J. Categorizing Binary Topological Relations between Regions, Lines, and Points in Geographic Databases [R]. Orono:University of Marine,1999.
[29] DENG Min, LIU Wenbao, FENG Xuezhi. A Generic Model Describing Topological Relations among Area Objects in GIS[J]. Acta Geodaetica et Cartographica Sinica, 2005, 34(1): 85-90.(邓敏, 刘文宝, 冯学智. GIS 面目标间拓扑关系的形式化模型 [J]. 测绘学报, 2005, 34(1): 85-90.)
文章导航

/