测绘学报 ›› 2023, Vol. 52 ›› Issue (3): 478-489.doi: 10.11947/j.AGCS.2023.20210349

• 地图学与地理信息 • 上一篇    下一篇

语义驱动的地理实体关联网络构建与知识服务

凌朝阳1, 李锐1, 吴华意1, 李江3, 桂志鹏2   

  1. 1. 武汉大学测绘遥感信息工程国家重点实验室, 湖北 武汉 430079;
    2. 武汉大学遥感信息工程学院, 湖北 武汉 430079;
    3. 湖北省自然资源厅信息中心, 湖北 武汉 430071
  • 收稿日期:2021-06-28 修回日期:2021-10-17 发布日期:2023-04-07
  • 通讯作者: 李锐 E-mail:ruili@whu.edu.cn
  • 作者简介:凌朝阳(1999-),男,硕士生,研究方向为时空数据关联与挖掘。E-mail:lingzhaoyang@whu.edu.cn
  • 基金资助:
    国家自然科学基金(U20A2091);湖北省自然资源科技(ZRZY2021KJ13);武汉大学知卓时空智能研究基金(ZZJJ202204)

Semantic-driven construction of geographic entity association network and knowledge service

LING Zhaoyang1, LI Rui1, WU Huayi1, LI Jiang3, GUI Zhipeng2   

  1. 1. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China;
    2. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China;
    3. Information Center of Department of Natural Resources of Hubei Province, Wuhan 430071, China
  • Received:2021-06-28 Revised:2021-10-17 Published:2023-04-07
  • Supported by:
    The National Natural Science Foundation of China (No. U20A2091);Natural Resources Science and Technology Fund of Hubei Province (No. ZRZY2021KJ13);Zhizhuo Research Fund on Spatial-Temporal Artificial Intelligence (No. ZZJJ202204)

摘要: 知识服务是GIS的重要应用方向,海量文本数据中蕴含的丰富隐式地理信息的分析与挖掘成为热点研究问题。在自然资源管理领域,一定时空范围内的自然资源分布相对独立和分散,文本中的丰富语义信息零散、庞杂且高度非结构化,缺少有效的组织表达、关联整合与综合应用方案。本文面向自然资源管理领域的文本数据和自然资源实体,提出了语义驱动的地理实体表达框架,通过语义描述、空间位置、属性特征和时间演化四元组来组织表达文本内蕴的地理实体多域信息,并从概念、空间、属性和时间4个维度定义并表示实体间的多类语义关系;继而按照地理实体信息抽取、信息存储和语义关联构建等步骤,给出了多维度地理实体关联网络的构建方法,并设计了基于关联网络的知识问答服务算法;最后,以建设用地审批为例,利用审批过程电子文本数据,完成建设用地信息的实体化表达、建设用地实体关联网络的构建及知识问答服务的实现。试验与分析结果表明,本文的理论与方法能有效促进自然资源管理领域文本中地理信息的有机整合、充分关联与科学管理,为提升自然资源领域信息的应用与社会化服务水平提供切实可行的途径。

关键词: 文本数据, 语义驱动, 地理实体表达框架, 关联网络, 知识问答服务

Abstract: Knowledge service is an important application direction of GIS. The analysis and mining of the rich implicit geographic information contained in massive text data has become a hot research issue. In the field of natural resource management, the distribution of natural resources within a certain temporal and spatial range is relatively independent and scattered. The rich semantic information in the text is fragmented, complex and highly unstructured, lacking effective organization, integration, and comprehensive application solutions. Oriented to text data and natural resource geographic entities, this paper proposes a semantic-driven geographic entity expression framework. It organizes and expresses the multi-domain information of geographic entities through a four-tuple of semantic description, spatial location, attribute characteristics, and temporal evolution. It defines and describes the multiple types of relationships between entities from the four dimensions of concept, space, attributes and time. Following the steps of geographic entity information extraction, information storage and association construction, we give a method for constructing a multi-dimensional geographic entity association network. Then, we design a knowledge question answering algorithm based on the associated network. Finally, taking construction land approval as an example, using electronic text data of the approval process, we complete the materialized expression of construction land information, the construction of the geographic entity association network, and the realization of knowledge question answering service. The experiments and analysis show the theories and methods of this article can effectively promote the organic integration, full association and scientific management of geographic information in the text, and provide practical ways to improve application and social service level of information in the field of natural resources.

Key words: text data, semantic-driven, geographic entity expression framework, association network, knowledge question answering service

中图分类号: