Acta Geodaetica et Cartographica Sinica ›› 2016, Vol. 45 ›› Issue (5): 616-622.doi: 10.11947/j.AGCS.2016.20150181

Previous Articles     Next Articles

A Bootstrapping Based Approach for Open Geo-entity Relation Extraction

YU Li1,2, LU Feng1,3, LIU Xiliang1   

  1. 1. State Key Lab of Resources and Environmental Information System, The Institute of Geographic Sciences and Natural Resources Research, Beijing 100101, China;
    2. University of Chinese Academy of Sciences, Beijing 100101, China;
    3. Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing 210023, ChinaAbstract
  • Received:2015-04-07 Revised:2016-02-02 Online:2016-05-20 Published:2016-05-30
  • Supported by:
    The National Natural Science Foundation of China (No.41271408);The National High-Tech Research and Development Program of China (863 Program) (No.2013AA120305)

Abstract: Extracting spatial relations and semantic relations between two geo-entities from Web texts, asks robust and effective solutions. This paper puts forward a novel approach: firstly, the characteristics of terms (part-of-speech, position and distance) are analyzed by means of bootstrapping. Secondly, the weight of each term is calculated and the keyword is picked out as the clue of geo-entity relations. Thirdly, the geo-entity pairs and their keywords are organized into structured information. Finally, an experiment is conducted with Baidubaike and Stanford CoreNLP. The study shows that the presented method can automatically explore part of the lexical features and find additional relational terms which neither the domain expert knowledge nor large scale corpora need. Moreover, compared with three classical frequency statistics methods, namely Frequency, TF-IDF and PPMI, the precision and recall are improved about 5% and 23% respectively.

Key words: text mining, geo-entities, relation extraction, quantitative evaluation, bootstrapping

CLC Number: