Acta Geodaetica et Cartographica Sinica ›› 2024, Vol. 53 ›› Issue (9): 1817-1828.doi: 10.11947/j.AGCS.2024.20220720

• Cartography and Geoinformation • Previous Articles    

Research on knowledge extraction from street scene images based on hybrid intelligence

Wanzeng LIU1,2,3(), Hang CHEN2,4, Jiaxin REN2,5(), Zhaojiang ZHANG4, Ran LI1,2,3, Tingting ZHAO1,2,3, Xi ZHAI1,2,3, Xiuli ZHU1,2,3   

  1. 1.National Geomatics Center of China, Beijing 100830, China
    2.Key Laboratory of Spatio-temporal Information and Intelligent Services, Ministry of Natural Resources of China, Beijing 100830, China
    3.Hubei Luojia Laboratory, Wuhan 430079, China
    4.College of Mining and Geomatics Engineering, Hebei University of Engineering, Handan 056038, China
    5.School of Geosciences and Info-Physics, Central South University, Changsha 410083, China
  • Received:2022-12-31 Published:2024-10-16
  • Contact: Jiaxin REN E-mail:luwnzg@163.com;jaycecd@foxmail.com
  • About author:LIU Wanzeng (1970—), male, PhD, professorate senior engineer, majors in spatio-temporal knowledge service. E-mail: luwnzg@163.com
  • Supported by:
    The National Natural Science Foundation of China(42394062);The National Key Research and Development Program of China(2022YFB3904205);The Open Fund of Hubei Luojia Laboratory(220100037)

Abstract:

This study presents a hybrid intelligence-based approach, named K-CAPSNet, for extracting knowledge from streetscape images. To tackle the challenge of intelligent extraction of streetscape image objects, we develop a panoramic segmentation network with a joint attention mechanism that integrates both channel information and spatial information of streetscape images. This improves the object segmentation accuracy. Additionally, we incorporate streetscape knowledge, which is formed by people in production and life, into the streetscape image cognition process. We set the object marking threshold using a priori knowledge to optimize the segmentation results. Moreover, we utilize the a priori knowledge of streetscape images to verify the topological relationship between streetscape objects and to mine spatial relationship knowledge using depth information. Finally, we employ semantic templates to describe and express the type, number, and spatial relationship between streetscape objects. The experimental results demonstrate that our method outperforms the baseline network and significantly improves the quality of panoramic segmentation and recognition, thereby achieving better extraction and expression of the knowledge of streetscape images.

Key words: hybrid intelligence, prior knowledge, panoptic segmentation, scene cognition, attentional mechanisms, spatial relationships

CLC Number: