Acta Geodaetica et Cartographica Sinica ›› 2026, Vol. 55 ›› Issue (5): 881-893.doi: 10.11947/j.AGCS.2026.20250230

• Photogrammetry and Remote Sensing • Previous Articles     Next Articles

Road extraction method for heterogeneous data using sparse labels

Yuzhun LIN1(), Shuxiang WANG1, Jie RUI1, Fei JIN1(), Jianfang JIANG1, Xibing ZUO2, Xiao LIU1, Yujie ZOU1   

  1. 1.Information Engineering University, Zhengzhou 450001, China
    2.Troops31016, Beijing 100080, China
  • Received:2025-06-05 Revised:2026-04-26 Online:2026-06-23 Published:2026-06-23
  • Contact: Fei JIN E-mail:lyz120218@163.com;jf371@sina.com
  • About author:LIN Yuzhun (1993—), male, phD, associate professor, majors in intelligent processing of remote sensing data. E-mail: lyz120218@163.com
  • Supported by:
    The National Natural Science Foundation of China(42201443);The Natural Science Foundation for Young Students of Henan Province(252300423931)

Abstract:

Labelled data are essential for road extraction from optical images; however, creating high-quality labels is labor-and time-intensive. Moreover, the transferability of network models across different regions, sensors, and imaging times is limited, restricting their broader application in spatio-temporal contexts. To address this issue, we propose a road extraction method for heterogeneous data using sparse labels that combines optical imagery with OpenStreetMap (OSM) data. Sparse road labels are generated through raster processing and coordinate alignment with OSM vector data. Then, the segment anything model (SAM) and simple linear iterative clustering (SLIC) are integrated to extract multi-level image features, thereby facilitating label dissemination through object-level processing for initial optimization. Finally, a network model was trained using both optical images and rough optimization results, and it refined the label accuracy via image-label association mapping and was further optimized with OSM data as a buffer. Experimental validation using the RoadNet and Oklahoma datasets in conjunction with the four semantic segmentation networks UNet, D-LinkNet, MANet and UNetFormer demonstrated that our proposed method outperforms existing methods in terms of both quantitative accuracy and performance, especially in challenging areas of road extraction.

Key words: OSM data, optical image, weak supervision, sparse sample, road extraction

CLC Number: