Acta Geodaetica et Cartographica Sinica ›› 2018, Vol. 47 ›› Issue (6): 882-891.doi: 10.11947/j.AGCS.2018.20180040

Previous Articles     Next Articles

Machine Vision Special Issue: Building Match Graph Using Deep Convolution Feature for Structure from Motion

WAN Jie1,2, Alper YILMAZ2   

  1. 1. The Institute of Remote Sensing and Geographic Information System, the School of Earth and Space Sciences, Peking University, Beijing Key Laboratory of Spatial Information Integration and 3S Engineering Application, Beijing 100871, China;
    2. Department of Civil, Environment and Geodetic Engineering, Ohio State University, Ohio 43210, USA
  • Received:2018-01-23 Revised:2018-04-17 Online:2018-06-20 Published:2018-06-21
  • Supported by:
    The National Natural Science Foundation of China (No.41571432)

Abstract: Image matching in an unordered image dataset is quite time-consuming for structure from motion (SfM) due to image matching by comparing features and large number of matches between all image pairs. To reduce matching times, deep convolution feature (DCF) is proposed to create image match graph in this paper. Firstly, the convolutional feature map of an image is extracted using the VGG-16 convolutional neural network trained on ImageNet. Then, the sum pooling is used to process the feature map. Finally, the vector is normalized and used to represent the image. The similarities between an image and all other images is calculated by calculating the distances between these feature vectors. Thus, the match graph is constructed by selecting the top 10 images with highest similarities. The experiment results showed that the proposed DCF can create the match graph effectively, find the potential image pairs. On the Urban and South Building datasets, the results of the SfM reconstruction based on the match graph created by the proposed DCF are almost the same as those of the exhaustive matching, but the number of matches are reduced by 97.4% and 92.1%, respectively. At the same time, the match graph created by the proposed DCF is obviously better than the match graph crated by the DBoW3 in the most advanced SLAM system.

Key words: deep convolution feature, match graph, structure from motion, transfer learning

CLC Number: