融合深度特征的无人机影像SfM重建

doi:10.11947/j.AGCS.2024.20220636

测绘学报 ›› 2024, Vol. 53 ›› Issue (2): 321-331.doi: 10.11947/j.AGCS.2024.20220636

融合深度特征的无人机影像SfM重建

姜三^1,2, 刘凯¹, 李清泉², 江万寿³

1. 中国地质大学(武汉)计算机学院, 湖北武汉 430074;
2. 人工智能与数字经济广东省实验室(深圳), 广东深圳 518060;
3. 武汉大学测绘遥感信息工程国家重点实验室, 湖北武汉 430079

收稿日期:2022-11-08 修回日期:2024-01-10 发布日期:2024-03-08
通讯作者: 李清泉 E-mail:liqq@szu.edu.cn
作者简介:姜三(1987-),男,博士,副研究员,研究方向为多源影像匹配和三维重建的理论和方法。E-mail:jiangsan@cug.edu.cn
基金资助:
国家自然科学基金（42371442）；湖北省自然科学基金（2023AFB568）；人工智能与数字经济广东省实验室（深圳）开放课题资助（GML-KF-22-08）

Learned local features for SfM reconstruction of UAV images

JIANG San^1,2, LIU Kai¹, LI Qingquan², JIANG Wanshou³

1. School of Computer Science, China University of Geosciences, Wuhan 430074, China;
2. Guangdong Laboratory of Artificial Intelligence and Digital Economy (Shenzhen), Shenzhen 518060, China;
3. State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China

Received:2022-11-08 Revised:2024-01-10 Published:2024-03-08
Supported by:
The National Natural Science Foundation of China (No.42371442); The Hubei Provincial Natural Science Foundation of China (No.2023AFB568); The Open Research Fund from the Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ) (No.GML-KF-22-08)

摘要/Abstract

摘要： 可靠特征匹配是无人机影像运动恢复结构（SfM）的重要环节。近年来，深度学习被用于特征提取和匹配，在基准数据集表现优于SIFT等手工特征。但是，公开模型往往采用互联网照片进行训练和测试，鲜有用于无人机影像SfM三维重建的性能评价。利用多组不同特点的无人机数据集，本文对比分析手工特征和深度学习特征在无人机影像特征匹配和SfM三维重建的综合性能。试验结果表明，利用公开的预训练模型，结合手工特征的高精度定位和深度学习的特征描述能力，可实现更准确和完整的特征匹配，并在SfM三维重建中取得与SIFT等手工特征相当，甚至更优的性能。

关键词: 摄影测量, 三维重建, 运动恢复结构, 深度特征, 卷积神经网络

Abstract: Reliable feature matching plays an essential role in SfM (structure from motion) for UAV (unmanned aerial vehicle) images. Recently, deep learning-based methods have been used for feature detection and matching, which outperforms traditional handcrafted methods, e.g., SIFT, on benchmark datasets. However, few studies have reported their performance on UAV images as these models are trained and tested using internet photos. By using UAV datasets with varying features, this study evaluated both handcrafted and learned methods in terms of feature matching and SfM-based image orientation. The experimental results show that even with the pretrained public-available models, more accurate and complete feature matching can be obtained through the combination of high-precision localization of handcrafted detectors and the high representation ability of learned descriptors, which has competitive or better performance in SfM-based image orientation when compared with SIFT-like handcrafted methods.

Key words: photogrammetry, 3D reconstruction, structure from motion, learned feature, convolutional neural network

中图分类号:

P237

姜三, 刘凯, 李清泉, 江万寿. 融合深度特征的无人机影像SfM重建[J]. 测绘学报, 2024, 53(2): 321-331.

JIANG San, LIU Kai, LI Qingquan, JIANG Wanshou. Learned local features for SfM reconstruction of UAV images[J]. Acta Geodaetica et Cartographica Sinica, 2024, 53(2): 321-331.

参考文献

[1] JIANG San, JIANG Wanshou, WANG Lizhe. Unmanned aerial vehicle-based photogrammetric 3D mapping:a survey of techniques, applications, and challenges[J]. IEEE Geoscience and Remote Sensing Magazine, 2022, 10(2):135-171.
[2] 陈武, 姜三, 李清泉, 等. 无人机影像增量式运动恢复结构研究进展[J]. 武汉大学学报(信息科学版), 2022, 47(10):1662-1674. CHEN Wu, JIANG San, LI Qingquan,et al. Recent research of incremental structure from motion for unmanned aerial vehicle images[J]. Geomatics and Information Science of Wuhan University, 2022, 47(10):1662-1674.
[3] JIANG San, JIANG Wanshou, HUANG Wei, et al. UAV-based oblique photogrammetry for outdoor data acquisition and offsite visual inspection of transmission line[J]. Remote Sensing, 2017, 9(3):278.
[4] ZHENG J, FU H, LI W, et al. Growing status observation for oil palm trees using unmanned aerial vehicle (UAV) images[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2021, 173:95-121.
[5] 姜三, 许志海, 张峰, 等. 面向无人机倾斜影像的高效SfM重建方案[J]. 武汉大学学报(信息科学版), 2019, 44(8):1153-1161. JIANG San, XU Zhihai, ZHANG Feng, et al. Solution for efficient SfM reconstruction of oblique UAV images[J]. Geomatics and Information Science of Wuhan University, 2019, 44(8):1153-1161.
[6] 张力, 刘玉轩, 孙洋杰, 等. 数字航空摄影三维重建理论与技术发展综述[J]. 测绘学报, 2022, 51(7):1437-1457.DOI:10.11947/J.AGCS.2022.20220130. ZHANG Li, LIU Yuxuan, SUN Yangjie, et al. A review of developments in the theory and technology of three-dimensional reconstruction in digital aerial photogrammetry[J]. Acta Geodaetica et Cartographica Sinica, 2022, 51(7):1437-1457.DOI:10.11947/J.AGCS.2022.20220130.
[7] LOWE D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(2):91-110.
[8] ARANDJELOVIC R, ZISSERMAN A. Three things everyone should know to improve object retrieval[C]//Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition. New York:ACM Press, 2012:2911-2918.
[9] DONG Jingming, SOATTO S. Domain-size pooling in local descriptors:DSP-SIFT[C]//Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition. Boston:IEEE, 2015:5097-5106.
[10] SUN Yanbiao, ZHAO Liang, HUANG Shoudong, et al. 2-SIFT:sift feature extraction and matching for large images in large-scale aerial photogrammetry[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2014, 91:1-16.
[11] SEDAGHAT A, EBADI H. Remote sensing image matching based on adaptive binning SIFT descriptor[J]. IEEE Transactions on Geoscience and Remote Sensing, 2015, 53(10):5283-5293.
[12] 范大昭, 董杨, 张永生. 卫星影像匹配的深度卷积神经网络方法[J]. 测绘学报, 2018, 47(6):844-853. DOI:10.11947/J.AGCS.2018.20170627. FAN Dazhao, DONG Yang, ZHANG Yongsheng. Satellite image matching method based on deep convolution neural network[J]. Acta Geodaetica et Cartographica Sinica, 2018, 47(6):844-853. DOI:10.11947/J.AGCS.2018.20170627.
[13] 蓝朝桢, 卢万杰, 于君明, 等. 异源遥感影像特征匹配的深度学习算法[J]. 测绘学报, 2021, 50(2):189-202.DOI:10.11947/J.AGCS.2021.20200048. LAN Chaozhen, LU Wanjie, YU Junming,et al. Deep learning algorithm for feature matching of cross modality remote sensing images[J]. Acta Geodaetica et Cartographica Sinica, 2021, 50(2):189-202.DOI:10.11947/J.AGCS.2021.20200048.
[14] JIN Yuhe, MISHKIN D, MISHCHUK A, et al. Image matching across wide baselines:from paper to practice[J]. International Journal of Computer Vision, 2021, 129(2):517-547.
[15] BALNTAS V, RIBA E, PONSA D, et al. Learning local feature descriptors with triplets and shallow convolutional neural networks[C]//Proceedings of 2016 British Machine Vision Conference 2016. York:British Machine Vision Association, 2016:3.
[16] TIAN Yurun, FAN Bin, WU Fuchao. L2-net:deep learning of discriminative patch descriptor in euclidean space[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu:IEEE, 2017:661-669.
[17] MISHCHUK A, MISHKIN D,RADENOVIĆ F, et al. Working hard to know your neighbor's margins:local descriptor learning loss[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems. New York:ACM Press, 2017:4829-4840.
[18] LUO Zixin, SHEN Tianwei, ZHOU Lei, et al. GeoDesc:learning local descriptors by integrating geometry constraints[EB/OL].[2023-12-30]. https://arxiv.org/abs/1807.06294.pdf.
[19] DETONE D, MALISIEWICZ T, RABINOVICH A. SuperPoint:self-supervised interest point detection and description[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.Salt Lake City:IEEE, 2018:224-236.
[20] DUSMANU M, ROCCO I, PAJDLA T, et al. D2-net:a trainable CNN for joint description and detection of local features[C]//Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach:IEEE, 2019:8092-8101.
[21] LUO Zixin, ZHOU Lei, BAI Xuyang, et al. ASLFeat:learning local features of accurate shape and localization[C]//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE, 2020:6589-6598.
[22] SARLIN P E, DETONE D, MALISIEWICZ T, et al. SuperGlue:learning feature matching with graph neural networks[C]//Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE, 2020:4938-4947.
[23] SUN Jiaming, SHEN Zehong, WANG Yuang, et al. LoFTR:detector-free local feature matching with transformers[C]//Proceedings of 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Nashville:IEEE, 2021:8922-8931.
[24] BALNTAS V, LENC K, VEDALDI A, et al. HPatches:a benchmark and evaluation of handcrafted and learned local descriptors[C]//Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition. Honolulu:IEEE, 2017:5173-5182.
[25] PHILBIN J, CHUM O, ISARD M, et al. Object retrieval with large vocabularies and fast spatial matching[C]//Proceedings of 2007 IEEE Conference on Computer Vision and Pattern Recognition.Minneapolis:IEEE, 2007:1-8.
[26] SCHONBERGER J L, FRAHM J M. Structure-from-motion revisited[C]//Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas:IEEE, 2016:4104-4113.

融合深度特征的无人机影像SfM重建

Learned local features for SfM reconstruction of UAV images

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	肖腾, 王鑫, 梅熙, 叶志伟, 颜青松, 邓非. 摄影测量局部场景稳健合并的并行式运动恢复结构方法[J]. 测绘学报, 2024, 53(2): 332-343.
[2]	廖钊宏, 张依晨, 杨飚, 林明春, 孙文博, 高智. 基于Swin Transformer-CNN的单目遥感影像高程估计方法及其在公路建设场景中的应用[J]. 测绘学报, 2024, 53(2): 344-352.
[3]	王建荣, 杨元喜, 卢学良, 缪毓喆. 光轴位置测量数据辅助立体影像无控定位技术[J]. 测绘学报, 2024, 53(1): 1-7.
[4]	孙一帆, 刘冰, 余旭初, 谭熊, 余岸竹. 图像级高光谱影像高分辨率特征网络分类方法[J]. 测绘学报, 2024, 53(1): 50-64.
[5]	任加新, 刘万增, 陈军, 张蓝, 陶远, 朱秀丽, 赵婷婷, 李然, 翟曦, 王海清, 周晓光, 侯东阳, 王勇. 知识引导的碎片化栅格地形图比例尺智能识别[J]. 测绘学报, 2024, 53(1): 146-157.
[6]	肖天元, 艾廷华, 余华飞, 杨敏, 刘鹏程. 地图综合图卷积神经网络点群简化方法[J]. 测绘学报, 2024, 53(1): 158-172.
[7]	安晓亚, 朱余德, 晏雄锋. 卷积神经网络支持下的建筑物选取方法[J]. 测绘学报, 2023, 52(9): 1574-1583.
[8]	何佳星, 郑南山, 丁锐, 张克非, 陈天悦. 粒子群优化卷积神经网络GNSS-IR土壤湿度反演方法[J]. 测绘学报, 2023, 52(8): 1286-1297.
[9]	雷臻, 张帆, 向瀚宇, 杨冲, 黄先锋. 多视角翻转拍摄物体的完整三维重建[J]. 测绘学报, 2023, 52(8): 1305-1316.
[10]	张蒙蒙, 李伟, 刘欢, 赵旭东, 陶然. 基于形态变换与空间逻辑聚合的高光谱森林树种分类[J]. 测绘学报, 2023, 52(7): 1202-1211.
[11]	顾小虎, 李正军, 缪健豪, 李星华, 沈焕锋. 高分遥感影像双通道并行混合卷积分类方法[J]. 测绘学报, 2023, 52(5): 798-807.
[12]	赵冰冰, 谭骁勇, 杨学习, 石岩, 邓敏. 融合地理条件驱动效应和图卷积的土地利用演化模拟CA模型[J]. 测绘学报, 2023, 52(5): 831-842.
[13]	余东行, 徐青, 赵传, 郭海涛, 卢俊, 林雨准, 刘相云. 注意力引导特征融合与联合学习的遥感影像场景分类[J]. 测绘学报, 2023, 52(4): 624-637.
[14]	张永显, 马国锐, 崔志祥, 张志军. 面向大视角差的无人机影像序列学习型特征匹配[J]. 测绘学报, 2023, 52(2): 230-243.
[15]	孙根云, 王鑫, 安娜, 张爱竹. 基于多源遥感的大尺度高分辨率不透水面深度学习提取方法[J]. 测绘学报, 2023, 52(2): 272-282.