融合深度图信息最大化和多层感知机的建筑物群组模式识别方法

doi:10.11947/j.AGCS.2026.20250348

测绘学报 ›› 2026, Vol. 55 ›› Issue (3): 425-438.doi: 10.11947/j.AGCS.2026.20250348

• 数智时代地图学新理论与新方法 • 上一篇下一篇

融合深度图信息最大化和多层感知机的建筑物群组模式识别方法

禄小敏¹^,²^,³(), 张志义¹^,²^,³, 闫浩文¹^,²^,³, 何毅¹^,²^,³, 苏小宁¹^,²^,³

^1.兰州交通大学测绘与地理信息学院，甘肃　兰州　730070
^2.地理国情监测技术应用国家地方联合工程研究中心，甘肃　兰州　730070
^3.甘肃省测绘科学与技术重点实验室，甘肃　兰州　730070

收稿日期:2025-08-28 修回日期:2026-03-03 出版日期:2026-04-16 发布日期:2026-04-16
作者简介:禄小敏（1982—），女，博士，教授，研究方向为地图综合、空间模式识别和空间关系智能计算。E-mail：xiaominlu08@mail.lzjtu.cn
基金资助:
国家自然科学基金(42471476; 42161066);甘肃省自然科学基金重点项目(24JRRA224)

A recognition method for building group pattern integrating deep graph infomax and multilayer perceptron

Xiaomin LU¹^,²^,³(), Zhiyi ZHANG¹^,²^,³, Haowen YAN¹^,²^,³, Yi HE¹^,²^,³, Xiaoning SU¹^,²^,³

^1.Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China
^2.National-Local Joint Engineering Research Center of Technologies and Applications for National Geographic State Monitoring, Lanzhou 730070, China
^3.Key Laboratory of Science and Technology in Surveying & Mapping, Gansu Province, Lanzhou 730070, China

Received:2025-08-28 Revised:2026-03-03 Online:2026-04-16 Published:2026-04-16
About author:LU Xiaomin (1982—), female, PhD, professor, majors in map generalization, spatial pattern recognition and intelligent computing for spatial relationships. E-mail: xiaominlu08@mail.lzjtu.cn
Supported by:
The National Natural Science Foundation of China(42471476; 42161066);Key Program of Gansu Provincial Natural Science Foundation(24JRRA224)

摘要/Abstract

摘要：

建筑物群组模式识别是地图自动综合与城市空间理解等领域的关键问题。针对现有方法在识别模式覆盖度、阈值主观性、模型泛化能力及对标注样本依赖程度等方面的局限，本文融合深度图信息最大化（DGI）的无监督表示学习与多层感知机（MLP）的分类能力，构建一种面向建筑物群组的多模式识别模型，旨在探索少量标注样本条件下高精度、强泛化的建筑物群组多模式识别路径。首先，依据道路网与建筑物最小生成树完成群组划分与几何模型构建；然后，提取建筑物个体特征与群组全局特征，并引入DGI模型进行无监督图表示学习，通过最大化图级与节点级表示间的互信息，有效捕捉群组内隐含的复杂拓扑依赖关系，生成判别性强的低维图嵌入向量；最后，将图嵌入与全局特征融合为统一特征向量，输入多层感知机（MLP）分类器实现端到端模式判别，从而完成对直线型、曲线型、格网型及不规则型4类典型建筑物群组模式的自动识别。试验结果表明，本文方法在测试集上的最高识别精度达到99.20%；即使在训练样本数量显著减少的情况下（如仅使用20%的标注数据），模型仍可保持97.85%的识别精度与较高的召回率，体现出优于对比模型的稳健性与数据利用效率。

关键词: 建筑物群组, 模式识别, 深度图信息最大化, 多层感知机, 深度学习

Abstract:

Building group pattern recognition is a key issue in fields such as map automatic generalization and urban spatial understanding. To address the limitations of existing methods in terms of pattern coverage, threshold subjectivity, model generalization capability, and reliance on labeled samples, this paper proposes a recognition model that integrates deep graph infomax (DGI) and a multilayer perceptron (MLP), aiming to explore a high-accuracy, strongly generalized approach for recognizing multiple building group patterns under limited labeled samples. First, building groups are partitioned and geometric models are constructed based on the road network and the minimum spanning tree of buildings. Next, individual building features and global group features are extracted, and the DGI model is introduced for unsupervised graph representation learning. By maximizing the mutual information between graph-level and node-level representations, the model effectively captures the complex topological dependencies within groups, generating discriminative low-dimensional graph embeddings. Finally, the graph embeddings and global features are fused into a unified feature vector, which is fed into an MLP classifier for end-to-end pattern discrimination, enabling automatic recognition of four typical building group patterns: linear, curved, grid-like, and irregular. The experimental results indicate that the highest recognition accuracy of the proposed method on the test set reaches 99.20%. Even with a significant reduction in the number of training samples (e.g., using only 20% of the labeled data), the model can still maintain a recognition accuracy of 97.85% along with a high recall rate, demonstrating superior robustness and data utilization efficiency compared to the baseline models.

Key words: building group, pattern recognition, deep graph infomax, multilayer perceptron, deep learning

中图分类号:

P208

禄小敏, 张志义, 闫浩文, 何毅, 苏小宁. 融合深度图信息最大化和多层感知机的建筑物群组模式识别方法[J]. 测绘学报, 2026, 55(3): 425-438.

Xiaomin LU, Zhiyi ZHANG, Haowen YAN, Yi HE, Xiaoning SU. A recognition method for building group pattern integrating deep graph infomax and multilayer perceptron[J]. Acta Geodaetica et Cartographica Sinica, 2026, 55(3): 425-438.

导出引用管理器 EndNote|Reference Manager|ProCite|BibTeX|RefWorks

链接本文: http://xb.chinasmp.com/CN/10.11947/j.AGCS.2026.20250348

http://xb.chinasmp.com/CN/Y2026/V55/I3/425

图/表 21

图1

图2

图3

图4

图5

图6

图7

表1

表2

表3

图8

图9

图10

表4

表5

表6

表7

表8

表9

图11

图12

参考文献 28

[1]	武芳, 钱海忠, 邓红艳, 等. 面向地图自动综合的空间信息智能处理[M]. 北京: 科学出版社, 2008.
	WU Fang, QIAN Haizhong, DENG Hongyan. et al. Intelligent spatial information processing for automated map generalization[M]. Beijing: Science Press, 2008.
[2]	李志林, 王继成, 谭诗腾, 等. 地理信息科学中尺度问题的30年研究现状[J]. 武汉大学学报(信息科学版), 2018, 43(12): 2233-2242.
	LI Zhilin, WANG Jicheng, TAN Shiteng, et al. Scale in geo-information science: an overview of thirty-year development[J]. Geomatics and Information Science of Wuhan University, 2018, 43(12): 2233-2242.
[3]	DU S H, SHU M, FENG C C. Representation and discovery of building patterns: a three-level relational approach[J]. International Journal of Geographical Information Science, 2015, 30(6): 1161-1186.
[4]	STEINIGER S. Enabling pattern-aware automated map gener alization[D]. Zurich: University of Zurich, 2007.
[5]	BRASSEL K E, WEIBEL R. A review and conceptual framework of automated map generalization[J]. International Journal of Geographical Information Systems, 1988, 2(3): 229-244.
[6]	MACKANESS W, EDWARDS G. The importance of modeling pattern and structure in automated map generalization[J]. Journal of Geographical Systems, 2007, 9(2): 147-164.
[7]	VANDERHAEGEN S, CANTERS F. Mapping urban form and function at city block level using spatial metrics[J]. Landscape and Urban Planning, 2017, 167: 399-409.
[8]	RIEDL A, KAINZ W, ELMES G A. Progress in spatial data handling: 12th International Symposium on spatial data handling[M]. Berlin: Springer, 2006: 25-26.
[9]	PILEHFOROOSHHA P, KARIMI M. An integrated framework for linear pattern extraction in the building group generalization process[J]. Geocarto International, 2018, 34(9): 1000-1021.
[10]	行瑞星, 武芳, 巩现勇, 等. 建筑群组合直线模式识别的模板匹配方法[J]. 测绘学报, 2021, 50(6): 800-811. DOI: . doi: 10.11947/j.AGCS.2021.20200298
	XING Ruixing, WU Fang, GONG Xianyong, et al. The template matching approach to combined collinear pattern recognition in building groups[J]. Acta Geodaetica et Cartographica Sinica, 2021, 50(6): 800-811. DOI: . doi: 10.11947/j.AGCS.2021.20200298
[11]	CHRISTOPHE S, RUAS A. Detecting building alignments for generalisation purposes[C]//Proceedings of 2002 Advances in Spatial Data Handling. Berlin: Springer, 2002: 419-432.
[12]	张志义, 禄小敏, 闫浩文, 等. 引入方向熵的建筑物群组模式识别方法[J]. 地球信息科学学报, 2024, 26(9): 2077-2092.
	ZHANG Zhiyi, LU Xiaomin, YAN Haowen, et al. Building group pattern recognition based on directional entropy[J]. Journal of Geo-information Science, 2024, 26(9): 2077-2092.
[13]	巩现勇, 武芳. 基于图匹配的城市建筑群典型字母型分布的识别[J]. 武汉大学学报(信息科学版), 2018, 43(1): 159-166.
	GONG Xianyong, WU Fang. A graph match approach to typical letter-like pattern recognition in urban building groups[J]. Geomatics and Information Science of Wuhan University, 2018, 43(1): 159-166.
[14]	刘慧敏, 胡文柯, 唐建波, 等. 顾及功能语义特征的建筑物空间分布模式识别方法[J]. 测绘学报, 2020, 49(5): 622-631. DOI: . doi: 10.11947/j.AGCS.2020.20190222
	LIU Huimin, HU Wenke, TANG Jianbo, et al. A method for recognizing building clusters by considering functional features of buildings[J]. Acta Geodaetica et Cartographica Sinica, 2020, 49(5): 622-631. DOI: . doi: 10.11947/j.AGCS.2020.20190222
[15]	ZHANG Liqiang, DENG Hao, CHEN Dong, et al. A spatial cognition-based urban building clustering approach and its applications[J]. International Journal of Geographical Information Science, 2013, 27(4): 721-740.
[16]	HE Xianjin, ZHANG Xinchang, XIN Qinchuan. Recognition of building group patterns in topographic maps based on graph partitioning and random forest[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2018, 136: 26-40.
[17]	张自强, 刘涛, 杜萍, 等. 典型建筑物群组模式的空间图卷积模型DGCNN识别方法[J]. 武汉大学学报(信息科学版), 2024, 49(5): 868-878.
	ZHANG Ziqiang, LIU Tao, DU Ping, et al. Recognition of typical building group patterns using spatial graph convolutional model DGCNN[J]. Geomatics and Information Science of Wuhan University, 2024, 49(5): 868-878.
[18]	YAN Xiongfeng, AI Tinghua, ZHANG Xiang. Template matching and simplification method for building features based on shape cognition[J]. ISPRS International Journal of Geo-Information, 2017, 6(8): 250.
[19]	孟妮娜, 王安东, 周校东. 建筑物线型排列模式识别的图卷积神经网络方法[J]. 测绘科学技术学报, 2019, 36(6): 627-631.
	MENG Nina, WANG Andong, ZHOU Xiaodong. A graph convolutional neural network method for pattern recognition of linear building alignment[J]. Journal of Geomatics Science and Technology, 2019, 36(6): 627-631.
[20]	于洋洋, 贺康杰, 武芳, 等. 面状居民地形状分类的图卷积神经网络方法[J]. 测绘学报, 2022, 51(11): 2390-2402. DOI: . doi: 10.11947/j.AGCS.2022.20210134
	YU Yangyang, HE Kangjie, WU Fang, et al. Graph convolution neural network method for shape classification of areal settlements[J]. Acta Geodaetica et Cartographica Sinica, 2022, 51(11): 2390-2402. DOI: . doi: 10.11947/j.AGCS.2022.20210134
[21]	令振飞, 刘涛, 杜萍, 等. 样本数量不平衡下的建筑群模式识别方法研究[J]. 地球信息科学学报, 2022, 24(1): 63-73.
	LING Zhenfei, LIU Tao, DU Ping, et al. Pattern recognition of regular buildings with unbalanced sample size[J]. Journal of Geo-information Science, 2022, 24(1): 63-73.
[22]	ZHAO Wenting, XU Gongping, CUI Zhen, et al. Deep graph structural infomax[C]//Proceedings of 2023 AAAI Conference on Artificial Intelligence. Washington DC: AAAI Press, 2023: 4920-4928.
[23]	YANG Shantian, YANG Bo. An inductive heterogeneous graph attention-based multi-agent deep graph infomax algorithm for adaptive traffic signal control[J]. Information Fusion, 2022, 88: 249-262.
[24]	周智超. 基于对比学习的多视图图表示学习方法研究[D]. 广州: 华南理工大学, 2022.
	ZHOU Zhichao. Multi-view graph representation learning algorithm based on contrastive learning[D]. Guangzhou: South China University of Technology, 2022.
[25]	PINKUS A. Approximation theory of the MLP model in neural networks[J]. Acta Numerica, 1999, 8: 143-195.
[26]	BELLILI A, GILLOUX M, GALLINARI P. An MLP-SVM combination architecture for offline handwritten digit recognition[J]. Document Analysis and Recognition, 2003, 5(4): 244-252.
[27]	BANERJEE M, MITRA S, PAL S K. Rough fuzzy MLP: knowledge encoding and classification[J]. IEEE Transactions on Neural Networks, 1998, 9(6): 1203-1216.
[28]	晏雄锋. 深度卷积学习支持下的建筑物模式分析[D]. 武汉: 武汉大学, 2019.
	YAN Xiongfeng. Building pattern analysis supported by deep convolutional learning[D]. Wuhan: Wuhan University, 2019.

精度	建筑物群组模型分类
精度	直线型	曲线型	格网型	不规则型
精确率（8∶1∶1）	99.00	99.20	99.80	99.60
召回率（8∶1∶1）	96.19	99.05	99.00	99.26
精确率（6∶2∶2）	99.40	99.20	99.70	99.12
召回率（6∶2∶2）	98.10	99.05	99.36	99.63
精确率（4∶3∶3）	98.80	98.80	99.60	99.60
召回率（4∶3∶3）	96.51	97.78	99.14	99.51
精确率（2∶4∶4）	98.35	98.50	98.95	99.30
召回率（2∶4∶4）	94.76	97.86	98.71	98.15

样本集划分	8∶1∶1	6∶2∶2	4∶3∶3	2∶4∶4
训练集	99.83	99.07	99.95	99.50
测试集	99.20	98.90	98.34	97.85
验证集	98.80	99.10	98.40	97.55

建筑物群组模型分类	精确率	召回率
曲线型	95.38	97.00
直线型	97.44	95.00
格网型	97.42	94.50
不规则型	98.51	99.00

样本划分比例	建筑物群组模型分类	精确率	召回率
（8∶1∶1）	直线型	92.38	92.38
	曲线型	88.04	77.14
	格网型	87.57	94.87
	不规则型	95.96	93.28
（2∶4∶4）	直线型	82.21	86.90
	曲线型	83.29	75.36
	格网型	82.13	88.08
	不规则型	95.96	96.67

样本划分比例	建筑物群组模型分类	精确率	召回率
（8∶1∶1）	直线型	86.00	81.90
	曲线型	80.65	71.43
	格网型	82.40	87.82
	不规则型	94.81	97.71
（2∶4∶4）	直线型	86.39	65.00
	曲线型	79.18	82.38
	格网型	76.69	87.92
	不规则型	94.60	93.90

融合深度图信息最大化和多层感知机的建筑物群组模式识别方法

A recognition method for building group pattern integrating deep graph infomax and multilayer perceptron

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 21

参考文献 28

相关文章 15

编辑推荐

Metrics

本文评价

标注的建筑群组模型分类	预测建筑物群组模型分类
标注的建筑群组模型分类	曲线型	直线型	格网型	不规则型
曲线型	411	10	6	0
直线型	11	398	0	2
格网型	0	0	613	13
不规则型	4	2	0	531

标注的建筑物群组模型分类	预测建筑物群组模型分类
标注的建筑物群组模型分类	曲线型	直线型	格网型	不规则型
曲线型	194	1	5	0
直线型	10	190	0	0
格网型	6	0	191	3
不规则型	0	2	0	198

样本划分比例	建筑物群组模型分类	精确率	召回率
（8∶1∶1）	直线型	88.66	81.90
	曲线型	95.19	94.29
	格网型	90.30	95.51
	不规则型	99.20	98.71
（2∶4∶4）	直线型	96.07	93.10
	曲线型	74.02	80.71
	格网型	87.27	83.90
	不规则型	98.70	98.52

移除类型	指标	测试集精度
移除个体特征值	面积	92.91
	周长	95.17
	密度	94.23
	紧凑度	93.88
	表征方向	95.17
移除全局特征值	方向熵	88.80
移除全局特征值	正交指数	92.59

[1]	王家耀, 陈琳, 程士源, 王利军, 熊思奇. 人工智能赋能地图科学数智化[J]. 测绘学报, 2026, 55(3): 381-389.
[2]	季顺平, 刘瑾, 高建, 龚健雅. 多视影像深度学习密集匹配三维重建智能框架[J]. 测绘学报, 2025, 54(9): 1633-1646.
[3]	张继贤, 顾海燕, 倪欢, 李海涛, 杨懿, 丁少鹏, 隋淞蔓. 遥感智能变化检测的深度学习方法：演变与发展趋势[J]. 测绘学报, 2025, 54(8): 1347-1370.
[4]	方帅, 刘加恩, 张晶. 自适应参考特征引入与多尺度特征聚合的时空融合算法[J]. 测绘学报, 2025, 54(8): 1476-1488.
[5]	孟妮娜, 李凤梅, 周校东. 数据与认知双驱动的建筑物群制图综合结果与尺度一致性识别[J]. 测绘学报, 2025, 54(7): 1318-1331.
[6]	王亚青, 王中辉. 异构图卷积网络支持下的河系自动选取方法[J]. 测绘学报, 2025, 54(7): 1332-1345.
[7]	安晓亚, 郭伟茹, 张鹏鑫, 李欣欣, 石磊. 顾及几何位置和移动特征相似性的船舶轨迹聚类方法[J]. 测绘学报, 2025, 54(6): 1107-1121.
[8]	王超, 陈天宇, 张同, AhmedTanvir, 纪立强, 谢涛, 杨佳俊, 王帅. 基于全局差分增强模块和平衡惩罚损失的多源光学遥感影像变化检测[J]. 测绘学报, 2025, 54(5): 873-887.
[9]	罗卿莉, 李雪岩, 黄国满, 陈红辉, 薛铭龙, 李健. AOSN：α-最优网络模型的山区单通道SAR高程重建方法[J]. 测绘学报, 2025, 54(5): 888-898.
[10]	涂伟, 池向沅, 赵天鸿, 杨剑, 朱世平, 陈德莉. 城市排水管网流量预测多视图时空图神经网络模型[J]. 测绘学报, 2025, 54(2): 334-344.
[11]	张志力, 姜慧伟, 胡翔云. 面向极简交互的遥感地物精确批量提取框架[J]. 测绘学报, 2025, 54(10): 1863-1876.
[12]	张正华, 陈国良. 一种轻量且旋转不变的激光雷达位置识别网络[J]. 测绘学报, 2025, 54(1): 90-103.
[13]	石岩, 王达, 邓敏, 杨学习. 时空异常探测：从数据驱动到知识驱动的内涵转变与实现路径[J]. 测绘学报, 2024, 53(8): 1493-1504.
[14]	鄢薪, 慎利, 潘俊杰, 戴延帅, 王继成, 郑晓莉, 李志林. 多尺度特征融合与空间优化的弱监督高分遥感建筑变化检测[J]. 测绘学报, 2024, 53(8): 1586-1597.
[15]	布金伟, 余科根, 汪秋兰, 李玲惠, 刘馨雨, 左小清, 常军. 融合星载GNSS-R数据和多变量参数全球海洋有效波高深度学习反演法[J]. 测绘学报, 2024, 53(7): 1321-1335.