测绘学报 ›› 2026, Vol. 55 ›› Issue (3): 425-438.doi: 10.11947/j.AGCS.2026.20250348

• 数智时代地图学新理论与新方法 • 上一篇    下一篇

融合深度图信息最大化和多层感知机的建筑物群组模式识别方法

禄小敏1,2,3(), 张志义1,2,3, 闫浩文1,2,3, 何毅1,2,3, 苏小宁1,2,3   

  1. 1.兰州交通大学测绘与地理信息学院,甘肃 兰州 730070
    2.地理国情监测技术应用国家地方联合工程研究中心,甘肃 兰州 730070
    3.甘肃省测绘科学与技术重点实验室,甘肃 兰州 730070
  • 收稿日期:2025-08-28 修回日期:2026-03-03 出版日期:2026-04-16 发布日期:2026-04-16
  • 作者简介:禄小敏(1982—),女,博士,教授,研究方向为地图综合、空间模式识别和空间关系智能计算。E-mail:xiaominlu08@mail.lzjtu.cn
  • 基金资助:
    国家自然科学基金(42471476; 42161066);甘肃省自然科学基金重点项目(24JRRA224)

A recognition method for building group pattern integrating deep graph infomax and multilayer perceptron

Xiaomin LU1,2,3(), Zhiyi ZHANG1,2,3, Haowen YAN1,2,3, Yi HE1,2,3, Xiaoning SU1,2,3   

  1. 1.Faculty of Geomatics, Lanzhou Jiaotong University, Lanzhou 730070, China
    2.National-Local Joint Engineering Research Center of Technologies and Applications for National Geographic State Monitoring, Lanzhou 730070, China
    3.Key Laboratory of Science and Technology in Surveying & Mapping, Gansu Province, Lanzhou 730070, China
  • Received:2025-08-28 Revised:2026-03-03 Online:2026-04-16 Published:2026-04-16
  • About author:LU Xiaomin (1982—), female, PhD, professor, majors in map generalization, spatial pattern recognition and intelligent computing for spatial relationships. E-mail: xiaominlu08@mail.lzjtu.cn
  • Supported by:
    The National Natural Science Foundation of China(42471476; 42161066);Key Program of Gansu Provincial Natural Science Foundation(24JRRA224)

摘要:

建筑物群组模式识别是地图自动综合与城市空间理解等领域的关键问题。针对现有方法在识别模式覆盖度、阈值主观性、模型泛化能力及对标注样本依赖程度等方面的局限,本文融合深度图信息最大化(DGI)的无监督表示学习与多层感知机(MLP)的分类能力,构建一种面向建筑物群组的多模式识别模型,旨在探索少量标注样本条件下高精度、强泛化的建筑物群组多模式识别路径。首先,依据道路网与建筑物最小生成树完成群组划分与几何模型构建;然后,提取建筑物个体特征与群组全局特征,并引入DGI模型进行无监督图表示学习,通过最大化图级与节点级表示间的互信息,有效捕捉群组内隐含的复杂拓扑依赖关系,生成判别性强的低维图嵌入向量;最后,将图嵌入与全局特征融合为统一特征向量,输入多层感知机(MLP)分类器实现端到端模式判别,从而完成对直线型、曲线型、格网型及不规则型4类典型建筑物群组模式的自动识别。试验结果表明,本文方法在测试集上的最高识别精度达到99.20%;即使在训练样本数量显著减少的情况下(如仅使用20%的标注数据),模型仍可保持97.85%的识别精度与较高的召回率,体现出优于对比模型的稳健性与数据利用效率。

关键词: 建筑物群组, 模式识别, 深度图信息最大化, 多层感知机, 深度学习

Abstract:

Building group pattern recognition is a key issue in fields such as map automatic generalization and urban spatial understanding. To address the limitations of existing methods in terms of pattern coverage, threshold subjectivity, model generalization capability, and reliance on labeled samples, this paper proposes a recognition model that integrates deep graph infomax (DGI) and a multilayer perceptron (MLP), aiming to explore a high-accuracy, strongly generalized approach for recognizing multiple building group patterns under limited labeled samples. First, building groups are partitioned and geometric models are constructed based on the road network and the minimum spanning tree of buildings. Next, individual building features and global group features are extracted, and the DGI model is introduced for unsupervised graph representation learning. By maximizing the mutual information between graph-level and node-level representations, the model effectively captures the complex topological dependencies within groups, generating discriminative low-dimensional graph embeddings. Finally, the graph embeddings and global features are fused into a unified feature vector, which is fed into an MLP classifier for end-to-end pattern discrimination, enabling automatic recognition of four typical building group patterns: linear, curved, grid-like, and irregular. The experimental results indicate that the highest recognition accuracy of the proposed method on the test set reaches 99.20%. Even with a significant reduction in the number of training samples (e.g., using only 20% of the labeled data), the model can still maintain a recognition accuracy of 97.85% along with a high recall rate, demonstrating superior robustness and data utilization efficiency compared to the baseline models.

Key words: building group, pattern recognition, deep graph infomax, multilayer perceptron, deep learning

中图分类号: