Acta Geodaetica et Cartographica Sinica ›› 2023, Vol. 52 ›› Issue (9): 1595-1608.doi: 10.11947/j.AGCS.2023.20220239

• Cartography and Geoinformation • Previous Articles     Next Articles

Multi-pedestrian trajectory prediction method based on multi-view 3D simulation video learning

CAO Xingwen1,2, ZHENG Hongwei1,2, LIU Ying1,2, WU Mengquan3, WANG Lingyue1,2, BAO Anming1,2, CHEN Xi1,2   

  1. 1. State Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi 830011, China;
    2. College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China;
    3. School of Resources and Environmental Engineering, Ludong University, Yantai 264025, China
  • Received:2022-04-06 Revised:2023-06-08 Published:2023-10-12
  • Supported by:
    The Key Research and Development Program of Xinjiang Uygur Autonomous Region (Nos. 2022B03001-3;2022B03021-3);The National Natural Science Foundation of China (No. NSFCU-U1803120);The Western Light Talents Training Program of CAS (No. 2021-XBQNXZ-012);The National Natural Science Foundation of China (No. 42071385)

Abstract: Multi-pedestrian trajectory prediction is one of the key factors in integrating urban geographic information system and intelligent transportation. To address the problems of insufficient training data, difficult labeling, and low accuracy of pedestrian trajectory prediction in multi-view scenes for existing methods, we propose a novel multi-pedestrian trajectory prediction method based on multi-view 3D simulation video learning. First, a simulation simulator is used to generate the required multi-view pedestrian trajectory annotation data. Then, we mix up the trajectory of the selected view and the adversarial trajectory by a convex combination function to generate the enhanced adversarial trajectory. Next, an advanced detection and tracking algorithm is used to encode and track pedestrian appearance information. Furthermore, the enhanced trajectory and coding information are used as the feature input of a graph attention recurrent neural network to model pedestrian interaction. Finally, the pedestrian trajectory is decoded by a position decoder to extract pedestrian motion characteristics, and multi-pedestrian trajectory prediction is completed. The ADE and FDE accuracies of our method on the ETH/UCY fixed-view dataset are 0.41 and 0.82, respectively. The ADE accuracy on the ActEV/VIRAT and Argoverse multi-view datasets is 17.74 and 65.4, and the FDE accuracy is 34.96 and 172.8.

Key words: 3D simulation, deep learning, trajectory prediction, object tracking, urban geographic information system

CLC Number: