Acta Geodaetica et Cartographica Sinica ›› 2021, Vol. 50 ›› Issue (9): 1203-1210.doi: 10.11947/j.AGCS.2021.20210191

• Smart Surveying and Mapping • Previous Articles     Next Articles

Multi-agent cooperative control for traffic signal on geographic road network

ZHENG Ye1,2, GUO Renzhong1,2, MA Ding1,2, ZHAO Zhigang1,2, LI Xiaoming1,2   

  1. 1. Research Institute for Smart Cities, School of Architecture and Urban Planning, Shenzhen University, Shenzhen 518060, China;
    2. Key Laboratory of Spatial Information Smart Sensing and Services, Shenzhen 518060, China
  • Received:2021-04-13 Revised:2021-06-21 Published:2021-10-09
  • Supported by:
    The National Key Research and Development Program of China (Nos. 2018YFB2100700; 2019YFB2103104; 2019YFB210310); China Postdoctoral Science Foundation (No. 2019M663070)

Abstract: Urban traffic efficiency is one of the key factors affecting urban productivity and is also a crucial topic in the process of smart city construction. With the development of computer technology, artificial intelligence, especially reinforcement learning, plays an increasingly important role in traffic signal control. Currently, traffic signal control based on reinforcement learning is mainly used for the optimization for simple scenarios, such as single road intersection or urban arterial road, not yet for regional coordinated control on an urban geographic road network. This paper is motivated to fill this gap by proposing a two-layered agent cooperative control approach based on reinforcement learning. The first layer implements a coarse-tuning training at a single intersection, where the agents make the single intersection non-blocking by observing the queue length for each lane; In the second layer, the coarse-tuning-trained agent models are put into the geographic network to execute the cooperative fine-tuning training at multiple intersections. This paper conducts the optimization-orientated traffic coordination through a case study of a middle school area in Ningbo. The results show that our control approach is superior to the traditional fixed timing scheme in terms of the passage efficiency.

Key words: geographic road network, traffic signal control, cooperative control, reinforcement learning

CLC Number: