Good characterization of road traffic correlations among urban roads can help improve the traffic-related applications,such as traffic interpolation and short-term traffic forecasting. Previous studies model the traffic correlations between two roads by their spatial or topological distances. However,the distance-based methods neglect the spatio-temporal heterogeneity of traffic influence among roads. In this paper,we integrate GPS-enabled vehicle operating travel routes and word embedding techniques in Natural Language Processing (NLP) domain to quantify traffic correlations of road segments in different time intervals. Firstly,the corresponding relationships between transportation elements (i.e.,road segments,travel routes) and NLP terms (i.e.,words,documents) are established. Secondly,the real-valued vectors of road segments are trained from massive travel routes using a word-embedding model called "Word2Vec". Thirdly,the traffic correlation between two roads is measured by the cosine similarity of their vectors. Finally,the results are evaluated using real traffic condition data. Results of a case study using a large-scale taxi trajectory dataset in Beijing show that:①road segments that have stronger traffic correlations are also more similar in their traffic conditions measured by roads' average travel speeds,proving that our approach is capable of quantifying road segment traffic correlations and detecting their spatial heterogeneity;②road segments' traffic correlations are stronger on workday rush hours and holidays than on weekends and workday non-rush hours,proving that our approach is capable of detecting temporal variations. Our approach and analysis provide methodological and theoretical basis for transportation related applications using NLP and machine learning models.
[1] 陆锋,郑年波,段滢滢,等.出行信息服务关键技术研究进展与问题探讨[J].中国图象图形学报,2009,14(7):1219-1229. LU Feng,ZHENG Nianbo,DUAN Yingying,et al.Travel Information Services:State of the Art and Discussion on Crucial Technologies[J].Journal of Image and Graphics,2009,14(7):1219-1229.
[2] 欧阳俊,陆锋,刘兴权,等.基于多核混合支持向量机的城市短时交通预测[J].中国图象图形学报,2010,15(11):1688-1695. OUYANG Jun,LU Feng,LIU Xingquan,et al.Short-term Urban Traffic Forecasting based on Multi-Kernel SVM Model[J].Journal of Image and Graphics,2010,15(11):1688-1695.
[3] WANG Junjie,WEI Dong,HE Kun,et al.Encapsulating Urban Traffic Rhythms into Road Networks[J].Scientific Reports,2014,4:4141.
[4] KAMARIANAKIS Y,PRASTACOS P.Space-time Modeling of Traffic Flow[J].Computers & Geosciences,2005,31(2):119-133.
[5] VLAHOGIANNI E I,KARLAFTIS M G,GOLIAS J C.Optimized and Meta-optimized Neural Networks for Short-term Traffic Flow Prediction:A Genetic Approach[J].Transportation Research Part C:Emerging Technologies,2005,13(3):211-234.
[6] MIN Xinyu,HU Jianming,ZHANG Zuo.Urban Traffic Network Modeling and Short-term Traffic Flow Forecasting based on GSTARIMA Model[C]//Proceedings of the 13th International IEEE Conference on Intelligent Transportation Systems (ITSC).Funchal,Portugal:IEEE,2010:1535-1540.
[7] WANG J,CHENG T,HEYDECKER B,et al.STARIMA for Journey Time Prediction in London[C]//Proceedings of the 5th IMA Conference on Mathematics in Transport.London,UK:IMA,2010.
[8] DING Qingyan,WANG Xifu,ZHANG Xiuyuan,et al.Forecasting Traffic Volume with Space-time ARIMA Model[J].Advanced Materials Research,2011,156-157:979-983.
[9] ZOU Haixiang,YUE Yang,LI Qingquan,et al.An Improved Distance Metric for the Interpolation of Link-based Traffic Data Using Kriging:A Case Study of A Large-scale Urban Road Network[J].International Journal of Geographical Information Science,2012,26(4):667-689.
[10] WHITTAKER J,GARSIDE S,LINDVELD K.Tracking and Predicting A Network Traffic Process[J].International Journal of Forecasting,1997,13(1):51-61.
[11] STATHOPOULOS A,KARLAFTIS M G.A Multivariate State Space Approach for Urban Traffic Flow Modeling and Prediction[J].Transportation Research Part C:Emerging Technologies,2003,11(2):121-135.
[12] MIN Wanli,WYNTER L.Real-time Road Traffic Prediction with Spatio-temporal Correlations[J].Transportation Research Part C:Emerging Technologies,2011,19(4):606-616.
[13] CHENG Tao,HAWORTH J,WANG Jiaqiu.Spatio-temporal Autocorrelation of Road Network Data[J].Journal of Geographical Systems,2012,14(4):389-413.
[14] ZOU Haixiang,YUE Yang,LI Qingquan,2014.Explaining the Urban Traffic State from Road Network Structure and Spatial Variance:Empirical Approach Using Floating Car Data[C]//Proceedings of the 93rd Transportation Research Record Annual Meeting.Washington D.C.,USA.
[15] JIANG Bin.Street Hierarchies:A Minority of Streets Account for A Majority of Traffic Flow[J].International Journal of Geographical Information Science,2009,23(8):1033-1048.
[16] LIU Xiliang,LU Feng,ZHANG Hengcai,et al.Intersection Delay Estimation from Floating Car Data Via Principal Curves:A Case Study on Beijing's Road Network[J].Frontiers of Earth Science,2013,7(2):206-216.
[17] 段滢滢,陆锋.基于道路结构特征识别的城市交通状态空间自相关分析[J].地球信息科学学报,2012,14(6):768-774. DUAN Yingying,LU Feng.Spatial Autocorrelation of Urban Road Traffic Based on Road Network Characterization[J].Journal of Geo-Information Science,2012,14(6):768-774.
[18] 刘康,段滢滢,陆锋.基于拓扑与形态特征的城市道路交通状态空间自相关分析[J].地球信息科学学报,2014,16(3):390-395. LIU Kang,DUAN Yingying,LU Feng.Spatial Autocorrelation Analysis of Urban Road Traffic Based on Topological and Geometric Properties[J].Journal of Geo-Information Science,2014,16(3):390-395.
[19] BENGIO Y,DUCHARME R,VINCENT P,et al.A Neural Probabilistic Language Model[J].Journal of Machine Learning Research,2003,3:1137-1155.
[20] MIKOLOV T,CHEN Kai,CORRADO G,et al.Efficient Estimation of Word Representations in Vector Space[C]//Proceedings of Workshop at International Conference on Learning Representations.2013:1-12.
[21] DEERWESTER S,DUMAIS S T,FURNAS G W,et al.Indexing by Latent Semantic Analysis[J].Journal of the American Society for Information Science,1990,41(6):391-407.
[22] 仇培元,陆锋,张恒才,等.蕴含地理事件微博客消息的自动识别方法[J].地球信息科学学报,2016,18(7):886-893. QIU Peiyuan,LU Feng,ZHANG Hengcai,et al.Automatic Identification Method of Micro-blog Messages Containing Geographical Events[J].Journal of Geo-Information Science,2016,18(7):886-893.
[23] PENNINGTON J,SOCHER R,MANNING C D.Glove:Global Vectors for Word Representation[C]//Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP).Doha,Qatar:Association for Computational Linguistics,2014,14:1532-1543.
[24] RUMELHART D E,HINTON G E,WILLIAMS R J.Learning Representations by Back-propagating Errors[J].Nature,1986,323(6088):533-536.
[25] LIU Xiliang,LIU Kang,LI Mingxiao,et al.A ST-CRF Map-Matching Method for Low-frequency Floating Car Data[J].IEEE Transactions on Intelligent Transportation Systems,2017,18(5):1241-1254.
[26] SAKOE H,CHIBA S.Dynamic Programming Algorithm Optimization for Spoken Word Recognition[J].IEEE Transactions on Acoustics,Speech,and Signal Processing,1978,26(1):43-49.