Abstract
Twitter has become one of the most popular social media platforms, evidently stirred by a very popular trend of event detection with many applications, including delay detection and traffic congestion on the public transport network. In this paper, we propose a Twitter-based railway delay detection method based on topic propagation analysis of geo-tagged tweets between railway stations. In particular, we aim to discover delay events and to predict train delays due to traffic accidents by analyzing topic propagation using railway network topology of real space. To realize this, first, we construct the topology of the railway network (the physical space) as a graph in which nodes are railway stations and edges are represented as routes between them. Then, we extract the topology of the social network that is mapped on the railway network, based on topic propagation analysis of accident delays between stations and by analyzing geo-tagged tweets of each station with a neural network. This allows us to observe the influence of delays on railway stations even if there are a few tweets on them and to predict stations affected by delays with the tweets which contain indirect topics about delays such as “crowded!” and “raining!”. Overall, this paper proposes the method which enables us to analyze the topic propagation of geo-tagged tweets in order to predict accident delays by considering the railway topology of real space. In addition, we also evaluate the performance of the proposed method on datasets derived from Twitter with the actual delay information from 488 stations of 62 routes in Tokyo area in Japan.
Similar content being viewed by others
Notes
The MeCab Japanese morphological analyzer: https://taku910.github.io/mecab/
References
Twitter: http://twitter.com/
Foursquare: https://foursquare.com/
Tumblr: https://www.tumblr.com/
Tokyo Metro Subway Map: http://www.tokyometro.jp/en/subwaymap/pdf/rosen_en_1702.pdf
Twitter Streaming API: https://dev.twitter.com/streaming/overview
Google Places API v3: https://developers.google.com/place
World Urbanization Prospects (2014) The 2014 revision population database, vol ST/ESA/SE.A/352. United Nations
Ardon S, Bagchi A, Mahanti A, Ruhela A, Seth A, Tripathy RM, Triukose S (2013) Spatio-temporal and events based analysis of topic popularity in twitter. In: Proceedings of the 22nd ACM international conference on information & knowledge management, CIKM ’13, pp 219–228. https://doi.org/10.1145/2505515.2505525. http://doi.acm.org/10.1145/2505515.2505525
Auxilia R, Gandhi M (2016) Earthquake reporting system development by tweet analysis with approach earthquake alarm systems. European Journal of Applied Sciences 8(3):176–180. https://doi.org/10.5829/idosi.ejas.2016.8.3.23003
Carvalho J, Marques M, Costeira JP (2017) Understanding people flow in transportation hubs. IEEE Trans Intell Transp Syst 19(10):1–10
Daly EM, Lecue F, Bicer V (2013) Westland row why so slow?: Fusing social media and linked data sources for understanding real-time traffic conditions. In: Proceedings of the 2013 international conference on intelligent user interfaces, IUI ’13, pp 203–212. https://doi.org/10.1145/2449396.2449423
D’Andrea E, Ducange P, Lazzerini B, Marcelloni F (2015) Real-time detection of traffic from twitter stream analysis. IEEE Trans Intell Transp Syst 16(4):2269–2283. https://doi.org/10.1109/TITS.2015.2404431
Dong G, Yang W, Zhu F, Wang W (2017) Discovering burst patterns of burst topic in twitter. Comput Electr Eng 58(C):551–559. https://doi.org/10.1016/j.compeleceng.2016.06.012
Eleta I, Golbeck J (2014) Multilingual use of twitter: social networks at the language frontier. Comput Hum Behav 41:424–432
Endarnoto SK, Pradipta S, Nugroho AS, Purnama J (2011) Traffic condition information extraction & visualization from social media twitter for android mobile application. In: Proceedings of the international conference on electronics engineering and informatics, ICEEI ’11, pp 1–4. https://doi.org/10.1109/ICEEI.2011.6021743
Goonetilleke O, Sellis T, Zhang X, Sathe S (2014) Twitter analytics: a big data management perspective. ACM SIGKDD Explorations Newsletter 16(1):11–20. https://doi.org/10.1145/2674026.2674029. http://doi.acm.org/10.1145/2674026.2674029
Günnemann N, Pfeffer J (2015) Finding non-redundant multi-word events on twitter. In: Proceedings of the 2015 IEEE/ACM international conference on advances in social networks analysis and mining 2015, ASONAM ’15, pp 520–525. https://doi.org/10.1145/2808797.2809390
Gutierrez C, Figueiras P, Oliveira P, Costa R, Jardim-Goncalves R (2015) Twitter mining for traffic events detection. In: IEEE science and information conference 2015, SAI 2015. https://doi.org/10.1109/SAI.2015.7237170
Itoh M, Yoshinaga N, Toyoda M (2016) Spatio-temporal event visualization from a geo-parsed microblog stream. In: Companion publication of the 21st international conference on intelligent user interfaces, IUI ’16 Companion, pp 58–61. https://doi.org/10.1145/2876456.2879486. http://doi.acm.org/10.1145/2876456.2879486
Kabalan B, Leurent F, Christoforou Z, Dubroca-Voisin M (2017) Framework for centralized and dynamic pedestrian management in railway stations. Transportation Research Procedia 27:712–719. https://doi.org/10.1016/j.trpro.2017.12.091
Kalloubi F, Nfaoui EH, El Beqqali O (2017) Harnessing semantic features for large-scale content-based hashtag recommendations on microblogging platforms. International Journal on Semantic Web & Information Systems 13(1):48–67. https://doi.org/10.4018/IJSWIS.2017010104
Lee R, Sumiya K (2010) Measuring geographical regularities of crowd behaviors for twitter-based geo-social event detection. In: Proceedings of the 2nd ACM SIGSPATIAL international workshop on location based social networks, LBSN ’10. ACM, New York, pp 1–10. https://doi.org/10.1145/1867699.1867701
Lee R, Wakamiya S, Sumiya K (2011) Discovery of unusual regional social activities using geo-tagged microblogs. World Wide Web 14(4):321–349. https://doi.org/10.1007/s11280-011-0120-x
Liu M, Fu K, Lu CT, Chen G, Wang H (2014) A search and summary application for traffic events detection based on twitter data. In: Proceedings of the 22nd ACM SIGSPATIAL international conference on advances in geographic information systems, SIGSPATIAL ’14, pp 549–552. https://doi.org/10.1145/2666310.2666366. http://doi.acm.org/10.1145/2666310.2666366
Mallela D, Ahlers D, Pera MS (2017) Mining twitter features for event summarization and rating. In: Proceedings of the international conference on web intelligence, WI ’17, pp 615–622. https://doi.org/10.1145/3106426.3106487
Morioka M, Kuramochi K, Mishina Y, Akiyama T, Taniguchi N (2015) City management platform using big data from people and traffic flows. Hitachi Review 64(1):53
Nugroho R, Zhao W, Yang J, Paris C, Nepal S (2017) Using time-sensitive interactions to improve topic derivation in twitter. World Wide Web 20(1):61–87. https://doi.org/10.1007/s11280-016-0417-x
Ozkurt C, Camci F (2009) Automatic traffic density estimation and vehicle classification for traffic surveillance systems using neural networks. Mathematical and Computational Application 14(3):187–196. https://doi.org/10.3390/mca14030187
Pla F, Hurtado LF (2016) Language identification of multilingual posts from twitter: a case study. Knowl Inf Syst 51(3):1–25. https://doi.org/10.1007/s10115-016-0997-x
Raghavi KC, Chinnakotla MK, Shrivastava M (2015) “answer ka type kya he?”: Learning to classify questions in code-mixed language. In: Proceedings of the 24th international conference on World Wide Web, WWW ’15 companion. ACM, New York, pp 853–858. https://doi.org/10.1145/2740908.2743006
Ritter A, Mausam, Etzioni O, Clark S (2012) Open domain event extraction from twitter. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’12, pp 1104–1112. https://doi.org/10.1145/2339530.2339704. http://doi.acm.org/10.1145/2339530.2339704
Sakaki T, Okazaki M, Matsuo Y (2013) Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans Knowl Data Eng 25(4):919–931. https://doi.org/10.1109/TKDE.2012.29
Stilo G, Velardi P (2014) Time makes sense: event discovery in twitter using temporal similarity. In: Proceeidngs of the 2014 IEEE/WIC/ACM international joint conferences on Web Intelligence (WI) and intelligent agent technologies (IAT) - Volume 02, WI-IAT ’14, pp 186–193. https://doi.org/10.1109/WI-IAT.2014.97
Sureesha B, Priyadarshini V (2016) Monitoring and analysis of dynamic traffic analyzer using twitter. IEEE Trans Intell Transp Syst 7(4):136–139
Wakamiya S, Lee R, Sumiya K (2011) Crowd-powered tv viewing rates: measuring relevancy between tweets and tv programs. In: International conference on database systems for advanced applications. Springer, pp 390–401
Wakamiya S, Lee R, Sumiya K (2011) Towards better tv viewing rates: Exploiting crowd’s media life logs over twitter for tv rating. In: Proceedings of the 5th international conference on ubiquitous information management and communication, ICUIMC ’11. ACM, New York, pp 39:1–39:10. https://doi.org/10.1145/1968613.1968661
Wang S, Zhang X, Cao J, He L, Stenneth L, Yu PS, Li Z, Huang Z (2017) Computing urban traffic congestions by incorporating sparse gps probe data and social media data. ACM Trans Inf Syst (TOIS) 35 (4):40:1–40:30. https://doi.org/10.1145/3057281
Wang Y, Yasui G, Hosokawa Y, Kawai Y, Akiyama T, Sumiya K (2014) Location-based microblog viewing system synchronized with web pages. In: 2014 IEEE 33rd international symposium on reliable distributed systems workshops (SRDSW). IEEE, pp 70–75. https://doi.org/10.1109/SRDSW.2014.18
Wang Y, Yasui G, Kawai Y, Akiyama T, Sumiya K, Ishikawa Y (2016) Dynamic mapping of dense geo-tweets and web pages based on spatio-temporal analysis. In: Proceedings of the 31st annual ACM symposium on applied computing, SAC ’16, pp 1170–1173. https://doi.org/10.1145/2851613.2851985. http://doi.acm.org/10.1145/2851613.2851985
Yuan Y, Lint HV, Wageningen-Kessels FV, Hoogendoorn S (2014) Network-wide traffic state estimation using loop detector and floating car data. J Intell Transp Syst Technol Plann Oper 18(1):41–50. https://doi.org/10.1080/15472450.2013.773225
Zhao F, Zhu Y, Jin H, Yang LT (2016) A personalized hashtag recommendation approach using lda-based topic model in microblog environment, vol 65, pp 196–206. https://doi.org/10.1016/j.future.2015.10.012
Zheng Y (2015) Methodologies for cross-domain data fusion: an overview. IEEE Transactions on Big Data 1 (1):16–34. https://doi.org/10.1109/TBDATA.2015.2465959
Funding
This work was partially supported by SCOPE of the Ministry of Internal Affairs and Communications of Japan (#171507010), JSPS KAKENHI Grant Numbers 16H01722, 17K12686, 15K00162, and 17H01822.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, Y., Siriaraya, P., Kawai, Y. et al. Twitter-based traffic delay detection based on topic propagation analysis using railway network topology. Pers Ubiquit Comput 23, 233–247 (2019). https://doi.org/10.1007/s00779-019-01204-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00779-019-01204-5