[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

VRE: a versatile, robust, and economical trajectory data system

Published: 01 August 2022 Publication History

Abstract

Managing massive trajectory data from various moving objects has always been a demanding task. A desired trajectory data system should be versatile in its supported query types and distance functions, of low storage cost, and be consistently efficient on processing trajectory data of different properties. Unfortunately, none of the existing systems can meet the above three criteria at the same time. To this end, we propose VRE, a versatile, robust, and economical trajectory data system.VRE separates the storage from the processing. In the storage layer, we propose a novel segment-based storage model that takes advantage of the strengths of both point-based and trajectory-based storage models. VRE supports these three storage models and ten storage schemas upon them. With the secondary index, VRE reduces the storage cost up to 3x. In the processing layer, we first propose a two-stage processing framework and a pushdown strategy to alleviate full trajectory transmission cost. Then, we design a unified pruning strategy for five widely used trajectory distance functions and numerous tailored processing algorithms for five advanced queries. Extensive experiments are conducted to verify the design choice and efficiency of VRE, from which we present some key insights that are crucial to both VRE and future trajectory system's design.

References

[1]
July, 2022. AIS Dataset. https://marinecadastre.gov/ais/.
[2]
July, 2022. GeoMesa. https://www.geomesa.org/.
[3]
July, 2022. HBase. https://hbase.apache.org/.
[4]
July, 2022. OSM Trace. https://www.openstreetmap.org/traces.
[5]
July, 2022. Porto Dataset. https://www.kaggle.com/c/pkdd-15-predict-taxi-service-trajectory-i/data.
[6]
July, 2022. PostGIS. https://postgis.net/.
[7]
July, 2022. PostgreSQL Database. http://www.postgresql.org/.
[8]
July, 2022. VRE's technical report. https://github.com/hailanwhu/vre_artifacts.
[9]
July, 2022. Zstandard. https://facebook.github.io/zstd/.
[10]
Pankaj K. Agarwal, Rinat Ben Avraham, Haim Kaplan, and Micha Sharir. 2013. Computing the Discrete Fréchet Distance in Subquadratic Time. In SODA. 156--167.
[11]
Ablimit Aji, Fusheng Wang, Hoang Vo, Rubao Lee, Qiaoling Liu, Xiaodong Zhang, and Joel H. Saltz. 2013. Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce. PVLDB 6, 11 (2013), 1009--1020.
[12]
Louai Alarabi. 2018. Summit: a scalable system for massive trajectory data management. In SIGSPATIAL. 612--613.
[13]
Louai Alarabi, Mohamed F. Mokbel, and Mashaal Musleh. 2018. ST-Hadoop: a MapReduce framework for spatio-temporal data. GeoInformatica 22, 4 (2018), 785--813.
[14]
Mohamed S. Bakli, Mahmoud Attia Sakr, and Esteban Zimányi. 2019. Distributed moving object data management in MobilityDB. In BigSpatial@SIGSPATIAL. 1:1--1:10.
[15]
Christian Böhm, Gerald Klump, and Hans-Peter Kriegel. 1999. XZ-Ordering: A Space-Filling Curve for Objects with Spatial Extension. In SSD (Lecture Notes in Computer Science), Vol. 1651. 75--90.
[16]
Sotiris Brakatsoulas, Dieter Pfoser, Randall Salas, and Carola Wenk. 2005. On Map-Matching Vehicle Tracking Data. In VLDB. 853--864.
[17]
Lei Chen, M. Tamer Özsu, and Vincent Oria. 2005. Robust and Fast Similarity Search for Moving Object Trajectories. In SIGMOD. 491--502.
[18]
Philippe Cudré-Mauroux, Eugene Wu, and Samuel Madden. 2010. TrajStore: An adaptive storage system for very large trajectory data sets. In ICDE. 109--120.
[19]
Xin Ding, Lu Chen, Yunjun Gao, Christian S. Jensen, and Hujun Bao. 2018. Ul-TraMan: A Unified Platform for Big Trajectory Data Management and Analytics. PVLDB 11, 7 (2018), 787--799.
[20]
Ahmed Eldawy and Mohamed F. Mokbel. 2015. SpatialHadoop: A MapReduce framework for spatial data. In ICDE. 1352--1363.
[21]
Chenjuan Guo, Bin Yang, Jilin Hu, and Christian S. Jensen. 2018. Learning to Route with Sparse Trajectory Sets. In ICDE. 1073--1084.
[22]
Scott T. Leutenegger, J. M. Edgington, and Mario Alberto López. 1997. STR: A Simple and Efficient Algorithm for R-Tree Packing. In ICDE. 497--506.
[23]
Ruiyuan Li, Huajun He, Rubin Wang, Yuchuan Huang, Junwen Liu, Sijie Ruan, Tianfu He, Jie Bao, and Yu Zheng. 2020. JUST: JD Urban Spatio-Temporal Data Engine. In ICDE. 1558--1569.
[24]
Ruiyuan Li, Huajun He, Rubin Wang, Sijie Ruan, Tianfu He, Jie Bao, Junbo Zhang, Liang Hong, and Yu Zheng. 2021. TrajMesa: A Distributed NoSQL-Based Trajectory Data Management System. TKDE (2021), 1--1.
[25]
Ruiyuan Li, Huajun He, Rubin Wang, Sijie Ruan, Yuan Sui, Jie Bao, and Yu Zheng. 2020. TrajMesa: A Distributed NoSQL Storage Engine for Big Trajectory Data. In ICDE. 2002--2005.
[26]
Yin Lou, Chengyang Zhang, Yu Zheng, Xing Xie, Wei Wang, and Yan Huang. 2009. Map-matching for low-sampling-rate GPS trajectories. In SIGSPATIAL. 352--361.
[27]
Sarana Nutanong, Edwin H. Jacox, and Hanan Samet. 2011. An Incremental Hausdorff Distance Calculation Algorithm. PVLDB 4, 8 (2011), 506--517.
[28]
Zeyuan Shang, Guoliang Li, and Zhifeng Bao. 2018. DITA: Distributed In-Memory Trajectory Analytics. In SIGMOD. 725--740.
[29]
MingJie Tang, Yongyang Yu, Qutaibah M. Malluhi, Mourad Ouzzani, and Walid G. Aref. 2016. LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data. PVLDB 9, 13 (2016), 1565--1568.
[30]
Michail Vlachos, Dimitrios Gunopulos, and George Kollios. 2002. Discovering Similar Multidimensional Trajectories. In ICDE. 673--684.
[31]
Sheng Wang, Zhifeng Bao, J. Shane Culpepper, and Gao Cong. 2021. A Survey on Trajectory Data Management, Analytics, and Learning. ACM Comput. Surv. 54, 2 (2021), 39:1--39:36.
[32]
Zheng Wang, Cheng Long, Gao Cong, and Yiding Liu. 2020. Efficient and Effective Similar Subtrajectory Search with Deep Reinforcement Learning. PVLDB 13, 11 (2020), 2312--2325.
[33]
Dong Xie, Feifei Li, and Jeff M. Phillips. 2017. Distributed Trajectory Similarity Search. PVLDB 10, 11 (2017), 1478--1489.
[34]
Dong Xie, Feifei Li, Bin Yao, Gefei Li, Liang Zhou, and Minyi Guo. 2016. Simba: Efficient In-Memory Spatial Analytics. In SIGMOD. 1071--1085.
[35]
Can Yang and Gyözö Gidófalvi. 2018. Fast map matching, an algorithm integrating hidden Markov model with precomputation. International Journal of Geographical Information Science 32, 3 (2018), 547--570.
[36]
Byoung-Kee Yi, H. V. Jagadish, and Christos Faloutsos. 1998. Efficient Retrieval of Similar Time Sequences Under Time Warping. In ICDE. 201--208.
[37]
Jia Yu, Zongsi Zhang, and Mohamed Sarwat. 2019. Spatial data management in apache spark: the GeoSpark perspective and beyond. GeoInformatica 23, 1 (2019), 37--78.
[38]
Haitao Yuan and Guoliang Li. 2019. Distributed In-memory Trajectory Similarity Search and Join on Road Network. In ICDE. 1262--1273.
[39]
Haitao Yuan, Guoliang Li, Zhifeng Bao, and Ling Feng. 2020. Effective Travel Time Estimation: When Historical Trajectories over Road Networks Matter. In SIGMOD. 2135--2149.
[40]
Ping Zhang, Zhifeng Bao, Yuchen Li, Guoliang Li, Yipeng Zhang, and Zhiyong Peng. 2018. Trajectory-driven Influential Billboard Placement. In KDD. 2748--2757.
[41]
Bolong Zheng, Lianggui Weng, Xi Zhao, Kai Zeng, Xiaofang Zhou, and Christian S. Jensen. 2021. REPOSE: Distributed Top-k Trajectory Similarity Search with Local Reference Point Tries. CoRR abs/2101.08929 (2021).

Cited By

View all
  • (2024)BT-Tree: A Reinforcement Learning Based Index for Big Trajectory DataProceedings of the ACM on Management of Data10.1145/36771302:4(1-27)Online publication date: 30-Sep-2024
  • (2024)Let's Speak Trajectories: A Vision to Use NLP Models for Trajectory Analysis TasksACM Transactions on Spatial Algorithms and Systems10.1145/365647010:2(1-25)Online publication date: 1-Jul-2024
  • (2023)Effective and Efficient Route Planning Using Historical Trajectories on Road NetworksProceedings of the VLDB Endowment10.14778/3603581.360359116:10(2512-2524)Online publication date: 1-Jun-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment  Volume 15, Issue 12
August 2022
551 pages
ISSN:2150-8097
Issue’s Table of Contents

Publisher

VLDB Endowment

Publication History

Published: 01 August 2022
Published in PVLDB Volume 15, Issue 12

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)57
  • Downloads (Last 6 weeks)5
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)BT-Tree: A Reinforcement Learning Based Index for Big Trajectory DataProceedings of the ACM on Management of Data10.1145/36771302:4(1-27)Online publication date: 30-Sep-2024
  • (2024)Let's Speak Trajectories: A Vision to Use NLP Models for Trajectory Analysis TasksACM Transactions on Spatial Algorithms and Systems10.1145/365647010:2(1-25)Online publication date: 1-Jul-2024
  • (2023)Effective and Efficient Route Planning Using Historical Trajectories on Road NetworksProceedings of the VLDB Endowment10.14778/3603581.360359116:10(2512-2524)Online publication date: 1-Jun-2023
  • (2023)ST4ML: Machine Learning Oriented Spatio-Temporal Data Processing at ScaleProceedings of the ACM on Management of Data10.1145/35889411:1(1-28)Online publication date: 30-May-2023

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media