[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3637528.3671548acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

LaDe: The First Comprehensive Last-mile Express Dataset from Industry

Published: 24 August 2024 Publication History

Abstract

Real-world last-mile express datasets are crucial for research in logistics, supply chain management, and spatio-temporal data mining. Despite a plethora of algorithms developed to date, no widely accepted, publicly available last-mile express dataset exists to support research in this field. In this paper, we introduce LaDe, the first publicly available last-mile express dataset with millions of packages from the industry. LaDe has three unique characteristics: (1)Large-scale. It involves 10,677k packages of 21k couriers over 6 months of real-world operation. (2)Comprehensive information. It offers original package information, task-event information, as well as couriers' detailed trajecotries and road networks. (3)Diversity. The dataset includes data from various scenarios, including package pick-up and delivery, and from multiple cities, each with its unique spatio-temporal patterns due to their distinct characteristics such as populations. We verify LaDe on three tasks by running several classical baseline models per task. We believe that the large-scale, comprehensive, diverse feature of LaDe can offer unparalleled opportunities to researchers in the supply chain community, data mining community, and beyond. The dataset and code is publicly available at https://huggingface.co/datasets/Cainiao-AI/LaDe.

Supplemental Material

MP4 File - LaDe: The First Comprehensive Last-mile Express Dataset from Industry
We introduce LaDe, the first publicly available last-mile express dataset with millions of packages from the industry. LaDe has three unique characteristics: (1) Large-scale. It involves 10,677k packages of 21k couriers over 6 months of real-world operation. (2) Comprehensive information. It offers original package information, task-event information, as well as couriers' detailed trajecotries and road networks. (3) Diversity. The dataset includes data from various scenarios, including package pick-up and delivery, and from multiple cities, each with its unique spatio-temporal patterns due to their distinct characteristics such as populations.

References

[1]
Paul Almasan, José Suárez-Varela, Krzysztof Rusek, Pere Barlet-Ros, and Albert Cabellos-Aparicio. 2022. Deep reinforcement learning meets graph neural networks: exploring a routing optimization use case. Computer Communications 196 (2022), 184--194.
[2]
Lei Bai, Lina Yao, Can Li, Xianzhi Wang, and Can Wang. 2020. Adaptive graph convolutional recurrent network for traffic forecasting. Advances in neural information processing systems 33 (2020), 17804--17815.
[3]
Irwan Bello, Hieu Pham, Quoc V Le, Mohammad Norouzi, and Samy Bengio. 2017. Neural combinatorial optimization with reinforcement learning. In ICLR.
[4]
Rishi Bommasani, Drew A Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, et al. 2021. On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258 (2021).
[5]
Nils Boysen, Stefan Fedtke, and Stefan Schwerdfeger. 2021. Last-mile delivery concepts: a survey from an operational research perspective. Or Spectrum 43 (2021), 1--58.
[6]
Ulrich Breunig, Roberto Baldacci, Richard F Hartl, and Thibaut Vidal. 2019. The electric two-echelon vehicle routing problem. Computers & Operations Research 103 (2019), 198--210.
[7]
Tianyue Cai, Huaiyu Wan, Fan Wu, Haomin Wen, Shengnan Guo, Lixia Wu, Haoyuan Hu, and Youfang Lin. 2023. M 2 g4rtp: A multi-level and multi-task graph model for instant-logistics route and time joint prediction. In 2023 IEEE 39th International Conference on Data Engineering (ICDE). IEEE, 3296--3308.
[8]
Chao Chen, Sen Yang, Yasha Wang, Bin Guo, and Daqing Zhang. 2020. Crowd-Express: a probabilistic framework for on-time crowdsourced package deliveries. IEEE transactions on big data 8, 3 (2020), 827--842.
[9]
Jeongwhan Choi, Hwangyong Choi, Jeehyun Hwang, and Noseong Park. 2022. Graph neural controlled differential equations for traffic forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 36. 6367--6374.
[10]
Arthur Cruz de Araujo and Ali Etemad. 2021. End-to-End Prediction of Parcel Delivery Time with Deep Learning for Smart-City Applications. IEEE Internet of Things Journal (2021).
[11]
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition. Ieee, 248--255.
[12]
Boya Du, Shaochuan Lin, Jiong Gao, Xiyu Ji, Mengya Wang, Taotao Zhou, Hengxu He, Jia Jia, and Ning Hu. 2023. BASM: A Bottom-up Adaptive Spatiotemporal Model for Online Food Ordering Service. In 2023 IEEE 39th International Conference on Data Engineering (ICDE). IEEE, 3549--3562.
[13]
Tao-Yang Fu and Wang-Chien Lee. 2020. Trembr: Exploring road networks for trajectory representation learning. ACM Transactions on Intelligent Systems and Technology (TIST) 11, 1 (2020), 1--25.
[14]
Chengliang Gao, Fan Zhang, Guanqun Wu, Qiwan Hu, Qiang Ru, Jinghua Hao, Renqing He, and Zhizhao Sun. 2021. A Deep Learning Method for Route and Time Prediction in Food Delivery Service. In KDD. 2879--2889.
[15]
Shengnan Guo, Youfang Lin, Ning Feng, Chao Song, and Huaiyu Wan. 2019. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 922--929.
[16]
Shuihua Han, Ling Zhao, Kui Chen, Zong-wei Luo, and Deepa Mishra. 2017. Appointment scheduling and routing optimization of attended home delivery system with random customer behavior. European Journal of Operational Research 262, 3 (2017), 966--980.
[17]
Mahyar Jahangiriesmaili, Sina Bahrami, and Matthew J Roorda. 2017. Solution of two-echelon facility location problems by approximation methods. Transportation Research Record 2610, 1 (2017), 1--9.
[18]
Shenggong Ji, Yu Zheng, Zhaoyuan Wang, and Tianrui Li. 2019. Alleviating users' pain of waiting: Effective task grouping for online-to-offline food delivery services. In The World Wide Web Conference. 773--783.
[19]
Manas Joshi, Arshdeep Singh, Sayan Ranu, Amitabha Bagchi, Priyank Karia, and Puneet Kala. 2022. Food Match: Batching and Matching for Food Delivery in Dynamic Road Networks. ACM Transactions on Spatial Algorithms and Systems (TSAS) 8, 1 (2022), 1--25.
[20]
Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. 2017. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems 30 (2017).
[21]
Ashu Kedia, Diana Kusumastuti, and Alan Nicholson. 2020. Locating collection and delivery points for goods' last-mile travel: A case study in New Zealand. Transportation Research Procedia 46 (2020), 85--92.
[22]
Maurice G Kendall. 1938. A new measure of rank correlation. Biometrika 30, 1/2 (1938), 81--93.
[23]
Xijun Li, Weilin Luo, Mingxuan Yuan, Jun Wang, Jiawen Lu, Jie Wang, Jinhu Lü, and Jia Zeng. 2021. Learning to optimize industry-scale dynamic pickup and delivery problems. In 2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 2511--2522.
[24]
Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In International Conference on Learning Representations.
[25]
Stanley Frederick WT Lim and Jagjit Singh Srai. 2018. Examining the anatomy of last-mile distribution in e-commerce omnichannel retailing: A supply network configuration approach. International Journal of Operations & Production Management (2018).
[26]
Shaochuan Lin, Jiayan Pei, Taotao Zhou, Hengxu He, Jia Jia, and Ning Hu. 2023. Exploring the Spatio temporal Features of Online Food Recommendation Service. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 3354--3358.
[27]
Yan Lin, Huaiyu Wan, Shengnan Guo, and Youfang Lin. 2021. Pre-training context and time aware location embeddings from spatial-temporal trajectories for user next location prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 4241--4248.
[28]
Dachuan Liu, Jin Wang, Shuo Shang, and Peng Han. 2022. Msdr: Multi-step dependency relation networks for spatial temporal forecasting. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 1042--1050.
[29]
Elżbieta Macioszek. 2018. First and last mile delivery-problems and issues. In Advanced Solutions of Transport Systems for Growing Mobility: 14th Scientific and Technical Conference" Transport Systems. Theory & Practice 2017" Selected Papers. Springer, 147--154.
[30]
Riccardo Mangiaracina, Alessandro Perego, Arianna Seghezzi, and Angela Tumino. 2019. Innovative solutions to increase last-mile delivery efficiency in B2C e-commerce: a literature review. International Journal of Physical Distribution & Logistics Management (2019).
[31]
Xiaowei Mao, Huaiyu Wan, Haomin Wen, Fan Wu, Jianbin Zheng, Yuting Qiang, Shengnan Guo, Lixia Wu, Haoyuan Hu, and Youfang Lin. 2023. GMDNet: A Graph-Based Mixture Density Network for Estimating Packages' Multimodal Travel Time Distribution. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 4561--4568.
[32]
Xiaowei Mao, Haomin Wen, Hengrui Zhang, Huaiyu Wan, Lixia Wu, Jianbin Zheng, Haoyuan Hu, and Youfang Lin. 2023. DRL4Route: A Deep Reinforcement Learning Framework for Pick-up and Delivery Route Prediction. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 4628--4637.
[33]
Daniel Merchán, Jatin Arora, Julian Pachon, Karthik Konduri, Matthias Winkenbach, Steven Parks, and Joseph Noszek. 2022. 2021 Amazon Last Mile Routing Research Challenge: Data Set. Transportation Science (2022).
[34]
John Nerbonne, Wilbert Heeringa, and Peter Kleiweg. 1999. Edit distance and dialect proximity. Time Warps, String Edits and Macromolecules: The theory and practice of sequence comparison 15 (1999).
[35]
John Olsson, Daniel Hellström, and Henrik Pålsson. 2019. Framework of last mile logistics research: A systematic review of the literature. Sustainability 11, 24 (2019), 7131.
[36]
MID Ranathunga, AN Wijayanayake, and DHH Niwunhella. 2021. Solution approaches for combining first-mile pickup and last-mile delivery in an e-commerce logistic network: A systematic literature review. In 2021 International Research Conference on Smart Computing and Systems Engineering (SCSE), Vol. 4. IEEE, 267--275.
[37]
Meera Ratnagiri, Clare O'Dwyer, Logan E Beaver, Heeseung Bang, Behdad Chalaki, and Andreas A Malikopoulos. 2022. A scalable last-mile delivery service: From simulation to scaled experiment. In 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC). IEEE, 4163--4168.
[38]
Sijie Ruan, Cheng Long, Jie Bao, Chunyang Li, Zisheng Yu, Ruiyuan Li, Yuxuan Liang, Tianfu He, and Yu Zheng. 2020. Learning to generate maps from trajectories. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 890--897.
[39]
Sijie Ruan, Cheng Long, Zhipeng Ma, Jie Bao, Tianfu He, Ruiyuan Li, Yiheng Chen, Shengnan Wu, and Yu Zheng. 2022. Service Time Prediction for Delivery Tasks via Spatial Meta-Learning. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3829--3837.
[40]
Sijie Ruan, Cheng Long, Xiaodu Yang, Tianfu He, Ruiyuan Li, Jie Bao, Yiheng Chen, Shengnan Wu, Jiangtao Cui, and Yu Zheng. 2022. Discovering Actual Delivery Locations from Mis-Annotated Couriers' Trajectories. In 2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE, 3241--3253.
[41]
Sijie Ruan, Zi Xiong, Cheng Long, Yiheng Chen, Jie Bao, Tianfu He, Ruiyuan Li, Shengnan Wu, Zhongyuan Jiang, and Yu Zheng. 2020. Doing in one go: delivery time inference based on couriers' trajectories. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2813--2821.
[42]
Jelena Simeunović, Baptiste Schubnel, Pierre-Jean Alet, and Rafael E Carrillo. 2021. Spatio-temporal graph neural networks for multi-site PV power forecasting. IEEE Transactions on Sustainable Energy 13, 2 (2021), 1210--1220.
[43]
Junxian Song, Rong Wen, Chi Xu, and Joel Wei En Tay. 2019. Service Time Prediction for Last-Yard Delivery. In 2019 IEEE International Conference on Big Data (Big Data). IEEE, 3933--3938.
[44]
Eiichi Taniguchi, Russell G Thompson, and Ali G Qureshi. 2020. Modelling city logistics using recent innovative technologies. Transportation Research Procedia 46 (2020), 3--12.
[45]
Yulia Vakulenko, Daniel Hellström, and Klas Hjort. 2018. What's in the parcel locker? Exploring customer value in e-commerce last mile delivery. journal of Business Research 88 (2018), 421--427.
[46]
Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, and Samuel R Bowman. 2018. GLUE: A multi-task benchmark and analysis platform for natural language understanding. arXiv preprint arXiv:1804.07461 (2018).
[47]
Haomin Wen, Youfang Lin, Xiaowei Mao, Fan Wu, Yiji Zhao, Haochen Wang, Jianbin Zheng, Lixia Wu, Haoyuan Hu, and Huaiyu Wan. 2022. Graph2Route: A Dynamic Spatial-Temporal Graph Neural Network for Pick-up and Delivery Route Prediction. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 4143--4152.
[48]
Haomin Wen, Youfang Lin, FanWu, HuaiyuWan, Shengnan Guo, Lixia Wu, Chao Song, and Yinghui Xu. 2021. Package Pick-up Route Prediction via Modeling Couriers' Spatial-Temporal Behaviors. In ICDE. IEEE, 2141--2146.
[49]
Haomin Wen, Youfang Lin, Fan Wu, Huaiyu Wan, Zhongxiang Sun, Tianyue Cai, Hongyu Liu, Shengnan Guo, Jianbin Zheng, Chao Song, et al. 2023. Enough Waiting for the Couriers: Learning to Estimate Package Pick-up Arrival Time from Couriers' Spatial-Temporal Behaviors. ACM Transactions on Intelligent Systems and Technology 14, 3 (2023), 1--22.
[50]
Haomin Wen, Youfang Lin, Lixia Wu, Xiaowei Mao, Tianyue Cai, Yunfeng Hou, Shengnan Guo, Yuxuan Liang, Guangyin Jin, Yiji Zhao, et al. 2023. A Survey on Service Route and Time Prediction in Instant Delivery: Taxonomy, Progress, and Prospects. arXiv preprint arXiv:2309.01194 (2023).
[51]
Haomin Wen, Youfang Lin, Yutong Xia, Huaiyu Wan, Qingsong Wen, Roger Zimmermann, and Yuxuan Liang. 2023. Diffstg: Probabilistic spatio-temporal graph forecasting with denoising diffusion models. In Proceedings of the 31st ACM International Conference on Advances in Geographic Information Systems. 1--12.
[52]
Fan Wu and Lixia Wu. 2019. DeepETA: A Spatial-Temporal Sequential Neural Network Model for Estimating Time of Arrival in Package Delivery System. In Proceedings of the AAAI Conference on Artificial Intelligence. 774--781.
[53]
Lixia Wu, Jianlin Liu, Junhong Lou, Haoyuan Hu, Jianbin Zheng, Haomin Wen, Chao Song, and Shu He. 2023. G2PTL: A Pre-trained Model for Delivery Address and its Applications in Logistics System. KDD 23 Urban Computing Workshop (2023).
[54]
Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, Xiaojun Chang, and Chengqi Zhang. 2020. Connecting the dots: Multivariate time series forecasting with graph neural networks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 753--763.
[55]
Zonghan Wu, Shirui Pan, Guodong Long, Jing Jiang, and Chengqi Zhang. 2019. GraphWaveNet for Deep Spatial-Temporal Graph Modeling. In IJCAI. 1907--1913.
[56]
Bing Yao, Caitlin McLean, and Hui Yang. 2019. Robust optimization of dynamic route planning in same-day delivery networks with one-time observation of new demand. Networks 73, 4 (2019), 434--452.
[57]
Huaxiu Yao, Fei Wu, Jintao Ke, Xianfeng Tang, Yitian Jia, Siyu Lu, Pinghua Gong, Jieping Ye, and Zhenhui Li. 2018. Deep multi-view spatial-temporal network for taxi demand prediction. In Proceedings of the AAAI conference on artificial intelligence, Vol. 32.
[58]
Bing Yu, Haoteng Yin, and Zhanxing Zhu. 2018. Spatio-temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting. In IJCAI.
[59]
Mingxuan Yue, Tianshu Sun, Fan Wu, Lixia Wu, Yinghui Xu, and Cyrus Shahabi. 2021. Learning a Contextual and Topological Representation of Areas-of-Interest for On-Demand Delivery Application. In Machine Learning and Knowledge Discovery in Databases: Applied Data Science Track: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14-18, 2020, Proceedings, Part IV. Springer, 52--68.
[60]
Yuxiang Zeng, Yongxin Tong, and Lei Chen. 2019. Last-mile delivery made practical: An efficient route planning framework with theoretical guarantees. Proceedings of the VLDB Endowment 13, 3 (2019), 320--333.
[61]
Junbo Zhang, Yu Zheng, and Dekang Qi. 2017. Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction. In AAAI. 1655--1661.
[62]
Yan Zhang, Yunhuai Liu, Genjian Li, Yi Ding, Ning Chen, Hao Zhang, Tian He, and Desheng Zhang. 2019. Route prediction for instant delivery. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 3, 3 (2019), Article 124.

Index Terms

  1. LaDe: The First Comprehensive Last-mile Express Dataset from Industry
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
      August 2024
      6901 pages
      ISBN:9798400704901
      DOI:10.1145/3637528
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 24 August 2024

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. benchmark
      2. courier trajectory
      3. dataset
      4. last-mile delivery

      Qualifiers

      • Research-article

      Funding Sources

      • Guangzhou-HKUST(GZ) Joint Funding Program

      Conference

      KDD '24
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 130
        Total Downloads
      • Downloads (Last 12 months)130
      • Downloads (Last 6 weeks)44
      Reflects downloads up to 10 Dec 2024

      Other Metrics

      Citations

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media