Data-Driven Leak Localization in Urban Water Distribution Networks Using Big Data for Random Forest Classifier
<p>Hanoi network pattern demands.</p> "> Figure 2
<p>Hanoi network with pattern demands in nodes.</p> "> Figure 3
<p>Net3 network with two sensor layouts.</p> "> Figure 4
<p>Flowchart of data generation and the machine learning algorithm used for the prediction of leak location.</p> "> Figure 5
<p>Prediction of leak location for 30 day measurements with percentage of predicted leak nodes.</p> ">
Abstract
:1. Introduction
2. Materials and Methods
2.1. Problem Statement
2.2. Benchmark Water Supply Networks
2.3. Random Forest Classifier
3. Results
3.1. Data Influence
3.2. Variation of Base Demand and Emitter Coefficient
3.3. Sensor Layout Influence
3.4. Feature Influence
3.5. Application of the Prediction Method
4. Discussion
5. Conclusions
- Greatest prediction model accuracy was achieved for the largest leaks, with the smallest demand variation. With the increase in demand variation, prediction model accuracy considerably decreased.
- Model accuracy increased significantly when the top three and five network nodes with the greatest certainty of being leak nodes were considered to narrow down the leak location region.
- Investigation of the application of the proposed methodology on a small-sized network showed that in the majority of records, true leak location was detected, where in other cases neighbor nodes were chosen.
- Investigation of a greater number of inputs should be conducted to increase model accuracy under greater demand variation, or multiple prediction models should be used for different demand ranges.
- Validation of the proposed methodology should be conducted on real water distribution networks.
- Randomness should be incorporated into other model uncertainties, such as pipe diameter and pipe roughness.
- Further investigation should be conducted to explore other algorithms with an increased number of inputs and an optimized number of features to further increase model accuracy.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Jacobsz, S.W.; Jahnke, S.I. Leak detection on water pipelines in unsaturated ground by discrete fiber optic sensing. Struct. Health Monit. 2020, 19, 1219–1236. [Google Scholar] [CrossRef]
- Nkemeni, V.; Mieyeville, F.; Tsafack, P. A Distributed Computing Solution Based on Distributed Kalman Filter for Leak Detection in WSN-Based Water Pipeline Monitoring. Sensors 2020, 20, 5204. [Google Scholar] [CrossRef]
- Wu, Y.; Liu, S.; Wu, X.; Liu, Y.; Guan, Y. Burst detection in district metering areas using a data driven clustering algorithm. Water Res. 2016, 100, 28–37. [Google Scholar] [CrossRef] [PubMed]
- Rajeswaran, A.; Narasimhan, S.; Narasimhan, S. A graph partitioning algorithm for leak detection in water distribution networks. Comput. Chem. Eng. 2018, 108, 11–23. [Google Scholar] [CrossRef] [Green Version]
- Cody, R.A.; Dey, P.; Narasimhan, S. Linear prediction for leak detection in water distribution networks. J. Pipeline Syst. Eng. Pract. 2020, 11, 04019043. [Google Scholar] [CrossRef]
- Bohorquez, J.; Alexander, B.; Simpson, A.R.; Lambert, M.F. Leak detection and topology identification in pipelines using fluid transients and artificial neural networks. J. Water Resour. Plan. Manag. 2020, 146, 04020040. [Google Scholar] [CrossRef]
- Arifin, B.; Li, Z.; Shah, S.L.; Meyer, G.A.; Colin, A. A novel data-driven leak detection and localization algorithm using the Kantorovich distance. Comput. Chem. Eng. 2018, 108, 300–313. [Google Scholar] [CrossRef]
- Wang, F.; Lin, W.; Liu, Z.; Wu, S.; Qiu, X. Pipeline leak detection by using time-domain statistical features. IEEE Sens. J. 2017, 17, 6431–6442. [Google Scholar] [CrossRef]
- Lang, X.; Li, P.; Hu, Z.; Ren, H.; Li, Y. Leak detection and location of pipelines based on LMD and least squares twin support vector machine. IEEE Access 2017, 5, 8659–8668. [Google Scholar] [CrossRef]
- Zadkarami, M.; Shahbazian, M.; Salahshoor, K. Pipeline leakage detection and isolation: An integrated approach of statistical and wavelet feature extraction with multi-layer perceptron neural network (MLPNN). J. Loss Prev. Process. Ind. 2016, 43, 479–487. [Google Scholar] [CrossRef]
- Adegboye, M.A.; Fung, W.K.; Karnik, A. Recent advances in pipeline monitoring and oil leakage detection technologies: Principles and approaches. Sensors 2019, 19, 2548. [Google Scholar] [CrossRef] [Green Version]
- Blesa, J.; Nejjari, F.; Sarrate, R. Robust sensor placement for leak location: Analysis and design. J. Hydroinform. 2015, 18, 136–148. [Google Scholar] [CrossRef] [Green Version]
- Soldevila, A.; Blesa, J.; Tornil-Sin, S.; Fernandez-Canti, R.M.; Puig, V. Sensor placement for classifier-based leak localization in water distribution networks using hybrid feature selection. Comput. Chem. Eng. 2018, 108, 152–162. [Google Scholar] [CrossRef] [Green Version]
- Khorshidi, M.S.; Nikoo, M.R.; Taravatrooy, N.; Sadegh, M.; Al-Wardy, M.; Al-Rawas, G.A. Pressure sensor placement in water distribution networks for leak detection using a hybrid information-entropy approach. Inf. Sci. 2020, 516, 56–71. [Google Scholar] [CrossRef]
- Raei, E.; Shafiee, M.E.; Nikoo, M.R.; Berglund, E. Placing an ensemble of pressure sensors for leak detection in water distribution networks under measurement uncertainty. J. Hydroinform. 2019, 21, 223–239. [Google Scholar] [CrossRef]
- Wu, Y.; Liu, S. A review of data-driven approaches for burst detection in water distribution systems. Urban Water J. 2017, 14, 972–983. [Google Scholar] [CrossRef]
- Chan, T.K.; Chin, C.S.; Zhong, X. Review of current technologies and proposed intelligent methodologies for water distributed network leakage detection. IEEE Access 2018, 6, 78846–78867. [Google Scholar] [CrossRef]
- Zaman, D.; Tiwari, M.K.; Gupta, A.K.; Sen, D. A review of leakage detection strategies for pressurised pipeline in steady-state. Eng. Fail. Anal. 2020, 109, 104264. [Google Scholar] [CrossRef]
- Zhou, M.; Pan, Z.; Liu, Y.; Zhang, Q.; Cai, Y.; Pan, H. Leak Detection and Location Based on ISLMD and CNN in a Pipeline. IEEE Access 2019, 7, 30457–30464. [Google Scholar] [CrossRef]
- Pérez-Pérez, E.; López-Estrada, F.; Valencia-Palomo, G.; Torres, L.; Puig, V.; Mina-Antonio, J. Leak diagnosis in pipelines using a combined artificial neural network approach. Control Eng. Pract. 2021, 107, 104677. [Google Scholar] [CrossRef]
- Mounce, S.; Boxall, J.; Machell, J. Development and verification of an online artificial intelligence system for detection of bursts and other abnormal flows. J. Water Resour. Plan. Manag. 2010, 136, 309–318. [Google Scholar] [CrossRef]
- Jensen, T.N.; Puig, V.; Romera, J.; Kallesøe, C.S.; Wisniewski, R.; Bendtsen, J.D. Leakage localization in water distribution using data-driven models and sensitivity analysis. Ifac Pap. 2018, 51, 736–741. [Google Scholar] [CrossRef]
- Zhang, Q.; Wu, Z.Y.; Zhao, M.; Qi, J.; Huang, Y.; Zhao, H. Leakage zone identification in large-scale water distribution systems using multiclass support vector machines. J. Water Resour. Plan. Manag. 2016, 142, 04016042. [Google Scholar] [CrossRef]
- Soldevila, A.; Blesa, J.; Tornil-Sin, S.; Duviella, E.; Fernandez-Canti, R.M.; Puig, V. Leak localization in water distribution networks using a mixed model-based/data-driven approach. Control Eng. Pract. 2016, 55, 162–173. [Google Scholar] [CrossRef] [Green Version]
- Soldevila, A.; Fernandez-Canti, R.M.; Blesa, J.; Tornil-Sin, S.; Puig, V. Leak localization in water distribution networks using Bayesian classifiers. J. Process Control 2017, 55, 1–9. [Google Scholar] [CrossRef] [Green Version]
- Quiñones-Grueiro, M.; Verde, C.; Prieto-Moreno, A.; Llanes-Santiago, O. An unsupervised approach to leak detection and location in water distribution networks. Int. J. Appl. Math. Comput. Sci. 2018, 28, 283–295. [Google Scholar] [CrossRef] [Green Version]
- Zhou, X.; Tang, Z.; Xu, W.; Meng, F.; Chu, X.; Xin, K.; Fu, G. Deep learning identifies accurate burst locations in water distribution networks. Water Res. 2019, 166, 115058. [Google Scholar] [CrossRef] [PubMed]
- Sun, C.; Parellada, B.; Puig, V.; Cembrano, G. Leak localization in water distribution networks using pressure and data-driven classifier approach. Water 2020, 12, 54. [Google Scholar] [CrossRef] [Green Version]
- Javadiha, M.; Blesa, J.; Soldevila, A.; Puig, V. Leak localization in water distribution networks using deep learning. In Proceedings of the 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT), Paris, France, 23–26 April 2019; pp. 1426–1431. [Google Scholar]
- Soldevila, A.; Blesa, J.; Fernandez-Canti, R.M.; Tornil-Sin, S.; Puig, V. Data-driven approach for leak localization in water distribution networks using pressure sensors and spatial interpolation. Water 2019, 11, 1500. [Google Scholar] [CrossRef] [Green Version]
- Grbčić, L.; Lučin, I.; Kranjčević, L.; Družeta, S. Water supply network pollution source identification by random forest algorithm. J. Hydroinform. 2020, 22, 1521–1535. [Google Scholar] [CrossRef]
- Lučin, I.; Grbčić, L.; Čarija, Z.; Kranjčević, L. Machine-Learning Classification of a Number of Contaminant Sources in an Urban Water Network. Sensors 2021, 21, 245. [Google Scholar] [CrossRef] [PubMed]
- Rossman, L.A. EPANET 2: Users Manual. 2000. Available online: https://epanet.es/wp-content/uploads/2012/10/EPANET_User_Guide.pdf (accessed on 6 September 2020).
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Centre for Water Systems, University of Exeter. Benchmarks. Available online: http://emps.exeter.ac.uk/engineering/research/cws/downloads/benchmarks/ (accessed on 6 November 2020).
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Data Inputs | 100,000 | 200,000 | 300,000 | 450,000 | 500,000 |
---|---|---|---|---|---|
Accuracy | 88% | 93% | 96% | 97% | 98% |
Top 3 | 98% | 99% | 99% | 99% | 99% |
Base Demand Variation | ±2.5% | ±5% | ±10% | ±15% | ±20% |
---|---|---|---|---|---|
Accuracy | 82% | 69% | 57% | 49% | 44% |
Top 3 | 98% | 93% | 86% | 81% | 76% |
Top 5 | 99% | 98% | 95% | 92% | 89% |
Base Demand Variation | ±2.5% | ±5% | ±10% |
---|---|---|---|
Accuracy | 62% | 49% | 36% |
Top 3 | 92% | 80% | 65% |
Top 5 | 98% | 90% | 77% |
Emitter Coefficient Range | 1–5 | 5–10 | 5–15 | 10–15 |
---|---|---|---|---|
Accuracy | 38% | 67% | 71% | 82% |
Top 3 | 67% | 92% | 93% | 98% |
Top 5 | 81% | 98% | 98% | 99% |
Emitter Coefficient Range | 1–5 | 5–10 | 5–15 | 10–15 |
---|---|---|---|---|
Accuracy | 32% | 51% | 52% | 62% |
Top 3 | 59% | 83% | 84% | 92% |
Top 5 | 72% | 92% | 94% | 98% |
Sensor Locations | Demand Variation | |||
---|---|---|---|---|
No Variation | ±2.5% | ±5% | ||
117, 143, 181, 213 | Accuracy | 98% | 51% | 37% |
Top 3 | 99% | 83% | 69% | |
Top 5 | 99% | 92% | 79% | |
117, 181 | Accuracy | 96% | 41% | 27% |
Top 3 | 99% | 71% | 52% | |
Top 5 | 99% | 83% | 64% | |
115, 119, 187, 209 | Accuracy | 98% | 54% | 37% |
Top 3 | 99% | 84% | 70% | |
Top 5 | 99% | 94% | 83% | |
119, 209 | Accuracy | 97% | 40% | 27% |
Top 3 | 99% | 71% | 53% | |
Top 5 | 99% | 85% | 67% |
Hanoi Network | Net3 Network | |||
---|---|---|---|---|
Number of Features | 194 | 50 | 388 | 100 |
Accuracy | 82% | 81% | 62% | 60% |
Top 3 | 98% | 98% | 92% | 91% |
Top 5 | 99% | 99% | 98% | 97% |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Lučin, I.; Lučin, B.; Čarija, Z.; Sikirica, A. Data-Driven Leak Localization in Urban Water Distribution Networks Using Big Data for Random Forest Classifier. Mathematics 2021, 9, 672. https://doi.org/10.3390/math9060672
Lučin I, Lučin B, Čarija Z, Sikirica A. Data-Driven Leak Localization in Urban Water Distribution Networks Using Big Data for Random Forest Classifier. Mathematics. 2021; 9(6):672. https://doi.org/10.3390/math9060672
Chicago/Turabian StyleLučin, Ivana, Bože Lučin, Zoran Čarija, and Ante Sikirica. 2021. "Data-Driven Leak Localization in Urban Water Distribution Networks Using Big Data for Random Forest Classifier" Mathematics 9, no. 6: 672. https://doi.org/10.3390/math9060672
APA StyleLučin, I., Lučin, B., Čarija, Z., & Sikirica, A. (2021). Data-Driven Leak Localization in Urban Water Distribution Networks Using Big Data for Random Forest Classifier. Mathematics, 9(6), 672. https://doi.org/10.3390/math9060672