Abstract
Various algorithms are utilized for the purpose of classification, with Decision Trees being one of the most popular. This is due to their easily understandable structure and simple way of operation, which led to their adoption in a variety of applications. However, the usage of Decision Trees often becomes challenging due to the fact that users need to be familiar with Machine Learning, have programming skills or knowledge of specialized scientific software. The present paper aims to address these issues by presenting AutoDTrees, a web-based application that offers the ability to utilize the Decision Trees in a simple and fast way. This can be done through a user-friendly interface, while an open-source Web API is provided for developers. AutoDTrees allows any user to select the preferred dataset and define parameters, in order to build Decision Tree models. Then, the model effectiveness can be evaluated by using the k-fold cross-validation and presenting detailed metrics. Users are then able to save the pre-trained model and reuse it for predicting unclassified instances or visualizing the Decision Tree. AutoDTrees was evaluated in terms of user experience using the System Usability Scale (SUS), with the results indicating that it can be a useful tool for a wide range of users, regardless of their experience level.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
As already mentioned, categorical features are not available due to the Scikit-learn implementation.
References
Akinola, S., Oyabugbe, O.: Accuracies and training times of data mining classification algorithms: an empirical comparative study. J. Softw. Eng. Appl. 8(9), 470–477 (2015). https://doi.org/10.4236/jsea.2015.89045
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Breiman, L., Friedman, J., Stone, C.J., Olshen, R.: Classification and Regression Trees. Chapman and Hall/CRC (1984)
Brooke, J.: SUS: a quick and dirty usability scale. Usability Eval. Ind. 189, 4–7 (1995)
Edelstein, H.A.: Introduction to Data Mining and Knowledge Discovery Third Edition. Two Crows Corporation (1999)
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. stat. 29, 1189–1232 (2001)
Gratsos, K., Ougiaroglou, S., Margaris, D.: kClusterHub: an automl-driven tool for effortless partition-based clustering over varied data types. Future Internet 15(10), 341 (2023). https://doi.org/10.3390/fi15100341
He, X., Zhao, K., Chu, X.: AutoML: a survey of the state-of-the-art. Knowl. Based Syst. 212, 106622 (2021). https://doi.org/10.1016/j.knosys.2020.106622
Kyrkos, E.: Business Intelligence and Data Mining. Kallipos, Open Academic Editions (2015)
Malliaridis, K., Ougiaroglou, S., Dervos, D.A.: WebApriori: a web application for association rules mining. In: Kumar, V., Troussas, C. (eds.) Intelligent Tutoring Systems, pp. 371–377. Springer International Publishing, Cham (2020). https://doi.org/10.1007/978-3-030-49663-0_44
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1(1), 81–106 (1986). https://doi.org/10.1023/A:1022643204877
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (1993)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zografos, M., Ougiaroglou, S. (2024). Simplifying Decision Tree Classification Through the AutoDTrees Web Application and Service. In: Sifaleras, A., Lin, F. (eds) Generative Intelligence and Intelligent Tutoring Systems. ITS 2024. Lecture Notes in Computer Science, vol 14799. Springer, Cham. https://doi.org/10.1007/978-3-031-63031-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-63031-6_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-63030-9
Online ISBN: 978-3-031-63031-6
eBook Packages: Computer ScienceComputer Science (R0)