Analysis of Machine Learning Classification Approaches for Predicting Students’ Programming Aptitude
<p>Paper folding questions compiled from [<a href="#B58-sustainability-15-12917" class="html-bibr">58</a>].</p> "> Figure 2
<p>Exploratory data analysis of 7 features by different classes.</p> "> Figure 3
<p>ML–based classification model used in this study.</p> "> Figure 4
<p>(<b>a</b>) Cubic SVM CM; (<b>b</b>) Cubic SVM ROC curve.</p> "> Figure 5
<p>(<b>a</b>) Fine DT CM; (<b>b</b>) Fine DT ROC curve.</p> "> Figure 6
<p>(<b>a</b>) Medium KNN CM; (<b>b</b>) Medium KNN ROC curve.</p> "> Figure 7
<p>(<b>a</b>) QD CM; (<b>b</b>) QD ROC curve.</p> ">
Abstract
:1. Introduction
1.1. Research Purpose
1.2. Research Questions
2. Related Work
2.1. Machine Learning
2.2. Spatial Skills
2.3. Blocked-Based Programming
3. Materials and Methods
3.1. Data Collection
3.2. Spatial Test
3.3. Code.org
3.4. Methodology
4. Results and Discussion
4.1. Results
4.2. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Code.org. More İnformation, History, and Philosophy. Available online: https://code.org/ (accessed on 10 June 2023).
- Lin, P.-Y.; Chai, C.-S.; Jong, M.S.-Y.; Dai, Y.; Guo, Y.; Qin, J. Modeling the structural relationship among primary students’ motivation to learn artificial intelligence. Comput. Educ. Artif. Intell. 2021, 2, 100006. [Google Scholar] [CrossRef]
- Grover, S.; Pea, R.; Cooper, S. Designing for deeper learning in a blended computer science course for middle school students. Comput. Sci. Educ. 2015, 25, 199–237. [Google Scholar] [CrossRef]
- Grover, S.; Pea, R.; Cooper, S. Factors influencing computer science learning in middle school. In Proceedings of the 47th ACM Technical Symposium on Computing Science Education, Memphis, TN, USA, 2–5 March 2016; pp. 552–557. [Google Scholar]
- Tukiainen, M.; Mönkkönen, E. Programming Aptitude Testing as a Prediction of Learning to Program. In Proceedings of the PPIG, London, UK, 18–21 June 2002; p. 4. [Google Scholar]
- Bockmon, R.; Cooper, S.; Gratch, J.; Dorodchi, M. (re) validating cognitive introductory computing instruments. In Proceedings of the 50th ACM Technical Symposium on Computer Science Education, Minneapolis, MN, USA, 27 February–2 March 2019; pp. 552–557. [Google Scholar]
- Bockmon, R.; Cooper, S.; Koperski, W.; Gratch, J.; Sorby, S.; Dorodchi, M. A cs1 spatial skills intervention and the impact on introductory programming abilities. In Proceedings of the 51st ACM Technical Symposium on Computer Science Education, Portland, OR, USA, 11–14 March 2020; pp. 766–772. [Google Scholar]
- Halpern, D.F. Sex Differences in Cognitive Abilities; Psychology Press: London, UK, 2013. [Google Scholar]
- Jones, S.; Burnett, G. Spatial ability and learning to program. Hum. Technol. Interdiscip. J. Hum. ICT Environ. 2008, 4, 47–61. [Google Scholar] [CrossRef]
- Vora, D.R.; Kamatchi, R. Predicting students’ academic performance: Levy search of cuckoo-based hybrid classification. Int. J. Grid Util. Comput. 2020, 11, 568–585. [Google Scholar] [CrossRef]
- Satyavathy, G.; Vagini, M.K.J.; Deepa, T. Combıned Algorıthm Based on Predıctıon of Dıabetıcs. J. Data Acquis. Process. 2023, 38, 322. [Google Scholar]
- Hu, J.; Niu, H.; Carrasco, J.; Lennox, B.; Arvin, F. Voronoi-based multi-robot autonomous exploration in unknown environments via deep reinforcement learning. IEEE Trans. Veh. Technol. 2020, 69, 14413–14423. [Google Scholar] [CrossRef]
- Zeineddine, H.; Braendle, U.; Farah, A. Enhancing prediction of student success: Automated machine learning approach. Comput. Electr. Eng. 2021, 89, 106903. [Google Scholar] [CrossRef]
- Christian, T.M.; Ayub, M. Exploration of classification using NBTree for predicting students’ performance. In Proceedings of the 2014 İnternational Conference on Data and Software Engineering (ICODSE), Bandung, Indonesia, 26–27 November 2014; pp. 1–6. [Google Scholar]
- Guo, B.; Zhang, R.; Xu, G.; Shi, C.; Yang, L. Predicting students performance in educational data mining. In Proceedings of the 2015 İnternational Symposium on Educational Technology (ISET), Wuhan, China, 27–29 July 2015; pp. 125–128. [Google Scholar]
- Harvey, J.L.; Kumar, S.A. A practical model for educators to predict student performance in K-12 education using machine learning. In Proceedings of the 2019 IEEE Symposium Series on Computational Intelligence (SSCI), Xiamen, China, 6–9 December 2019; pp. 3004–3011. [Google Scholar]
- Kaya, F.H. Identifying the Factors Affecting Students’ Academic Achievement Using Machine Learning Algorithms. Master’s Thesis, Konya Teknik Üniversitesi, Konya, Turkey, 2022. [Google Scholar]
- Suhaimi, N.M.; Abdul-Rahman, S.; Mutalib, S.; Hamid, N.A.; Hamid, A. Review on predicting students’ graduation time using machine learning algorithms. Int. J. Mod. Educ. Comput. Sci. 2019, 11, 1–13. [Google Scholar] [CrossRef]
- Alom, B.M.; Courtney, M. Educational data mining: A case study perspectives from primary to university education in australia. Int. J. Inf. Technol. Comput. Sci. 2018, 10, 1–9. [Google Scholar] [CrossRef]
- Kovacic, Z. Early prediction of student success: Mining students’ enrolment data. In Proceedings of the Informing Science + Information Technology Education Joint Conference, Cassino, Italy, 19–24 June 2010. [Google Scholar]
- Yadav, S.K.; Bharadwaj, B.; Pal, S. Mining Education data to predict student’s retention: A comparative study. arXiv 2012, arXiv:1203.2987. [Google Scholar]
- Pandey, M.; Taruna, S. A multi-level classification model pertaining to the student’s academic performance prediction. Int. J. Adv. Eng. Technol. 2014, 7, 1329. [Google Scholar]
- De Morais, A.M.; Araujo, J.M.; Costa, E.B. Monitoring student performance using data clustering and predictive modelling. In Proceedings of the 2014 IEEE Frontiers in Education Conference (FIE) Proceedings, Madrid, Spain, 22–25 October 2014; pp. 1–8. [Google Scholar]
- Kolo, D.K.; Adepoju, S.A. A decision tree approach for predicting students academic performance. Int. J. Educ. Manag. Eng. 2015, 5, 12–19. [Google Scholar]
- Sikder, M.F.; Uddin, M.J.; Halder, S. Predicting students yearly performance using neural network: A case study of BSMRSTU. In Proceedings of the 2016 5th International Conference on Informatics, Electronics and Vision (ICIEV), Dhaka, Bangladesh, 13–14 May 2016; pp. 524–529. [Google Scholar]
- Saa, A.A. Educational data mining & students’ performance prediction. Int. J. Adv. Comput. Sci. Appl. 2016, 7, 212–220. [Google Scholar]
- Hsieh, Y.-Z.; Su, M.-C.; Jeng, Y.-L. The jacobian matrix-based learning machine in student. In Proceedings of the Emerging Technologies for Education: Second International Symposium, SETE 2017, Cape Town, South Africa, 20–22 September 2017; pp. 469–474. [Google Scholar]
- Han, M.; Tong, M.; Chen, M.; Liu, J.; Liu, C. Application of ensemble algorithm in students’ performance prediction. In Proceedings of the 2017 6th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), Hamamatsu, Japan, 9–13 July 2017; pp. 735–740. [Google Scholar]
- Tampakas, V.; Livieris, I.E.; Pintelas, E.; Karacapilidis, N.; Pintelas, P. Prediction of students’ graduation time using a two-level classification algorithm. In Proceedings of the Technology and Innovation in Learning, Teaching and Education: First International Conference, TECH-EDU 2018, Thessaloniki, Greece, 20–22 June 2018; pp. 553–565. [Google Scholar]
- Hussain, S.; Dahan, N.A.; Ba-Alwib, F.M.; Ribata, N. Educational data mining and analysis of students’ academic performance using WEKA. Indones. J. Electr. Eng. Comput. Sci. 2018, 9, 447–459. [Google Scholar] [CrossRef]
- Miguéis, V.L.; Freitas, A.; Garcia, P.J.; Silva, A. Early segmentation of students according to their academic performance: A predictive modelling approach. Decis. Support Syst. 2018, 115, 36–51. [Google Scholar] [CrossRef]
- Salal, Y.; Abdullaev, S.; Kumar, M. Educational data mining: Student performance prediction in academic. Int. J. Eng. Adv. Technol. 2019, 8, 54–59. [Google Scholar]
- Berens, J.; Schneider, K.; Görtz, S.; Oster, S.; Burghoff, J. Early Detection of Students at Risk–Predicting Student Dropouts Using Administrative Student Data and Machine Learning Methods; Leibniz-Informationszentrum Wirtschaft: Kiel, Germany, 2018. [Google Scholar]
- Liao, S.N.; Zingaro, D.; Thai, K.; Alvarado, C.; Griswold, W.G.; Porter, L. A robust machine learning technique to predict low-performing students. ACM Trans. Comput. Educ. (TOCE) 2019, 19, 1–19. [Google Scholar] [CrossRef]
- Chen, X.; Xie, H.; Hwang, G.-J. A multi-perspective study on artificial intelligence in education: Grants, conferences, journals, software tools, institutions, and researchers. Comput. Educ. Artif. Intell. 2020, 1, 100005. [Google Scholar] [CrossRef]
- Chen, X.; Zou, D.; Xie, H.; Cheng, G.; Liu, C. Two decades of artificial intelligence in education. Educ. Technol. Soc. 2022, 25, 28–47. [Google Scholar]
- Hwang, G.-J.; Xie, H.; Wah, B.W.; Gašević, D. Vision, Challenges, Roles and Research İssues of Artificial Intelligence in Education; Elsevier: Amsterdam, The Netherlands, 2020; Volume 1, p. 100001. [Google Scholar]
- Chen, X.; Xie, H.; Zou, D.; Hwang, G.-J. Application and theory gaps during the rise of artificial intelligence in education. Comput. Educ. Artif. Intell. 2020, 1, 100002. [Google Scholar] [CrossRef]
- Rastrollo-Guerrero, J.L.; Gómez-Pulido, J.A.; Durán-Domínguez, A. Analyzing and predicting students’ performance by means of machine learning: A review. Appl. Sci. 2020, 10, 1042. [Google Scholar] [CrossRef]
- Akmeşe, Ö.F.; Kör, H.; Erbay, H. Use of machine learning techniques for the forecast of student achievement in higher education. Inf. Technol. Learn. Tools 2021, 82, 297–311. [Google Scholar]
- Pallathadka, H.; Wenda, A.; Ramirez-Asís, E.; Asís-López, M.; Flores-Albornoz, J.; Phasinam, K. Classification and prediction of student performance data using various machine learning algorithms. Mater. Today Proc. 2023, 80, 3782–3785. [Google Scholar] [CrossRef]
- Siddique, A.; Jan, A.; Majeed, F.; Qahmash, A.I.; Quadri, N.N.; Wahab, M.O.A. Predicting academic performance using an efficient model based on fusion of classifiers. Appl. Sci. 2021, 11, 11845. [Google Scholar] [CrossRef]
- Bognár, L.; Fauszt, T. Factors and conditions that affect the goodness of machine learning models for predicting the success of learning. Comput. Educ. Artif. Intell. 2022, 3, 100100. [Google Scholar] [CrossRef]
- Bacci, S.; Bertaccini, B. A Mixture Hidden Markov Model to Mine Students’ University Curricula. Data 2022, 7, 25. [Google Scholar] [CrossRef]
- Alboaneen, D.; Almelihi, M.; Alsubaie, R.; Alghamdi, R.; Alshehri, L.; Alharthi, R. Development of a web-based prediction system for students’ academic performance. Data 2022, 7, 21. [Google Scholar] [CrossRef]
- Parkinson, J.; Cutts, Q. Chairs’ AWARD: Investigating the relationship between spatial skills and computer science. ACM Inroads 2019, 10, 64–73. [Google Scholar] [CrossRef]
- Carroll, J.B. Human cognitive abilities: A survey of factor-analytic studies//Review. Can. J. Exp. Psychol. 1993, 47, 763. [Google Scholar]
- Parkinson, J.; Cutts, Q.; Draper, S. Relating spatial skills and expression evaluation. In Proceedings of the United Kingdom & Ireland Computing Education Research Conference, Glasgow, UK, 3–4 September 2020; pp. 17–23. [Google Scholar]
- Parkinson, J.; Cutts, Q. The effect of a spatial skills training course in introductory computing. In Proceedings of the 2020 ACM Conference on Innovation and Technology in Computer Science Education, Trondheim, Norway, 17–18 June 2020; pp. 439–445. [Google Scholar]
- Cooper, S.; Wang, K.; Israni, M.; Sorby, S. Spatial skills training in introductory computing. In Proceedings of the Eleventh Annual İnternational Conference on İnternational Computing Education Research, Omaha, NE, USA, 9–13 August 2015; pp. 13–20. [Google Scholar]
- Kinnunen, P.; Malmi, L. CS minors in a CS1 course. In Proceedings of the Fourth International Workshop on Computing Education Research, Sydney, Australia, 6–7 September 2008; pp. 79–90. [Google Scholar]
- Lahtinen, E.; Ala-Mutka, K.; Järvinen, H.-M. A study of the difficulties of novice programmers. ACM Sigcse Bull. 2005, 37, 14–18. [Google Scholar] [CrossRef]
- Kim, J.A.; Kim, H.J. Flipped learning of scratch programming with code.org. In Proceedings of the 2017 9th International Conference on Education Technology and Computers, Barcelona, Spain, 20–22 December 2017; pp. 68–72. [Google Scholar]
- Kalelioğlu, F.; Jeong, A.; Kim, H.J. A new wayFlipped learning of teachingscratch programming skills to K-12 students: Code.with code.org. In Proceedings of the 2017 9th International Conference on Education Technology and Computers in Human Behavior, Barcelona, Spain, 20–22 December 2017; Volume 52, p. 200-21068-21072. [Google Scholar]
- Farrell, C. Predicting (and creating) success in CS1. Issues Inf. Syst. 2006, 7, 259–263. [Google Scholar]
- Doane, W.E. Predicting Student Performance in İntroductory Computer Programming Courses; State University of New York: Albany, NY, USA, 2008. [Google Scholar]
- Kolikant, Y.-D.; Pollack, S. Improving mathematically oriented programming skills in Computer Science studies. In Proceedings of the 32nd Annual Frontiers in Education, Boston, MA, USA, 6–9 November 2002; p. T1G. [Google Scholar]
- Fincher, S.; Baker, B.; Box, I.; Cutts, Q.; de Raadt, M.; Haden, P.; Hamer, J.; Hamilton, M.; Lister, R.; Petre, M. Programmed to Succeed?: A Multi-National, Multi-İnstitutional Study of İntroductory Programming Courses; University of Kent: Canterbury, UK, 2005. [Google Scholar]
- Du, J.; Wimmer, H.; Rada, R. “Hour of Code”: Can It Change Students’ Attitudes Toward Programming? J. Inf. Technol. Educ. Innov. Pract. 2016, 15, 53. [Google Scholar] [CrossRef] [PubMed]
- Lambić, D.; Đorić, B.; Ivakić, S. Investigating the effect of the use of code. org on younger elementary school students’ attitudes towards programming. Behav. Inf. Technol. 2021, 40, 1784–1795. [Google Scholar] [CrossRef]
- Kalelioğlu, F. A new way of teaching programming skills to K-12 students: Code.org. Comput. Hum. Behav. 2015, 52, 200–210. [Google Scholar] [CrossRef]
- Mythili, M.; Shanavas, A.M. An Analysis of students’ performance using classification algorithms. IOSR J. Comput. Eng. 2014, 16, 63–69. [Google Scholar] [CrossRef]
- Anuradha, C.; Velmurugan, T. A comparative analysis on the evaluation of classification algorithms in the prediction of students performance. Indian J. Sci. Technol. 2015, 8, IPL057. [Google Scholar] [CrossRef]
- Almarabeh, H. Analysis of students’ performance by using different data mining classifiers. Int. J. Mod. Educ. Comput. Sci. 2017, 9, 9. [Google Scholar] [CrossRef]
- Zafari, M.; Sadeghi-Niaraki, A.; Choi, S.-M.; Esmaeily, A. A practical model for the evaluation of high school student performance based on machine learning. Appl. Sci. 2021, 11, 11534. [Google Scholar] [CrossRef]
- Bagunaid, W.; Chilamkurti, N.; Veeraraghavan, P. AISAR: Artificial Intelligence-Based Student Assessment and Recommendation System for E-Learning in Big Data. Sustainability 2022, 14, 10551. [Google Scholar] [CrossRef]
- Triayudi, A.; Widyarto, W.O. Educational data mining analysis using classification techniques. J. Phys. Conf. Ser. 2021, 1933, 012061. [Google Scholar] [CrossRef]
- Frank, M.R.; Autor, D.; Bessen, J.E.; Brynjolfsson, E.; Cebrian, M.; Deming, D.J.; Feldman, M.; Groh, M.; Lobo, J.; Moro, E. Toward understanding the impact of artificial intelligence on labor. Proc. Natl. Acad. Sci. USA 2019, 116, 6531–6539. [Google Scholar] [CrossRef] [PubMed]
Puzzle | Line Number | Content | Concept | Class |
---|---|---|---|---|
L1 | 2 | forward | sequence | 1 |
L2 | 3 | forward | sequence | |
L3 | 4 | forward-direction | sequence | |
L4 | 5 | forward-direction | sequence | |
L5 | 8 | forward-direction | sequence | |
L6 | 2 | forward-repeat | loop | 2 |
L7 | 3 | forward-direction-repeat | loop | |
L8 | 5 | forward-direction-2 repeat | loop | |
L9 | 3 | loop in loop | loop | |
L10 | 2 | conditional loop | while loop | |
L11 | 4 | conditional loop | while loop | |
L12 | 5 | conditional loop (with 5-line code) | while loop | |
L13 | 5 | conditional loop (with 5-line code) | while loop | |
L14 | 3 | if loop- | if | 3 |
L15 | 5 | if loop- | if | |
L16 | 4 | if loop- | if | |
L17 | 4 | if loop- | if | |
L18 | 4 | else if | else if | |
L19 | 4 | else if | else if | |
L20 | 4 | else if | else if |
Classification Method | Accuracy | Kappa | Precision | Recall | F-Score |
---|---|---|---|---|---|
Cubic SVM | 94.8% | 91.5% | 93.6% | 94.8% | 94.1% |
Fine DT | 89.0% | 82.5% | 88.2% | 87.2% | 87.6% |
Medium KNN | 80.8% | 70.0% | 79.9% | 76.4% | 77.8% |
QD | 84.3% | 75.0% | 83.5% | 82.0% | 82.6% |
Paper | Purpose | Algorithm | Accuracy | Data |
---|---|---|---|---|
[62] | To examine and evaluate the performance of school students using classification algorithms. | RF | 89.23% | 100 |
[63] | To apply classification algorithms to the prediction of students’ performance on semester-ending university exams. | NB | 70.00% | 250 |
[64] | To investigate and evaluate the performance of university students using various classification methods. | Bayes net | 92.00% | 225 |
[65] | To examine and evaluate the performance of high school students | SVM | 78.00% | 459 |
[66] | To investigate and evaluate the performance of students | J48 | 97.21% | 32,593 |
[67] | To investigate and evaluate the performance of university students | SVM | 80.00% | 340 |
The current study | To classify the coding abilities of middle school children using spatial test and Code.org platform information. | Cubic SVM | 94.80% | 400 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Çetinkaya, A.; Baykan, Ö.K.; Kırgız, H. Analysis of Machine Learning Classification Approaches for Predicting Students’ Programming Aptitude. Sustainability 2023, 15, 12917. https://doi.org/10.3390/su151712917
Çetinkaya A, Baykan ÖK, Kırgız H. Analysis of Machine Learning Classification Approaches for Predicting Students’ Programming Aptitude. Sustainability. 2023; 15(17):12917. https://doi.org/10.3390/su151712917
Chicago/Turabian StyleÇetinkaya, Ali, Ömer Kaan Baykan, and Havva Kırgız. 2023. "Analysis of Machine Learning Classification Approaches for Predicting Students’ Programming Aptitude" Sustainability 15, no. 17: 12917. https://doi.org/10.3390/su151712917
APA StyleÇetinkaya, A., Baykan, Ö. K., & Kırgız, H. (2023). Analysis of Machine Learning Classification Approaches for Predicting Students’ Programming Aptitude. Sustainability, 15(17), 12917. https://doi.org/10.3390/su151712917