Educational Data Mining: Employing Machine Learning Techniques and Hyperparameter Optimization to Improve Students’ Academic Performance

Authors

  • Mohamed Bellaj Higher Normal School (ENS), Abdelmalek Essaadi University, Tetuan, Morocco. https://orcid.org/0000-0002-7057-6107
  • Ahmed Ben Dahmane Higher Normal School (ENS), Abdelmalek Essaadi University, Tetuan, Morocco. https://orcid.org/0000-0003-3843-4800
  • Said Boudra Laboratory of Chemistry, Applied Biology and Biotechnology, Faculty of Science, CRMEF, Tangier, Morocco.
  • Mohammed Lamarti Sefian Higher Normal School (ENS), Abdelmalek Essaadi University, Tetuan, Morocco. https://orcid.org/0000-0001-8270-2660

DOI:

https://doi.org/10.3991/ijoe.v20i03.46287

Keywords:

educational data mining, machine learning algorithms, students’ performance, hyper-parameter tuning, cross-validation.

Abstract


Educational data mining (EDM) is a specialized field within data mining that focuses on extracting valuable insights from academic data across high school and university levels. A common practice in EDM involves predicting students’ grades to identify at-risk individuals and improve the efficiency of academic tasks. This knowledge benefits students, parents, and institutions equally. Early detection enables interventions that improve student performance. The literature presents various prediction strategies, each with its own unique advantages and disadvantages. This study aims to comprehensively evaluate the methods, tools, and applications of machine learning (ML) and data mining (DM) in education. The main goal is to improve the accuracy of predicting academic achievements by employing eight widely recognized ML algorithms: naïve bayes (NB), k-nearest neighbors (KNN), support vector machine (SVM), random forest (RF), logistic regression (LR), extreme gradient boost (XGBOOST), and ensemble voting classifier (EVC). The focus is on improving data quality by eliminating instances of noise. Performance evaluation involves assessing parameters such as accuracy, precision, F-measure, and recall. Incorporating cross-validation and hyperparameter tuning improves classification accuracy. The ML models outperform other ensemble approaches, providing a valuable tool for predicting student performance and assisting educators in making proactive decisions through timely alerts.

Downloads

Published

2024-02-27

How to Cite

Bellaj, M., Ben Dahmane, A., Boudra , S. ., & Lamarti Sefian, M. . (2024). Educational Data Mining: Employing Machine Learning Techniques and Hyperparameter Optimization to Improve Students’ Academic Performance. International Journal of Online and Biomedical Engineering (iJOE), 20(03), pp. 55–74. https://doi.org/10.3991/ijoe.v20i03.46287

Issue

Section

Papers