Diabetes Prediction: Optimization of Machine Learning through Feature Selection and Dimensionality Reduction

Authors

  • Abd Allah Aouragh Faculty of Sciences and Techniques, Hassan 1st University Settat, Morocco https://orcid.org/0000-0003-1355-8639
  • Mohamed Bahaj
  • Fouad Toufik

DOI:

https://doi.org/10.3991/ijoe.v20i08.47765

Keywords:

Diabetes, Machine Learning, Balancing, Feature Selection, Dimensionality Reduction, Grid Search

Abstract


Diabetes, a pervasive global health concern, presents diagnostic challenges due to its nuanced onset and far-reaching implications. Traditional diagnostic approaches, reliant on time-consuming assessments, necessitate a paradigm shift towards more efficient methodologies. In response, this study introduces a diagnostic support system leveraging the power of optimized machine learning algorithms. Addressing class imbalance within a dataset comprising 768 records, our methodology intricately weaves together feature selection, dimensionality reduction techniques, and grid search optimization. Specifically, the Extra Trees model, fine-tuned via grid search, emerges as the most potent, showcasing remarkable performance metrics: an accuracy score of 92.5%, an F1-score of 93.7%, and an AUC-ROC of 92.47%. These findings underscore the pivotal role of machine learning in reshaping diabetes diagnosis, offering transformative possibilities for global healthcare enhancement.

Author Biographies

Mohamed Bahaj

Mohamed Bahaj is a professor of computer sciences at the Faculty of Sciences and Techniques, Hassan First University, Settat, Morocco, where he conducts research in Artificial Intelligence, Software Engineering, and Data Mining.

Fouad Toufik

Fouad Toufik is a professor of computer sciences at the Higher School of Technology SALE, Mohammed V University, Morocco. His research interests focus on artificial intelligence, big data, and database architectures

Downloads

Published

2024-05-21

How to Cite

Aouragh, A. A., Bahaj, M., & Toufik, F. (2024). Diabetes Prediction: Optimization of Machine Learning through Feature Selection and Dimensionality Reduction. International Journal of Online and Biomedical Engineering (iJOE), 20(08), pp. 100–114. https://doi.org/10.3991/ijoe.v20i08.47765

Issue

Section

Papers