Enhancing Classification Performance through FeatureBoostThyro: A Comparative Study of Machine Learning Algorithms and Feature Selection

Authors

  • Deepali Omprakash Bhende G. H. Raisoni University, Saikheda, MP https://orcid.org/0009-0001-4937-8915
  • Gopal Sakarkar 1) Department of Computer Science, G. H. Raisoni University, Saikheda, MP
  • Punam Khandar Department of Computer Applications, RCOEM, Nagpur
  • Satyajit S Uparkar Department of Computer Applications, RCOEM, Nagpur
  • Arvind Bhave IIC MCA Department, RTMNU, Nagpur

DOI:

https://doi.org/10.3991/ijoe.v20i04.45413

Keywords:

Machine Learning, Thyroid Disorder, Feature Selection, Ensemble Model, Information Gain, Logistic Regression, Recursive Feature elimination

Abstract


Early-stage prediction of a disease is an important and challenging task. The application of machine learning techniques is playing an important role in this era. Thyroid is one of the chronic endocrine diseases, and approximately 42 million people in India are affected by this disease. This paper presents a comprehensive investigation into the enhancement of classification performance through the novel ‘FeatureBoostThyro’ (FBT) model. The study evaluates various machine learning algorithms, including stochastic gradient descent (SGD), K nearest neighbor (KNN), logistic regression (LR), naive bayes (NB), and support vector machine (SVM), in conjunction with diverse feature selection methods. The research systematically explores the impact of feature selection techniques such as information gain, relief F, chi-square, gini index, forward selection, backward selection, recursive feature elimination, and LASSO on model performance across the chosen algorithms. The analysis reveals notable variations in performance metrics, including accuracy, precision, recall, and F1-score, providing valuable insights into the interplay between algorithm and feature selection. One main contribution of this research is the introduction of the FBT model, which consistently outperforms other models across various feature selection methods, making it a promising tool for addressing complex classification tasks. The findings contribute to a broader understanding of model selection and optimization in machine learning applications. The proposed model undergoes evaluation using two distinct datasets: the primary dataset acquired from Lata Mangeshkar Hospital in Nagpur and the secondary dataset obtained from the UCI dataset.

Author Biographies

Deepali Omprakash Bhende, G. H. Raisoni University, Saikheda, MP

Department of Computer Science

Gopal Sakarkar, 1) Department of Computer Science, G. H. Raisoni University, Saikheda, MP

2) Dr. Vishwanath Karad, MIT World Peace University, Paud Road, Pune, India

Downloads

Published

2024-03-04

How to Cite

Bhende, D. O., Sakarkar, G., Khandar, P., Uparkar, S. S., & Bhave, A. (2024). Enhancing Classification Performance through FeatureBoostThyro: A Comparative Study of Machine Learning Algorithms and Feature Selection . International Journal of Online and Biomedical Engineering (iJOE), 20(04), pp. 29–42. https://doi.org/10.3991/ijoe.v20i04.45413

Issue

Section

Papers