A Comparative Study of Anemia Classification Algorithms for International and Newly CBC Datasets


  • Safa S. Abdul-Jabbar Lecturer in Department of Computer Sciences, College of science for women, University of Baghdad https://orcid.org/0000-0002-4063-7917
  • Alaa k. Farhan Computer Science Department, University of Technology, Baghdad , Iraq
  • Alexander S. Luchinin Federal State-Financed Scientific Institution Kirov Research/ Institute of Hematology and Blood Transfusion under the Federal Medical Biological Agency, Kirov, Russia https://orcid.org/0000-0002-5016-210X




Anemia Diagnosis Model, Data Analytics Tools, Analytics Methodologies, Hematology, CBC Dataset


Data generated from modern applications and the internet in healthcare is extensive and rapidly expanding. Therefore, one of the significant success factors for any application is understanding and extracting meaningful information using digital analytics tools. These tools will positively impact the application's performance and handle the challenges that can be faced to create highly consistent, logical, and information-rich summaries. This paper contains three main objectives: First, it provides several analytics methodologies that help to analyze datasets and extract useful information from them as preprocessing steps in any classification model to determine the dataset characteristics. Also, this paper provides a comparative study of several classification algorithms by testing 12 different classifiers using two international datasets to provide an accurate indicator of their efficiency and the future possibility of combining efficient algorithms to achieve better results. Finally, building several CBC datasets for the first time in Iraq helps to detect blood diseases from different hospitals. The outcome of the analysis step is used to help researchers to select the best system structure according to the characteristics of each dataset for more organized and thorough results. Also, according to the test results, four algorithms achieved the best accuracy (Logitboost, Random Forest, XGBoost, Multilayer Perceptron). Then use the Logitboost algorithm that achieved the best accuracy to classify these new datasets. In addition, as future directions, this paper helps to investigate the possibility of combining the algorithms to utilize benefits and overcome their disadvantages.




How to Cite

Abdul-Jabbar, S. S., Farhan , A. K., & Luchinin, A. S. (2023). A Comparative Study of Anemia Classification Algorithms for International and Newly CBC Datasets. International Journal of Online and Biomedical Engineering (iJOE), 19(06), pp. 141–157. https://doi.org/10.3991/ijoe.v19i06.38157