Improving the Accuracy of Oncology Diagnosis: A Machine Learning-Based Approach to Cancer Prediction

Authors

DOI:

https://doi.org/10.3991/ijoe.v20i11.49139

Keywords:

machine learning, cancer, prediction, tumor, models

Abstract


Cancer ranks among the most lethal illnesses worldwide, and predicting its onset can be a crucial factor in enhancing people’s quality of life by taking preventive measures to improve treatment and survival. This study conducted comparative research to determine the machine learning model with the highest accuracy for tumor type classification, distinguishing between malignant (cancer) and benign tumors. The models evaluated include decision tree (DT), naive bayes (NB), extra trees classifier (ETM), random forest (RF), K-means clustering (K-means), logistic regression (LR), adaptive boosting (AdaBoost), gradient boosting (GB), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost) to identify the one with the best accuracy. The models were trained using a dataset of 569 records and a total of 32 variables, containing patient information and tumor characteristics. The study was structured into sections, such as related studies, descriptions of the models, case study development, results, discussion, and conclusions. The models’ performance was evaluated based on metrics of precision, sensitivity, accuracy, and F1 score. Following the training, the results positioned the XGBoost model as having the best performance, achieving 98% precision, accuracy, sensitivity, and F1 score.

Author Biographies

Michael Cabanillas-Carbonell, Universidad Privada del Norte

Michael Cabanillas-Carbonell, is an Engineer and Master's in Systems Engineering, pursuing a PhD in Systems Engineering and Telecommunications at the Polytechnic University of Madrid. Conference Chair of the Engineering International Research Conference IEEE Peru EIRCON. Research Professor and International lecturer specializing in software development, artificial intelligence, machine learning, business intelligence, and augmented reality. He’s authored more than 25 scientific articles indexed in IEEE Xplore and Scopus (Email: mcabanillas@ieee.org).

Joselyn Zapata-Paulini, Universidad Continental

Joselyn Zapata-Paulini, is Bachelor in Systems Engineering and Computer Science from the Universidad de Ciencias y Humanidades, Master in Science with environmental management and sustainable development at the Universidad Continental, Peru. She has several international publications. Specialized in the areas of augmented reality, virtual reality, machine learning and the internet of things. Author of scientific articles indexed in IEEE Xplore, Scopus, and WoS (Email: 70994337@continental.edu.pe).

Downloads

Published

2024-08-08

How to Cite

Cabanillas-Carbonell, M., & Zapata-Paulini, J. (2024). Improving the Accuracy of Oncology Diagnosis: A Machine Learning-Based Approach to Cancer Prediction. International Journal of Online and Biomedical Engineering (iJOE), 20(11), pp. 102–122. https://doi.org/10.3991/ijoe.v20i11.49139

Issue

Section

Papers