Artificial Intelligence-Based Surveillance of Tuberculosis in South Africa Using Google Trends Data
DOI:
https://doi.org/10.3991/ijoe.v21i12.56193Keywords:
Tuberculosis, Disease Prediction, Artificial Intelligence,, Digital Epidemiology, Google Trends, South AfricaAbstract
Tuberculosis (TB) remains a significant global public health challenge and the deadliest infectious disease, according to the World Health Organization (WHO) Global TB Report of 2024. This study explores integrating Google Trends (GT) data with machine learning (ML) methods to forecast TB incidence in South Africa. Pearson correlation analysis identified eight TB-related search terms with moderate to strong correlations to official surveillance data from the National Institute for Communicable Diseases (NICD) between 2012–2021. Four ML models were compared using rolling-window cross-validation: partial least squares (PLS), LASSO regression, support vector machine (SVM), and long short-term memory (LSTM) networks. The PLS model achieved superior performance, significantly outperforming more complex deep learning approaches. These findings demonstrate that simpler linear models can effectively leverage GT data to complement traditional TB surveillance systems in South Africa.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Nqobile S. Hlatshwayo, Seun O. Olukanmi

This work is licensed under a Creative Commons Attribution 4.0 International License.

