Artificial Intelligence-Based Surveillance of Tuberculosis in South Africa Using Google Trends Data

Authors

DOI:

https://doi.org/10.3991/ijoe.v21i12.56193

Keywords:

Tuberculosis, Disease Prediction, Artificial Intelligence,, Digital Epidemiology, Google Trends, South Africa

Abstract


Tuberculosis (TB) remains a significant global public health challenge and the deadliest infectious disease, according to the World Health Organization (WHO) Global TB Report of 2024. This study explores integrating Google Trends (GT) data with machine learning (ML) methods to forecast TB incidence in South Africa. Pearson correlation analysis identified eight TB-related search terms with moderate to strong correlations to official surveillance data from the National Institute for Communicable Diseases (NICD) between 2012–2021. Four ML models were compared using rolling-window cross-validation: partial least squares (PLS), LASSO regression, support vector machine (SVM), and long short-term memory (LSTM) networks. The PLS model achieved superior performance, significantly outperforming more complex deep learning approaches. These findings demonstrate that simpler linear models can effectively leverage GT data to complement traditional TB surveillance systems in South Africa.

Downloads

Published

2025-10-10

How to Cite

Hlatshwayo, N. S., & Olukanmi, S. O. (2025). Artificial Intelligence-Based Surveillance of Tuberculosis in South Africa Using Google Trends Data. International Journal of Online and Biomedical Engineering (iJOE), 21(12), 94–105. https://doi.org/10.3991/ijoe.v21i12.56193

Issue

Section

Papers