Depression Prediction Using Enhanced Machine Learning Pipeline

Authors

  • Seyed Ebrahim Hosseini School of IT, Whitecliffe, New Zealand https://orcid.org/0000-0002-2947-5582
  • Shahbaz Pervez School of IT, Whitecliffe, New Zealand https://orcid.org/0000-0003-0232-2356
  • Paula Luz Manalo School of IT, Whitecliffe, New Zealand
  • Muhammad Javed Iqbal UET Taxila, Taxila, Pakistan
  • Muazma Shahbaz School of IT, Whitecliffe, New Zealand

DOI:

https://doi.org/10.3991/ijoe.v22i01.57957

Keywords:

Depression, Machine Learning, Feature Selection, Synthetic Minority Oversampling Technique

Abstract


Depression is a common mental health problem that has a big impact on a person’s mood, conduct, and ability to function in general. Existing studies have problems while dealing with a large number of features and selecting suitable algorithms at different stages of model development. However, current complex systems are very difficult to understand and use by doctors. The objective of this study is to propose suitable feature extraction and classification models to predict depression that are both cost-effective and easy to understand. The study concludes that the most important features were contentment with the surroundings, financial stress, sleeplessness, anxiety, and psychological issues such as conflict, abuse, and feeling inferior. The findings suggest that Boruta with logistic regression (LR) had the best accuracy of 93.3%, which is better than existing methods.

Downloads

Published

2026-01-22

How to Cite

Hosseini, S. E., Pervez, S., Manalo, P. L., Iqbal, M. J., & Shahbaz, M. (2026). Depression Prediction Using Enhanced Machine Learning Pipeline. International Journal of Online and Biomedical Engineering (iJOE), 22(01), pp. 147–163. https://doi.org/10.3991/ijoe.v22i01.57957

Issue

Section

Papers