Swin Transformer with Auxiliary Mask Supervision for Stroke Lesion Segmentation in Brain MRI

Authors

  • Batyrkhan Omarov Narxoz University, Almaty, Kazakhstan; International Information Technology University, Almaty, Kazakhstan; Al-Farabi Kazakh National University, Almaty, Kazakhstan https://orcid.org/0000-0002-8341-7113
  • Zhanseri Ikram Narxoz University, Almaty, Kazakhstan https://orcid.org/0009-0001-8059-6590

DOI:

https://doi.org/10.3991/ijoe.v21i14.57029

Keywords:

Brain stroke, Swin transformers, Weakly supervision, MRI Imaging, Medical Image Segmentation, Image Segmentation, , Deep Learning, Vision Transformer

Abstract


Accurate segmentation of stroke lesions in brain magnetic resonance imaging (MRI) is critical for early diagnosis and effective intervention. Existing convolutional neural networks (CNNs) have shown promising results but often struggle with global contextual reasoning and generalization in the presence of small, diffuse, or anatomically variable lesions. To address these limitations, we introduce a novel segmentation framework that integrates a Swin Transformer backbone with an auxiliary supervision mechanism based on bounding box-derived pseudo masks. Unlike prior transformer-based models that rely solely on end-to-end attention, our method introduces intermediate supervision via an auxiliary branch, which guides early layers to focus on lesion-relevant regions using weak annotations. This dual-path strategy enhances spatial representation learning while mitigating the annotation burden typically required for full supervision. Evaluated on the ISLES 2024 dataset, one of the most challenging benchmarks for ischemic lesion segmentation, the proposed model achieves superior performance in dice similarity, precision, and recall when compared to recent state-of-the-art CNN and vision transformer architectures. Qualitative results further highlight its robustness in capturing diverse lesion morphologies. By combining weak supervision with transformerbased learning, our approach contributes a scalable and annotation-efficient solution to neuroimaging, advancing the field of automated stroke diagnosis with improved accuracy and clinical feasibility.

Downloads

Published

2025-12-12

How to Cite

Omarov, B., & Ikram, Z. (2025). Swin Transformer with Auxiliary Mask Supervision for Stroke Lesion Segmentation in Brain MRI. International Journal of Online and Biomedical Engineering (iJOE), 21(14), pp. 122–137. https://doi.org/10.3991/ijoe.v21i14.57029

Issue

Section

Papers