Decoding brain tumor insights: Evaluating CAM variants with 3D U-Net for segmentation
Main Article Content
Abstract
Brain tumor segmentation is critical for effective diagnosis and treatment planning. While, conventional manual segmentation techniques are seen inefficient and variable, highlighting the need for automated methods. This study enhances medical image analysis, particularly in brain tumor segmentation by improving the explainability and accuracy of deep learning models, which are essential for clinical trust. Using the 3D U-Net architecture with the BraTS 2020 dataset, the study achieved precise localization and detailed segmentation with the mean recall values of 0.8939 for Whole Tumor (WT), 0.7941 for Enhancing Tumor (ET), and 0.7846 for Tumor Core (TC). The Dice coefficients were 0.9065 for WT, 0.8180 for TC, and 0.7715 for ET. By integrating explainable AI techniques, such as Class Activation Mapping (CAM) and its variants (Grad-CAM, Grad-CAM++, and Score-CAM), the study ensures high segmentation accuracy and transparency. Grad-CAM, in this case, provided the most reliable and detailed visual explanations, significantly enhancing model interpretability for clinical applications. This approach not only enhances the accuracy of brain tumor segmentation but also builds clinical trust by making model decisions more transparent and understandable. Finally, the combination of 3D U-Net and XAI techniques supports more effective diagnosis, treatment planning, and patient care in brain tumor management.
Downloads
Article Details
This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright
Open Access authors retain the copyrights of their papers, and all open access articles are distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution and reproduction in any medium, provided that the original work is properly cited.
The use of general descriptive names, trade names, trademarks, and so forth in this publication, even if not specifically identified, does not imply that these names are not protected by the relevant laws and regulations.
While the advice and information in this journal are believed to be true and accurate on the date of its going to press, neither the authors, the editors, nor the publisher can accept any legal responsibility for any errors or omissions that may be made. The publisher makes no warranty, express or implied, with respect to the material contained herein.
This work is licensed under a Creative Commons Attribution 4.0 International License.
References
D. Patel, D. Patel, R. Saxena, and T. Akilan, Multi-class brain tumor segmentation using graph attention network, in 2023 8th International Conference on Signal and Image Processing (ICSIP), IEEE, 2023, pp. 196–201.
T. Magadza and S. Viriri, Deep learning for brain tumor segmentation: A survey of state-of-the-art, Journal of Imaging 7 (2) (2021) 19.
H. A. Nugroho, T. Kirana, V. Pranowo, and A. H. T. Hutami, Optic cup segmentation using adaptive threshold and morphological image processing, Commun. Sci. Technol. 4(2) (2019) 63-67.
S. Bauer, R.Wiest, L.-P. Nolte, and M. Reyes, A survey of mri-based medical image analysis for brain tumor studies, Phys. Med. Biol. 58 (13) (2013) R97–R129.
B. H. Menze, et al., The multimodal brain tumor image segmentation benchmark (BRATS), IEEE Trans. Med. Imaging 34(10) (2015) 1993–2024.
A. B. Abdusalomov, M. Mukhiddinov, and T. K. Whangbo, Brain tumor detection based on deep learning approaches and magnetic resonance imaging, Cancers, 15(16) (2023) 4172.
U. Baid, et al., A novel approach for fully automatic intra-tumor segmentation with 3d u-net architecture for gliomas, Front. Comput. Neurosci. 14 (feb 2020).
L. Weninger, O. Rippel, S. Koppers, and D. Merhof, Segmentation of Brain Tumors and Patient Survival Prediction: Methods for the BraTS 2018 Challenge, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 11384 LNCS, 2019, pp. 3–12.
O. Cicek, A. Abdulkadir, S. S. Lienkamp, T. Brox, and O. Ronneberger, 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 9901 LNCS, 2016, pp. 424–432.
N. Siddique, S. Paheding, C. P. Elkin, and V. Devabhaktuni, U-net and its variants for medical image segmentation: A review of theory and applications, IEEE Access 9 (2021) 82031–82057.
Z. Zhou, M. M. R. Siddiquee, N. Tajbakhsh, and J. Liang, Unet++: Redesigning skip connections to exploit multiscale features in image segmentation (2019). doi:10.48550/ARXIV.1912.05074.
F. Isensee, P. F. Jager, S. A. A. Kohl, J. Petersen, and K. H. Maier-Hein, Automated design of deep learning methods for biomedical image segmentation (2019). doi:10.48550/ARXIV.1904.08128.
J. B. Abraham, Malaria parasite segmentation using U-Net: Comparative study of loss functions, Commun. Sci. Technol. 4(2) (2019) 57-62.
W. Samek, T.Wiegand, and K.-R. Muller, Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models, 2017. doi:10.48550/ARXIV.1708.08296.
F. A. Zaman, X. Wu, W. Xu, M. Sonka, and R. Mudumbai, Trust, but verify: Robust image segmentation using deep learning, in 2023 57th Asilomar Conference on Signals, Systems, and Computers, IEEE, 2023, pp. 1070–1074.
B. Zhou, A. Khosla, A. Lapedriza, A. Oliva, and A. Torralba, Learning deep features for discriminative localization, 2015. doi:10.48550/ARXIV.1512.04150.
R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, Grad-cam: Visual explanations from deep networks via gradient-based localization, in 2017 IEEE International Conference on Computer Vision (ICCV), IEEE, 2017, pp. 618–626.
A. Chattopadhay, A. Sarkar, P. Howlader, and V. N. Balasubramanian, Gradcam++: Generalized gradient-based visual explanations for deep convolutional networks, in 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), IEEE, 2018, pp. 839–847.
H.Wang, et al., Score-cam: Score-weighted visual explanations for convolutional neural networks, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), IEEE, 2020, pp. 111–119.
P. Natekar, A. Kori, and G. Krishnamurthi, Demystifying brain tumor segmentation networks: Interpretability and uncertainty analysis, Front. Comput. Neurosci. 14 (feb 2020) 6.
H. Saleem, A. R. Shahid, and B. Raza, Visual interpretability in 3d brain tumor segmentation network, Comput. Biol. Med. 133 (2021) 104410.
Y. Liu, et al., Mixed-UNet: Refined class activation mapping for weakly-supervised semantic segmentation with multi-scale inference, Front. Comput. Sci. 4 (2022) 135.
L. Zhu, et al., A multi-task two-path deep learning system for predicting the invasiveness of craniopharyngioma, Comput. Methods Programs Biomed. 216 (2022) 106651.
S. Bakas, et al., Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the brats challenge (2018). doi:10.48550/ARXIV.1811.02629.
V. Petsiuk, A. Das, and K. Saenko, Rise: Randomized input sampling for explanation of black-box models (2018). doi:10.48550/ARXIV.1806.07421.
R. Raza, U. I. Bajwa, Y. Mehmood, M. W. Anwar, and M. H. Jamal, dResUnet: 3d deep residual u-net based brain tumor segmentation from multimodal MRI, Biomed. Signal Process. Control 79 (2023) 103861.
J. Sun, Y. Peng, D. Li, and Y. Guo, Segmentation of the Multimodal Brain Tumor Images Used Res-U-Net, Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 12658 LNCS (2021) 263–273.
Q. Zuo, S. Chen, and Z. Wang, R2AU-Net: Attention Recurrent Residual Convolutional Neural Network for Multimodal Medical Image Segmentation, Secur. Commun. Netw. 2021 (2021) 1-10.
M.U. Saeed, et al., RMU-Net: A Novel Residual Mobile U-Net Model for Brain Tumor Segmentation from MR Images, Electronics 10 (2021) 1962.
N. M. AboElenein, P. Songhao, and A. Afifi, IRDNU-Net: Inception residual dense nested u-net for brain tumor segmentation, Multimed. Tools Appl. 81 (2022) 24041–24057.
J. Hu, X. Gu, Z.Wang, and X. Gu, Mixture of calibrated networks for domain generalization in brain tumor segmentation, Knowl. Based Syst. 270 (2023) 110520.
M. Lerma and M. Lucas, Grad-CAM++ is equivalent to Grad-CAM with positive gradients, in 24th Irish Machine Vision and Image Processing Conference (IMVIP), Irish Pattern Recognition and Classification Society, 2022, pp. 113-120.
N. O. Pinciroli Vago, F. Milani, P. Fraternali, and R. da Silva Torres, Comparing CAM Algorithms for the Identification of Salient Image Features in Iconography Artwork Analysis, Journal of Imaging 7 (2021) 106.