Supervised machine learning models for cancer prognosis and treatment response prediction: A systematic review of algorithm performance, feature importance, and clinical deployment

Hidayath Ali Baig Mohammed

doi:10.55529/jaimlnn.61.88.99

Authors

Hidayath Ali Baig Mohammed Department of Computing and Information Sciences, College of Computing and Information Sciences, University of Technology and Applied Sciences, Sur Branch, Sultanate of Oman.

Keywords:

Supervised Machine Learning, Cancer Prognosis, Treatment Response, Random Forest, XGBoost, SHAP.

Abstract

Recently, Supervised machine learning (SML) has become an exciting paradigm in clinical oncology for building prediction models based on the available clinical, genomic, imaging and treatment data, to predict outcomes and responses to cancer treatment. Although numerous studies in SML have been published, there is no systematic evaluation of the performance of the algorithms, the extent of consistency across SML studies and algorithms, the quality of calibration, or the readiness for clinical implementation. This review aims to bridge this gap by summarising the findings of 36 studies in a variety of cancers.

Methods: We searched for the PubMed/MEDLINE, Embase, IEEE Xplore, Web of Science and the ACM Digital Library for publications between January 2018 and January 2025, according to the PRISMA 2020 guidelines and registered in PROSPERO (CRD42025412104). Cancer prognosis and/or prediction of treatment response studies that developed or had models validated by other groups were included. A five domain PROBAST-AI quality assessment was used.

Results: 36 studies with 34 different oncology datasets comprising of more than 3.1 million patient records were eligible. Random Forest was the most frequently deployed algorithm (n = 24, 67%), followed by XGBoost (n = 21, 58%) and SVM (n = 16, 44%). Median best-reported AUC was 0.908 (IQR: 0.887–0.927). In 33 out of 36 studies (92%), the SML models outperformed clinical staging and the average AUC gain was 0.108. Tumour stage and a number of key biomarkers turned out to be consistently important predictors. There were significant methodological gaps in reporting calibration in just 41% of studies.

Conclusions: Conventional oncological models are not as effective as SML models, and the SML models provide clinically meaningful performance improvements that are consistent. However, there are gaps in the prospective validation, reporting of features used for the result, as well as standardized representations of the features' importance. A checklist of 16 items (SML-ONCO-Report) is proposed to help overcome the reporting failures. The systematic results here reported are actionable to inform clinical trials for the use of SML in oncology.

Supervised machine learning models for cancer prognosis and treatment response prediction: A systematic review of algorithm performance, feature importance, and clinical deployment

Authors

Keywords:

Abstract

Published

How to Cite

Issue

Section

Similar Articles

SidebarMenu

Downloads

Current Issue

Information

Make a Submission