Abstract

Title

Machine learning-based models of toxicity in prostate cancer ultra-hypofractionated radiotherapy

Authors

Matteo Pepa1, Mattia Zaffaroni1, Stefania Volpe1,2, Giulia Marvaso1,2, Johannes Lars Isaksson1, Simona Barzaghi3, Federica Benigni3, Marta Callegari3, Alessia Gismundi3, Francesco Maria La Fauci4, Giulia Corrao1,2, Matteo Augugliaro1, Federica Cattani4, Guido Baroni3, Elena De Momi3, Roberto Orecchia5, Barbara Alicja Jereczek-Fossa1,2

Authors Affiliations

1IEO European Institute of Oncology IRCCS, Division of Radiation Oncology, Milan, Italy; 2University of Milan, Department of Oncology and Hemato-Oncology, Milan, Italy; 3Politecnico di Milano, Department of Electronics Information and Bioengineering (DEIB), Milan, Italy; 4IEO European Institute of Oncology IRCCS, Medical Physics Unit, Milan, Italy; 5IEO European Institute of Oncology IRCCS, Scientific Direction, Milan, Italy

Purpose or Objective

In the last decades, radiotherapy (RT) treatments have become safer and more effective, allowing for dose escalation to the target volume without jeopardizing the sparing of surrounding organs at risk (OARs). However, effective toxicity prediction tools are essential in the era of tailored treatments. The purpose of the study is to test machine learning (ML)-based predictive models of toxicity in prostate cancer (PCa) patients (pts) treated with ultra-hypofractionated RT regimens.

Materials and Methods

Two cohorts of 61 and 186 non-metastatic low-intermediate risk PCa pts (from AIRC IG-13218 prospective trial and “Give me Five” retrospective trial, respectively), who underwent ultra- hypofractionated RT (35 Gy/5 fractions), were considered (Ethics Committee Notification UID 2410). Dosimetric and clinical features were used to train different ML models to predict genitourinary (GU) and gastrointestinal (GI) acute toxicities. The area under the receiver operating characteristic curve (AUC) was used to compare the model performances.

Results

Separate analyses of the three groups (61, 186 and 247) were carried out. Bagged trees outperformed all the others on the 61 cohort (Fig. 1a), with an AUC of 0.75, while SVM resulted the best algorithm on the 186 (Fig 1b) and 247 cohort (Fig 1c), with an AUC of 0.94 and 0.66, respectively. Overall, the best performing algorithm was LR, since, among the 8 best results, 4 were achieved with this one. The most predictive features were found to be T-stage, age, OARs volume and DVH punctual values. No significant correlation with the outcome was found for sub-areas under the DVH. Overall, the models achieved better AUC values when the two subgroups of pts were considered separately. The study presents some limitations, such as the relatively low occurrence of toxicity events, often resulting in a scarce capability of identifying true positives, and low AUC values in certain configurations.

Figure 1



Conclusion

In the era of personalised medicine and tailored treatments, a model that accurately predicts toxicity could represent a useful clinical tool for better patients’ selection. The study shows that the performance of the predictive models is highly dependent on the choice of classifier, features and training/testing sets. Differences between groups can be due to the different characteristics of trials. Further analyses should focus on features and classifiers selection, to improve the performances and generalizability of the models.