Session Item

Saturday
August 01
11:40 - 12:00
Training The Next Generation of Brachytherapists
465
Plenary Session
14:55 - 15:03
Standardization influences repeatability and prognostic value of radiomic features
PH-0652

Abstract

Standardization influences repeatability and prognostic value of radiomic features
Authors: Fornacon-wood|, Isabella(1)*[isabella.fornacon-wood@postgrad.manchester.ac.uk];O'Connor|, James(1);Faivre-Finn|, Corinne (1);Price|, Gareth(1);
(1)The University of Manchester, Division of Cancer Sciences, Manchester, United Kingdom;
Show Affiliations
Purpose or Objective

Radiomic features have shown potential as prognostic biomarkers for personalizing radiotherapy treatment. However, standardization issues hinder clinical translation. The image biomarker standardization initiative (IBSI) makes recommendations to address these issues. In this work we assessed the benefit of IBSI compliance for radiomic feature calculation packages in lung and head and neck (H&N) cancer datasets. The impact of package choice on predicting survival for patients with H&N cancer was also investigated.

Material and Methods

Four widely used radiomics software packages were evaluated: LIFEx, PyRadiomics and CERR (all of which are IBSI compliant) and IBEX (not IBSI-compliant). 17 radiomic features common to all packages were calculated on the planning CT scan for the GTV of small cell lung cancer (N=37) and H&N cancer (N=111) patients treated with radiotherapy (Table 1). Agreement between features across packages was assessed using the intraclass correlation coefficient (ICC). The ICC and confidence intervals (CI) were stratified to indicate poor (ICC CI < 0.5), moderate (0.5 < ICC CI < 0.75), good (0.75 < ICC CI < 0.9) and excellent (ICC CI > 0.9) agreement. The agreement between features using default package parameters versus setting the same parameters (64 grey levels, Chebyshev distance 1) for the IBSI-compliant packages was also assessed. The impact of different packages on the prognostic value of the 17 radiomic features was investigated by comparing univariable cox models predicting survival for patients with H&N cancer. The Benjamini-Hochberg method was applied.

Results

The agreement of features between all four packages was only excellent for volume, and the maximum and mean intensity in both datasets (Table 1). Stratification of agreement was consistent across H&N and lung cancer for 10/17 features. Where disagreement occurred the agreement tended to be worse in the H&N dataset. When comparing IBSI-compliant packages only, the agreement was improved only when the same parameters were used per package. A calculation error for sphericity was identified in one package which is in the process of being corrected by the vendor following these results.

6 texture features were significant (p<0.05) for survival when calculated in LIFEx, PyRadiomics and CERR, but not IBEX. Figure 1 presents Kaplan-Meier analysis for the feature GLCM joint entropy calculated in each package, split on the median. The risk stratification is significant for GLCM joint entropy calculated in LIFEx, PyRadiomics and CERR (p=0.01, 0.013, 0.0095 respectively), but not in IBEX (p=0.64).

Conclusion

IBSI-compliance improves feature repeatability between software packages, given the same parameters are used, and this improves prognostic model quality. It is important that different software packages are transparent about the precise features they calculate. Further, investigators should be aware of potential uncertainties introduced by differences in the way mathematically identical features are implemented.