Milea Timbergen

135 Comparison with radiologists As described in the methods, for the comparison with radiologists, a location-matched cohort consisting of all extremity DTFs and an equal amount of extremity non-DTF was used. To this end, all 20 extremity DTFs and 20 randomly selected extremity non-DTFs were included in the location-matched cohort. The performance of radiomics and the radiologists in this cohort is shown in Table 4: model 1 and 5-6 were omitted from the results for brevity. The AUCs of the r0adiomics models (model 2: 0.93; model 3: 0.88; model4: 0.98) were generally higher than both radiologists 1 (0.80) and 2 (0.88). This is confirmed by the ROC curves in Figure 4. Cohen’s kappa between the two radiologists was 0.40, indicating intermediate observer agreement. A DeLong power analysis of the AUCs resulted in a power of only 0.1. Due to the limited power, the p-values of the DeLong test were omitted. Figure 4. Receiver operating characteristic curves of the radiomics models based on age and sex (model 2); imaging (model 3); and imaging, age and sex (model 4); and those of the radiologists (Rad1 and Rad2), in the location-matched cohort. 5

RkJQdWJsaXNoZXIy ODAyMDc0