Lan Zhu, Haipeng Dong, Jing Sun, Lingyun Wang, Yue Xing, Yangfan Hu, Junjie Lu, Jiarui Yang, Jingshen Chu, Chao Yan, Fei Yuan, Jingyu Zhong
Eur Radiol . 2024 Jul 24. doi: 10.1007/s00330-024-10976-1. Online ahead of print.
Objectives: To evaluate the robustness of radiomics features among photon-counting detector CT (PCD-CT) and dual-energy CT (DECT) systems.
Methods: A texture phantom consisting of twenty-eight materials was scanned with one PCD-CT and four DECT systems (dual-source, rapid kV-switching, dual-layer, and sequential scanning) at three dose levels twice. Thirty sets of virtual monochromatic images at 70 keV were reconstructed. Regions of interest were delineated for each material with a rigid registration. Ninety-three radiomics were extracted per PyRadiomics. The test-retest repeatability between repeated scans was assessed by Bland-Altman analysis. The intra-system reproducibility between dose levels, and inter-system reproducibility within the same dose level, were evaluated by intraclass correlation coefficient (ICC) and concordance correlation coefficient (CCC). Inter-system variability among five scanners was assessed by coefficient of variation (CV) and quartile coefficient of dispersion (QCD).
Results: The test-retest repeatability analysis presented that 97.1% of features were repeatable between scan-rescans. The mean ± standard deviation ICC and CCC were 0.945 ± 0.079 and 0.945 ± 0.079 for intra-system reproducibility, respectively, and 86.0% and 85.7% of features were with ICC > 0.90 and CCC > 0.90, respectively, between different dose levels. The mean ± standard deviation ICC and CCC were 0.157 ± 0.174 and 0.157 ± 0.174 for inter-system reproducibility, respectively, and none of the features were with ICC > 0.90 or CCC > 0.90 within the same dose level. The inter-system variability suggested that 6.5% and 12.8% of features were with CV < 10% and QCD < 10%, respectively, among five CT systems.
Conclusion: The radiomics features were non-reproducible with significant variability in values among different CT techniques.
Clinical relevance statement: Radiomics features are non-reproducible with significant variability in values among photon-counting detector CT and dual-energy CT systems, necessitating careful attention to improve the cross-system generalizability of radiomic features before implementation of radiomics analysis in clinical routine.
Key points: CT radiomics stability should be guaranteed before the implementation in the clinical routine. Radiomics robustness was on a low level among photon-counting detectors and dual-energy CT techniques. Limited inter-system robustness of radiomic features may impact the generalizability of models.