Abstract

Objectives

This study aims to evaluate the impact of image preprocessing methods, including traditional and artificial intelligence (AI)-based techniques, on the performance of MRI-based radiomics for predicting tumour aggressiveness in papillary thyroid carcinoma (PTC).

Methods

We retrospectively analysed MRI data from 69 patients with PTC, acquired between January 2011 and April 2023, alongside corresponding histopathology. MRI scans underwent N4 bias field correction and resampling using 10 traditional methods and an AI-based technique, synthetic multi-orientation resolution enhancement (SMORE). Radiomic features were extracted from the original and preprocessed images. Recursive feature elimination with random forests was used for feature selection, and predictive models were developed using XGBoost. The performance of the model was assessed by calculating the area under the receiver operating characteristic curve (AUC) across 1000 iterations.

Results

The combination of the correction of the bias field of N4 with SMORE resampling produced the highest mean AUC (0.75), significantly outperforming all traditional resampling methods (P<.001). The lowest mean AUC (0.66) was observed with nearest neighbour resampling. Texture-based radiomic features, particularly the standard deviation of the grey-level co-occurrence matrix autocorrelation, were frequently selected in models using SMORE-resampled images.

Conclusions

Preprocessing techniques critically influence the predictive performance of MRI-based radiomics in PTC. The combination of N4 bias field correction and SMORE resampling enhances accuracy, highlighting the necessity of optimizing preprocessing pipelines.

Advances in knowledge

This study demonstrates the superiority of AI-driven preprocessing techniques, such as SMORE, in improving MRI radiomic models, paving the way for enhanced clinical decision-making in PTC management.

Introduction

Radiomics is a data-centric field involving the high-throughput extraction of quantitative features from medical images,1 including computed tomography2,3 and magnetic resonance imaging (MRI).4 These image features capture the shape, histogram, and textural and higher order statistics features from a specified region of interest (ROI) based on the pixel’s spatial distribution of signal intensities,5,6 which are typically not perceptible to human eyes. Despite the rapid growth of radiomic studies, the lack of standardization and test-retest studies have hampered its translation into radiological practice.7,8 Recently, an International Image Biomarker standardization Initiative (IBSI) recommended the use of a set of standardized image features, which enabled verification and calibration across different software.9

A prerequisite for image feature extraction from qualitative anatomical MR images is the preprocessing step, including the normalization of signal intensities, which are represented in arbitrary units and slightly differ between patients, introducing variations in the contrast, as they are highly dependent on MRI data acquisition, scanner field strength, and the image reconstruction method.10 MR images may also suffer from geometric distortion, ghosting, and susceptibility artefacts depending on the organ of study.11,12 These factors can lead to measurement biases and inconsistencies in image feature values.13,14 To account for these inconsistencies, normalization methods have been applied to MR images, including N4 bias field correction, which reduces the inhomogeneity within tissue, and Z-Score, which scales each feature to have 0 mean and a variance of 1.15–18 In addition, image resampling to a unified voxel volume has been investigated in radiomics analyses.19,20 Traditional resampling methods, including linear, nearest neighbour, B-spline, Gaussian, and windowed sinc, are readily available across various image processing software platforms.21 Most recently, Zhao et al developed a novel synthetic multi-orientation resolution enhancement (SMORE) method using convolutional neural networks (CNNs) that restores image quality by improving resolution and reducing aliasing in MR images.22 SMORE method has been shown to be visually and quantitatively superior to previously reported methods.

Radiomics has promised to improve clinical decision-making, including for patients with cancer.23–25 In particular, radiomics and machine learning (ML) approaches have been increasingly applied to different imaging modalities for thyroid cancer.26–29 For instance, Hu et al evaluated the utility of image features obtained from multi-contrast MR images (T2weighted (w), contrast-enhanced (CE)-T1w, and diffusion-weighted (DW)) for the detection of lymph node metastases (LNM) in papillary thyroid cancer (PTC),30 while Dai et al predicted tumour aggressiveness using CE-T1w, T2w, DW images.31 Wei et al utilized CE-T1w, T2w, and DW images to predict extrathyroidal extension.32 Although these studies made valuable contributions, they did not incorporate data preprocessing steps in their methodologies. In contrast, Wang et al performed N4 bias field correction and image intensity rescale prior to feature extraction and reported that ML-based multiparametric MR (T2w and CE-T1w) images and apparent diffusion coefficient map radiomics could accurately distinguish aggressive from non-aggressive PTC preoperatively.29 These studies form a foundation for further radiomic investigations in PTC.

PTC has various subtypes, each exhibiting distinct histological, imaging and prognostic characteristics.33 The presence of local invasion, extra-glandular invasion, LNM or distant organ metastasis defines invasive thyroid cancer.33,34 Surgical resection remains the primary treatment for most aggressive thyroid cancers, often with poor prognoses.35 Despite imaging advances, pathological tissue examination remains the gold standard for diagnosis. Hence, early diagnosis and characterization of thyroid cancer aggressiveness holds paramount importance. However, its efficacy is influenced by various factors, including nodule size and location, presence of calcification and liquefaction.36

There is an unmet need to expand the use of the radiomics feature with the ML approach and explore its implications for improving risk stratification in PTC. To the best of our knowledge, this study is the first to investigate the potential of an MR radiomics pipeline that consists of several image preprocessing and harmonization steps ahead of feature extraction for PTC. We assessed the impact of 10 traditional image resampling techniques and Artificial Intelligence (AI)-based SMORE sampling on the performance of radiomics signatures. Traditional resampling techniques included Nearest neighbour, linear, Gaussian, label Gaussian, Hamming windowed sinc, cosine windowed sinc, Welch windowed Sinc, Lanczos windowed sinc, and Blackman windowed sinc, available in SimpleITK.37 This study aims to investigate the impact of the standardization of MR images using traditional and AI preprocessing methods that are important towards developing novel radiomics-based ML models to predict tumour aggressiveness of PTC.

Methods

Patient data

The present study retrospectively collected 71 patients (n = 71) evaluated for thyroidectomy who were enrolled after thyroid nodule FNA demonstrated either PTC or suspicion of thyroid cancer and had preoperative MRI exams between January 2011 and April 2023. Inclusion criteria required that patients have both MRI imaging and tissue pathology results. Baseline T2w images were analysed in this study, rather than multiparametric images, to identify the best combination of preprocessing and standardization methods for radiomics analysis. The study was approved by our institutional review board.

MR data acquisition

MRI exams were performed on a 3.0 T MR scanner (Discovery MR 750, GE Healthcare, Milwaukee, WI) and 1.5 T (Optima MR450w, GE Healthcare, Milwaukee, WI) with a neurovascular phased‐array coil before surgery. T2w, fat-suppressed acquisition (2D multislice) images were acquired with the following MR acquisition parameters: repetition time/echo time, TR/TE = 2500-5066 ms/77-124 ms, slice thickness = 5 mm, and FOV = 2024 mm covering the entire thyroid gland (25-30 slices).

Histopathology

Pathology details have been described previously by Lu et al.38 Surgical specimens of the PTC were taken after radical thyroidectomy or lobectomy under the supervision of a pathologist with more than 10 years of experience. Paraffin-embedded tissue blocks were obtained for each surgically resected specimen by sectioning each tumour and staining the sections with haematoxylin and eosin. The pathologist evaluated tumour aggressiveness using 6 established histopathologic criteria, including tall cell subtype, extrathyroidal extension (ETE), vascular or capsular invasion, necrosis, distant metastases, and regional metastases.39 The presence of any one of these features in tumours was deemed aggressive. Figure 1 shows an example of aggressive and non-aggressive histopathology images.

T2 weighted MR and histopathologic images of patients with and without aggressive tumour features. Top Row: The H&E shows NIFTP, noninvasive follicular thyroid neoplasm with papillary-like nuclear features, showing noninvasive well circumscribed borders and follicular growth pattern. Bottom Row: The H&E shows papillary thyroid carcinoma, tall cell variant as well as the negative IHC for oncogenic NRASQ61R, an activating mutation of RAS oncogene.
Figure 1.

T2 weighted MR and histopathologic images of patients with and without aggressive tumour features. Top Row: The H&E shows NIFTP, noninvasive follicular thyroid neoplasm with papillary-like nuclear features, showing noninvasive well circumscribed borders and follicular growth pattern. Bottom Row: The H&E shows papillary thyroid carcinoma, tall cell variant as well as the negative IHC for oncogenic NRASQ61R, an activating mutation of RAS oncogene.

Image preprocessing and analysis pipeline

The tumours were segmented on T2w images by neuroradiologists with over ten years of experience.40 As patient data were prospectively collected, acquisition parameters varied among each patient’s scans, potentially influencing radiomic feature values. Given the lack of standardization in MRI intensity values across T2w scans, multiple preprocessing steps were required to ensure comparability between patients, as described below.

The pipeline (Figure 2) begins with N4 bias field correction to mitigate low-frequency intensity non-uniformity in the MRI data, as this bias can distort the overall signal intensity.41 Images were resampled using traditional techniques available in SimpleITK as well as a CNN-based super-resolution algorithm to achieve isotropic scans based on the original in-plane resolution of each scan.22 The SMORE algorithm was implemented using the published self-supervised version without any modifications. CNN-based resampled images were further linearly resampled to a fixed isotropic resolution to ensure the same resolution across all patient scans. IBSI-compatible pyCERR software (github.com/cerr/pycerr) was used to extract radiomics features.42

Workflow for preprocessing MR image pipeline.
Figure 2.

Workflow for preprocessing MR image pipeline.

After N4-bias field correction of image intensities, the images were standardized to robust Z-scores using median and mean absolute deviation of intensities within respective regions of interest. Robust Z-score = 0.6745*(xi − x˜)/MAD; where xi is voxel intensity, x˜ and MAD are the median and mean absolute deviation of voxel intensities within the tumour ROI. Figure 3 shows examples of the SMORE MR scans.

T1w images and fat-suppressed T2-weighted images for 2 patients demonstrate STIR hyperintense tumour in the thyroid isthmus (patient 1, ROI) and T2 heterogeneous tumour in the right thyroid lower pole (patient 2, ROI). Image quality for both patients is maintained between N4 bias-field corrected images and SMORE images, noting tumour visibility without significant artefacts.
Figure 3.

T1w images and fat-suppressed T2-weighted images for 2 patients demonstrate STIR hyperintense tumour in the thyroid isthmus (patient 1, ROI) and T2 heterogeneous tumour in the right thyroid lower pole (patient 2, ROI). Image quality for both patients is maintained between N4 bias-field corrected images and SMORE images, noting tumour visibility without significant artefacts.

Radiomics features included first-order statistics and higher-order textures based on grey-level co-occurrence matrix (GLCM), Run Length matrix (RLM), Size Zone matrix (SZM), neighbourhood grey Level dependence matrix (NGLDM), and neighbourhood grey Tone difference matrix (NGTDM). To calculate texture features, the images were discretized using a bin width of 0.05 with 1st bin corresponding to the Z-score of −5. A voxel offset of 1 was used for GLCM features, and a patch size of 3 × 3×3 was used for NGLDM and NGTDM texture features.

To assess the efficacy of the preprocessing pipeline, radiomic features were extracted from 3 image types: original T2w images, N4-bias field-corrected T2w images, and N4-bias field-corrected T2w images resampled using ten traditional resampling methods using Simple ITK Python library (Nearest neighbour, Linear, Gaussian, label Gaussian, Hamming windowed sinc, cosine windowed sinc, Welch windowed sinc, Lanczos windowed sinc, and Blackman windowed sinc) and AI-based SMORE resampling. The radiomic features generated from each of these image types were used separately to model PTC aggressiveness.

Statistical analysis

Following the preprocessing steps, patients were randomly split into 75% training and 25% testing cohorts. Feature selection was performed using Recursive Feature Elimination (RFE) with Random Forests,43 followed by hyperparameter tuning using RandomizedSearchCV to optimize model performance,44 which included 5-fold cross validation. Top 5 features were retained in the final model for each experiment. Performance was measured using the area under the receiver operating characteristic curve (AUC) statistics on the testing cohort. The process was repeated 1000 times to ensure robust results and mitigate overfitting.

In addition, we compared AUC values from the different models using paired t-tests or Wilcoxon signed-rank tests, depending on the normality of the data, to determine the optimal preprocessing technique.45 Normality was checked using the Shapiro-Wilk test. These comparisons were used to select the most effective resampling method and preprocessing approach for subsequent analysis. All statistical analyses were conducted using Python, leveraging XGBoost (an open-source software library which provides a regularizing gradient boosting framework) for model building.46

Results

Patient characteristics

Out of 71 patients, 2 patients were further excluded in this study. One patient had susceptibility artefact in the thyroid lobe; therefore, no tumour was visible to delineate, and the other patient had chronic lymphocytic thyroiditis. For this latter patient, only features suggesting thyroiditis were seen on T2w images and the tumour was not visible. The remaining 69 patients were included in further analyses. The median patient age was 43 years, ranging between 19 and 71 years. Overall, 43 (62.3%) patients had tumours with aggressive features (Table 1). The neuroradiologists reviewed standard anatomical T1w images, and tumours were visible in only 32% of these patients. The standardization pipeline was applied to T2w images.

Table 1.

Patient characteristics.

Characteristicn (%)
Age at diagnosis, years (median)43 (19-71)
Gender (male)23 (33.3%)
Aggressive features based on pathology
Tall cell25 (36.76%)
Extrathyroidal extension27 (39.71%)
Necrosis0 (0%)
Vascular and/or tumour capsular invasion10 (14.71%)
Regional metastases30 (44.12%)
Distant metastases0 (0%)
Characteristicn (%)
Age at diagnosis, years (median)43 (19-71)
Gender (male)23 (33.3%)
Aggressive features based on pathology
Tall cell25 (36.76%)
Extrathyroidal extension27 (39.71%)
Necrosis0 (0%)
Vascular and/or tumour capsular invasion10 (14.71%)
Regional metastases30 (44.12%)
Distant metastases0 (0%)
Table 1.

Patient characteristics.

Characteristicn (%)
Age at diagnosis, years (median)43 (19-71)
Gender (male)23 (33.3%)
Aggressive features based on pathology
Tall cell25 (36.76%)
Extrathyroidal extension27 (39.71%)
Necrosis0 (0%)
Vascular and/or tumour capsular invasion10 (14.71%)
Regional metastases30 (44.12%)
Distant metastases0 (0%)
Characteristicn (%)
Age at diagnosis, years (median)43 (19-71)
Gender (male)23 (33.3%)
Aggressive features based on pathology
Tall cell25 (36.76%)
Extrathyroidal extension27 (39.71%)
Necrosis0 (0%)
Vascular and/or tumour capsular invasion10 (14.71%)
Regional metastases30 (44.12%)
Distant metastases0 (0%)

AUC performance across preprocessing methods

The performance of radiomic signatures across the different preprocessing techniques was assessed using AUC on the testing cohorts. Figure 4 presents the AUC values from the various preprocessing pipelines across the runs. The highest-performing preprocessing techniques included N4 bias field correction with SMORE resampling, where the SMORE L-1x1x1 and SMORE L-1x1x5 scan sets achieved the highest mean AUC of 0.75. In contrast, the lowest performance was seen with N4 bias field correction followed by nearest-neighbour (NN) resampling, yielding a mean AUC of 0.66.

Box-and-whisker plots of AUC values for predicting the aggressiveness of PTC using the different sets of sampling methods. The red line represents the highest mean AUC across the scan sets.
Figure 4.

Box-and-whisker plots of AUC values for predicting the aggressiveness of PTC using the different sets of sampling methods. The red line represents the highest mean AUC across the scan sets.

Comparison of resampling techniques

Wilcoxon signed-rank tests and paired t-tests were used to assess performance differences between each pair of preprocessing methods. Notably, N4 SMORE L-1x1x5 significantly outperformed N4 NN (P<1e9) and other traditional resampling methods, such as B-spline and linear (P<.001). A paired t-test revealed significant improvements in AUC when using advanced techniques such as SMORE compared to traditional methods like Gaussian (P = .003) and original T2w images (P<1e6). In contrast, no significant difference was observed between N4 SMORE and N4 SMORE L-1x1x1, indicating that variations in resolutions of SMORE sampling performed similarly well. Figure 5 summarizes the results of these comparisons, with significant P-values indicating where one preprocessing method outperformed another.

Statistical differences in AUC values across the sets of scans. ns = not significant; *= P<.05; **= P<.01; ***= P<.00001.
Figure 5.

Statistical differences in AUC values across the sets of scans. ns = not significant; *= P<.05; **= P<.01; ***= P<.00001.

Selected features across preprocessing methods

We also examined the most frequently selected features across different preprocessing pipelines. Figure 6 highlights the features most commonly selected by RFE across the runs. The most frequently selected feature was the standard deviation of GLCM autoCorr across different directions, which was chosen 12140/15000 times (80.9%) across all pipelines, with particularly high selection in models using N4 SMORE L-1x1x5 (960 times) and N4 SMORE L-1x1x1 (920 times). Other top features included the standard deviation of GLCM haralickCorr across all directions and GTDM Busyness. The features standard deviation of GLCM jointMax across all directions and GLRLM GrayLevel Variance were predominantly selected in models that used SMORE-based methods, emphasizing the potential value of second-order texture features in predicting PTC aggressiveness.

Most selected radiomic features F1: GLCM autoCorr, F2: GLCM haralick Corr, F3: GTDM Busyness, F4: GLCM jointMax, F5: GLRLM GrayLevel variance. F4 and F5 were almost unique to the models of SMORE scan sets.
Figure 6.

Most selected radiomic features F1: GLCM autoCorr, F2: GLCM haralick Corr, F3: GTDM Busyness, F4: GLCM jointMax, F5: GLRLM GrayLevel variance. F4 and F5 were almost unique to the models of SMORE scan sets.

Several features, such as the standard deviation of GLCM invar across all directions and Shape Compactness1, also showed high selection consistency across different preprocessing methods, indicating their robustness in various pipelines. Interestingly, some features like GLCM jointMax and GLRLM short Run Emphasis were more frequently selected in advanced resampling methods, suggesting that specific preprocessing techniques can enhance feature relevance in prediction models.

Discussion

In this study, we investigated the influence of traditional and AI-based image preprocessing and resampling techniques on the performance of MRI-based radiomic features in predicting PTC aggressiveness. Results revealed that the highest average performance was achieved by scans preprocessed with N4 bias field correction combined with SMORE resampling with unification of voxel size across patients. These approaches outperformed other resampling techniques, including traditional methods like Gaussian resampling and linear resampling, in terms of AUC performance. Nearest neighbour resampling yielded the lowest performance among the tested techniques.

The superior performance of SMORE-based resampling can be attributed to its ability to enhance spatial resolution and mitigate aliasing artefacts, resulting in improved radiomic feature extraction.22 N4 bias field correction likely contributed to this improvement by correcting for low-frequency intensity non-uniformity, enhancing the signal-to-noise ratio, and improving the discernibility of radiomic features.41

Notably, texture-based features such as GLCM autoCorr emerged as the most consistently selected feature across multiple preprocessing methods. This feature, which captures the autocorrelation of grey-level co-occurrence matrices, played a crucial role in the predictive models, demonstrating its potential in the classification of PTC aggressiveness.47 The prominence of second-order texture features, particularly those derived from the GLCM matrix, suggests that SMORE resampling enhances the extraction of these higher-order statistical measures, potentially improving model performance compared to traditional resampling techniques.

Interestingly, first-order features, which were more commonly selected in the original and N4 bias-corrected scans, were less frequently selected following SMORE resampling. This observation suggests that SMORE may improve the capture of textural characteristics, while first-order statistics may become less relevant in these enhanced images. Additionally, the variability in intensity values introduced by SMORE resampling might affect the relevance of certain first-order features, which rely heavily on image intensity values.40

Among the most frequently selected features were GLCM haralickCorr and GTDM Busyness. These features have previously been associated with the quantification of tumour heterogeneity, and their consistent selection across different resampling pipelines highlights their relevance in PTC aggressiveness prediction.1 The ability of these features to reflect the structural complexity and heterogeneity of tumours further underscores the value of texture analysis in radiomic studies.

Our findings suggest that while SMORE resampling outperformed other methods, bias field correction alone also yielded strong predictive performance, highlighting the importance of addressing MRI intensity non-uniformity 16. The improvement with SMORE resampling may be due to the increase in in-plane resolution to 1 × 1 mm and the enhancement of the z-axis resolution from 5-6 to 1 mm. However, the interpolation required to add slices between the originals may introduce noise and reduce the predictive capability of some radiomic features. This aligns with previous research, which has suggested that image resampling may not always improve radiomic model performance.48

Our results underscore the potential of radiomic features in predicting the aggressiveness of PTC, offering valuable clinical insights that could enhance patient management. However, further research is essential to validate these findings, particularly through the use of external datasets, to enhance the generalizability of radiomic signatures in clinical practice.

Despite the rigorous statistical analysis employed in this study, a minor limitation should be acknowledged. The small sample size limits the generalizability of our results. Future studies should incorporate larger, multi-institutional datasets with standardized imaging protocols to improve the reliability and applicability of radiomic models in clinical settings.

Conclusion

In conclusion, this study demonstrates that traditional and AI-based image preprocessing and resampling techniques impact the performance of radiomic features in predicting PTC aggressiveness from T2 MR images. AI-based SMORE resampling, particularly in combination with linear resampling, emerged as the optimal choice among the methods examined. These findings highlight the importance of selecting appropriate preprocessing pipelines in radiomic studies to emphasize the potential of radiomic features as powerful tools in the analysis of MRI in clinical decision-making for PTC.

Author contributions

A. Ibrahim and R. Paudyal contributed equally to this work. A. Ibrahim, A. Shukla-Dave, and A. Apte conceived the experiments; R. Paudyal, N. Katabi, A.R. Shaha, and R.M. Tuttle collected the data; A. Ibrahim, A. Shaha, and V. Hatzoglou conducted the experiments; A. Ibrahim and A. Apte analysed the results; B. Zhao, R.J. Wong, R.M. Tuttle, L.H. Schwartz provided the resources; A. Ibrahim, R. Paudyal, A. Shukla-Dave and A. Apte wrote and reviewed the manuscript. All authors reviewed and approved the submitted manuscript.

Funding

This work was supported by NIH grants R21CA176660-01A1, NIH U01 CA211205, and in part through the NIH/NCI Cancer Center Support grants P30 CA008748 and P50 CA172012.

Conflicts of interest

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: L.H.S. reports a relationship with Bristol Myers Squibb Co that includes: board membership and funding grants. L.H.S. reports a relationship with Merck Co Inc that includes: board membership. L.H.S. reports a relationship with Regeneron Pharmaceuticals Inc that includes: board membership. All other authors have no competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data availability

Data can be requested by contacting the corresponding author.

References

1

Gillies
RJ
,
Kinahan
PE
,
Hricak
H.
 
Radiomics: images are more than pictures, they are data
.
Radiology
.
2016
;
278
:
563
-
577
. .

2

Boas
FE
,
Fleischmann
D.
 
CT artifacts: causes and reduction techniques
.
Imaging Med
.
2012
;
4
:
229
-
240
. .

3

Ibrahim
A
,
Barufaldi
B
,
Refaee
T
, et al.  
MaasPenn radiomics reproducibility score: a novel quantitative measure for evaluating the reproducibility of CT-Based handcrafted radiomic features
.
Cancers (Basel)
.
2022
;
14
:
1599
. .

4

Shahzadi
I
,
Zwanenburg
A
,
Lattermann
A
, et al.  
Analysis of MRI and CT-based radiomics features for personalized treatment in locally advanced rectal cancer and external validation of published radiomics models
.
Sci Rep
.
2022
;
12
:
10192
. .

5

Rogers
W
,
Seetha
ST
,
Refaee
TAG
, et al.  
Radiomics: from qualitative to quantitative imaging
.
Br J Radiol
.
2020
;
93
:
20190948
. .

6

Papanikolaou
N
,
Matos
C
,
Koh
DM.
 
How to develop a meaningful radiomic signature for clinical use in oncologic patients
.
Cancer Imaging
.
2020
;
20
:
33
. .

7

Granzier
RWY
,
Ibrahim
A
,
Primakov
S
, et al.  
Test-Retest data for the assessment of breast MRI radiomic feature repeatability
.
J Magn Reson Imaging
.
2022
;
56
:
592
-
604
. .

8

Brown
AM
,
Nagala
S
,
McLean
MA
, et al.  
Multi-institutional validation of a novel textural analysis tool for preoperative stratification of suspected thyroid tumours on diffusion-weighted MRI
.
Magn Reson Med
.
2016
;
75
:
1708
-
1716
. .

9

Zwanenburg
A
,
Vallières
M
,
Abdalah
MA
, et al.  
The image biomarker standardisation initiative: standardized quantitative radiomics for high-throughput image-based phenotyping
.
Radiology
.
2020
;
295
:
328
-
338
. .

10

Kociołek
M
,
Strzelecki
M
,
Obuchowicz
R.
 
Does image normalisation and intensity resolution impact texture classification?
 
Comput Med Imaging Graph
.
2020
;
81
:
101716
. .

11

Naser
MA
,
Wahid
KA
,
Ahmed
S
, et al.  
Quality assurance assessment of intra-acquisition diffusion-weighted and T2-weighted magnetic resonance imaging registration and contour propagation for head and neck cancer radiotherapy
.
Med Phys
.
2023
;
50
:
2089
-
2099
. .

12

Lu
Y
,
Hatzoglou
V
,
Banerjee
S
, et al.  
Repeatability investigation of reduced field-of-view diffusion-weighted magnetic resonance imaging on thyroid glands
.
J Comput Assist Tomogr
.
2015
;
39
:
334
-
339
. .

13

Foltyn-Dumitru
M
,
Schell
M
,
Rastogi
A
, et al.  
Impact of signal intensity normalisation of MRI on the generalizability of radiomic-based prediction of molecular glioma subtypes
.
Eur Radiol
.
2024
;
34
:
2782
-
2790
. .

14

Peerlings
J
,
Woodruff
HC
,
Winfield
JM
, et al.  
Stability of radiomics features in apparent diffusion coefficient maps from a multi-centre test-retest trial
.
Sci Rep
.
2019
;
9
:
4800
. .

15

Simmons
A
,
Tofts
PS
,
Barker
GJ
,
Arridge
SR.
 
Sources of intensity nonuniformity in spin echo images at 1.5 T
.
Magn Reson Med
.
1994
;
32
:
121
-
128
. .

16

Reinhold
JC
,
Dewey
BE
,
Carass
A
,
Prince
JL.
 
Evaluating the impact of intensity normalisation on MR image synthesis
.
Proc SPIE Int Soc Opt Eng
.
2019
;
10949
:
109493
H. .

17

Carré
A
,
Klausner
G
,
Edjlali
M
, et al.  
Standardisation of brain MR images across machines and protocols: bridging the gap for MRI-based radiomics
.
Sci Rep
.
2020
;
10
:
12340
. .

18

Demircioğlu
A.
 
The effect of feature normalisation methods in radiomics
.
Insights Imaging
.
2024
;
15
:
2
. .

19

Ibrahim
A
,
Refaee
T
,
Primakov
S
, et al.  
The effects of In-Plane spatial resolution on CT-Based radiomic features’ stability with and without ComBat harmonisation
.
Cancers (Basel)
.
2021
;
13
:
1848
. .

20

Ibrahim
A
,
Refaee
T
,
Primakov
S
, et al.  
Reply to Orlhac F., Buvat I. Comment on “ibrahim et al. The effects of In-Plane spatial resolution on CT-Based radiomic features’ stability with and without ComBat harmonisation. Cancers 2021, 13, 1848.”
 
Cancers (Basel)
.
2021
;
13
:
3080
. .

21

Thévenaz
P
,
Blu
T
,
Unser
M.
 
Interpolation revisited [medical images application]
.
IEEE Trans Med Imaging
.
2000
;
19
:
739
-
758
. .

22

Zhao
C
,
Dewey
BE
,
Pham
DL
, et al.  
SMORE: a self-supervised anti-aliasing and super-resolution algorithm for MRI using deep learning
.
IEEE Trans Med Imaging
.
2020
;
40
:
805
-
817
. .

23

Lambin
P
,
Leijenaar
RTH
,
Deist
TM
, et al.  
Radiomics: the bridge between medical imaging and personalized medicine
.
Nat Rev Clin Oncol
.
2017
;
14
:
749
-
762
. .

24

Nie
K
,
Al-Hallaq
H
,
Li
XA
, et al.  
NCTN assessment on current applications of radiomics in oncology
.
Int J Radiat Oncol Biol Phys
.
2019
;
104
:
302
-
315
. .

25

Tomaszewski
MR
,
Gillies
RJ.
 
The biological meaning of radiomic features
.
Radiology
.
2021
;
298
:
505
-
516
. .

26

Li
X
,
Zhang
S
,
Zhang
Q
, et al.  
Diagnosis of thyroid cancer using deep convolutional neural network models applied to sonographic images: a retrospective, multicohort, diagnostic study
.
Lancet Oncol
.
2019
;
20
:
193
-
201
. (18)30762-9.

27

Shayesteh
SP
,
Nazari
M
,
Salahshour
A
, et al. Machine learning based malignancy prediction in thyroid nodules: radiomics analysis of ultrasound images. In 2020 IEEE Nuclear Science Symposium and Medical Imaging Conference (NSS/MIC),
2020
:
1
-
2
. .

28

Hu
Y
,
Huang
Z
,
Xie
Y
,
Wang
X.
 
In benign and malignant thyroid nodules First-Order CT texture analysis to identify the value of
.
J Clin Radiol
.
2019
;
38
:
422
-
425
.

29

Wang
H
,
Song
B
,
Ye
N
, et al.  
Machine learning-based multiparametric MRI radiomics for predicting the aggressiveness of papillary thyroid carcinoma
.
Eur J Radiol
.
2020
;
122
:
108755
. .

30

Hu
W
,
Wang
H
,
Wei
R
, et al.  
MRI-Based radiomics analysis to predict preoperative lymph node metastasis in papillary thyroid carcinoma
.
Gland Surg
.
2020
;
9
:
1214
-
1226
. https://gs.amegroups.org/article/view/54819.

31

Dai
Z
,
Wei
R
,
Wang
H
, et al.  
Multimodality MRI-Based radiomics for aggressiveness prediction in papillary thyroid cancer
.
BMC Med Imaging
.
2022
;
22
:
54
. .

32

Wei
R
,
Wang
H
,
Wang
L
, et al.  
Radiomics based on multiparametric MRI for extrathyroidal extension feature prediction in papillary thyroid cancer
.
BMC Med Imaging
.
2021
;
21
:
20
. .

33

Tuttle
RM
,
Leboeuf
R
,
Martorella
AJ.
 
Papillary thyroid cancer: monitoring and therapy
.
Endocrinol Metab Clin North Am
.
2007
;
36
:
753
-
778
, vii. .

34

Tuttle
RM
,
Alzahrani
AS.
 
Risk stratification in differentiated thyroid cancer: from detection to final Follow-Up
.
J Clin Endocrinol Metab
.
2019
;
104
:
4087
-
4100
. .

35

Haugen
BR
,
Alexander
EK
,
Bible
KC
, et al.  
2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the American thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer
.
Thyroid
.
2016
;
26
:
1
-
133
. .

36

Polyzos
SA
,
Anastasilakis
AD.
 
Clinical complications following thyroid fine-needle biopsy: a systematic review
.
Clin Endocrinol (Oxf)
.
2009
;
71
:
157
-
165
. .

37

Lowekamp
BC
,
Chen
DT
,
Ibáñez
L
,
Blezek
D.
 
The design of SimpleITK
.
Front Neuroinform
.
2013
;
7
:
45
. .

38

Biberacher
V
,
Schmidt
P
,
Keshavan
A
, et al.  
Intra- and interscanner variability of magnetic resonance imaging based volumetry in multiple sclerosis
.
Neuroimage
.
2016
;
142
:
188
-
197
. .

39

Ganly
I
,
Ibrahimpasic
T
,
Rivera
M
, et al.  
Prognostic implications of papillary thyroid carcinoma with tall-cell features
.
Thyroid
.
2014
;
24
:
662
-
670
. .

40

Nyúl
LG
,
Udupa
JK.
 
On standardizing the MR image intensity scale
.
Magn Reson Med
.
1999
;
42
:
1072
-
1081
. .

41

Raffelt
D
,
Dhollander
T
,
Tournier
JD
, et al. Bias field correction and intensity normalisation for quantitative analysis of apparent fibre density. In Proceedings of International Society Magnetic Resonance in Medicine, Vol.
25
. International Society Magnetic Resonance in Medicine;
2017
:
3541
.

42

Apte
AP
,
Iyer
A
,
Locastro
E
, et al. pyCERR – a python-based computational environment for radiological research. In International Conference on the Use of Computers in Radiation Therapy, ICCR,
2024
.

43

Devetyarov
D
,
Nouretdinov
I.
Prediction with confidence based on a random forest classifier. In Artificial Intelligence Applications and Innovations: 6th IFIP WG 12.5 International Conference, AIAI 2010, October 6-7,
2010
. Proceedings 6, Springer,
37
-
44
.

44

Bergstra
J
,
Bengio
Y.
 
Random search for hyper-parameter optimization
.
J Mach Learn Res
.
2012
;
13
:
281
-
305
. https://jmlr.org/papers/v13/bergstra12a.html.

45

Ma
H
,
Bandos
AI
,
Rockette
HE
,
Gur
D.
 
On use of partial area under the ROC curve for evaluation of diagnostic performance
.
Stat Med
.
2013
;
32
:
3449
-
3458
. .

46

Li
H
,
Zhu
Y
,
Burnside
ES
, et al.  
Quantitative MRI radiomics in the prediction of molecular classifications of breast cancer subtypes in the TCGA/TCIA data set
.
NPJ Breast Cancer
.
2016
;
2
:
16012
. .

47

Van Griethuysen
JJM
,
Fedorov
A
,
Parmar
C
, et al.  
Computational radiomics system to decode the radiographic phenotype
.
Cancer Res
.
2017
;
77
:
e104
-
e107
. .

48

Demircioğlu
A.
 
The effect of data resampling methods in radiomics
.
Sci Rep
.
2024
;
14
:
2858
. .

Author notes

Abdalla Ibrahim and Ramesh Paudyal contributed equally to this work.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.