-
PDF
- Split View
-
Views
-
Cite
Cite
Jasbir Dhaliwal, Xiaoxuan Liu, Jim Reigle, Mihika Sharma, Oscar Lopez-Nunez, Tom Walters, Margaret Collins, Jeffrey Hyams, Iram Siddiqui, Lee Denson, Anil Jegga, Surya Prasath, DEVELOPMENT OF AN OPTIMAL MACHINE LEARNING MODEL USING TREATMENT NAÏVE DIAGNOSTIC PATHOLOGY IMAGES TO PREDICT STEROID-FREE CLINICAL REMISSION AT ONE YEAR IN PEDIATRIC ULCERATIVE COLITIS, Inflammatory Bowel Diseases, Volume 29, Issue Supplement_1, February 2023, Pages S56–S57, https://doi.org/10.1093/ibd/izac247.109
- Share Icon Share
Abstract
The majority of children with ulcerative colitis present with extensive colitis at diagnosis, but the response to therapy is heterogenous. Identifying the optimal window for biologic treatment and risk stratification of patients remains an unmet need.
To develop a pathology based histomic model to predict corticosteroid free clinical remission (CSF) with mesalamine alone at one year.
292 hematoxylin and eosin diagnostic treatment naïve rectal mucosal biopsies from the multi-center PROTECT study were digitized. Whole slide images (WSIs) underwent two-step pre-processing a)stain normalization and b)informative patch selection (size 512x512). We trained machine learning (ML) models using 250 histomic features (texture, color, histogram and nuclei features) with 5-fold cross-validation, for patch-level classification. Feature importance was determined by the Gini index. We re-trained the classifier using the top features. Slide-level prediction was defined by threshold voting. Performance metrics at the patch and WSI level was evaluated.
A total of 161 patients underwent high-throughput RNA sequencing to define rectal gene expression. We undertook unsupervised weighted gene co-expression network analysis (WGCNA) to discover networks of co-expressed genes with shared biologic functions correlated with histomic features, histological traits, and outcomes.
187571 informative patches from 292 patients (Male:55%; Age:12.7y (IQR:11-15); CSF remission:41%) were trained on 23 ML classifiers. The best model trained on 250 features at the patch-level was random forest (RF). At a remission ratio threshold of 0.48, WSI area under the receiver operator curve (AUROC) was 0.90 (95%CI:0.70, 1.00), accuracy 90.4%, precision 90.9% and recall 84.7%. 18 top features were identified and trained, and the corresponding WSI AUROC was 0.87 (95%CI:0.72, 1.00), accuracy 90.1%, precision 89.4%, and recall 83.9% (Fig. 1). We re-trained the 18 features on an independent real-world dataset of 131 UC patients and the model WSI AUROC was 0.85 (95%CI:0.74, 1.00) and an accuracy of 88.5%.

Histomic feature importance represented by the SHapley Additive exPlanations (SHAP) values. The figure shows the direction of the relationship between a variable and outcome. Positive SHAP-values are indicative of clinical remission. As demonstrated by the color bar, values with higher importance are shown in red, while lower values are shown in blue.
Of the 13 modules identified by WGCNA analysis, six modules significantly correlated with clinical and/or histomic features (Fig. 2). Two of the gene co-expression modules were negatively associated with baseline clinical, endoscopic, and histologic measures of severity, and positively associated with nuclei features (Otsu area, perimeter, and equivalent diameter) and the outcome measure of CSF remission. Intersection of genes with adult single cell RNA seq data demonstrated enrichment for enterocytes (SLC26A3) and extracellular matrix (IHH)

Module trait relationship for selected histomic, histological and phenotypic traits with outcome variables
We developed a predictive model for UC disease course using histomic features from standard of care pre-treatment pathology images. Characterization of the underlying molecular basis of the histomic features is ongoing.
- phenotype
- gene expression
- extracellular matrix
- adrenal corticosteroids
- biopsy
- ulcerative colitis
- endoscopy
- glucocorticoids
- adult
- cell nucleus
- child
- colitis
- color
- disease progression
- enterocytes
- eosine yellowish-(ys)
- genes
- hematoxylin
- mesalamine
- mental recall
- sequence analysis, rna
- steroids
- diagnosis
- mineralocorticoids
- mucous membrane
- pathology
- juvenile ulcerative colitis
- stratification
- standard of care
- outcome measures
- protect trial
- outcome variable
- protect trial
- protect trial
- precision
- performance measures
- disease remission
- datasets
- machine learning
- area under the roc curve
- rna-seq