Classifying histopathological growth patterns for resected colorectal liver metastasis with a deep learning analysis

Abstract

Background

Histopathological growth patterns are one of the strongest prognostic factors in patients with resected colorectal liver metastases. Development of an efficient, objective and ideally automated histopathological growth pattern scoring method can substantially help the implementation of histopathological growth pattern assessment in daily practice and research. This study aimed to develop and validate a deep-learning algorithm, namely neural image compression, to distinguish desmoplastic from non-desmoplastic histopathological growth patterns of colorectal liver metastases based on digital haematoxylin and eosin-stained slides.

Methods

The algorithm was developed using digitalized whole-slide images obtained in a single-centre (Erasmus MC Cancer Institute, the Netherlands) cohort of patients who underwent first curative intent resection for colorectal liver metastases between January 2000 and February 2019. External validation was performed on whole-slide images of patients resected between October 2004 and December 2017 in another institution (Radboud University Medical Center, the Netherlands). The outcomes of interest were the automated classification of dichotomous hepatic growth patterns, distinguishing between desmoplastic hepatic growth pattern and non-desmoplatic growth pattern by a deep-learning model; secondary outcome was the correlation of these classifications with overall survival in the histopathology manual–assessed histopathological growth pattern and those assessed using neural image compression.

Results

Nine hundred and thirty-two patients, corresponding to 3.641 whole-slide images, were reviewed to develop the algorithm and 870 whole-slide images were used for external validation. Median follow-up for the development and the validation cohorts was 43 and 29 months respectively. The neural image compression approach achieved significant discriminatory power to classify 100% desmoplastic histopathological growth pattern with an area under the curve of 0.93 in the development cohort and 0.95 upon external validation. Both the histopathology manual–scored histopathological growth pattern and neural image compression-classified histopathological growth pattern achieved a similar multivariable hazard ratio for desmoplastic versus non-desmoplastic growth pattern in the development cohort (histopathology manual score: 0.63 versus neural image compression: 0.64) and in the validation cohort (histopathology manual score: 0.40 versus neural image compression: 0.48).

Conclusions

The neural image compression approach is suitable for pathology-based classification tasks of colorectal liver metastases.

Introduction

Colorectal cancer (CRC) is the third most common cancer and second cause of cancer mortality worldwide^1,2. Approximately one-third of these patients are afflicted with metastatic disease, with the liver representing the most predominant metastatic site^3,4. The presence of CRC distant metastases itself does not preclude potentially curative treatment^5–12. Although half of all patients with colorectal liver metastases (CRLM) may now be eligible for local treatment¹³, the results are still unsatisfactory, with only a quarter of patients achieving a long-term cure^14,15. This has garnered a longstanding interest in the prediction of prognosis and treatment effect, with the ultimate goal of guiding patient selection and improving outcome¹⁶.

In the search for new biomarkers, histological evaluation of liver metastases has emerged as a promising candidate. Light-microscopic evaluation of resected metastases allows for the determination of distinct histopathological growth patterns (HGPs)¹⁷. The most clinically relevant distinction between HGPs is desmoplastic versus non-desmoplastic HGP, according to the Rotterdam 50% cut-off. A desmoplastic HGP is recognized with an approximate two-fold reduction in mortality and cancer recurrence^18,19. Beside prognosis, several studies suggest that HGP is also predictive for treatment effect^2,20,21. Although HGPs have been shown to describe the biological properties of the tumour relating to therapy response and prognosis, they are not routinely scored yet. Expertise is required because there are several caveats in scoring²². Moreover, as HGP scoring requires a pathologist to score the full interface between the liver and the tumour cell by cell, the task is time-consuming. The lack of an efficient, objective and ideally automated HGP classification method substantially limits the implementation of HGPs in daily practice and research.

Developments in the application of artificial intelligence, and specifically deep learning, to high-resolution digitalized whole-slide images (WSI) has led to a rapidly growing research field at the interface of medical and computer sciences^23,24. Several deep-learning models are already approaching or even surpassing dedicated pathologists in histology-based marker determination tasks^25–32. Moreover, deep-learning models can predict prognosis by learning directly from the histology slides, effectively creating novel AI-based computational biomarkers³².

This study aims to assess whether a novel state-of-the-art deep-learning approach can be employed for the automated classification of the desmoplastic HGP in resected CRLM.

Methods

The current study adheres to the REporting recommendations for tumour MARKer prognostic studies (REMARK)³³. Institutional ethical review was obtained from both the medical ethics committee of the Erasmus Medical Centre (MEC-2018-1743), which granted a waiver for (renewed) informed consent, and the Ethical Committee of the Radboud University Medical Centre (MEC 2015–1637).

Patient cohorts and sample preparation

The patient cohort used for development consisted of patients undergoing surgical treatment of CRLM at the Erasmus MC Cancer Institute, Rotterdam, the Netherlands, between January 2000 and February 2019. For external validation purposes patients treated in a similar time frame (October 2004 to December 2017) at a different centre, the Radboud University Medical Centre, Nijmegen, the Netherlands, were selected. All available haematoxylin and eosin–stained slides of all resection specimens were requested from the respective pathology departments and subsequently digitalized. Patients were included only if they underwent first curative intent CRLM resection (that is, resection specimens for recurrent disease were excluded, and patients had to have had curative intent local treatment of all known cancerous disease at time of first liver surgery). Follow-up was obtained through the electronic patient record as patients are scheduled for regular follow-up after resection.

Histopathological growth patterns determination

All slides were scanned at the pathology department of the Radboud UMC using a 3DHistech P1000 scanner at a spatial resolution of 0.25 µm/pixel. Digital assessment of all WSI was performed by a trained observer (DJH) to confirm slide content and assess WSI quality.

The HGP was previously determined in accordance with international consensus guidelines within the context of retrospective cohort studies^18,19,34. The HGPs represent distinct histomorphological tumour–liver interface phenotypes of resected liver metastasis (Fig. S1), and can be grossly divided into two classes. The desmoplastic HGP is characterized by a broad band of desmoplastic stroma barring tumour–liver cell contact, and often displays a dense lymphocytic infiltrate peripherally to this desmoplastic stroma. The non-desmoplastic types most often exhibit cell-to-cell contact between tumour and liver cells, with the replacement of hepatocytes by tumour cells retaining the liver-cell plate architecture, that is the ‘replacement’ HGP. Although HGPs can appear in conjunction, we performed classification of the dichotomous presence of any non-desmoplastic HGP (Fig. S1) rather than relative abundance for the development and validation of the model, as this best distinguishes prognosis and is therefore clinically most relevant^17–19.

Neural image compression algorithm with multitask learning and attention pooling

For the classification of WSI we developed a neural image compression (NIC) algorithm with a supervised multitask-learning encoder framework (Fig. 1), building upon previous work³⁵. The multitask NIC pipeline consists of two steps.

Fig. 1

Neural image compression pipeline (A) with a supervised multitask learning encoder framework and convolutional neural networks classifier (B) Neural image compression with attention pipeline. First the slide is compressed, then classified. The classification architecture consists of four 1 × 1 convolutional layers and a final linear layer starting with a 1 × 1 convolution reducing the input channels from 2048 to 512 (conv1-512). H and W stand for height and width of the image respectively. H' and W' are the height and width of the compressed images respectively with H' << than H and W' << than W.

Open in new tab Download slide

First, subregions of the entire gigapixel WSI are compressed into low-dimensional embedding vectors using a convolutional neural network (CNN), the encoder. These vectors are subsequently organized to form a compressed representation of the WSI, maintaining the spatial arrangement of the original WSI. The encoder model is responsible for gleaning high-level discriminatory information contained in the WSI for a variety of downstream tasks, while simultaneously suppressing image noise and spurious correlations^35,36. To improve the extraction of high-level discriminatory factors that are transferable between a variety of tasks, we initially developed a supervised multitask learning architecture and trained an encoder on four histopathological tasks³⁵. This approach demonstrated increased performance when compared to an unsupervised single-task framework. Independently, another author developed a similar multitask encoder, trained on 22 classification tasks and with validated performance increase compared to non-histopathological pretrained encoders³⁷. In this work, we therefore use the new multitask encoder³⁷, which compresses a tile of size 256 × 256 × 3 into a vector of size 2048. As input, we use here tiles at resolution 5 × (2 µm/px).

Second, a second CNN is trained on the entire compressed WSI as input to predict an outcome of interest, for example the HGP. For the CNN classifier, we adapted the attention-based architecture introduced in previous works (Fig. 1)^38,39. In the context of neural networks, the term ‘attention’ refers to the capability of a network to learn to focus, that is to attend to specific regions of the input image. Using attention allows neural networks to make efficient use of training data as well as provide visually interpretable outputs via so-called ‘attention maps’. In one of the authors' previous works, they demonstrated the performance advantage of attention on a task for lung cancer subtyping compared to a convolutional architecture without attention.⁴⁰ After a single layer, an attention block is applied, resulting in a score for each compressed tile. It follows a matrix multiplication of the attention map with the output of the first layer (‘attention pooling’), resulting in a single vector which is then fed to the final classification layer. In the attention block, a dropout rate of 0.25 was used. The attention maps were used to visualize what is relevant for the network’s prediction and thus contributes to the interpretability of the model.

Experimental setup

Following the compression of the slides using the encoder model, we trained the CNN with cross-entropy loss minimization to predict the image label of interest (that is the HGP). Development was performed using a five-fold cross-validation (three folds for training, one for validation, one for testing). The training was done with balanced sampling, batch size of one, and early stopping with 25 epoch patience using the validation ROC-AUC (receiver operating characteristic area under the curve) as stopping criteria. External validation was performed on previously unseen slides of the Nijmegen cohort by averaging the predictions of the five models. A patient-level score was subsequently obtained by averaging the scores of all slides belonging to a single patient.

Outcomes of interest

The primary outcome of interest was the classification of dichotomous hepatic growth patterns, distinguishing between desmoplastic hepatic growth pattern and non-desmoplastic growth pattern by a deep-learning model. The secondary outcome was the correlation of these classifications with overall survival, irrespective of the underlying cause of death.

Statistical analysis

All statistical analyses were performed using the R project for statistical computing (https://www.r-project.org/). A complete case analysis was performed because of a low percentage of missing data (<5%) and large sample size. Categorical variables are reported as absolute numbers with corresponding percentages and non-parametric ordinal or numerical variables as medians with corresponding interquartile ranges, and were compared using the chi² or Kruskall Wallis tests respectively. Assessment of HGP classifier performance was done through ROC curve analysis with the slide-level ensemble score and observer-based HGP as the predictor and label respectively and the AUC with corresponding 95% c.i. as the performance metric. Given the class imbalance (roughly 80% of patients have a non-desmoplastic HGP), the optimality criteria were modified according to the prevalence of desmoplastic samples in the development cohort as proposed by others⁴¹. This threshold was subsequently applied in the external validation cohort to the patient-level ensemble scores, using the balanced accuracy as a performance metric⁴². Kaplan–Meier and Cox proportional regression survival analyses were performed to assess the prognostic value of the histopathology observer–based HGP and NIC-classified HGP. Multivariable models were corrected for age, sex, pT-stage, pN-stage, right-sided colorectal cancer, disease-free interval, number of liver metastasis, diameter of largest liver metastasis, preoperative carcinoembryonic antigen level and extrahepatic disease.

Results

Of 1254 patients treated at the development institution, 965 met the inclusion criteria and 932 were eligible for analysis. On the other hand, of 305 patients treated at the validation centre, 294 were eligible for analysis. A timeline of patients’ enrolment over the years at the development and the validation centres is presented respectively in Fig. S2 and Fig. S3.

Patient and treatment characteristics of the original and validation cohort are provided in Table S1. The development cohort comprised a total of 3.641 WSI from 932 patients (median follow-up time 43 months) undergoing first curative intent surgical treatment for CRLM. For external validation, a total of 870 WSI from 294 patients were available (median follow-up time 29 months). Fifty-five per cent of the patients in the development cohort received neo-adjuvant chemotherapy and 72.1% in the validation cohort. pT-stage did not differ significantly between the two cohorts (P = 0.94); however, a higher proportion of pN0-stage primary tumour was observed in the development cohort (P = 0.02). No statistically significant difference in HGP proportions was observed between the two cohorts.

Automated HGP classification

Using a five-fold cross-validation the NIC classifier achieved an AUC of 0.93 (95% c.i. 0.93 to 0.94) in the original cohort to classify the slide-level HGP (Fig. 2). Applying the optimal threshold for the ensemble score (0.69) based on the ROC curve/Youden’s J statistic (Fig. 2) resulted in a patient-level sensitivity of 82%, a specificity of 93% and a balanced accuracy of 88% (Fig. 2, Table 1). Upon external validation in the 870 previously unseen WSI of the validation cohort the NIC classifier achieved a similar AUC of 0.95 (95% c.i. 0.93 to 0.96) to classify the slide-level HGP (Fig. 2). Application of the optimal threshold from the development cohort achieved a patient-level sensitivity of 87%, a specificity of 91% and a balanced accuracy of 89% when compared to the observer-based HGP (Fig. 2).

Fig. 2

ROC curves of the automated histopathological growth pattern (HGP) classification in the original (a) and in the external validation cohort (b)

Open in new tab Download slide

Table 1

Open in new tab

NIC HGP classification performance in the development and validation cohorts

	TP	TN	FP	FN	Sens.	Spec.	PPV	NPV	Bal. Acc.
Development—patient level (n = 932)	180	662	51	39	82%	93%	78%	94%	88%
Validation—patient level (n = 294)*	52	213	21	8	87%	91%	71%	96%	89%

	TP	TN	FP	FN	Sens.	Spec.	PPV	NPV	Bal. Acc.
Development—patient level (n = 932)	180	662	51	39	82%	93%	78%	94%	88%
Validation—patient level (n = 294)*	52	213	21	8	87%	91%	71%	96%	89%

^*According to the predefined classification cut-off determined in the development cohort. Bal. Acc., balanced accuracy; FN, false negative; FP, false positive; NIC, neural image compression; NPV, negative predictive value; PPV, positive predictive value; Sens., sensitivity; Spec., specificity; TN, true negative; TP, true positive.

Table 1

Open in new tab

NIC HGP classification performance in the development and validation cohorts

	TP	TN	FP	FN	Sens.	Spec.	PPV	NPV	Bal. Acc.
Development—patient level (n = 932)	180	662	51	39	82%	93%	78%	94%	88%
Validation—patient level (n = 294)*	52	213	21	8	87%	91%	71%	96%	89%

	TP	TN	FP	FN	Sens.	Spec.	PPV	NPV	Bal. Acc.
Development—patient level (n = 932)	180	662	51	39	82%	93%	78%	94%	88%
Validation—patient level (n = 294)*	52	213	21	8	87%	91%	71%	96%	89%

Survivals

Table 2 reports the survival estimates and regression results for the observer-based and the NIC-classified HGP in both the development and external validation cohort, and Fig. 3 and Fig. 4 display the respective overall survival (OS) curves with stratification for chemo-naïve and pretreated. Overall, the NIC-classified HGP exhibited similar prognostic impact on OS as the histopathology observer–based HGP, also upon external validation. For example, the adjusted hazards ratio (95% c.i.) for desmoplastic versus non-desmoplastic patients based on the NIC-classified HGP was 0.64 (0.51 to 0.79) in the original cohort and 0.48 (0.28 to 0.83) upon external validation, compared to 0.63 (0.50 to 0.79) and 0.40 (0.22 to 0.75) respectively for the observer-based HGP (Table 1). Figure S4 shows examples of attention maps of four different histological slides of liver tissue samples paired with their corresponding attention maps. The attention maps are generated using predictive models to visualize the areas of importance for classifying the hepatic growth pattern, thus providing insights into the model's decision-making process. An initial analysis of the attention maps shows that the model is indeed mainly focusing on the tumour–stroma border to determine the HGP.

Fig. 3

Overall survival (OS) curves for the observer-based (a–c) and neural image compression (NIC) (d–f) classified histopathological growth pattern (HGP) in the development cohort and stratified for pretreatment (c,f) and chemo-naïve (b,e) patients

Open in new tab Download slide

Fig. 4

Overall survival (OS) curves for the observer-based (a–c) and neural image compression (NIC) (d–f) classified histopathological growth pattern (HGP) in the validation cohort and stratified for pretreatment (c,f) and chemo-naïve (b,e) patients

Open in new tab Download slide

Table 2

Open in new tab

Survival analyses on the ground-truth and NIC-classified HGP

Desmoplastic versus non-desmoplastic	Non-desmoplastic 5-year OS (95% c.i.)	Desmoplastic 5-year OS (95% c.i.)	Desmoplastic versus non-desmoplastic
			Univariable HR (95% c.i.)	Multivariable HR (95% c.i.)*
Development cohort (n = 932)
Ground-truth HGP	40% (36,44)	63% (57,70)	0.57 (0.47,0.70)	0.63 (0.50,0.79)
NIC-classified HGP	40% (37,44)	60% (54,67)	0.61 (0.50,0.75)	0.64 (0.51,0.79)
Validation cohort (n = 294)
Ground-truth HGP	64% (58,71)	80% (70,91)	0.51 (0.30,0.86)	0.40 (0.22,0.75)
NIC-classified HGP	66% (60,72)	73% (63,84)	0.64 (0.41,1.02)	0.48 (0.28,0.83)

Desmoplastic versus non-desmoplastic	Non-desmoplastic 5-year OS (95% c.i.)	Desmoplastic 5-year OS (95% c.i.)	Desmoplastic versus non-desmoplastic
			Univariable HR (95% c.i.)	Multivariable HR (95% c.i.)*
Development cohort (n = 932)
Ground-truth HGP	40% (36,44)	63% (57,70)	0.57 (0.47,0.70)	0.63 (0.50,0.79)
NIC-classified HGP	40% (37,44)	60% (54,67)	0.61 (0.50,0.75)	0.64 (0.51,0.79)
Validation cohort (n = 294)
Ground-truth HGP	64% (58,71)	80% (70,91)	0.51 (0.30,0.86)	0.40 (0.22,0.75)
NIC-classified HGP	66% (60,72)	73% (63,84)	0.64 (0.41,1.02)	0.48 (0.28,0.83)

*Corrected for age, sex, primary tumour location, pT-stage, pN-stage, disease-free interval, number of CRLM, diameter of largest CRLM, preoperative CEA, and extrahepatic disease. CEA, carcinoembryonic antigen; CRLM, colorectal liver metastasis; HGP, histopathological growth pattern; NIC, neural image compression; OS, overall survival.

Table 2

Open in new tab

Survival analyses on the ground-truth and NIC-classified HGP

Desmoplastic versus non-desmoplastic	Non-desmoplastic 5-year OS (95% c.i.)	Desmoplastic 5-year OS (95% c.i.)	Desmoplastic versus non-desmoplastic
			Univariable HR (95% c.i.)	Multivariable HR (95% c.i.)*
Development cohort (n = 932)
Ground-truth HGP	40% (36,44)	63% (57,70)	0.57 (0.47,0.70)	0.63 (0.50,0.79)
NIC-classified HGP	40% (37,44)	60% (54,67)	0.61 (0.50,0.75)	0.64 (0.51,0.79)
Validation cohort (n = 294)
Ground-truth HGP	64% (58,71)	80% (70,91)	0.51 (0.30,0.86)	0.40 (0.22,0.75)
NIC-classified HGP	66% (60,72)	73% (63,84)	0.64 (0.41,1.02)	0.48 (0.28,0.83)

Desmoplastic versus non-desmoplastic	Non-desmoplastic 5-year OS (95% c.i.)	Desmoplastic 5-year OS (95% c.i.)	Desmoplastic versus non-desmoplastic
			Univariable HR (95% c.i.)	Multivariable HR (95% c.i.)*
Development cohort (n = 932)
Ground-truth HGP	40% (36,44)	63% (57,70)	0.57 (0.47,0.70)	0.63 (0.50,0.79)
NIC-classified HGP	40% (37,44)	60% (54,67)	0.61 (0.50,0.75)	0.64 (0.51,0.79)
Validation cohort (n = 294)
Ground-truth HGP	64% (58,71)	80% (70,91)	0.51 (0.30,0.86)	0.40 (0.22,0.75)
NIC-classified HGP	66% (60,72)	73% (63,84)	0.64 (0.41,1.02)	0.48 (0.28,0.83)

Discussion

In this study the authors developed and validated a deep-learning–based pipeline with compression and attention to classify HGP on a large data set of digitalized WSI of resected CRLM without manual input from a clinician. The developed NIC classifier performed similarly across the development and previously unseen external validation cohort, achieving high levels of classifier performance and demonstrating generalizability with a balanced accuracy of ≥ 88%. In addition, the NIC-classified HGP demonstrated similar prognostic impact in terms of OS when compared to observer-based pathologist determination with the added benefit of faster output.

Literature shows that HGP is an independent prognostic factor for survival and there are studies suggesting HGP as a predictive factor for therapeutic effectiveness, making it a clinically highly relevant biomarker^2,15,21,43. It is of the utmost importance that such a biomarker is objective and reproducible, independent of the scoring physician. It is known that scoring of HGPs has several caveats, so expertise is necessary. The results of this study demonstrate high levels of HGP classification performance in both the development and validation cohorts (AUC ≥ 0.93), suggesting the development of an objective and reproducible clinically relevant scoring method that can automatically be determined. This will substantially help the implementation of HGPs in daily practice and research.

The attention maps illustrate that the NIC model concentrates on various regions of the slide. By analysing these maps, we can understand which features or regions of the slide are most influential in the model’s assessment of the HGP. This can be particularly useful for identifying potential areas for improvement in the model or for validating that the model is focusing on clinically relevant regions or even potentially to discover new histopathological biomarkers.

Although promising, these results also suggest the limits of the NIC classification pipeline with incorporation of even larger data sets and different immunohistochemical staining. This study includes only two tertiary university hospitals with very high performance both in the development and validation cohorts, which could be a sign of model overfitting to these specific data sets. Additional development and validation cohorts of different centres in multiple countries could improve this model even further and alleviate this problem. A recent study has suggested that a more granular, non-dichotomous approach could potentially offer enhanced prognostic value and stratify patient survival even further than the dichotomous classification. Lastly, this study did not explore how the deep-learning model could be seamlessly integrated into existing clinical workflows. Addressing the practical challenges of implementation in daily clinical practice is essential for ensuring the model’s effective use in real-world settings. Further research is necessary to validate these results⁴⁴.

In conclusion, these experimental results show that automated NIC-based models are promising to objectively classify HGP following surgical treatment of CRLM.

Funding

This study was partly funded by a grant from Stichting Coolsingel, Rotterdam, the Netherlands, and by the European Union through the Horizon 2020 framework under grant agreement No. 825292 (ExaMode, htttp://www.examode.eu/).

Acknowledgements

D.J.H. and W.A. are contributed equally. F.C. and C.V. are jointly supervised this work.

Disclosure

David Tellez is now affiliated with Aiosyn BV, the Netherlands. Jeroen van der Laak was a member of the advisory boards of Philips, the Netherlands and ContextVision, Sweden, and received research funding from Philips, the Netherlands, ContextVision, Sweden, and Sectra, Sweden in the last 5 years. He is chief scientific officer and a shareholder of Aiosyn BV, the Netherlands. Francesco Ciompi was Chair of the Scientific and Medical Advisory Board of TRIBVN Healthcare, France, and received advisory board fees from TRIBVN Healthcare, France in the last 5 years. He is a shareholder of Aiosyn BV, the Netherlands.

All other authors have no conflict of interests to declare.

Supplementary material

Supplementary material is available at BJS Open online.

Data availability

All clinical data and corresponding digital WSI used in this study are not publicly available but can be requested from and may be provided at the discretion of the corresponding authors of each respective centre and under the provision of appropriate data and material transfer agreements.

All developed deep-learning models are published online and are freely accessible at https://grand-challenge.org/algorithms/colorectal-liver-metastases-survival-prediction upon request. We also provide the code to create the correct input (slide as.tif or.svs file with correct background mask as.tif file) for the algorithm: https://grand-challenge.org/algorithms/tissue-segmentation-and-packing. The source code for NIC is available at https://github.com/DIAGNijmegen/pathology-whole-slide-learning.

Author contributions

Diederik Höppener (Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Validation, Visualization, Writing—original draft, Writing—review & editing), Witali Aswolinskiy (Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Visualization, Writing—original draft, Writing—review & editing), Zhen Qian (Visualization, Writing—review & editing), David Tellez (Writing—review & editing), Pieter Nierop (Writing—review & editing), Martijn Starmans (Writing—review & editing), Iris Nagtegaal (Writing—review & editing), Michail Doukas (Writing—review & editing), Johannes De Wilt (Writing—review & editing), Dirk Grünhagen (Writing—review & editing), Jeroen van der Laak (Writing—review & editing), Peter Vermeulen (Writing—review & editing), Francesco Ciompi (Conceptualization, Funding acquisition, Project administration, Supervision, Writing—review & editing), and Cornelis Verhoef (Conceptualization, Funding acquisition, Project administration, Supervision, Writing—review & editing)

References

Bray

Ferlay

Soerjomataram

Siegel

Torre

Jemal

Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries

CA Cancer J Clin

2018

;

394

–

424

Buisman

van der Stok

Galjart

Vermeulen

Balachandran

Coebergh van den Braak

RRJ

et al.

Histopathological growth patterns as biomarker for adjuvant systemic chemotherapy in patients with resected colorectal liver metastases

Clin Exp Metastasis

2020

;

593

–

605

Manfredi

Lepage

Hatem

Coatmeur

Faivre

Bouvier

Epidemiology and management of liver metastases from colorectal cancer

Ann Surg

2006

;

244

254

–

259

Engstrand

Nilsson

Strömberg

Jonas

Freedman

Colorectal cancer liver metastases—a population-based study on incidence, management and survival

BMC Cancer

2018

;

Gootjes

Buffart

Tol

Burger

Grunhagen

van der Stok

et al.

The ORCHESTRA trial: a phase III trial of adding tumor debulking to systemic therapy versus systemic therapy alone in multi-organ metastatic colorectal cancer (mCRC)

J Clin Oncol

2016

;

TPS788

–

TPS788

Google Scholar

Crossref

WorldCat

Moris

Ronnekleiv-Kelly

Rahnemai-Azar

Felekouras

Dillhoff

Schmidt

et al.

Parenchymal-sparing versus anatomic liver resection for colorectal liver metastases: a systematic review

J Gastrointest Surg

2017

;

1076

–

1085

Capussotti

Muratore

Baracchi

Lelong

Ferrero

Regge

et al.

Portal vein ligation as an efficient method of increasing the future liver remnant volume in the surgical treatment of colorectal metastases

Arch Surg

2008

;

143

978

–

982

;

discussion 982

Sandström

Røsok

Sparrelid

Larsen

Larsson

Lindell

et al.

ALPPS improves resectability compared with conventional two-stage hepatectomy in patients with advanced colorectal liver metastasis: results from a Scandinavian multicenter randomized controlled trial (LIGRO trial)

Ann Surg

2018

;

267

833

–

840

Bismuth

Adam

Lévi

Farabos

Waechter

Castaing

et al.

Resection of nonresectable liver metastases from colorectal cancer after neoadjuvant chemotherapy

Ann Surg

1996

;

224

509

–

520

;

discussion 520–502

Huiskens

van Gulik

van Lienden

Engelbrecht

Meijer

van Grieken

et al.

Treatment strategies in colorectal cancer patients with initially unresectable liver-only metastases, a study protocol of the randomised phase 3 CAIRO5 study of the Dutch Colorectal Cancer Group (DCCG)

BMC Cancer

2015

;

365

Stang

Fischbach

Teichmann

Bokemeyer

Braumann

A systematic review on the clinical benefit and role of radiofrequency ablation as treatment of colorectal liver metastases

Eur J Cancer

2009

;

1748

–

1756

Mahadevan

Blanck

Lanciano

Peddada

Sundararaman

D'Ambrosio

et al.

Stereotactic body radiotherapy (SBRT) for liver metastasis—clinical outcomes from the international multi-institutional RSSearch® patient registry

Radiat Oncol

2018

;

Meyer

Olthof

Grünhagen

de Hingh

de Wilt

JHW

Verhoef

et al.

Treatment of metachronous colorectal cancer metastases in the Netherlands: a population-based study

Eur J Surg Oncol

2022

;

1104

–

1109

Tomlinson

Jarnagin

DeMatteo

Fong

Kornprat

Gonen

et al.

Actual 10-year survival after resection of colorectal liver metastases defines cure

J Clin Oncol

2007

;

4575

–

4580

Buisman

Giardiello

Kemeny

Steyerberg

Höppener

Galjart

et al.

Predicting 10-year survival after resection of colorectal liver metastases; an international study including biomarkers and perioperative treatment

Eur J Cancer

2022

;

168

–

Kanas

Taylor

Primrose

Langeberg

Kelsh

Mowat

et al.

Survival after liver resection in metastatic colorectal cancer: review and meta-analysis of prognostic factors

Clin Epidemiol

2012

;

283

–

301

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Latacz

Höppener

Bohlok

Leduc

Tabariès

Fernández Moro

et al.

Histopathological growth patterns of liver metastasis: updated consensus guidelines for pattern scoring, perspectives and recent mechanistic insights

Br J Cancer

2022

;

127

988

–

1013

Galjart

Nierop

PMH

van der Stok

van den Braak

RRJC

Höppener

Daelemans

et al.

Angiogenic desmoplastic histopathological growth pattern as a prognostic marker of good outcome in patients with colorectal liver metastases

Angiogenesis

2019

;

355

–

368

Höppener

Galjart

Nierop

PMH

Buisman

van der Stok

Coebergh van den Braak

RRJ

et al.

Histopathological growth patterns and survival after resection of colorectal liver metastasis: an external validation study

JNCI Cancer Spectr

2021

;

pkab026

Nierop

PMH

Galjart

Höppener

van der Stok

Coebergh van den Braak

RRJ

Vermeulen

et al.

Salvage treatment for recurrences after first resection of colorectal liver metastases: the impact of histopathological growth patterns

Clin Exp Metastasis

2019

;

109

–

118

Frentzas

Simoneau

Bridgeman

Vermeulen

Foo

Kostaras

et al.

Vessel co-option mediates resistance to anti-angiogenic therapy in liver metastases

Nat Med

2016

;

1294

–

1302

van Dam

van der Stok

Teuwen

Van den Eynden

Illemann

Frentzas

et al.

International consensus guidelines for scoring the histopathological growth patterns of liver metastasis

Br J Cancer

2017

;

117

1427

–

1441

van der Laak

Litjens

Ciompi

Deep learning in histopathology: the path to the clinic

Nat Med

2021

;

775

–

784

Echle

Rindtorff

Brinker

Luedde

Pearson

Kather

Deep learning in cancer pathology: a new generation of clinical biomarkers

Br J Cancer

2021

;

124

686

–

696

Bulten

Pinckaers

van Boven

Vink

de Bel

van Ginneken

et al.

Automated deep-learning system for Gleason grading of prostate cancer using biopsies: a diagnostic study

Lancet Oncol

2020

;

233

–

241

Nagpal

Foote

Tan

Liu

Chen

Steiner

et al.

Development and validation of a deep learning algorithm for Gleason grading of prostate cancer from biopsy specimens

JAMA Oncol

2020

;

1372

–

1380

Coudray

Ocampo

Sakellaropoulos

Narula

Snuderl

Fenyö

et al.

Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning

Nat Med

2018

;

1559

–

1567

Ehteshami Bejnordi

Mullooly

Pfeiffer

Fan

Vacek

Weaver

et al.

Using deep convolutional neural networks to identify and classify tumor-associated stroma in diagnostic breast biopsies

Mod Pathol

2018

;

1502

–

1512

Mercan

Mehta

Bartlett

Shapiro

Weaver

Elmore

Assessment of machine learning of breast pathology structures for automated differentiation of breast cancer and high-risk proliferative lesions

JAMA Netw Open

2019

;

e198777

Yan

Liu

Automatic classification of ovarian cancer types from cytological images using deep convolutional neural networks

Biosci Rep

2018

;

BSR20180289

Hekler

Utikal

Enk

Solass

Schmitt

Klode

et al.

Deep learning outperformed 11 pathologists in the classification of histopathological melanoma images

Eur J Cancer

2019

;

118

–

Skrede

O-J

De Raedt

Kleppe

Hveem

Liestøl

Maddison

et al.

Deep learning for prediction of colorectal cancer outcome: a discovery and validation study

Lancet

2020

;

395

350

–

360

McShane

Altman

Sauerbrei

Taube

Gion

Clark

REporting recommendations for tumour MARKer prognostic studies (REMARK)

Br J Cancer

2005

;

387

–

391

Höppener

Nierop

PMH

Herpel

Rahbari

Doukas

Vermeulen

et al.

Histopathological growth patterns of colorectal liver metastasis exhibit little heterogeneity and can be determined with a high diagnostic accuracy

Clin Exp Metastasis

2019

;

311

–

319

Tellez

Höppener

Verhoef

Grünhagen

Nierop

Drozdzal

et al.

Extending unsupervised neural image compression with supervised multitask learning

Proc Machine Learning Res

2020

;

121

770

–

783

Google Scholar

OpenURL Placeholder Text

WorldCat

Tellez

Litjens

Bándi

Bulten

Bokhorst

Ciompi

et al.

Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology

Med Image Anal

2019

;

101544

Mormont

Geurts

Marée

Multi-task pre-training of deep neural networks for digital pathology

IEEE J Biomed Health Inform

2020

;

412

–

421

Google Scholar

Crossref

WorldCat

Maximilian

Jakub

Max

Attention-based Deep Multiple Instance Learning. In: Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden. PMLR,

2018

Williamson

DFK

Chen

Barbieri

Mahmood

Data-efficient and weakly supervised computational pathology on whole-slide images

Nat Biomed Eng

2021

;

555

–

570

Witali

David

Gabriel

Lieke van der

Monika

L-S

Jeroen van der

et al.

Neural image compression for non-small cell lung cancer subtype classification in H&E stained whole-slide images. In: Proc SPIE, International Society for Optics and Photonics, 2021, San Diego Convention Center/San Diego, California, United States

2021

Perkins

Schisterman

The inconsistency of “optimal” cutpoints obtained using two criteria based on the receiver operating characteristic curve

Am J Epidemiol

2006

;

163

670

–

675

Kleppe

Skrede

De Raedt

Liestøl

Kerr

Danielsen

Designing deep learning studies in cancer diagnostics

Nat Rev Cancer

2021

;

199

–

211

Zaharia

Veen

Lea

Kanani

Alexeeva

Søreide

Histopathological growth pattern in colorectal liver metastasis and the tumor immune microenvironment

Cancers (Basel)

2022

;

181

Fernández Moro

Geyer

Harrizi

Hamidi

Söderqvist

Kuznyecov

et al.

An idiosyncratic zonated stroma encapsulates desmoplastic liver metastases and originates from injured liver

Nat Commun

2023

;

5024

This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact [email protected].

Download all slides

Month:	Total Views:
October 2024	124
November 2024	507
December 2024	191
January 2025	69
February 2025	72
March 2025	101
April 2025	59
May 2025	5

Article Contents

Classifying histopathological growth patterns for resected colorectal liver metastasis with a deep learning analysis

Abstract

Introduction

Methods

Patient cohorts and sample preparation

Histopathological growth patterns determination

Neural image compression algorithm with multitask learning and attention pooling

Experimental setup

Outcomes of interest

Statistical analysis

Results

Automated HGP classification

Survivals

Discussion

Funding

Acknowledgements

Disclosure

Supplementary material

Data availability

Author contributions

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Most Read

Most Cited

Article Contents

Classifying histopathological growth patterns for resected colorectal liver metastasis with a deep learning analysis

Abstract

Introduction

Methods

Patient cohorts and sample preparation

Histopathological growth patterns determination

Neural image compression algorithm with multitask learning and attention pooling

Experimental setup

Outcomes of interest

Statistical analysis

Results

Automated HGP classification

Survivals

Discussion

Funding

Acknowledgements

Disclosure

Supplementary material

Data availability

Author contributions

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Most Read

Most Cited

This Feature Is Available To Subscribers Only