Personalized treatment selection via product partition models with covariates Free

Predictive performance: mean across 50 replicated datasets.

	Scenario 1a			Scenario 1b
	MOT	\|$\%\Delta MTU$\|	NPC	MOT	\|$\%\Delta MTU$\|	NPC
pam-bp	14.0600	0.0192	10.2600	14.2400	−0.0106	13.2000
	(3.2351)	(0.3447)	(1.9672)	(2.9593)	(0.3429)	(2.2039)
km-bp	13.4200	0.1130	11.4000	13.4000	0.0750	13.9600
	(2.8074)	(0.3038)	(2.5314)	(2.6108)	(0.3076)	(2.2584)
hc-bp	12.8600	0.1520	12.0200	12.4400	0.1418	12.6800
	(3.1429)	(0.3642)	(2.8961)	(3.1374)	(0.3403)	(2.7807)
dm-int	12.6000	0.1756	13.8200	13.2800	0.0740	12.9600
	(3.4934)	(0.3536)	(2.9877)	(3.7310)	(0.3851)	(3.0902)
t-ppmx	10.0000	0.3933	15.1600	10.7800	0.3339	14.4280
	(3.2451)	(0.3080)	(2.2800)	(3.2968)	(0.3362)	(2.8646)
	Scenario 2a			Scenario 2b
	MOT	%ΔMTU	*NPC*	MOT	%ΔMTU	NPC
pam-bp	14.1600	0.0145	10.1600	14.2000	−0.0068	13.2200
	(3.2474)	(0.3405)	(2.2439)	(2.9966)	(0.3487)	(2.2341)
km-bp	13.3600	0.1136	12.4800	13.5200	0.0689	13.7800
	(2.9190)	(0.3082)	(2.7198)	(2.5414)	(0.3037)	(2.2883)
hc-bp	12.9600	0.1223	11.5400	12.4400	0.1430	12.7600
	(3.5165)	(0.3846)	(11.54)	(3.1112)	(0.3344)	(2.7372)
dm-int	12.9600	0.1223	11.5400	13.0400	0.1021	13.0200
	(3.5165)	(0.3847)	(2.8082)	(3.5798)	(0.3627)	(2.9657)
t-ppmx	10.6200	0.3578	15.3800	10.6000	0.3497	14.4400
	(3.3313)	(0.3347)	(2.5446)	(3.1880)	(0.3269)	(2.8224)
	Scenario 3a			Scenario 3b
	MOT	%ΔMTU	NPC	MOT	%ΔMTU	NPC
pam-bp	13.9800	0.0275	11.8600	14.5000	−0.0635	14.1400
	(3.3654)	(0.3469)	(2.6955)	(2.9433)	(0.3310)	(2.8856)
km-bp	13.3600	0.1159	11.6000	13.7000	0.0405	13.8000
	(2.8909)	(0.3055)	(2.6954)	(2.9014)	(0.3363)	(2.5873)
hc-bp	12.6600	0.1621	11.5000	12.9800	0.0859	12.3800
	(3.2740)	(0.3684)	(2.8158)	(3.3715)	(0.3647)	(2.3724)
dm-int	12.7400	0.1616	13.9000	12.8400	0.0957	13.6000
	(3.7461)	(0.3747)	(3.1445)	(3.1646)	(0.3548)	(2.9207)
t-ppmx	10.2600	0.3610	15.0400	10.8600	0.3244	14.8400
	(3.6411)	(0.3352)	(2.4320)	(3.1234)	(0.3281)	(2.6677)

	Scenario 1a			Scenario 1b
	MOT	\|$\%\Delta MTU$\|	NPC	MOT	\|$\%\Delta MTU$\|	NPC
pam-bp	14.0600	0.0192	10.2600	14.2400	−0.0106	13.2000
	(3.2351)	(0.3447)	(1.9672)	(2.9593)	(0.3429)	(2.2039)
km-bp	13.4200	0.1130	11.4000	13.4000	0.0750	13.9600
	(2.8074)	(0.3038)	(2.5314)	(2.6108)	(0.3076)	(2.2584)
hc-bp	12.8600	0.1520	12.0200	12.4400	0.1418	12.6800
	(3.1429)	(0.3642)	(2.8961)	(3.1374)	(0.3403)	(2.7807)
dm-int	12.6000	0.1756	13.8200	13.2800	0.0740	12.9600
	(3.4934)	(0.3536)	(2.9877)	(3.7310)	(0.3851)	(3.0902)
t-ppmx	10.0000	0.3933	15.1600	10.7800	0.3339	14.4280
	(3.2451)	(0.3080)	(2.2800)	(3.2968)	(0.3362)	(2.8646)
	Scenario 2a			Scenario 2b
	MOT	%ΔMTU	*NPC*	MOT	%ΔMTU	NPC
pam-bp	14.1600	0.0145	10.1600	14.2000	−0.0068	13.2200
	(3.2474)	(0.3405)	(2.2439)	(2.9966)	(0.3487)	(2.2341)
km-bp	13.3600	0.1136	12.4800	13.5200	0.0689	13.7800
	(2.9190)	(0.3082)	(2.7198)	(2.5414)	(0.3037)	(2.2883)
hc-bp	12.9600	0.1223	11.5400	12.4400	0.1430	12.7600
	(3.5165)	(0.3846)	(11.54)	(3.1112)	(0.3344)	(2.7372)
dm-int	12.9600	0.1223	11.5400	13.0400	0.1021	13.0200
	(3.5165)	(0.3847)	(2.8082)	(3.5798)	(0.3627)	(2.9657)
t-ppmx	10.6200	0.3578	15.3800	10.6000	0.3497	14.4400
	(3.3313)	(0.3347)	(2.5446)	(3.1880)	(0.3269)	(2.8224)
	Scenario 3a			Scenario 3b
	MOT	%ΔMTU	NPC	MOT	%ΔMTU	NPC
pam-bp	13.9800	0.0275	11.8600	14.5000	−0.0635	14.1400
	(3.3654)	(0.3469)	(2.6955)	(2.9433)	(0.3310)	(2.8856)
km-bp	13.3600	0.1159	11.6000	13.7000	0.0405	13.8000
	(2.8909)	(0.3055)	(2.6954)	(2.9014)	(0.3363)	(2.5873)
hc-bp	12.6600	0.1621	11.5000	12.9800	0.0859	12.3800
	(3.2740)	(0.3684)	(2.8158)	(3.3715)	(0.3647)	(2.3724)
dm-int	12.7400	0.1616	13.9000	12.8400	0.0957	13.6000
	(3.7461)	(0.3747)	(3.1445)	(3.1646)	(0.3548)	(2.9207)
t-ppmx	10.2600	0.3610	15.0400	10.8600	0.3244	14.8400
	(3.6411)	(0.3352)	(2.4320)	(3.1234)	(0.3281)	(2.6677)

SDs are in parentheses. In each scenario and for each index, the best performance is in bold.

TABLE 1

Predictive performance: mean across 50 replicated datasets.

	Scenario 1a			Scenario 1b
	MOT	\|$\%\Delta MTU$\|	NPC	MOT	\|$\%\Delta MTU$\|	NPC
pam-bp	14.0600	0.0192	10.2600	14.2400	−0.0106	13.2000
	(3.2351)	(0.3447)	(1.9672)	(2.9593)	(0.3429)	(2.2039)
km-bp	13.4200	0.1130	11.4000	13.4000	0.0750	13.9600
	(2.8074)	(0.3038)	(2.5314)	(2.6108)	(0.3076)	(2.2584)
hc-bp	12.8600	0.1520	12.0200	12.4400	0.1418	12.6800
	(3.1429)	(0.3642)	(2.8961)	(3.1374)	(0.3403)	(2.7807)
dm-int	12.6000	0.1756	13.8200	13.2800	0.0740	12.9600
	(3.4934)	(0.3536)	(2.9877)	(3.7310)	(0.3851)	(3.0902)
t-ppmx	10.0000	0.3933	15.1600	10.7800	0.3339	14.4280
	(3.2451)	(0.3080)	(2.2800)	(3.2968)	(0.3362)	(2.8646)
	Scenario 2a			Scenario 2b
	MOT	%ΔMTU	*NPC*	MOT	%ΔMTU	NPC
pam-bp	14.1600	0.0145	10.1600	14.2000	−0.0068	13.2200
	(3.2474)	(0.3405)	(2.2439)	(2.9966)	(0.3487)	(2.2341)
km-bp	13.3600	0.1136	12.4800	13.5200	0.0689	13.7800
	(2.9190)	(0.3082)	(2.7198)	(2.5414)	(0.3037)	(2.2883)
hc-bp	12.9600	0.1223	11.5400	12.4400	0.1430	12.7600
	(3.5165)	(0.3846)	(11.54)	(3.1112)	(0.3344)	(2.7372)
dm-int	12.9600	0.1223	11.5400	13.0400	0.1021	13.0200
	(3.5165)	(0.3847)	(2.8082)	(3.5798)	(0.3627)	(2.9657)
t-ppmx	10.6200	0.3578	15.3800	10.6000	0.3497	14.4400
	(3.3313)	(0.3347)	(2.5446)	(3.1880)	(0.3269)	(2.8224)
	Scenario 3a			Scenario 3b
	MOT	%ΔMTU	NPC	MOT	%ΔMTU	NPC
pam-bp	13.9800	0.0275	11.8600	14.5000	−0.0635	14.1400
	(3.3654)	(0.3469)	(2.6955)	(2.9433)	(0.3310)	(2.8856)
km-bp	13.3600	0.1159	11.6000	13.7000	0.0405	13.8000
	(2.8909)	(0.3055)	(2.6954)	(2.9014)	(0.3363)	(2.5873)
hc-bp	12.6600	0.1621	11.5000	12.9800	0.0859	12.3800
	(3.2740)	(0.3684)	(2.8158)	(3.3715)	(0.3647)	(2.3724)
dm-int	12.7400	0.1616	13.9000	12.8400	0.0957	13.6000
	(3.7461)	(0.3747)	(3.1445)	(3.1646)	(0.3548)	(2.9207)
t-ppmx	10.2600	0.3610	15.0400	10.8600	0.3244	14.8400
	(3.6411)	(0.3352)	(2.4320)	(3.1234)	(0.3281)	(2.6677)

	Scenario 1a			Scenario 1b
	MOT	\|$\%\Delta MTU$\|	NPC	MOT	\|$\%\Delta MTU$\|	NPC
pam-bp	14.0600	0.0192	10.2600	14.2400	−0.0106	13.2000
	(3.2351)	(0.3447)	(1.9672)	(2.9593)	(0.3429)	(2.2039)
km-bp	13.4200	0.1130	11.4000	13.4000	0.0750	13.9600
	(2.8074)	(0.3038)	(2.5314)	(2.6108)	(0.3076)	(2.2584)
hc-bp	12.8600	0.1520	12.0200	12.4400	0.1418	12.6800
	(3.1429)	(0.3642)	(2.8961)	(3.1374)	(0.3403)	(2.7807)
dm-int	12.6000	0.1756	13.8200	13.2800	0.0740	12.9600
	(3.4934)	(0.3536)	(2.9877)	(3.7310)	(0.3851)	(3.0902)
t-ppmx	10.0000	0.3933	15.1600	10.7800	0.3339	14.4280
	(3.2451)	(0.3080)	(2.2800)	(3.2968)	(0.3362)	(2.8646)
	Scenario 2a			Scenario 2b
	MOT	%ΔMTU	*NPC*	MOT	%ΔMTU	NPC
pam-bp	14.1600	0.0145	10.1600	14.2000	−0.0068	13.2200
	(3.2474)	(0.3405)	(2.2439)	(2.9966)	(0.3487)	(2.2341)
km-bp	13.3600	0.1136	12.4800	13.5200	0.0689	13.7800
	(2.9190)	(0.3082)	(2.7198)	(2.5414)	(0.3037)	(2.2883)
hc-bp	12.9600	0.1223	11.5400	12.4400	0.1430	12.7600
	(3.5165)	(0.3846)	(11.54)	(3.1112)	(0.3344)	(2.7372)
dm-int	12.9600	0.1223	11.5400	13.0400	0.1021	13.0200
	(3.5165)	(0.3847)	(2.8082)	(3.5798)	(0.3627)	(2.9657)
t-ppmx	10.6200	0.3578	15.3800	10.6000	0.3497	14.4400
	(3.3313)	(0.3347)	(2.5446)	(3.1880)	(0.3269)	(2.8224)
	Scenario 3a			Scenario 3b
	MOT	%ΔMTU	NPC	MOT	%ΔMTU	NPC
pam-bp	13.9800	0.0275	11.8600	14.5000	−0.0635	14.1400
	(3.3654)	(0.3469)	(2.6955)	(2.9433)	(0.3310)	(2.8856)
km-bp	13.3600	0.1159	11.6000	13.7000	0.0405	13.8000
	(2.8909)	(0.3055)	(2.6954)	(2.9014)	(0.3363)	(2.5873)
hc-bp	12.6600	0.1621	11.5000	12.9800	0.0859	12.3800
	(3.2740)	(0.3684)	(2.8158)	(3.3715)	(0.3647)	(2.3724)
dm-int	12.7400	0.1616	13.9000	12.8400	0.0957	13.6000
	(3.7461)	(0.3747)	(3.1445)	(3.1646)	(0.3548)	(2.9207)
t-ppmx	10.2600	0.3610	15.0400	10.8600	0.3244	14.8400
	(3.6411)	(0.3352)	(2.4320)	(3.1234)	(0.3281)	(2.6677)

SDs are in parentheses. In each scenario and for each index, the best performance is in bold.

7 CASE STUDY OF LOW-GRADE GLIOMA

Glioma is the most frequent brain tumor: It makes up approximately 30% of all brain and central nervous system tumors and 80% of all malignant brain tumors (Goodenberger and Jenkins, 2012). Gliomas are classified as grades I-IV based on histological criteria established by the World Health Organization (WHO). Grade I tumors are generally circumscribed benign tumors with favorable prognoses, while grades II-IV comprise more aggressive tumors (diffuse gliomas). Grade II and grade III gliomas are usually referred to as low-grade gliomas (LGGs), which may eventually progress to grade IV, high-grade gliomas. Most LGG patients undergo resection and then receive radiotherapy and/or chemotherapy. Nonetheless, these standard procedures have proved to be largely inadequate (Claus et al., 2015). LGG exhibits significant molecular heterogeneity, and many research efforts are now devoted to developing precision medicine for these patients (Olar and Sulman, 2015; Ius et al., 2018). We apply our method to the dataset analyzed in Ma et al. (2019), where clinical data and protein expression of patients affected by lower-grade glioma are collected from the TCGA (2023) data portal. Publicly available data underwent an accurate preprocessing, thoroughly documented in Ma et al. (2019), and summarized in Supplementary Material F. The resulting LGG dataset comprises patients who received standard and advanced treatments. A treatment qualifies as advanced if it includes targeted therapies or radiotherapy. Each group comprises 79 patients balanced in the covariates to account for potential selection bias. Following Ma et al. (2019), we defined the tumor response for the LGG dataset using the RECIST (2008) criteria. In our analysis, tumor response is formulated in 3 ordinal levels: progressive disease (PD), partial response/stable disease (PS), and complete response (CR). Utility weights for treatment selection for ordinal outcomes are elicited, namely, |$\boldsymbol \omega = (0, 40, 100)^\top$| to make the ordinal response reflect the clinical importance of each level (Ma et al., 2016). We evaluate the robustness of our method to weight elicitation in Supplementary Material I. Finally, we analyze the same 23 predictive and 2 prognostic protein expressions considered in Ma et al. (2019). See Supplementary Material F for more details, including the list of predictive and prognostic proteins. TCGA data do not provide the true optimal treatment, and only the NPC measure, among those discussed in Section 6.1, can be used. We employ an empirical summary measure (ESM, Song and Pepe, 2004) to evaluate the relative increase in the population response rate attributable to a treatment allocation method compared to random allocation. Let Y be the binary outcome variables, taking 0 for nonrespondents or 1 for respondent patients. We define the treatment contrast as |$\Delta (\boldsymbol X, \boldsymbol Z) = Pr(Y=1|A=2, \boldsymbol X, \boldsymbol Z)-Pr(Y=1|A=1, \boldsymbol X, \boldsymbol Z)$|⁠, where A = {1, 2} denotes the nontargeted and targeted treatments, respectively. Indicating with Pr(Y = 1|A_r) the probability of being a respondent under a randomized treatment assignment, we obtain the relative increase in the population response rate under a personalized treatment selection rule as |$\mathrm{ ESM}= \lbrace Pr(Y=1|A=2, \Delta (\boldsymbol X, \boldsymbol Z)\gt 0)\times Pr(\Delta (\boldsymbol X, \boldsymbol Z)\gt 0)+ Pr(Y=1|A=1, \Delta (\boldsymbol X, \boldsymbol Z)\lt 0)\times Pr(\Delta (\boldsymbol X, \boldsymbol Z)\lt 0)\rbrace $||$-Pr(Y=1|A_r);$|see Supplementary Material E for more details. Note that we based this summary measure on only 2 response categories, responders (CR) and nonresponders (PD + PS), whereas we used all 3 levels of the ordinal outcome in the data analysis to implement personalized treatment selection.

7.1 Results

In this section, we applied the proposed method to the LGG dataset alongside the approach proposed by Ma et al. (2019). Table 2 reports NPC and ESM summary measures computed from assignments obtained using a 10-fold cross-validation strategy. We run the algorithm for 12 000 iterations, with a burn-in period of 2000 iterations; chains were thinned, and we kept every fifth sampled value. We report MCMC diagnostic checks in Supplementary Material F.

TABLE 2

Predictive performance: metrics are obtained by gathering 10-fold cross-validation results.

	NPC	ESM
pam-bp	48	0.0553
km-bp	45	0.0384
hc-bp	48	0.0285
dm-int	64	0.0746
t-ppmx	69	0.1008

For each index, the best performance is in bold.

TABLE 2

Open in new tab Download slide

Predictive performance: metrics are obtained by gathering 10-fold cross-validation results.

	NPC	ESM
pam-bp	48	0.0553
km-bp	45	0.0384
hc-bp	48	0.0285
dm-int	64	0.0746
t-ppmx	69	0.1008

For each index, the best performance is in bold.

The proposed t-ppmx outperforms competing methods in terms of both NPC and ESM. These results are consistent with those obtained in our simulation studies, especially in scenarios featuring significant heterogeneity and a moderate number of predictive covariates (Scenarios 2a and 3a). In particular, t-ppmx attains an ESM of 0.1008, while the ESM for pam-bp is 0.0553 among 2-stage procedures. Patients show pronounced heterogeneity, particularly those assigned to Treatment 2. The absence of a sharp separation between clusters demonstrates a significant uncertainty in the clustering. Patients assigned to Treatment 1 form more homogeneous clusters, but the low probability of co-clustering still indicates a large variability in clusters’ assignments.

7.2 Cluster analysis

Here, we want to investigate the composition of the clusters identified characterize the profiles of co-clustered patients. Following Wade and Ghahramani (2018), we use the variation of information loss function to estimate the optimal partition on the space of clusters. In particular, we obtain a partition of the 79 patients who received the standard treatment (Treatment 1) into 10 groups ranging from 1 to 38. Similarly, patients who received the advanced treatment (Treatment 2) are grouped into 10 clusters with cluster membership ranging from 1 to 34. Figure 1 reports the heatmap of the averaged co-occurrence matrices. We refer to T1G1|$,\ldots,$| T1G10 to denote the groups of patients treated with the standard treatment (pane A of Figure 1) and to T2G1|$,\ldots,$| T2G10 for the groups of patients who received the advanced treatment (pane B). Our PPMx model provides homogeneous clusters in terms of predictive covariates; indeed, it substantially reduces the within-group variance for each predictive covariate (see Figure 4 in Supplementary Material G). We deem clusters with less than 8 members residual clusters and exclude them from the following analysis.

FIGURE 1

Heatmap of averaged co-occurrence matrix for patients who received Treatment 1 (pane A) and Treatment 2 (pane B).

To characterize the groups, we consider the cluster-specific mean for predictive biomarkers. Figure 2 shows that cluster-specific means in T1G2, T1G4, and T2G6 strongly depart from the population value. Moreover, T1G2 and T1G4 feature the underexpression of a mutual set of proteins, namely SF2, RB15, and KU80, still presenting opposite trends in the expression of AKT, YAP, BC2, and GSK3. Cluster-specific means in T2G1, T2G2, and T2G3 are really close for almost all the proteins. Noticeably, T2G3 features a sharp underexpression of AKT with respect to the mean values expressed in T2G1 and T2G2. Finally, patients in T2G6 show underexpression of SF2, AKT, RBM15, and KU80, in addition to the overexpression of GSK3. It is important to note that both T1G4 and T2G6 exhibit similar protein expression patterns. Specifically, both groups displayed underexpression of SF2, AKT, RBM, and KU80, as well as overexpression of GSK3. The role of these proteins in tumor progression is not entirely understood. Nonetheless, most of these proteins have been implicated in gliomas’ oncogenesis and developmental processes (Mills et al., 2011; Li et al., 2020). A better characterization of the groups of interest can be achieved by evaluating the cluster-specific response probability. To obtain meaningful cluster-specific parameters, we consider the a posteriori estimated clustering |$\hat{\mathcal {P}}^{a}_{n^a}$| as fixed, and—conditional on it—we obtain the a posteriori distribution of |$\boldsymbol \pi ^{a\star }_j$|s. Response probabilities are summarized by the posterior distributions |$\boldsymbol \pi _{j}^{a\star }\mid \hat{\mathcal {P}}^{a}_{n^a}, \boldsymbol x^a$| displayed in Figure 3.

FIGURE 2

Group-specific mean of predictive biomarkers for patients who received Treatment 1 (pane A) and Treatment 2 (pane B).

Open in new tab Download slide

FIGURE 3

Ternary plot of the posterior density of group-specific response probabilities for patients who received Treatment 1 (row A) and Treatment 2 (row B).

Open in new tab Download slide

Figure 3 displays the response probabilities for patients who underwent the standard treatment (row A). Patients in T1G1 are those who most benefit from the standard treatment, as the posterior distribution of the response probability is concentrated toward the CR vertex. On the other hand, patients in T1G4 and T1G2 are more likely to experience a partial or nonresponse to the treatment. Response probabilities for patients who received Treatment 2 (Figure 3, row B) clearly characterize these clusters of patients, too. Notably, the posterior densities exhibit an evident skewness toward the vertices of the ternary plot. If we consider T2G1, T2G2, and T2G3, our model successfully uses predictive biomarkers along with the response to the treatment to cluster patients. In fact, t-ppmx is able to distinguish between patients who may be similar in terms of their covariates (see 2, pane B) but have different responses to treatment, which is a common phenomenon in cancer genetics. Nonetheless, it is also important to notice that discrimination of respondents may be accomplished based on AKT protein underexpression. Among the Treatment 2 clusters, T2G6 is characterized by unique patterns in terms of posterior probabilities and cluster-specific means. On a comparative analysis, it is noteworthy that T2G6 and T1G4 exhibit a shared set of under/upregulated proteins, namely SF2, AKT, RBM15, and KU80 are underexpressed and GSK3 is overexpressed. Consequently, T1G4 and T2G6 subjects can be regarded as individuals with closely related genetic profiles who underwent different treatment modalities. Intriguingly, these patients have shown limited response to the standard procedure (Treatment 1), whereas they appear to be responsive to the advanced treatment (Treatment 2). Interpretation of the within-cluster expression levels needs particular care. Interactions among the proteins and the relationships between proteins’ expression and tumor progression are highly complex. Nonetheless, our analysis provides a proof of concept that the proposed method can empirically identify subgroups in heterogeneous populations. Noteworthy, the proposed method quantifies the group-specific deviation from a population baseline accounting for the variability in the clustering and, in contrast to what is typically done with regression models, does not require us to prespecify the functional form of the association between predictive covariates and the outcome variable.

8 DISCUSSION

We have proposed a novel Bayesian approach that, given a set of predictive and prognostic biomarkers, suggests the best-suited treatment for each patient. The model clusters patients into homogeneous groups with respect to their predictive markers, separately for each treatment. Cluster-level effects adjust the baseline probability of response to treatment obtained by prognostic factors. As a key innovative feature of the proposed approach, model-based clustering and treatment assignment are jointly estimated from the data, that is, treatment selection fully accounts for patients’ heterogeneity. Simulation studies and the analysis of LGG data showed that the proposed method is well suited for predictions in scenarios of practical relevance, for example, in the presence of considerable heterogeneity. Moreover, our approach leads to a precise characterization of the clusters of patients supported by the data, identifying the group of patients more likely to benefit from targeted treatments. In its current version, the model is designed to be used after the biomarker discovery phase, that is, after identifying relevant prognostic and predictive biomarkers. This limitation could be addressed by adopting variable selection approaches in the Bayesian framework. Nevertheless, while the use of the latter methods is straightforward when selecting prognostic biomarkers entering the likelihood, variable selection methods for PPMs are part of our ongoing research (see, for instance, Barcella et al., 2017). In this regard, we would like to highlight that, although assuming which are the prognostic and predictive biomarkers may be restrictive in certain scenarios, and we could recast the proposed model to accommodate this lack of knowledge, this assumption remains practically very relevant. In fact, the major drawback of a model that simultaneously performs biomarker discovery and treatment selection would be the absence of a confirmatory process. Biomarkers can lead to targeted therapy and serve as useful prognostic and predictive factors of clinical outcomes. Nonetheless, biomarkers need to be validated on a completely independent data set not used during development to serve these purposes.

Acknowledgments

We thank J. Ma for providing us with the companion R code of Ma et al. (2019).

Funding

The first and third authors were partially supported by the “Dipartimenti Eccellenti 2023-2027” ministerial funds (Italy). All authors were partially supported by the grant “CLUstering: Bayesian Partition Models for Precise Medicine (CluB: PMx²),” funded by Fondo di Beneficienza di Intesa San Paolo (Italy), and the last author was partially supported by the Tuscany Health Ecosystem (THE) grant, funded by Ministero dell’Università e della Ricerca.

Conflict of interest

None declared.

Data availability

The data that support the findings in this paper are available in the National Cancer Institute Genomic Data Commons (NCI GDC) data portal at https://portal.gdc.cancer.gov/. These data were derived from the following resources available in the public domain: https://portal.gdc.cancer.gov/projects/TCGA-LGG.

References

Agresti

A.

(

2019

).

An Introduction to Categorical Data Analysis

.

Hoboken, New Jersey

:

John Wiley & Sons

.

Google Preview

Argiento

R.

,

Bianchini

I.

,

Guglielmi

A.

(

2016

).

A blocked Gibbs sampler for ngg-mixture models via a priori truncation

.

Statistics and Computing

,

26

,

641

–

661

.

Argiento

R.

,

Corradin

R.

,

Guglielmi

A.

,

Lanzarone

E.

(

2022

).

Clustering blood donors via mixtures of product partition models with covariates

.

arXiv, arXiv:2210.08297, preprint: not peer reviewed

.

Barcella

W.

,

De Iorio

M.

,

Baio

G.

(

2017

).

A comparative review of variable selection techniques for covariate dependent Dirichlet process mixture models

.

Canadian Journal of Statistics

,

45

,

254

–

273

.

Bedard

P. L.

,

Hansen

A. R.

,

Ratain

M. J.

,

Siu

L. L.

(

2013

).

Tumour heterogeneity in the clinic

.

Nature

,

501

,

355

–

364

.

Bonetti

M.

,

Gelber

R. D.

(

2000

).

A graphical method to assess treatment–covariate interactions using the Cox model on subsets of the data

.

Statistics in Medicine

,

19

,

2595

–

2609

.

Carvalho

C. M.

,

Polson

N. G.

,

Scott

J. G.

(

2010

).

The horseshoe estimator for sparse signals

.

Biometrika

,

97

,

465

–

480

.

Chen

J.

,

Li

H.

(

2013

).

Variable selection for sparse Dirichlet-multinomial regression with an application to microbiome data analysis

.

The Annals of Applied Statistics

,

7

,

418

–

442

.

Claus

E. B.

,

Walsh

K. M.

,

Wiencke

J. K.

,

Molinaro

A. M.

,

Wiemels

J. L.

,

Schildkraut

J. M.

et al. (

2015

).

Survival and low-grade glioma: the emergence of genetic information

.

Neurosurgical Focus

,

38

,

E6

.

Corsini

N.

,

Viroli

C.

(

2022

).

Dealing with overdispersion in multivariate count data

.

Computational Statistics & Data Analysis

,

170

,

107447

.

De Blasi

P.

,

Favaro

S.

,

Lijoi

A.

,

Mena

R. H.

,

Prünster

I.

,

Ruggiero

M.

(

2013

).

Are Gibbs-type priors the most natural generalization of the Dirichlet process?

.

IEEE Transactions on Pattern Analysis and Machine Intelligence

,

37

,

212

–

229

.

Favaro

S.

,

Teh

Y.W.

(

2013

).

MCMC for normalized random measure mixture models

.

Statistical Science

,

28

,

335

–

359

.

Gnedin

A.

,

Pitman

J.

(

2006

).

Exchangeable Gibbs partitions and stirling triangles

.

Journal of Mathematical Sciences

,

138

,

5674

–

5685

.

Goodenberger

M. L.

,

Jenkins

R. B.

(

2012

).

Genetics of adult glioma

.

Cancer Genetics

,

205

,

613

–

621

.

Hartigan

J. A.

(

1990

).

Partition models

.

Communications in Statistics—Theory and Methods

,

19

,

2745

–

2756

.

Ius

T.

,

Ciani

Y.

,

Ruaro

M. E.

,

Isola

M.

,

Sorrentino

M.

,

Bulfoni

M.

et al. (

2018

).

An nf-κb signature predicts low-grade glioma prognosis: a precision medicine approach based on patient-derived stem cells

.

Neuro-Oncology

,

20

,

776

–

787

.

Kosorok

M. R.

,

Laber

E. B.

(

2019

).

Precision medicine

.

Annual Review of Statistics and Its Application

,

6

,

263

–

286

.

Lee

J.

,

Thall

P. F.

,

Lim

B.

,

Msaouel

P.

(

2022

).

Utility-based Bayesian personalized treatment selection for advanced breast cancer

.

Journal of the Royal Statistical Society Series C: Applied Statistics

,

71

:

1605

–

1622

.

Li

S.-Z.

,

Hu

Y.-Y.

,

Zhao

J.-L.

,

Zang

J.

,

Fei

Z.

,

Han

H.

et al. (

2020

).

Downregulation of fhl1 protein in glioma inhibits tumor growth through pi3k/akt signaling

.

Oncology Letters

,

19

,

3781

–

3788

.

PubMed

Lijoi

A.

,

Mena

R. H.

,

Prünster

I.

(

2007

).

Controlling the reinforcement in Bayesian non-parametric mixture models

.

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

,

69

,

715

–

740

.

Ma

J.

,

Hobbs

B. P.

,

Stingo

F. C.

(

2015

).

Statistical methods for establishing personalized treatment rules in oncology

.

BioMed Research International

,

2015

,

670691

.

PubMed

Ma

J.

,

Hobbs

B. P.

,

Stingo

F. C.

(

2018

).

Integrating genomic signatures for treatment selection with Bayesian predictive failure time models

.

Statistical Methods in Medical Research

,

27

,

2093

–

2113

.

Ma

J.

,

Stingo

F. C.

,

Hobbs

B. P.

(

2016

).

Bayesian predictive modeling for genomic based personalized treatment selection

.

Biometrics

,

72

,

575

–

583

.

Ma

J.

,

Stingo

F. C.

,

Hobbs

B. P.

(

2019

).

Bayesian personalized treatment selection strategies that integrate predictive with prognostic determinants

.

Biometrical Journal

,

61

,

902

–

917

.

Mills

C. N.

,

Nowsheen

S.

,

Bonner

J. A.

,

Yang

E. S.

(

2011

).

Emerging roles of glycogen synthase kinase 3 in the treatment of brain tumors

.

Frontiers in Molecular Neuroscience

,

4

,

47

.

Monti

S.

,

Tamayo

P.

,

Mesirov

J.

,

Golub

T.

(

2003

).

Consensus clustering: a resampling-based method for class discovery and visualization of gene expression microarray data

.

Machine Learning

,

52

,

91

–

118

.

Müller

P.

,

Quintana

F.

,

Rosner

G. L.

(

2011

).

A product partition model with regression on covariates

.

Journal of Computational and Graphical Statistics

,

20

,

260

–

278

.

Neal

R. M.

(

2000

).

Markov chain sampling methods for Dirichlet process mixture models

.

Journal of Computational and Graphical Statistics

,

9

,

249

–

265

.

Olar

A.

,

Sulman

E. P.

(

2015

).

Molecular markers in low-grade glioma–toward tumor reclassification

.

Seminars in Radiation Oncology

,

25

,

155

–

163

.

Page

G. L.

,

Quintana

F. A.

(

2016

).

Spatial product partition models

.

Bayesian Analysis

,

11

,

265

–

298

.

Page

G. L.

,

Quintana

F. A.

(

2018

).

Calibrating covariate informed product partition models

.

Statistics and Computing

,

28

,

1009

–

1031

.

Pocock

S. J.

,

Assmann

S. E.

,

Enos

L. E.

,

Kasten

L. E.

(

2002

).

Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: current practiceand problems

.

Statistics in Medicine

,

21

,

2917

–

2930

.

Quintana

F. A.

,

Iglesias

P. L.

(

2003

).

Bayesian clustering and product partition models

.

Journal of the Royal Statistical Society Series B: Statistical Methodology

,

65

,

557

–

574

.

RECIST, N. C. I.

(

2008

).

Recist 1.1

. http://www.recist.com/.

[Accessed 31 August 2023]

.

Simon

R.

(

2010

).

Clinical trial designs for evaluating the medical utility of prognostic and predictive biomarkers in oncology

.

Personalized Medicine

,

7

,

33

–

47

.

Song

X.

,

Pepe

M. S.

(

2004

).

Evaluating markers for selecting a patient’s treatment

.

Biometrics

,

60

,

874

–

883

.

TCGA, N. C. I.

(

2023

).

Genomic Data Commons Data Portal

. https://portal.gdc.cancer.gov/.

[Accessed 31 August 2023]

.

Wade

S.

,

Ghahramani

Z.

(

2018

).

Bayesian cluster analysis: point estimation and credible balls (with discussion)

.

Bayesian Analysis

,

13

,

559

–

626

.