Utility-Based Bayesian Personalized Treatment Selection for Advanced Breast Cancer

Lee, Juhee; Thall, Peter F.; Lim, Bora; Msaouel, Pavlos

doi:10.1111/rssc.12582

Abstract

A Bayesian method is proposed for personalized treatment selection in settings where data are available from a randomized clinical trial with two or more outcomes. The motivating application is a randomized trial that compared letrozole plus bevacizumab to letrozole alone as first-line therapy for hormone receptor-positive advanced breast cancer. The combination treatment arm had larger median progression-free survival time, but also a higher rate of severe toxicities. This suggests that the risk-benefit trade-off between these two outcomes should play a central role in selecting each patient's treatment, particularly since older patients are less likely to tolerate severe toxicities. To quantify the desirability of each possible outcome combination for an individual patient, we elicited from breast cancer oncologists a utility function that varied with age. The utility was used as an explicit criterion for quantifying risk-benefit trade-offs when making personalized treatment selections. A Bayesian nonparametric multivariate regression model with a dependent Dirichlet process prior was fit to the trial data. Under the fitted model, a new patient's treatment can be selected based on the posterior predictive utility distribution. For the breast cancer trial dataset, the optimal treatment depends on the patient's age, with the combination preferable for patients 70 years or younger and the single agent preferable for patients older than 70.

Bayesian nonparametrics, dependent Dirichlet process, multivariate probit regression, precision medicine, statistical decision making, utility function

1 INTRODUCTION

Nearly all published clinical trial results focus on statistical inferences about effects of treatments and patient prognostic variables on clinical outcomes. This may fall short of what is needed by practicing physicians to make informed treatment decisions for individual patients. In many settings, estimated effects on efficacy and toxicity lead to conflicting treatment choices, and the relative desirability of two treatments also may vary with patient prognostic variables. Our motivating dataset, which illustrates this class of problems, arose from a phase III study of targeted agents for treating hormone receptor-positive advanced breast cancer, reported by Dickler et al. (2016). Patients were randomized between letrozole plus bevacizumab (L + B) and letrozole plus placebo (L). The primary efficacy endpoint was progression-free survival (PFS) time, defined as the time from the treatment to disease progression or death from any cause. Due to safety concerns, 21 different types of toxicity were monitored, including the type and grade (0 = none to 5 = fatal) of each occurrence. A statistically significant PFS improvement was seen with L + B compared to L (one-sided p-value = 0.016), with estimated median PFS 20.2 months (95% confidence interval, CI, 17.0–24.1) with L + B compared to 15.6 months (95% CI 12.9 – 19.7) with L. Consideration of toxicities led to the opposite conclusion, with 46.8% of patients treated with L + B experiencing severe (grade ≥ 3) toxicities compared to 14.2% with L.

Considering each outcome alone, selecting an optimal treatment is straightforward, since longer PFS and less toxicity each is more desirable. This leads to the problematic conclusions that L + B is preferable in terms of PFS but L is preferable in terms of toxicity. Thus, when considering these two outcomes together, as must be done in practice by a physician when choosing between the treatments for an individual patient, decision making is not straightforward. In our analyses, rather than dichotomizing toxicity severity, we will use total toxicity burden (TTB) (Bekele & Thall, 2004; Le-Rademacher et al., 2020) to summarize each patient's adverse events. In general, for K toxicities $z = (z_{1}, \dots, z_{K})$ ⁠, where each $z_{k}$ is a grade in {0, …, J − 1}, we define the scaled TTB to be $q = \sum_{k = 1}^{K} z_{k} / {K \times (J - 1)},$ which takes on values between 0 and 1. Figure 1 illustrates the TTB distributions and Kaplan–Meier estimates of the PFS survival function for each treatment arm in the breast cancer dataset, with L + B represented by blue and L by red. The questions that we will address in this paper are how one may use the available data to choose between L + B and L for a new breast cancer patient, and how this may be done in other, similar settings.

FIGURE 1

For the breast cancer data, histograms of scaled total toxicity burden (TTB) are given in panels (a) and (b). Panel (c) illustrates Kaplan–Meier estimates of survival functions, S(t). Blue and red represent treatments, letrozole plus bevacizumab (L + B) and letrozole plus placebo (L) respectively. [Colour figure can be viewed at https://dbpia.nl.go.kr]

Open in new tab Download slide

We assume that each patient's data can be summarized as a vector, y, of clinical outcomes, a vector, x, of prognostic covariates, and a treatment indicator variable, τ. A Bayesian regression model, f( y | τ, x, θ) is assumed and fit to the data, where θ denotes the model's parameter vector. For the breast cancer trial data, y = $(y_{1}, y_{2})$ with $y_{1}$ = PFS time and $y_{2}$ = TTB. Using the breast cancer trial data for illustration, we extend the usual statistical process of data analysis by connecting it with medical decision making by practicing physicians. To do this, we first construct a family of utility functions, with each utility assigning numerical desirability scores U(y, x) to all y = (PFS time, TTB) pairs for a patient with prognostic variables x. In a given setting, a physician and patient may choose a particular utility function from the family that best represents the patient's subjective trade-offs. Because the trade-off between PFS and TTB may vary with x, the desirability of a particular pair of treatment options may not be the same for all patients, and this may lead to different treatment preferences for two patients having different x.

Decision analysis based on utility functions certainly is not new. This has been studied and applied in many areas, including business (e.g. Loewenstein et al., 1989; Pennings & Smidts, 2003), engineering (e.g. Bagočius et al., 2014; Chen et al., 1998) and operations research (e.g. Roy et al., 2017; Walsh et al., 2004). Two papers of a five-part primer on medical decision analysis are given by Detsky et al. (1997) and Naglie et al. (1997). However, formal utility-based decision procedures are seldom included in statistical data analysis reports. Our application of the methodology to the breast cancer dataset illustrates how a utility function and Bayesian statistical model can be used to choose between two treatments for a patient with given prognostic variables. To establish the idea that outcome utilities can be used as practical tools for decision making, and illustrate the range of potential applications, Supplementary Section 2 provides examples of utility functions for different types of outcome vectors. These include a one-dimensional ordinal outcome, binary (response, toxicity) indicators, and the two ordinal categorical outcomes (disease status, toxicity severity).

For the breast cancer data analysis, we develop a robust Bayesian regression model, f( y | τ, x, θ), that assumes latent patient frailties to account for association among the elements of y and z, and describes how each outcome varies as a function of treatment, τ, and baseline covariates, x. We formulate a joint Bayesian nonparametric (BNP) multivariate regression model that includes a vector of continuous latent variables defined to represent ordinal toxicity outcomes, using the dependent Dirichlet process (DDP) (MacEachern, 1999). We use a linear DDP developed by De Iorio et al. (2004, 2009). DDP models are highly flexible and provide a robust basis for inferences about regression relationships. BNP models have been applied to a broad range of statistical problems, including density estimation, regression, clustering and survival analysis. See, for example, Müller et al. (2015) for general applications, Mitra and Müller (2015) for applications in biostatistics, or Müller and Mitra (2013); Thall et al. (2017) for overviews and illustrations. Given a Bayesian model f( y | τ, x, θ) and utility function U( y, x), we use posterior predictive utility distributions as a basis for deciding between treatments for a new patient with prognostic variables x. We choose the treatment that yields the greatest posterior mean utility.

The remainder of the paper is organized as follows. Section 2.1 formulates a bivariate regression model for clinical outcomes PFS time and TTB, and provides a predictive distribution of (PFS, TTB) for a future patient as a function of the patient's covariates and each potential treatment that may be given to the patient. Section 2.2 provides computational details for implementation. In Section 3, we describe a utility function that varies with PFS, TTB and covariates in order to represent a personalized risk-benefit trade-off between PFS and TTB. The utility function is constructed using separate contributions from PFS time and TTB, with each contribution constrained so that it is logically consistent and reflects elicited expert opinion. The predictive distribution allows one to estimate the utility of each treatment for a future patient with given covariates, and provides a way to compute the probability that each treatment is preferred for the future patient. In Section 4, we illustrate the methodology by applying it to make personalized treatment selections based on the breast cancer trial data. In Section 5, a simulation study is presented to illustrate general properties of the proposed decision-making approach. We close with a brief discussion in Section 6.

2 A BNR MODEL

2.1 Sampling distribution and prior specification

Let $t_{i} \in R^{+}$ denote PFS time during the follow-up period $(0, c_{i}]$ ⁠, for patient i = 1, …, n. The observed time of failure (progression or death) or independent administrative censoring at $c_{i}$ is $t_{i}^{o} = \min (t_{i}, c_{i}),$ with $δ_{i} = 1$ if PFS time was observed, $t_{i} \leq c_{i}$ ⁠, and $δ_{i} = 0$ if censored, $t_{i} > c_{i}$ ⁠. Denote the ordinal variable $z_{i, k} \in {0, 1, \dots, J - 1}$ for the maximum grade that patient i experienced of toxicity type k = 1, …, K. Censoring is assumed to be independent of $t_{i}$ ⁠, toxicity occurrences, and covariates. Denote the ith patient's vector of baseline covariates by $x_{i} = (x_{i, 1}, \dots, x_{i, P}),$ and the observed data by $𝒟 = {(t_{i}^{o}, δ_{i}, z_{i}, τ_{i}, x_{i}), i = 1, \dots, n}$ ⁠, where $z_{i} = (z_{i, 1}, \dots, z_{i, K})$ ⁠. In the breast cancer dataset, there are n = 340 patients after removing three patients having $t^{o} = 0$ ⁠, K = 21 toxicity categories and J = 6 severity grades, where grade 0 = no occurrence of that toxicity type and 5 = death. If a subject died due to a toxicity type k occurrence, the corresponding $z_{i, k} = 5$ was recorded with observed survival time $t_{i}^{o} = t_{i}$ and $δ_{i} = 1$ ⁠. We include P = 3 prognostic covariates, $x_{1}$ = age, an indicator $x_{2}$ of measurable disease at baseline, and an indicator $x_{3}$ of whether the patient's disease free interval prior to trial entry was greater than 24 months, in addition to the indicator τ of treatment L + B.

To construct a model that accounts for heterogeneity between patients not explained by the covariates, we introduce real-valued (K + 1)−dimensional multivariate normal latent frailty vectors, $s_{i} = {(s_{i, 0}, s_{i, 1}, \dots, s_{i, K})}^{'}$ for i = 1, …, n. We assume $s_{i} | Ω \overset{iid}{\sim} N_{K + 1} (0, Ω)$ and $Ω \sim Inv-Wishart (a_{Ω}, Ω_{0})$ ⁠. Following Chib and Greenberg (1998), we construct a multinomial probit model for the ordinal toxicity outcomes $z_{i}$ by introducing the unobserved real-valued latent variables ${\tilde{z}}_{i} = ({\tilde{z}}_{i, 1}, \dots, {\tilde{z}}_{i, K})$ ⁠, where ${\tilde{z}}_{i, k} \in R$ ⁠, and define $z_{i, k} = j$ if and only if $u_{k, j} < {\tilde{z}}_{i, k} \leq u_{k, j + 1}$ ⁠, where $u_{k, 0} < u_{k, 1} < \dots < u_{k, J}$ denote toxicity type-specific cutoffs for each k. This is a common modelling strategy that uses real-valued latent variables to induce a tractable multivariate distribution on a vector of observed ordinal categorical variables, and greatly facilitates computation. For logarithm transformed PFS time, $\tilde{t} = \log (t),$ latent variables ${\tilde{z}}_{i},$ treatment $τ_{i},$ and covariates $x_{i}$ ⁠, we assume

({\tilde{t}}_{i}, {\tilde{z}}_{i}) | τ_{i}, x_{i}, s_{i} \overset{indep}{\sim} h ({\tilde{t}}_{i}, {\tilde{z}}_{i} | τ_{i}, x_{i}, s_{i}), where ({\tilde{t}}_{i}, {\tilde{z}}_{i}) \in R^{K + 1} .

(1)

We take a BNP approach for modelling h in (1) that allows flexible regression structures by assuming the DDP (MacEachern, 1999), which is a family of random probability distributions indexed by (τ, x). Specifically, we use a linear-DDP that induces covariate dependence through linear regression structures (De Iorio et al., 2004, 2009). Denoting ${\tilde{x}}_{i}^{'} = (1, τ_{i}, x_{i}^{'})$ ⁠, $β = {(β_{0}, β_{τ}, β_{1}, \dots, β_{P})}^{'}$ and $α_{k} = {(α_{k, 0}, α_{k, τ}, α_{k, 1}, \dots, α_{k, P})}^{'}$ ⁠, we assume the simple parametric linear combinations $η_{0} ({\tilde{x}}_{i}) = β^{'} {\tilde{x}}_{i}$ and $η_{k} ({\tilde{x}}_{i}) = α_{k}^{'} {\tilde{x}}_{i}$ ⁠, for each k = 1, …, K. We denote $η ({\tilde{x}}_{i}) = (η_{0} ({\tilde{x}}_{i}), \dots, η_{K} ({\tilde{x}}_{i}))$ and construct a model for h through a convolution with a normal kernel

h ({\tilde{t}}_{i}, {\tilde{z}}_{i} | τ_{i}, x_{i}, s_{i}) = \int ϕ_{K + 1} ({\tilde{t}}_{i}, {\tilde{z}}_{i} | η ({\tilde{x}}_{i}) + s_{i}, \sum) d G (β, α_{1}, \dots, α_{K}),

(2)

where $ϕ_{d} (\cdot | a, B)$ is the density function of the d−variate normal distribution with mean vector a and d × d covariance matrix B > 0. We use the Dirichlet process (DP) as a prior for the random mixing distribution G in (2). This gives a DP mixture of multivariate normal linear models,

h ({\tilde{t}}_{i}, {\tilde{z}}_{i} | τ_{i}, x_{i}, s_{i}) = \sum_{m = 1}^{\infty} w_{m} ϕ_{K + 1} ({\tilde{t}}_{i}, {\tilde{z}}_{i} | η_{m} ({\tilde{x}}_{i}) + s_{i}, \sum),

(3)

where $η_{m} ({\tilde{x}}_{i}) = (η_{m, 0} ({\tilde{x}}_{i}), \dots, η_{m, K} ({\tilde{x}}_{i}))$ with $η_{m, 0} ({\tilde{x}}_{i}) = β_{m}^{'} {\tilde{x}}_{i}$ and $η_{m, k} ({\tilde{x}}_{i}) = α_{m, k}^{'} {\tilde{x}}_{i}$ ⁠. The weights ${w_{m}}$ are constructed via Sethuraman's (1994) so-called ‘stick-breaking’ process by assuming $w_{m} / \prod_{m^{'} = 1}^{m - 1} (1 - w_{m^{'}}) \overset{iid}{\sim} Be (1, ξ)$ ⁠, with fixed ξ > 0. For the covariate and treatment effect parameters in (3), we assume

β_{m} \overset{iid}{\sim} {MVN}_{P + 2} (\overline{β}, κ^{2} I_{P + 2}) and α_{m, k} | {\overline{α}}_{k}, V \overset{iid}{\sim} {MVN}_{P + 2} ({\overline{α}}_{k}, V)

(4)

with $\overline{β}$ and $κ^{2}$ fixed, where ${MVN}_{P + 2}$ represents a (P + 2)-dimensional multivariate normal distribution. In (4), $\overline{β} = ({\overline{β}}_{0}, {\overline{β}}_{τ}, {\overline{β}}_{1}, \dots, {\overline{β}}_{P})$ and ${\overline{α}}_{k} = ({\overline{α}}_{k, 0}, {\overline{α}}_{k, τ}, {\overline{α}}_{k, 1}, \dots, {\overline{α}}_{k, P})$ are (P + 2)-dimensional mean vectors and $V = diag [v_{p}^{2}]$ is a (P + 2) × (P + 2) matrix. We assume ${\overline{α}}_{k, p} \overset{indep}{\sim} N ({\overset{=}{α}}_{p}, v_{α}^{2})$ with fixed ${\overset{=}{α}}_{p}$ and $v_{α}^{2}$ ⁠, and $v_{p}^{2} \overset{iid}{\sim} IG (a_{v}, b_{v})$ with fixed $a_{v}$ and $b_{v}$ for p = 0, …, P. The hierarchical structure for $α_{m, k}$ enables the model to borrow information across the toxicity categories. The model in (3) incorporates τ and x linearly in the mean of each normal summand. Due to the fact that the distribution of $(\tilde{t}, \tilde{z})$ is a weighted average of multivariate normal distributions, each with its own linear term, the model accounts for possible effects of τ and x on $(\tilde{t}, \tilde{z})$ that can be nonlinear and quite complex, including interactions between two or more variables in (τ, x). This construction thus provides a flexible modelling framework for inference and prediction, avoiding restrictive assumptions of linearity or additivity in the covariate effects. This facilitates accurate decision making by avoiding restrictions imposed by conventional parametric models, such as the proportional hazards model. We let $\sum = diag (σ_{t}^{2}, σ_{z}^{2}, \dots, σ_{z}^{2})$ ⁠, which implies conditional independence between ${\tilde{t}}_{i}$ and ${\tilde{z}}_{i, k}$ given $s_{i}$ ⁠. Due to the conditional independence, the marginal distributions from (3) can be expressed as the weighted averages

\begin{aligned} {\tilde{t}}_{i} | τ_{i}, x_{i}, s_{i, 0} & \overset{indep}{\sim} f ({\tilde{t}}_{i} | τ_{i}, x_{i}, s_{i, 0}) = \sum_{m = 1}^{\infty} w_{m} ϕ_{1} ({\tilde{t}}_{i} | η_{m, 0} ({\tilde{x}}_{i}) + s_{i, 0}, σ_{t}^{2}), \\ {\tilde{z}}_{i, k} | τ_{i}, x_{i}, s_{i, k} & \overset{indep}{\sim} g_{k} ({\tilde{z}}_{i, k} | τ_{i}, x_{i}, s_{i, k}) = \sum_{m = 1}^{\infty} w_{m} ϕ_{1} ({\tilde{z}}_{i, k} | η_{m, k} ({\tilde{x}}_{i}) + s_{i, k}, σ_{z}^{2}) \end{aligned}

(5)

That is, f and each $g_{k}$ also is a linear DDP mixture distribution. The marginal distribution of each $z_{i, k}$ is obtained by integrating over the latent variables,

P (z_{i, k} = j | τ_{i}, x_{i}, s_{i, k}) = \sum_{m = 1}^{\infty} w_{m} \int_{u_{k, j}}^{u_{k, j + 1}} ϕ_{1} (\tilde{z} | η_{m, k} ({\tilde{x}}_{i}) + s_{i, k}, σ_{z}^{2}) d \tilde{z} .

(6)

Marginalizing by averaging over s in (3), the resulting DP mixture model, $h (\tilde{t}, \tilde{z} | τ, x)$ ⁠, has covariance matrix ∑ + Ω. Thus, Ω induces dependence between PFS time and the toxicities within each patient, in addition to explaining additional variability between patients not explained by τ and x.

To ensure identifiability in the multivariate ordinal regression model, we fix $σ_{z}^{2}$ and set $u_{k, 1} = 0$ for all k. We also set $u_{k, 0} = - \infty$ ⁠, and $u_{k, J} = \infty$ ⁠, and $P (z_{i, k} < 0) = 0$ and $P (z_{i, k} \leq J - 1) = 1$ ⁠. We let the cut-offs $u_{k, j}$ ⁠, j = 2, …, J−1 be random for flexibility, by defining $u_{k, j} = u_{k, j - 1} + e_{k, j - 1}$ k = 1, …, K and with error terms $e_{k, j} \overset{iid}{\sim} Ga (a_{e}, b_{e})$ ⁠, for j = 2, …, J − 1. Lastly, we assume $σ_{t}^{2} \sim IG (a_{t}, b_{t})$ ⁠.

2.2 Posterior inference

Collecting terms, $θ = (w_{m}, β_{m}, σ_{t}^{2}, α_{m, k}, e, {\overline{α}}_{k, p}, v_{p}^{2}, Ω)$ is the vector of all model parameters, and $\tilde{θ} = (M, \overline{β}, a_{t}, b_{t}, κ^{2}, {\overset{=}{α}}_{p}, v_{α}^{2}, a_{v}, b_{v}, a_{Ω}, Ω_{0})$ is the vector of all fixed hyper-parameters. Given $\tilde{θ}$ and data $𝒟$ ⁠, the joint posterior of θ and the patient-specific random effects $s = {s_{i}, i = 1, \dots, n}$ is

p (θ, s | 𝒟, \tilde{θ}) \propto \{\prod_{i = 1}^{n} p ({\tilde{t}}_{i}^{o}, δ_{i}, z_{i} | τ_{i}, x_{i}, s_{i}, θ, \tilde{θ}) \times p (s_{i} | θ, \tilde{θ})\} p (θ | \tilde{θ}),

(7)

where the joint likelihood of the observed data for the i^th patient is the product

\begin{matrix} p ({\tilde{t}}_{i}^{o}, δ_{i}, z_{i} | τ_{i}, x_{i}, s_{i}, θ, \tilde{θ}) & = {f ({\tilde{t}}_{i} | τ_{i}, x_{i}, s_{i, 0}, β, σ_{t}^{2})}^{δ_{i}} {1 - F ({\tilde{t}}_{i} | τ_{i}, x_{i}, s_{i, 0}, β, σ_{t}^{2})}^{1 - δ_{i}} \\ \times \prod_{k = 1}^{K} p (z_{ik} | τ_{i}, x_{i}, s_{i, k}, α_{k}, u_{k}) . \end{matrix}

We use Markov chain Monte Carlo (MCMC) simulation to generate posterior samples of the parameter and latent variable vectors, (θ, s). For computational convenience, we approximate the DDP in (5) by truncating the infinite number of mixture components of F and $G_{k}$ to the finite value M. The final weight is set to $w_{M} = 1 - \sum_{m = 1}^{M - 1} w_{m}$ to ensure that F and $G_{k}$ are proper distributions. For sufficiently large M, the truncated sum produces inferences virtually identical to those with the infinite sum (Ishwaran & James, 2001; Rodriguez & Dunson, 2011). As discussed in Rodriguez and Dunson (2011), if there is a discrepancy between the posterior distributions under the truncated and infinite sums, then the model is sensitive to the choice of M. Any value of M that has a small value for $w_{M}$ is sufficiently large to produce a negligible discrepancy. We examined the posterior distribution of $w_{M}$ ⁠, and assessed sensitivity of the model to several different M values for the breast cancer dataset. We found that the truncated process is robust to the choice of M, if M is sufficiently large. This led us to use M = 15 for the data analysis and simulation studies. Computational details are given in Supplementary Section 1.1. A computer program ‘utility-analysis’ for fitting the proposed model is available from https://users.soe.ucsc.edu/juheelee/.

3 UTILITY FUNCTIONS FOR PFS AND TTB

In this section, we describe how a utility function was constructed for the breast cancer data analysis. While we focus on the case where y consists of PFS time and TTB, the methodology may be applied generally in settings where y is a single variable, a bivariate binary or ordinal variable, or some combination of two or more discrete and continuous outcomes. Examples are given in Supplementary Section 2.

Given the reduction of the K-dimensional toxicity vector z to the scaled TTB q, 0 ≤ q ≤ 1, we will construct a utility function for the pair (t, q). A departure of our utility formulation from previous published outcome utilities is that we construct U so that it varies with covariates x as well as the outcomes (t, q). We let τ = 1 for L + B and τ = 0 for L, so choosing τ for a future patient with a covariates $x^{new}$ is the target of our decision analysis. To ensure a consistent utility function that quantifies trade-offs between t and q for each x, we require

U (t, q, x) > U (t, q^{'}, x) if q < q^{'} for any t, and U (t, q, x) < U (t^{'}, q, x) if t < t^{'} for any q .

That is, considered individually with the other outcome variable fixed, smaller TTB and longer PFS each must be more desirable.

The form of the utility function given here, and the numerical values that it takes on, were obtained based on the consensus of two medical oncologists who are co-authors of this paper, PM and BL, one of whom is a breast cancer subspecialist. The first step of our construction was to specify a parametric total utility function, which we defined generally as the product

U_{tot} (t, q, x) = U_{PFS} (t, x) \times U_{TTB} (q, x),

(8)

subject to the constraints $0 \leq U_{PFS} (t, x) \leq U_{\max}$ and $0 < U_{TTB} (q, x) \leq 1$ ⁠. For a given x, the utility component $U_{TTB} (q, x)$ acts multiplicatively to decrease the utility component $U_{PFS} (t, x)$ ⁠, and one may regard multiplying by $U_{TTB} (q, x)$ as penalizing the PFS utility, where the magnitude of the penalty is determined by q.

To apply this to the breast cancer dataset, we constructed a functional form for (8) to reflect this particular treatment setting. We denote the prognostic covariates by $x_{1} = Age$ ⁠, an indicator $x_{2}$ of measurable disease, and an indicator $x_{3}$ of whether the patient's disease free interval prior to trial entry was > 24 months, with all three included in the regression models for outcomes t and z. While the randomization for the breast cancer trial was stratified by $x_{2}$ and $x_{3}$ to improve precision, based on clinical experience PM and BL decided that neither $x_{2}$ nor $x_{3}$ should have any effect on the utility function, whereas $x_{1} = Age$ is included in $U_{TTB} .$ This is because, in clinical practice, the utility gained by greater PFS is similar regardless of age group, while older patients tend to care more about maintaining a good quality of life, quantified by a lower TTB. Thus, older patients are less likely than younger patients to accept a higher level of toxicity for the same PFS benefit. Accordingly, we only used the prognostic covariate $x_{1} = Age$ in the utility function, and constructed $U_{tot} (t, q, x) = U_{tot} (t, q, A g e)$ so that, for any PFS time t, a given value q > 0 for TTB decreases the utility for an older patient more than for a young patient. We also assumed that $U_{PFS} (t, A g e) = U_{PFS} (t)$ ⁠. We set $U_{\max} = 100$ since the domain (0, 100) is easy to interpret, and set $U_{TTB} (0, A g e) = 1$ for any Age. Thus, if a patient has no toxicity (q = 0) then $U_{tot} (t, 0, A g e) = U_{PFS} (t)$ ⁠. For example, if $q = 0.50, U_{PFS} (36) = 80,$ and $U_{TTB} (0.50, 60) = 0.50$ ⁠, then the utility of 60-month PFS time and TTB = 0.5 is $U_{tot} (36, 0.50, 60) = 80 \times 0.5 = 40$ for a 60-year-old patient, whereas $U_{TTB} (0.50, 40) = 0.70$ for a 40-year-old patient gives the much higher total utility $U_{tot} (36, 0.50, 40) = 80 \times 0.7 = 56$ ⁠.

For the PFS component, the clinicians PM and BL specified the particular values $U_{PFS} (24) = 50$ ⁠, $U_{PFS} (48)$ = 95, and required $\lim_{t \to \infty} U_{PFS} (t) = 100$ for patients with any Age. This was based on the clinical experience that, in the hormone receptor-positive metastatic breast cancer setting, the increase in utility with increasing PFS is linear for up to 4 years, after which there is generally not much increase in utility with increasing PFS. This is due, in part, to the availability of newer regimens that can be given as salvage therapy to patients whose disease progresses after 4 years. To reflect this, we constructed the following parametric function for $U_{PFS} (t)$ ⁠:

U_{PFS} (t) = \{\begin{matrix} U_{0} {(\frac{t}{t_{0}})}^{a} & if t < t_{0} \\ \frac{U_{\max}}{1 + \exp (- b_{1} t)} & if t \geq t_{0}, \end{matrix}

(9)

and we set $t_{0}$ = 48 months, $U_{0}$ = 95, and $U_{\max}$ = 100. This function increases in t up to 48 months, with a small additional increase for t > 48 months. Since $U_{PFS} (24) = 50$ ⁠, the equation $95 \times {(24 / 48)}^{a} = 50$ gives a = 0.926. Similarly, since $U_{PFS} (48) = 100 / {1 + \exp (- b_{1} 48)} = 95,$ this gives $b_{1} = 0.061$ ⁠. The resulting function $U_{PFS} (t)$ is plotted in Figure 2(a).

FIGURE 2

Illustration of utility functions. Panel (a) has $U_{PFS} (t)$ ⁠, the utility function of progression-free survival (PFS) time with t = PFS time. Panels (b)–(d) have the total utility functions $U_{tot} (t, q, A g e)$ ⁠, where t = PFS time and q = total toxicity burden (TTB). [Colour figure can be viewed at https://dbpia.nl.go.kr]

Open in new tab Download slide

We defined $U_{TTB} (q, A g e)$ to vary with Age so that it decreases at a faster rate for older Age. PM and BL established numerical values of $U_{TTB}$ in [0, 1] for each pair of (q, Age) values specified on a grid. These are tabulated in Supplementary Table 2. We constructed a parametric function that closely approximates these elicited TTB utilities by exploring various functional forms, and chose

U_{TTB} (q, A g e) = \exp {- q^{2} / (2 g^{2} (A g e))}, for 0 \leq q \leq 1,

(10)

where $g (A g e) = \exp (c_{0} + c_{1} A g e)$ ⁠. As q increases, the total utility given in (8) decreases by a factor of the above exponential function of $q^{2},$ and $U_{PFS}$ is penalized in $U_{tot}$ using this exponential value. If no toxicity occurs, that is, q = 0, then $U_{TTB} (0, A g e) = 1$ and $U_{tot} (t, q, A g e) = U_{PFS} (t)$ ⁠, as required. The function $U_{TTB} (q, A g e)$ has an inflection point at q = g(Age), and for any q > g(Age), $U_{tot}$ decreases to 0 very quickly. We defined g(Age) to be a decreasing function of Age so that $U_{TTB} (q, A g e)$ decreases in q faster for larger values of Age. To obtain this, we restricted $c_{1} < 0$ and calibrated the values of $c_{0}$ and $c_{1}$ using the elicited numerical utilities. The numerical values $c_{0} = 0.823$ and $c_{1} = - 0.05$ yield a good approximation. Details are given in Supplementary Section 2. Figure 2b,c compares $U_{tot} (t, q, A g e)$ (solid line) to the elicited values (dots connected by dotted lines) for Age = 50, 65 and 85. In each plot, different colours represent different values of q. The figure illustrates how $U_{tot} (t, q, A g e)$ decreases with q, and how the magnitude of decrease changes with Age. The specific utility function described here may be questioned due to its subjectivity. However, ordering the consequences of decisions is inherently subjective and is necessary for decision making, and using an explicit utility function that is constructed based on expert knowledge produces meaningful decisions.

To use this structure for individualized treatment selection, we exploit the Bayesian model to compute the posterior predictive (PP) distribution of $(\tilde{t}, z)$ for a new patient with prognostic covariates $x^{new} = (x_{1}^{new}, x_{2}^{new}, x_{3}^{new})$ assuming a particular τ is given to the patient, similar to the examples in Supplementary Section 2,

p (\tilde{t}, z | τ, x^{new}, 𝒟) = \int \int p (\tilde{t}, z | θ, s^{new}, x^{new}, τ) p (θ, s^{new} | 𝒟) d s^{new} d θ .

(11)

By averaging the likelihood of the new patient's future outcomes $(\tilde{t}, z)$ over the joint posterior distribution of $(θ, s^{new})$ ⁠, the PP distribution in (11) provides a fully model-based criterion for making inferences to compare treatments, with appropriate quantification of uncertainty. The PP distribution of the utility, $p {U_{tot} (t, q, A g e^{new}) | τ, x^{new}, 𝒟},$ for a new patient with prognostic covariates $x^{new}$ can be derived directly from (11).

We use the predictive mean total utility for each τ as a basis for treatment selection. For τ = 0 corresponding to L and τ = 1 for L + B, the predictive mean total utility is

{\overline{u}}_{tot} (τ, x^{new}) = \sum_{z_{1} = 0}^{J - 1} \dots \sum_{z_{K} = 1}^{J - 1} \int_{R} U_{tot} (\tilde{t}, q, A g e^{new}) p (\tilde{t}, z | τ, 𝒟, x^{new}) d \tilde{t} .

(12)

One may choose the treatment τ having larger ${\overline{u}}_{tot} (τ, x^{new})$ for the new patient. In general, ${\overline{u}}_{tot} (τ, x^{new})$ is a function of the entire $x^{new}$ vector and τ through the utility function and/or the probability distribution, and this still would be the case if $U_{tot}$ did not depend on Age.

In general, another criterion for comparing treatments τ and $τ^{'}$ studied in a trial is to compare the PP distributions $p {U_{tot} (t, q, A g e^{new}) | τ, x^{new}, 𝒟}$ and $p {U_{tot} (t, q, A g e^{new}) | τ^{'}, x^{new}, 𝒟}$ of the total utilities. To do this, we define the posterior probability that treatment $τ^{'}$ has a larger total utility than treatment τ for a new patient with prognostic variables $x^{new}$ ⁠,

Δ (x^{new}, τ, τ^{'}) = \Pr {U_{tot} (t (τ), q (τ), A g e^{new}) < U_{tot} (t (τ^{'}), q (τ^{'}), A g e^{new}) | τ, τ^{'}, 𝒟, x^{new}} .

(13)

One may select $τ^{'}$ to treat a new patient with $x^{new}$ if $Δ (x^{new}, τ, τ^{'}) \geq 0.5,$ and otherwise select τ. This can be done by first computing the joint PP distribution of $(\tilde{t} (τ), z (τ), \tilde{t} (τ^{'}), z (τ^{'}))$ ⁠, that is, of two sets of outcomes with one for each treatment,

\begin{matrix} p (\tilde{t} (τ), z (τ), \tilde{t} (τ^{'}), z (τ^{'}) | τ, τ^{'}, x^{new}, 𝒟) & = \int \int p (\tilde{t} (τ), z (τ) | θ, s^{new}, x^{new}, τ) \\ \times p (\tilde{t} (τ^{'}), z (τ^{'}) | θ, s^{new}, x^{new}, τ^{'}) p (s^{new}, θ | 𝒟) d s^{new} d θ, \end{matrix}

(14)

which can be used in turn to compute $Δ (x^{new}, τ, τ^{'})$ ⁠. Although $U_{tot}$ depends on $x^{new}$ only through Age, the joint distribution in (14) depends on the entire $x^{new}$ and $(τ, τ^{'})$ ⁠, and Δ does as well. The decision criteria ${\overline{u}}_{tot} (τ, x^{new})$ and $Δ (x^{new}, τ, τ^{'})$ may be computed numerically using MCMC samples of θ simulated from $p (θ | 𝒟, \tilde{θ})$ ⁠. Computational details are given in Supplementary Section 1.2.

4 DECISION MAKING FOR THE BREAST CANCER DATA

In this section, we illustrate the proposed decision-making procedures by application to the breast cancer dataset. We fit the statistical model in Section 2 to the data and used the elicited utility in Section 3. To fit the Bayesian model, we specified values of the fixed hyperparameters $\tilde{θ}$ as follows. We let the DP concentration parameter be ξ = 1, $κ^{2} = 5$ for the prior of $β_{m, p}$ ⁠, $a_{v} = 5$ and $b_{v} = 1$ for the priors of $v_{p}^{2}, a_{e} = b_{e} = 3$ for the priors of $e_{k, j},$ and $a_{t} = 3$ and $b_{t} = 1$ for the prior of $σ_{t}^{2}$ ⁠. We fixed ${\overline{β}}_{p} = 0$ ⁠, p ≥ 1 and used the empirical average of the observed $\tilde{t}$ to specify ${\overline{β}}_{0}$ ⁠. Similarly, we let ${\overset{=}{α}}_{p} = 0$ ⁠, p = 1, …, P and used the empirical probabilities of $z_{i, k}$ being 0 to determine ${\overset{=}{α}}_{0}$ ⁠. We fixed $σ_{z}^{2} = 9$ ⁠. For Ω, we let $a_{Ω} = 50$ ⁠, and $Ω_{0} = 0.01 (a_{Ω} - K - 1) I_{K + 1}$ ⁠. We discarded the first 20,000 iterates for burn-in, and kept the next 5000 iterates for posterior inference. We examined mixing and convergence of the Markov chains using trace plots, and did not find evidence of poor mixing or bad convergence.

Estimates of the posterior predictive distributions of t, q and $U_{tot}$ are shown in Figure 3 for a future patient with $x^{new} = (A g e^{new}, 0, 0)$ ⁠. The top, middle and bottom rows of the figure correspond to $A g e^{new} =$ 55, 65 and 75 years, with treatment L (τ = 0) represented by red and L + B (τ = 1) by blue. PP estimates of the survival functions $S (t | τ, x^{new}, 𝒟)$ with 95% pointwise credible intervals are given in the left column, and the middle column gives estimates of the PP distributions, $p (q | τ, x^{new}, 𝒟),$ of TTB. Estimated $S (t | τ, x^{new}, 𝒟)$ for L + B are slightly above those for L at all ages, indicating a small overall improvement in PFS with L + B. In contrast, estimated $p (q | τ, x^{new}, 𝒟)$ for L + B have much longer and thicker right tails, indicating an increased risk of toxicity events with L + B compared to L. From the figures, effects of Age on PFS and TTB are small. Additional comparisons of the PP distributions of PFS (t) and TTB (q) for treatments L and L + B for more values of $x^{new}$ are given in panels (b) and (c) of Supplementary Figures 1 and 2. These give PP point estimates of t and q as functions of $x^{new}$ ⁠. Panel (a) of the figures shows estimated PP probabilities that L + B yields greater PFS than L, and that L + B yields greater TTB than L. The figures also indicate that L + B tends to yield better PFS than L, while L + B is more likely to be associated with higher TTB. Swanson and Lin (1994) also noted that older patients are more likely to respond to hormone therapies such as L + B or L. Our inferences for PFS and TTB considered individually agree with the findings reported by Dickler et al. (2016).

FIGURE 3

[Breast Cancer Trial Data] Estimated posterior predictive (PP) survival functions $S (t | 𝒟)$ where $𝒟$ denotes data, are given in panels (a), (d) and (g); estimates of PP cumulative distributions of total toxicity burden (TTB, Q) in panels (b), (e) and (h); and PP distribution estimates of total utility $U_{tot}$ in (c), (f) and (i). The top, middle and bottom rows are correspond to ages ( $x_{1}^{new}$ ⁠) 55, 65 and 75 years. Covariates $(x_{2}^{new}, x_{3}^{new}) = (0, 0)$ ⁠, which indicate the absence of measurable disease at baseline and the patient's disease free interval prior to trial entry ≤ 24 months, are fixed. In each panel, red and blue represent treatments L (τ = 0) and L + B (τ = 1), respectively. [Colour figure can be viewed at https://dbpia.nl.go.kr]

Open in new tab Download slide

As noted earlier, because treatment comparisons based on TTB and PFS considered separately lead in opposite directions, these results do not provide a clear basis for choosing one treatment over the other, either overall or for individual patients. Considering the outcomes together by using the utility function of treatment and Age provides a useful tool for resolving this problem. The plots in the rightmost column of Figure 3 compare PP distributions of $U_{tot}$ for the two treatments, as functions of Age, indicating that the utility benefit of L + B over L diminishes with increasing Age. In the plot, we assume that $x_{2}^{new} = 0$ and $x_{3}^{new} = 0 .$ Posterior predictive estimates $\hat{Δ} (x^{new}, L, L + B)$ of the probabilities that treatment L + B has greater utility than treatment L are 0.56, 0.54 and 0.46 for 55-, 65- and 75-year-old patients respectively. Thus, in terms of overall utility accounting for both PFS and TTB, decisions based on $\hat{Δ} (x^{new}, L, L + B)$ would be to give L + B to patients with $A g e^{new} < 70,$ but give L to patients with $A g e^{new} \geq$ 70.

Figure 4(a) illustrates $\hat{Δ} (x^{new}, L, L + B)$ on a grid of $A g e^{new} = x_{1}^{new}$ values for the combinations $(x_{2}^{new}, x_{3}^{new}) =$ (0, 0), (0, 1), (1, 0) and (1, 1). Overall, while L + B tends to yield a greater utility for younger patients, L is expected to have a greater utility for older patients. Although PFS is improved by L + B for patients of all ages, increases in TTB with both L + B and age make it less desirable for older patients. For example, $\hat{Δ} (x^{new}, L, L + B) < 0.5$ for a 70-year-old patient, with $x^{new} = (70, 0, 1)$ (dashed line), implying that L is a better treatment option than L + B for this patient. The differences in Δ for varying $(x_{2}^{new}, x_{3}^{new})$ values are small, and the values of $x_{2}$ and $x_{3}$ do not change any decisions significantly. Posterior expected utility estimates ${\hat{\overline{u}}}_{tot} (x^{new}, τ)$ are computed on the age grid for different treatment options, shown in panels (b) and (c) of the figure. The figure in panel (c) shows that the expected utility decreases rapidly with age for L + B, while it increases slightly for L. Thus, Age-specific recommendations in terms of ${\hat{\overline{u}}}_{tot} (x^{new}, τ)$ and $\hat{Δ} (x^{new}, L, L + B)$ are the same for these values of $(x_{2}^{new}, x_{3}^{new})$ ⁠.

[Breast Cancer Trial Data] (a) Estimated posterior predictive (PP) probability Δ^(xnew,L,L+B) that treatment letrozole plus bevacizumab (L + B) has greater utility than treatment letrozole plus placebo (L) for different cases of xnew=(x1new,x2new,x3new), where x1new = age, x2new = the presence of measurable disease at baseline, and x3new = disease free interval prior to trial entry > 24 months for an unobserved patient. Panels (b) and (c) give posterior predictive mean utility estimates u‾^tot(τ,xnew) for treatments τ = L + B and L, respectively, for different values of xnew. [Colour figure can be viewed at https://dbpia.nl.go.kr]

FIGURE 4

[Breast Cancer Trial Data] (a) Estimated posterior predictive (PP) probability $\hat{Δ} (x^{new}, L, L + B)$ that treatment letrozole plus bevacizumab (L + B) has greater utility than treatment letrozole plus placebo (L) for different cases of $x^{new} = (x_{1}^{new}, x_{2}^{new}, x_{3}^{new})$ ⁠, where $x_{1}^{new}$ = age, $x_{2}^{new}$ = the presence of measurable disease at baseline, and $x_{3}^{new}$ = disease free interval prior to trial entry > 24 months for an unobserved patient. Panels (b) and (c) give posterior predictive mean utility estimates ${\hat{\overline{u}}}_{tot} (τ, x^{new})$ for treatments τ = L + B and L, respectively, for different values of $x^{new}$ ⁠. [Colour figure can be viewed at https://dbpia.nl.go.kr]

Open in new tab Download slide

5 SIMULATION STUDY

In this section, we summarize a simulation study to illustrate the performance of the proposed utility-based decision-making procedure. To generate data similar to the breast cancer dataset, we set the number of patients to be n = 350, with three covariates and K = 20 toxicity types each having J = 6 grades. To mimic the covariate distribution in the breast cancer dataset, we randomly drew a sample of size 350 from $(τ_{i}, x_{i, 1}, x_{i, 2})$ with replacement, where τ, $x_{1}$ and $x_{2}$ are a binary treatment indicator, age, and a binary indicator of disease measurability, respectively, so our P = 2.

We simulated patient-specific frailty vectors $s_{i}^{TR}$ for correlation between $y_{i}^{TR} = ({\tilde{t}}_{i}^{TR}, {\tilde{z}}_{i}^{TR})$ within a patient. To illustrate how the BNP model flexibly accommodates complicated relationships between $τ_{i}$ ⁠, $x_{i}$ and $y_{i}$ ⁠, we generated $y_{i}^{TR}$ from a mixture of two regression functions, each having main effects for τ and $x_{p}$ ⁠, and their interaction effects. Specifically, we generated $s_{i}^{TR} \overset{iid}{\sim} N_{K + 1} (0, Ω^{TR})$ ⁠, where $Ω^{TR}$ assumes that variances of $s_{i, k}^{TR}$ are 0.05 and correlations between $s_{i, k}^{TR}$ and $s_{i, k^{'}}^{TR}$ are 0.5 if $k, k^{'} > 0$ and $k \neq k^{'}$ ⁠, and −0.5 if k or $k^{'} = 0$ ⁠. Given $s_{i}^{TR}$ ⁠, we then generated $y_{i}^{TR}$ from the following distribution with probability 0.4,

\begin{matrix} {\tilde{t}}_{i} | x_{i}, s_{i, 0}^{TR} & \overset{indep}{\sim} N (4.0 - 0.5 τ_{i} + 0.5 x_{i, 1} + 0.5 τ_{i} x_{i, 1} + s_{i, 0}^{TR}, 0.6), \\ {\tilde{z}}_{i, k} | x_{i}, s_{i, k}^{TR}, α^{TR} & \overset{indep}{\sim} N (α_{1, k, 0}^{TR} + α_{1, k, 1}^{TR} τ_{i} + α_{1, k, 2}^{TR} x_{i, 1} + α_{1, k, 3}^{TR} τ_{i} x_{i, 1} + s_{i, k}^{TR}, 4), \end{matrix}

where $α_{1, k, 0}^{TR} \overset{iid}{\sim} U (- 4.5, - 4.0)$ ⁠, $α_{1, k, 1}^{TR} \overset{iid}{\sim} U (0.5, 1.0)$ ⁠, $α_{1, k, 2}^{TR} \overset{iid}{\sim} U (0.3, 0.8)$ ⁠, and $α_{1, k, 3}^{TR} \overset{iid}{\sim} U (0, 0.5)$ were simulated for all k. With the remaining probability 0.6, we generated

\begin{matrix} {\tilde{t}}_{i} | x_{i}, s_{i, 0}^{TR} & \overset{indep}{\sim} N (1.5 + 0.8 τ_{i} - 0.5 x_{2, i} + s_{i, 0}^{TR}, 0.6) \\ {\tilde{z}}_{i, k} | x_{i}, s_{i, k}^{TR}, α^{TR} & \overset{indep}{\sim} N (α_{2, k, 0}^{TR} + α_{2, k, 1}^{TR} τ_{i} + α_{2, k, 2}^{TR} x_{i, 2} + s_{i, k}^{TR}, 4), \end{matrix}

where $α_{2, k, 0}^{TR} \overset{iid}{\sim} U (- 4.5, - 4.0)$ ⁠, $α_{2, k, 1}^{TR} \overset{iid}{\sim} U (1.0, 1.5)$ ⁠, and $α_{2, k, 2}^{TR} \overset{iid}{\sim} U (0.5, 1.0)$ were simulated for all k. We simulated censoring times $c_{i} \overset{iid}{\sim} N (4.63, 2)$ ⁠, and let $t_{i}^{o} = \min (t_{i}, c_{i})$ and $δ_{i} = 1 (c_{i} < t_{i})$ ⁠. In the simulated data, 22.86% of $t_{i}$ 's was censored. For ordinal outcomes $z_{i, k}$ ⁠, we simulated toxicity type specific cutoff points $u_{k, j}^{TR}$ ⁠, j = 2, …, J − 1 for each k, and let $z_{i, k} = j$ if $u_{k, j - 1}^{TR} < {\tilde{z}}_{i, k}^{TR} \leq u_{k, j}^{TR}$ ⁠. Details of the simulation setup are given in Supplementary Section 5. The utility function $U_{tot} (t, q, A g e)$ elicited in Section 3 is assumed.

Supplementary Figure 3(a), (b) and (e) illustrate the true total utility $p (U_{tot} | τ, x, θ^{TR})$ ⁠, TTB distribution $p (q | τ, x, θ^{TR}),$ and PFS survival function $S (t | τ, x, θ^{TR})$ (solid lines) in the simulation. The covariate vector x = (65, 0) of a 65-year-old patient with no measurable disease is used for illustration. In each plot, red represents τ = 0 and blue represents τ = 1. From panels (b) and (e), treatment τ = 1 has a greater TTB and greater expected PFS time than treatment τ = 0, which complicates treatment choice. This is resolved by panel (a), which shows that $U_{tot}$ with τ = 1 (blue) is stochastically greater than $U_{tot}$ with τ = 0 (red), implying that τ = 1 is better for a patient with x = (65, 0). Supplementary Figure 4 gives

Δ^{TR} (x, 0, 1) = \Pr (U_{tot} (t (0), q (0), A g e) < U_{tot} (t (1), q (1), A g e) | τ = 0, τ^{'} = 1, x, θ^{TR}),

with the true expected utilities ${\overline{U}}^{TR} (τ, x) = E (U_{tot} | τ, x, θ^{TR})$ shown in dark green symbols for varying Age. For both treatments, the expected utility decreases with Age, shown in panels (a) and (b). The difference in ${\overline{U}}^{TR} (τ, x)$ between the treatments and $Δ^{TR} (x, 0, 1)$ decrease with Age, indicating that the superiority of treatment τ = 1 compared to treatment $τ^{'} = 0$ diminishes with Age.

We specified values of the fixed hyperparameters similar to those in Section 4, and ran the MCMC simulation as described in Section 2.2. Posterior inferences are summarized in Supplementary Figure 3. The posterior predictive distributions, $p (q | τ, x^{new}, 𝒟)$ and $S (t | τ, x^{new}, 𝒟)$ with $x^{new} = (65, 0)$ ⁠, are shown in Supplementary Figure 3c–e, respectively, where τ = 0 and 1 are in red and blue respectively. In panel (e), the dashed lines represent the posterior mean estimates with 95% pointwise credible intervals in the shaded areas. Comparing the estimates in panels (d) and (e) to the truth under in panels (b), and (e) shows that the flexible BNP regression model captures the simulation truth reasonably well, which provides a good basis for accurate statistical decision making. For example, the posterior predictive distribution of $U_{tot}$ in panel (c) that provides a comprehensive criterion for treatment comparison, and is close to the truth in panel (a).

Supplementary Figure 4 illustrates posterior estimates $\hat{Δ} (x, 0, 1)$ of the probabilities that τ = 1 has greater utility than τ = 0, and compares posterior estimates of expected utilities $\hat{\overline{u}} (τ, x^{new})$ for each of τ = 0 and 1, the varying $x^{new} = (x_{1}^{new}, x_{2}^{new})$ ⁠. For $x_{1}^{new}$ = Age, a grid from 40 to 80 years in 5-year increments was used, and values of $x_{2}^{new} \in {0, 1}$ are indicated by the symbols, + and ×, respectively, with true values given in dark green. The model recovers the simulation truth reasonably well, and the decision-making procedure based on Δ(x, 0, 1) in Section 3, selects the truly superior treatment for all cases of $x^{new}$ ⁠. From panels (b) and (c), we observe discrepancies between ${\hat{\overline{u}}}_{tot} (τ, x^{new})$ and $U_{tot}^{TR} (τ, x^{new})$ for small $x_{1}$ (Age). However, the ranks of ${\hat{\overline{u}}}_{tot} (τ, x^{new})$ between the treatments are well estimated and the procedure of choosing τ to maximize ${\hat{\overline{u}}}_{tot} (τ, x^{new})$ reliably selects the treatments with truly greater utility.

We simulated 100 datasets to further examine the performance of the proposed decision-making procedures. The results are summarized in Supplementary Figure 5. Panels (a) and (d) of the figure show the distributions of $\hat{Δ} (x^{new}, 0, 1)$ over the 100 datasets, for $x_{2}^{new} = 0$ and 1, respectively, where the symbols + and × represent $Δ^{TR} (x^{new}, 0, 1)$ with $x_{2}^{new} = 0$ and 1 respectively. In most cases, the proposed decision-making procedure based on Δ(x, 0, 1) produces the correct decisions. Panels (b), (c), (e) and (f) illustrate the distributions of predictive mean utility estimates, ${\hat{\overline{u}}}_{tot} (x^{new}, τ)$ ⁠, using the 100 datasets for $(x_{1}, x_{3}) = (0, 0), (1, 0)$ ⁠, (0, 1) and (1, 1) respectively. The proposed model produces reasonably good estimates of the expected utilities, and the ordering of ${\overline{u}}_{tot} (x^{new}, τ)$ are also well estimated overall.

6 DISCUSSION

We have presented a formal decision-making framework based on utility functions to address the goal of statistical decision making based on data from a randomized clinical trial. The key elements of our methodology are a multivariate Bayesian regression model and a utility function of outcomes and covariates. We assumed a DDP, which is a general, flexible family of Bayesian regression models.

The methodology was illustrated with a breast cancer dataset from a randomized clinical trial. This required close collaboration with oncologists to elicit a utility function that reflected this clinical setting, and the resulting utility function varied with Age to reflect different risk-benefit trade-offs between PFS and TTB for older versus younger patients. Our application illustrates that, by establishing a utility function that quantifies the risk-benefit trade-off between two competing outcomes, one can derive a rational basis for making treatment choices in settings where simpler comparisons in terms of individual outcomes give different, contradictory choices.

The particular form of our utility function was tailored to the breast cancer setting, and may not be appropriate in other clinical settings. In general, the requirements are that U(y,x) must be consistent in the arguments of y so that it makes sense, and that it be tractable enough to facilitate application. Beyond that, a utility should be constructed so that it provides a sensible basis for quantifying trade-offs between different outcomes. It also should be kept in mind that, if U is constructed to be a function of y but not x, both the posterior mean utility $\overline{u} (τ, x)$ and $Δ (x, τ, τ^{'})$ still will vary with x due to the regression structure of f( y | τ, x, θ), and thus both criteria still will serve as a basis for making personalized treatment decisions.

Our framework may be generalized to accommodate more complex clinical settings, such as meta-analysis of multiple studies, optimization of a multi-stage treatment strategies or non-medical applications. Although the key elements will remain the same, such applications may be complex and will require tailoring to particular settings.

ACKNOWLEDGEMENTS

Juhee Lee's research was supported by NSF grant DMS-1662427. Pavlos Msaouel is supported by a Career Development Award by the American Society of Clinical Oncology, a Research Award by KCCure, the MD Anderson Khalifa Scholar Award and the MD Anderson Physician-Scientist Award. Peter Thall's research was supported by NIH/NCI grants P01 2P30CA016672 and R01 CA261978.

This manuscript was prepared using data from Datasets NCT00601900-D1 and NCT00601900-D2 from the NCTN Data Archive of the National Cancer Institute's (NCI's) National Clinical Trials Network (NCTN). Data were originally collected from clinical trial NCT number NCT00601900, Endocrine Therapy With or Without Anti-VEGF Therapy: A Randomized, Phase III Trial of Endocrine Therapy Alone or Endocrine Therapy Plus Bevacizumab (NSC 704865) for Women With Hormone Receptor-Positive Advanced Breast Cancer. All analyses and conclusions in this manuscript are the sole responsibility of the authors and do not necessarily reflect the opinions or views of the clinical trial investigators, the NCTN, or the NCI. The CALGB-40503 dataset was made available through an NCI-Genentech agreement under which Genentech supplied the drug to support the study.

REFERENCES

Bagočius

,

V.

,

Zavadskas

,

E.K.

&

Turskis

,

Z.

(

2014

)

Multi-person selection of the best wind turbine based on the multi-criteria integrated additive-multiplicative utility function

.

Journal of Civil Engineering and Management

,

20

(

4

),

590

–

599

.

Google Scholar

Crossref

WorldCat

Bekele

,

B.

&

Thall

,

P.

(

2004

)

Dose-finding based on multiple toxicities in a soft tissue sarcoma trial

.

Journal of the American Statistical Association

,

99

,

71

–

84

.

Google Scholar

Crossref

WorldCat

Chen

,

W.

,

Wiecek

,

M.M.

&

Zhang

,

J.

(

1998

)

Quality utility: a compromise programming approach to robust design

. In International design engineering technical conferences and computers and information in engineering conference, volume 80326, American Society of Mechanical Engineers. pp. V002T02A032.

Chib

,

S.

&

Greenberg

,

E.

(

1998

)

Analysis of multivariate probit models

.

Biometrika

,

85

,

347

–

361

.

Google Scholar

Crossref

WorldCat

De Iorio

,

M.

,

Müller

,

P.

,

Rosner

,

G.L.

&

MacEachern

,

S.N.

(

2004

)

An ANOVA model for dependent random measures

.

Journal of the American Statistical Association

,

99

(

465

),

205

–

215

.

Google Scholar

Crossref

WorldCat

De Iorio

,

M.

,

Johnson

,

W.O.

,

Müller

,

P.

&

Rosner

,

G.L.

(

2009

)

Bayesian nonparametric nonproportional hazards survival modeling

.

Biometrics

,

65

(

3

),

762

–

771

.

Detsky

,

A.S.

,

Naglie

,

G.

,

Krahn

,

M.D.

,

Naimark

,

D.

&

Redelmeier

,

D.A.

(

1997

)

Primer on medical decision analysis: part 1—Getting started

.

Medical Decision Making

,

17

(

2

),

123

–

125

.

Dickler

,

M.N.

,

Barry

,

W.T.

,

Cirrincione

,

C.T.

,

Ellis

,

M.J.

,

Moynahan

,

M.E.

,

Innocenti

,

F.

et al. (

2016

)

Phase III trial evaluating letrozole as first-line endocrine therapy with or without bevacizumab for the treatment of postmenopausal women with hormone receptor-positive advanced-stage breast cancer: CALGB 40503 (Alliance)

.

Journal of Clinical Oncology

,

34

(

22

),

2602

.

Ishwaran

,

H.

&

James

,

L.F.

(

2001

)

Gibbs sampling methods for stick-breaking priors

.

Journal of the American Statistical Association

,

96

(

453

),

161

–

173

.

Google Scholar

Crossref

WorldCat

Le-Rademacher

,

J.

,

Hillman

,

S.

,

Storrick

,

E.

,

Mahoney

,

M.

,

Thall

,

P.

,

Jatoi

,

A.

et al. (

2020

)

Adverse event burden score a versatile summary measure in cancer clinical trials

.

Cancers

,

12

,

3256

.

Loewenstein

,

G.F.

,

Thompson

,

L.

&

Bazerman

,

M.H.

(

1989

)

Social utility and decision making in interpersonal contexts

.

Journal of Personality and Social psychology

,

57

(

3

),

426

.

Google Scholar

Crossref

WorldCat

MacEachern

,

S.N.

(

1999

) Dependent nonparametric processes. In

ASA proceedings of the section on Bayesian statistical science

, volume 1.

Alexandria, VA

:

American Statistical Association

, pp.

50

–

55

.

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Mitra

,

R.

&

Müller

,

P.

(

2015

)

Nonparametric Bayesian inference in biostatistics

.

Berlin

:

Springer-Verlag

.

Müller

,

P.

&

Mitra

,

R.

(

2013

)

Bayesian nonparametric inference why and how

.

Bayesian Analysis

,

8

(

2

),

269

–

302

.

Google Scholar

OpenURL Placeholder Text

WorldCat

Müller

,

P.

,

Quintana

,

A.

,

Jara

,

A.

&

Hanson

,

T.

(

2015

)

Bayesian nonparametric data analysis

.

Berlin

:

Springer-Verlag

.

Naglie

,

G.

,

Krahn

,

M.D.

,

Naimark

,

D.

,

Redelmeier

,

D.A.

&

Detsky

,

A.S.

(

1997

)

Primer on medical decision analysis: part 3 estimating probabilities and utilities

.

Medical Decision Making

,

17

(

2

),

136

–

141

.

Pennings

,

J.M.

&

Smidts

,

A.

(

2003

)

The shape of utility functions and organizational behavior

.

Management Science

,

49

(

9

),

1251

–

1263

.

Google Scholar

Crossref

WorldCat

Rodriguez

,

A.

&

Dunson

,

D.B.

(

2011

)

Nonparametric Bayesian models through probit stick-breaking processes

.

Bayesian Analysis

,

6

(

1

),

145

–

177

.

Google Scholar

OpenURL Placeholder Text

WorldCat

Roy

,

S.K.

,

Maity

,

G.

&

Weber

,

G.-W.

(

2017

)

Multi-objective two-stage grey transportation problem using utility function with goals

.

Central European Journal of Operations Research

,

25

(

2

),

417

–

439

.

Google Scholar

Crossref

WorldCat

Sethuraman

,

J.

(

1994

)

A constructive definition of Dirichlet priors

.

Statistica Sinica

,

4

(

2

),

639

–

650

.

Google Scholar

OpenURL Placeholder Text

WorldCat

Swanson

,

G.M.

&

Lin

,

C.-S.

(

1994

)

Survival patterns among younger women with breast cancer: the effects of age, race, stage, and treatment

.

Journal of the National Cancer Institute. Monographs

, (

16

),

69

–

77

.

Google Scholar

OpenURL Placeholder Text

WorldCat

Thall

,

P.

,

Müller

,

P.

,

Xu

,

Y.

&

Guidani

,

M.

(

2017

)

Bayesian nonparametric statistics: a new toolkit for discovery in cancer research

.

Pharmaceutical Statistics

,

16

,

414

–

423

.

Walsh

,

W.E.

,

Tesauro

,

G.

,

Kephart

,

J.O.

&

Das

,

R.

(

2004

) Utility functions in autonomic systems. In

International conference on autonomic computing, 2004. Proceedings

.

IEEE

, pp.

70

–

77

. Available from: https://www.computer.org/csdl/proceedings-article/icac/2004/21140070/12OmNzahc1L [Accessed 5th September 2022].

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Download all slides

Month:	Total Views:
March 2023	21
April 2023	20
May 2023	9
June 2023	5
July 2023	9
August 2023	6
October 2023	2
November 2023	23
December 2023	29
January 2024	33
February 2024	8
March 2024	17
April 2024	34
May 2024	15
June 2024	10
July 2024	15
August 2024	23
September 2024	13
October 2024	11
November 2024	30
December 2024	8
January 2025	28
February 2025	4
March 2025	14
April 2025	8
May 2025	12

Article Contents

Utility-Based Bayesian Personalized Treatment Selection for Advanced Breast Cancer

Abstract

1 INTRODUCTION

2 A BNR MODEL

2.1 Sampling distribution and prior specification

2.2 Posterior inference

3 UTILITY FUNCTIONS FOR PFS AND TTB

4 DECISION MAKING FOR THE BREAST CANCER DATA

5 SIMULATION STUDY

6 DISCUSSION

ACKNOWLEDGEMENTS

REFERENCES

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Utility-Based Bayesian Personalized Treatment Selection for Advanced Breast Cancer

Abstract

1 INTRODUCTION

2 A BNR MODEL

2.1 Sampling distribution and prior specification

2.2 Posterior inference

3 UTILITY FUNCTIONS FOR PFS AND TTB

4 DECISION MAKING FOR THE BREAST CANCER DATA

5 SIMULATION STUDY

6 DISCUSSION

ACKNOWLEDGEMENTS

REFERENCES

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only