-
PDF
- Split View
-
Views
-
Cite
Cite
José L. Jiménez, Mourad Tighiouart, Combining Cytotoxic Agents with Continuous Dose Levels in Seamless Phase I-II Clinical Trials, Journal of the Royal Statistical Society Series C: Applied Statistics, Volume 71, Issue 5, November 2022, Pages 1996–2013, https://doi.org/10.1111/rssc.12598
- Share Icon Share
Abstract
Phase I-II cancer clinical trial designs are intended to accelerate drug development. In cases where efficacy cannot be ascertained in a short period of time, it is common to divide the study in two stages: (i) a first stage in which dose is escalated based only on toxicity data and we look for the maximum tolerated dose (MTD) set and (ii) a second stage in which we search for the most efficacious dose within the MTD set. Current available approaches in the area of continuous dose levels involve fixing the MTD after stage I and discarding all collected stage I efficacy data. However, this methodology is clearly inefficient when there is a unique patient population present across stages. In this article, we propose a two-stage design for the combination of two cytotoxic agents assuming a single patient population across the entire study. In stage I, conditional escalation with overdose control is used to allocate successive cohorts of patients. In stage II, we employ an adaptive randomisation approach to allocate patients to drug combinations along the estimated MTD curve, which is constantly updated. The proposed methodology is assessed with extensive simulations in the context of a real case study.
1 INTRODUCTION
Integrated phase I-II clinical trial designs are efficient approaches to accelerate drug development. This seamless process uses a single protocol to investigate two aspects of the development of a compound, namely safety and efficacy.
With the use of chemotherapy compounds, a dose limiting toxicity (DLT) is generally ascertained after one cycle of therapy for the purpose of estimating the maximum tolerated dose (MTD). If the efficacy outcome can be ascertained in a short period of time (e.g., after one or two cycles of therapy), it is common to employ one-stage designs where the joint probability of toxicity and efficacy is sequentially updated after each cohort of patients. Extensive methodology to carry out such designs can be found in (Braun, 2002; Chen et al., 2015; Ivanova, 2003; Sato et al., 2016; Thall & Cook, 2004; Thall & Russell, 1998; Yuan & Yin, 2009) for single agent trials and in (Huang et al., 2007; Yin et al., 2006) for drug combinations. In contrast, when efficacy cannot be ascertained in a short period of time, we may employ two-stage designs where a set of MTD combinations are first selected, and then tested for efficacy in a subsequent second stage (see e.g., Chen et al., 2012; Jiménez et al., 2020; Le Tourneau et al., 2009; Rogatko et al., 2008; Shimamura et al., 2018; Tighiouart, 2019; Yuan & Yin, 2011; Zhang & Yuan, 2016).
This work is motivated by the cisplatin-cabazitaxel clinical trial design proposed by Tighiouart (2019). In that phase I-II study design, stage I was inspired by a previous phase I trial published by Lockhart et al. (2014) in patients with advanced solid tumours, where a single MTD of cisplatin/cabazitaxel 15/75 mg/m was recommended. However, based on their results and other preliminary efficacy data, it was hypothesised by the clinical team that there could be a series of dose combinations which could be tolerable and efficacious in prostate cancer. In stage II of Tighiouart (2019), the design enrols 30 additional patients to identify dose combinations with high probability of efficacy, along the MTD curve estimated from the first stage data.
A limitation of the original cisplatin-cabazitaxel design by Tighiouart (2019) is that uncertainty about the estimated MTD curve is not taken into account in stage II. Moreover, efficacy data from stage I were not used to model the dose–efficacy relationship in stage II. The rational for this disconnection was that stage I and stage II populations were not exactly the same, and therefore it was hypothesised that the dose–efficacy profiles could also be different. Recently, Jiménez and Zheng (2021) proposed a method to robustly combine efficacy data across stages in this particular setting. In practice however, it is not uncommon to design phase I-II clinical trials with the same patient population across stages. In such a situation, it would be more sensible to use all the toxicity and efficacy data collected from stage I as well as update the MTD curve during stage II. To the best of our knowledge, such a design does not exist in the phase I-II clinical trial designs literature with combination of cytotoxic agents and continuous dose levels.
In this article, we extend the work of Tighiouart (2019) by allowing the use of all safety and efficacy data collected throughout the entire trial, which implies that the MTD curve is sequentially updated during stage II. Since we assume that efficacy cannot be ascertained in a short period, the design proceeds in two stages. The design's operating characteristics are evaluated by extensive simulations in the context of the cisplatin-cabazitaxel clinical trial.
The rest of the article is organised as follows. In Section 2 we describe the clinical trial that motivates this research. In Section 3 we introduce the dose-toxicity and dose–efficacy models as well as the study design. In Section 4 we present the operating characteristics under several scenarios for the true dose-toxicity and dose–efficacy relationships. Last, in Section 5, we discuss the major contributions of this manuscript together with potential areas where there may be room for improvement.
2 THE CISPLATIN-CABAZITAXEL TRIAL
The two-stage phase I-II cisplatin-cabazitaxel trial design proposed by Tighiouart (2019) investigates the combination of cisplatin and cabazitaxel, given every 3 weeks, in patients with metastatic, castration resistant prostate cancer. A continuous set of doses that range from 10 to 25 mg/m for cabazitaxel and from 50 to 100 mg/m for cisplatin were administered intravenously. In stage I, the study enrols 30 patients using conditional escalation with overdose control (EWOC) algorithm (see Tighiouart et al., 2017) to estimate the MTD curve. In stage II, the study enrols another 30 patients from the same population of patients but with visceral metastasis to identify dose combinations with high probability of efficacy along the MTD curve estimated at the end of stage I. These patients are allocated to dose combinations along the MTD curve using a Bayesian adaptive design after modeling the dose–efficacy curve with cubic splines.
The first stage of the cisplatin-cabazitaxel trial was motivated by the phase I trial published by Lockhart et al. (2014), where the combinations of cisplatin/cabazitaxel 15/75, 20/75 and 25/75 mg/m were explored in patients with advanced solid tumours, using a ‘3+3’ design. The recommended MTD was 15/75 mg/m on the basis of data from nine patients who can be evaluated in phase I and 15 patients in the expansion cohort, although only two patients out of 18 treated at the recommended MTD had DLT. Based on this low toxicity rate and preliminary efficacy data, the investigators hypothesised that a series of tolerable dose combinations which could be efficacious exist in prostate cancer.
The original cisplatin-cabazitaxel trial design was tailored for a situation with different patient populations across stages. As previously mentioned, it is not uncommon to have the same patient population across the entire study. Under this assumption, having two disconnected stages would not be sensible since the MTD could be constantly updated and the efficacy data collected in stage I could be used in stage II. Therefore, we propose an extension of the original cisplatin-cabazitaxel trial design to a situation where we have either the same patient population across the two stages or very similar populations with respect to their susceptibility to toxicities and clinical benefit, the primary response endpoint in the cisplatin-cabazitaxel trial.
3 METHOD
3.1 Formulation of the problem
Let be the lower and upper bound of the continuous dose levels , of the compounds and , respectively. The doses are standardised to fall within the interval [0,1] using the transformations and , so that and .
Let be the binary indicator of DLT where represents the presence of a DLT after a pre-defined number of treatment cycles, and otherwise. Let be the indicator of treatment response where represents a positive response after a pre-defined number of treatment cycles, and otherwise. Following Tighiouart (2019), let be the target probability of DLT and be the probability of efficacy of the standard of care treatment. The goal of this phase I-II trial is to identify dose combinations with probability of DLT and probability of efficacy . When employing synergistic cytotoxic agents, it is common to assume that both the dose-toxicity and dose–efficacy relationships are monotonically increasing functions. This implies that the optimal dose will lie in the MTD set, defined as .
3.2 Definition of the marginal dose-toxicity and dose–efficacy models
Following the work of (Cai et al., 2014; Ivanova et al., 2009; Lyu et al., 2019), we assume that the binary outcomes of toxicity and efficacy are independent. Let the marginal probability of DLT and marginal probability of efficacy models be defined as
and
respectively, with , and where represents the cumulative distribution function of the standard normal distribution. Note that in (1), a DLT is assumed to be attributed to either drug or , or both, that is, we do not take into account toxicity attributions (see Jimenez et al., 2019) because the number of attributable DLTs is expected to be very low given the cytotoxic nature of cisplatin and cabazitaxel.
In (2), we added quadratic terms to allow more flexibility in the marginal efficacy model . The rationale for this is that the dose–efficacy profiles along the MTD curve can become quite complex (see Figure 2) when having synergistic compounds (i.e., ) even if we only consider linear terms in both marginal models. If we also account for the uncertainty in the MTD curve estimation and the relatively small sample size available, in the development of this research we have found that a marginal efficacy model with only linear terms was, in certain situations, not able to capture the dose–efficacy profile along the MTD curve. In our simulation scenarios, the optimal dose combination is guaranteed to be on the MTD curve because the underlying true dose–efficacy relationship (i.e., the data-generating model) is a monotonically increasing function. However, in a setting with dose-toxicity profiles without drug–drug interactions (i.e., ), this additional flexibility would not be necessary and a marginal efficacy model with only linear terms would be sufficient to capture any dose–efficacy profile along the MTD curve.
Following Tighiouart et al. (2017), we re-parameterise the marginal probability of DLT defined in Equation (1) in terms of parameters that clinicians can easily interpret. Let denote the probability of DLT when the levels of agents and , with , and , so that , , and . It then follows that the MTD set takes the form
3.3 Likelihood and posterior distributions
We assume that and are independent a priori with , , and conditional on , . Also, let the interaction parameter . Let be the data collected after patients have been enroled. The posterior distribution of the dose–toxicity model parameters is
where
In the dose–efficacy model, we do not incorporate any prior knowledge. Thus, we assume that , and that are independent a priori. The posterior distribution of the dose–efficacy model parameters is
All Bayesian computations are done using R (see R Core Team, 2019) and JAGS (see Plummer, 2003).
3.4 Study design
Stage I will enrol a total of patients, where represents the total number of cohorts in phase I with equal number of patients . Stage II will enrol a total of patients. Let be the total number of patients that the entire study will enrol and be the estimated MTD set based on data from all patients. At the end of stage II, we test the following null and alternative hypotheses
and we reject the null hypothesis if
where is a pre-specified design parameter. Moreover, the dose combination
is recommended as the optimal dose combination and is selected for further phase II or III studies.
Stage I is based on the EWOC principle (see Babb et al., 1998; Shi & Yin, 2013; Tighiouart et al., 2005, 2017; Tighiouart & Rogatko, 2010; Tighiouart & Rogatko, 2012) where the posterior probability of overdosing the next cohort of patients is bounded by a feasibility bound . In a cohort with two patients, the first one would receive a new dose of compound given that the dose of compound that was previously assigned. The other patient would receive a new dose of compound given that dose of compound was previously assigned. These steps are described in Stage I of Algorithm 1. Using EWOC, these new doses are at the percentile of the conditional posterior distribution of the MTD combinations. The feasibility bound increases from 0.25 up to 0.5 in increments of 0.05 (see Wheeler et al., 2017). Accrual continues until the maximum sample size in stage I is reached or the trial is stopped early for safety.
Stage II follows the response-adaptive randomisation principle. This type of Monte Carlo algorithm uses the current parameter estimates to sample a cohort of dose combinations from the estimated dose–efficacy standardised density of along the estimated MTD curve. Note that uses the Bayes estimates of the dose–efficacy model parameters. Because stage II selects doses on the estimated MTD curve , and there is a one-to-one correspondence between and , where , we may write for . The standardised density of the estimated efficacy curve is . A rejection sampling algorithm is then used to sample dose combinations from this density.
These steps are described in Stage II of Algorithm 1. Note that in this stage, the MTD curve is updated after the DLT status of each cohort of patients is resolved and a new cohort of patients is randomly selected from the new MTD curve according to the standardised density .
Another contribution of this article with respect to (Jiménez et al., 2020; Jiménez & Zheng, 2021; Tighiouart, 2019) is that we do not require the first cohort of stage II to be homogeneously distributed along the estimated MTD curve. The justification is that, in this article, we do not have two disconnected stages and thus we use all the available stage I efficacy data to already have posterior estimates of at the end of stage I as displayed in line 13 of Algorithm 1. This allows to allocate the first cohort of patients in stage II to dose combination following the adaptive randomisation principle.
To facilitate the understanding of Algorithm 1, we define the variables that will be used. In stage I, denotes the current cohort indicator with each cohort containing two patients. For example, (i.e., the second cohort) contains the dose combinations from patients 3 and 4. In a generalised way, one cohort contains patients and and so, if , we know that the second cohort contains the patients and . Now let and denote the dose combinations to which the two patients from cohort were (or will be) allocated. Using the same example, if , we see that and correspond to the dose combinations from patients 3 and 4. Also, let represent the posterior distribution of the MTD of drug given that the level of drug is equal to (i.e., given that is fixed) and given the data (see equation (3) for the definition of the MTD). Last, let denote the percentile of .

Algorithm 1. Stage I and stage II algorithms
Stage II can also be viewed as an extension of a response-adaptive randomisation procedure with a finite number of doses (see Berry et al., 2010) to a response-adaptive randomisation procedure with an infinite number of dose combinations. Mind that one could easily adapt Algorithm 1 to have a single stage by starting directly with the adaptive randomisation part of the design (i.e., stage II). The dose finding algorithm contains the following stopping rules for safety and futility:
- •
Futility stopping rule
For ethical considerations and to avoid exposing patients to sub-therapeutic dose combinations, we would stop the trial for futility ifwhere is a pre-specified threshold. For the purpose of this article, we choose . Mind that this stopping rule applies only after the initial cohort of patients in stage II. - •
Safety stopping rule
The design contains two stopping rules for safety, one for stage I and a different one for stage II. During stage I, we would stop the trial ifwhere . In contrast, during stage II we would stop the trial if(7)where represents the rate of DLTs for both stages of the design regardless of dose and represents the confidence level (i.e., 70%) that a prospective trial results in an excessive DLT rate. A non-informative Jeffrey's prior Beta is placed on the parameter . Mind that Equation (8) does not depend on a pre-specified dose combination like Equation (7). The rational behind this choice is that it would be sub-optimal to condition the stage II safety stopping rule on a pre-specified dose combination, even if selected at the end of stage I, given that, during stage II, the MTD curve is updated after each new cohort of patients.(8)
4 SIMULATION STUDY
4.1 Operating characteristics
Stages I and II are evaluated through the following operating characteristics:
- (a)
Pointwise average bias of the estimated MTD curve:
Following Tighiouart et al. (2017), let be the true MTD curve of a pre-specified scenario and be the estimated MTD set at the jth simulated trial, . The pointwise average bias when estimating the MTD is defined as follows. For every point , letwhere is such that . This is the minimum relative distance of the point of to . The pointwise average bias is thus calculated as .(9) - (b)
Percentage of correct MTD curve recommendation:
Following Tighiouart et al. (2017), we define the percentage of correct MTD recommendation as the pointwise percentage of trial for which the minimum distance of the point on to is no more than of the true MTD. It is calculated using the definition of the minimum relative distance given by Equation (9) and it is defined aswhere , and represents the euclidean distance between the minimum dose combination and the point on the MTD curve. Intuitively, this metric is equivalent to drawing a circle with radius and calculating the percentage of trials with falling inside the circle, where is a tolerance parameter. - (c)
Average DLT rate and percentage of trials with DLT rate higher than .
- (d)
Distribution of recommended optimal dose combinations.
- (e)
Proportion of recommended optimal dose combinations with true probability of efficacy above .
- (f)Bayesian power (type-I error probability under ):where ‘’ represents an indicator function.
For scenarios favouring the null hypothesis as defined in (4), the above formula represents the Bayesian type-I error probability.
- (g)
Proportion of patients in stage II allocated to dose combinations with true probability of efficacy above .
- (h)
Average posterior probability of early stopping for futility and safety.
4.2 Dose-toxicity and dose–efficacy scenarios
Following Tighiouart (2019), we use the two dose–toxicity scenarios that were expected by the principal investigator of the cisplatin-cabazitaxel trial, which are displayed in Figure 1. To generate dose-toxicity scenarios 1 and 2, we have used the parameter values and , respectively. The target probability of DLT is .

True dose-toxicity profiles. The black dot in the dose combination of cisplatin and cabazitaxel 15/75 represents the maximum tolerated dose found by Lockhart et al. (2014) using a ‘3+3’ design.
Within each of these dose–toxicity scenarios, we study four different dose–efficacy profiles, both under the alternative () and the null () hypotheses, that place the most efficacious dose combination in a different region of the MTD curve. These profiles are obtained using model (2) for in the true MTD curve as defined by dose–toxicity scenarios 1 and 2. These are displayed in Figure 2. The parameter values used to generate the dose–efficacy profiles under and are displayed in Table 1. It is important to mention that all the dose–efficacy scenarios have been generated with the parameters (i.e., using monotonic dose–efficacy profiles) since we are combining two cytotoxic agents. These two parameters are incorporated for the sole purpose of adding some flexibility since, as mentioned in Section 3.2, in the development of this article we have found that, because of the assumed synergistic effect of the compounds in terms of the dose–toxicity profile, an efficacy model with only linear terms was not able to fully capture the dose–efficacy profile in one of the dose–efficacy scenarios we present (scenario 3).
![True dose–efficacy profiles under the null and alternative hypotheses varying with the dose of Cisplatin along the true maximum tolerated dose curves. The red dotted line represents θE and the green dotted line represent the highest probability of efficacy. [Colour figure can be viewed at https://dbpia.nl.go.kr]](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/jrsssc/71/5/10.1111_rssc.12598/1/m_rssc12598_fig-2.jpeg?Expires=1750376029&Signature=Q5yuRZMhRpRw1YNpF798FWUWSEk7Vzkf~FplUQbySBawFv8lRIxxO01SaRasEyQCXljFIGAIaijthiRMfEtW~Le9TsEPqG2RYdn0-24AKrpQ-HdgwmcP9GthktvjykrHD4yE78E5KYIpghidcuIhAYdNBgNWswcAMrgSupzguOKqMQBwUC3t8yUzfDQsLGO6tB5o-h~f6wWIE8aL3oHisK2KhhIWo5LmISHuZUfFqUIPrpdPD6ZTkOKOV-4xp7rgrgsfUaWb5cZ4FwonW3JxKgtQzNuRNu-AHV5jtRh9MDWgdxwcR6Ijhh3B7aMkQfSx4v9wCrG65DOpUELBBzPzXw__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
True dose–efficacy profiles under the null and alternative hypotheses varying with the dose of Cisplatin along the true maximum tolerated dose curves. The red dotted line represents and the green dotted line represent the highest probability of efficacy. [Colour figure can be viewed at https://dbpia.nl.go.kr]
Dose–efficacy parameters values used to generate the dose–efficacy profiles under and presented in Figure 2
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
2 | 4.3 | 6.17 | 1.25 | 0.05 | 1.55 | 4.63 | 0.2 | |
4.3 | 2 | 5.5 | 1.25 | 1.57 | 0.05 | 4.73 | 0.2 | |
10 | 10 | 0 | 12 | 1 | 1 | 0 | 26 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
2 | 4.3 | 6.17 | 1.25 | 0.05 | 1.55 | 4.63 | 0.2 | |
4.3 | 2 | 5.5 | 1.25 | 1.57 | 0.05 | 4.73 | 0.2 | |
10 | 10 | 0 | 12 | 1 | 1 | 0 | 26 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Notes: The parameters and belong to the null and alternative hypotheses, respectively. The parameters are the same under the null and alternative hypotheses.
Dose–efficacy parameters values used to generate the dose–efficacy profiles under and presented in Figure 2
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
2 | 4.3 | 6.17 | 1.25 | 0.05 | 1.55 | 4.63 | 0.2 | |
4.3 | 2 | 5.5 | 1.25 | 1.57 | 0.05 | 4.73 | 0.2 | |
10 | 10 | 0 | 12 | 1 | 1 | 0 | 26 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
2 | 4.3 | 6.17 | 1.25 | 0.05 | 1.55 | 4.63 | 0.2 | |
4.3 | 2 | 5.5 | 1.25 | 1.57 | 0.05 | 4.73 | 0.2 | |
10 | 10 | 0 | 12 | 1 | 1 | 0 | 26 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Notes: The parameters and belong to the null and alternative hypotheses, respectively. The parameters are the same under the null and alternative hypotheses.
The probability of response (clinical benefit) for the standard of care treatment is , and the cohort size in stage II is . Under , we assume an effect size of 0.25 (i.e., in all dose–efficacy profiles, the highest probability of efficacy is equal to ). Under , the highest probability of efficacy is equal to . The sample sizes for Stages I and II are and we simulated trials using Algorithm 1. The DLT and efficacy responses were generated from models (1) and (2), respectively.
4.2.1 Results
In Figure 3, we present the pointwise average bias of the estimated MTD curve at the end of the trial with respect to the true MTD curve under the null (dashed line) and alternative (solid line) hypotheses, respectively. In line with the results presented by (Jiménez et al., 2020; Tighiouart et al., 2017), the design yields a good estimation of the MTD curve, specially around its central region whereas this estimation is less precise at the extremes of the MTD curve. However, because of the use of all toxicity data from stages I and II, the average bias we observe in the MTD curve estimation is, overall, lower than the values reported in the supplementary material of Jiménez et al. (2020) with also vague prior distributions. In Figures S1 and S2-S3, we display the estimated MTD curves and the percentage of correct MTD recommendation under the null and alternative hypotheses, respectively. In most scenarios, the pointwise percent selection varies between and except near the edges of the true MTD curve using a threshold value of and between and using a tolerance threshold . These results are consistent with the conclusions we draw from the pointwise average bias.

Pointwise average bias of the estimated maximum tolerated dose (MTD )curve with respect to the true MTD curve under the null (dashed line) and alternative (solid line) hypotheses, respectively. Results are displayed from the point of view of the standardised Cisplatin.
In Table 2 we present the safety operating characteristics and we observe that, on average, the DLT rate is between 29% and 33% and the percentage of trials with proportion of DLT above ranges between 0% and 0.2%. Therefore, the design is considered safe.
Average dose limiting toxicity (DLT) rate and percentage of trials with DLT above
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
Average DLT rate | 0.333 | 0.337 | 0.333 | 0.336 | 0.289 | 0.304 | 0.299 | 0.330 |
Average DLT rate | 0.336 | 0.339 | 0.337 | 0.337 | 0.289 | 0.308 | 0.300 | 0.332 |
Percentage (%) of trial with DLT rate above | 0.0 | 0.1 | 0.2 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 |
Percentage (%) of trial with DLT rate above | 0.1 | 0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 |
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
Average DLT rate | 0.333 | 0.337 | 0.333 | 0.336 | 0.289 | 0.304 | 0.299 | 0.330 |
Average DLT rate | 0.336 | 0.339 | 0.337 | 0.337 | 0.289 | 0.308 | 0.300 | 0.332 |
Percentage (%) of trial with DLT rate above | 0.0 | 0.1 | 0.2 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 |
Percentage (%) of trial with DLT rate above | 0.1 | 0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 |
Average dose limiting toxicity (DLT) rate and percentage of trials with DLT above
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
Average DLT rate | 0.333 | 0.337 | 0.333 | 0.336 | 0.289 | 0.304 | 0.299 | 0.330 |
Average DLT rate | 0.336 | 0.339 | 0.337 | 0.337 | 0.289 | 0.308 | 0.300 | 0.332 |
Percentage (%) of trial with DLT rate above | 0.0 | 0.1 | 0.2 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 |
Percentage (%) of trial with DLT rate above | 0.1 | 0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 |
Dose–toxicity scenario . | 1 . | 2 . | ||||||
---|---|---|---|---|---|---|---|---|
Dose–efficacy scenario . | 1 . | 2 . | 3 . | 4 . | 1 . | 2 . | 3 . | 4 . |
Average DLT rate | 0.333 | 0.337 | 0.333 | 0.336 | 0.289 | 0.304 | 0.299 | 0.330 |
Average DLT rate | 0.336 | 0.339 | 0.337 | 0.337 | 0.289 | 0.308 | 0.300 | 0.332 |
Percentage (%) of trial with DLT rate above | 0.0 | 0.1 | 0.2 | 0.1 | 0.0 | 0.0 | 0.0 | 0.1 |
Percentage (%) of trial with DLT rate above | 0.1 | 0.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 |
In Figure 4, we show the probability of rejecting the null hypothesis, which corresponds to the (Bayesian) power under the alternative hypothesis and to the (Bayesian) type-I error probability under the null hypothesis. This probability ranges between 0.7 and 0.85 under the alternative hypothesis (i.e., power), and between 0.07 and 0.29 under the null hypothesis for all but one scenario (i.e., type-I error). These values are in line with the results presented by Tighiouart (2019) and Jiménez et al. (2020). We observe one interesting result under the null hypothesis in dose–toxicity scenario 2—dose–efficacy scenario 4, where the type-I error probability is close to 0.5. This value is a consequence of a slight overestimation of the MTD curve in its central region (see Figure 3). This behaviour is not present in dose–toxicity scenario 1—dose–efficacy scenario 4 where the MTD curve is practically unbiassed in its central region. By overestimating the MTD, we are no longer having a maximum probability of 0.4 and 0.15 under the alternative and null hypotheses, respectively. This causes an increase of both the power and the type-I error probability. In fact, dose–toxicity scenario 2, dose–efficacy scenario 4 is the scenario that achieves both the highest power and highest type-I error probability because of the overestimation of the MTD curve in the region of highest efficacy. By the same token, an underestimation of the MTD curve in the region of highest efficacy will lead to both lower power and lower type-I error probability.
![Bayesian power and type-I error probability for scenarios under the alternative and null hypotheses, respectively. [Colour figure can be viewed at https://dbpia.nl.go.kr]](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/jrsssc/71/5/10.1111_rssc.12598/1/m_rssc12598_fig-4.jpeg?Expires=1750376029&Signature=OVL9YW0~kDtkDtLNB9bktuO4rVhTpis-6LV226-8st84tLtvPtgshS039NiMJuYjOwPoyw0TNFS4ZUXtFXhUFDWWY8iP0fBJoCyt-gO7BkjTwhn-hds5PyxQSl0tnbkUwNW69JHh0GYeR6FlD4vz6oBGUsU8BYlxOZKQQ6kEkN7m9IYROdGnE3rYLxlY~s39YpZ6ae-IawGt7SU9CI2tb21jMhIS9swdk-s2zRt3hUVHqCEJoLCN5~2u75XSGItUXMGRBk7sjKtSVUqRBMyxOzKka4Xz0xk7NjxHiKm2ji0fv6NKY4R2~hHu0ZQyaZlv0MUVuw2CnmeinvE4h42ikw__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Bayesian power and type-I error probability for scenarios under the alternative and null hypotheses, respectively. [Colour figure can be viewed at https://dbpia.nl.go.kr]
In Figure 5, we present the density (i.e., the distribution) of the dose combinations recommended at the end of each simulated trial as the optimal dose combination. In this article, the MTD curve is never fixed which translates into recommended optimal dose combinations above and below the true MTD curve. However, we can see how the density of these recommended optimal dose combinations is, generally speaking, very close to the true MTD curve, with the region of higher mass (i.e., bright yellow) containing the true optimal dose combination. Mind that in dose–toxicity scenario 2—dose–efficacy scenario 3, the true probability of efficacy at the left extreme of the true MTD curve is intended to be very close to the one from the optimal dose combination located at the right extreme of the MTD curve (see Figure 2). The bi-modality observed in the density of the recommended optimal dose combinations in this scenario shows that the design is able to capture the characteristic of this dose–efficacy scenario.
![Density of the dose combinations recommended at the end of each simulated trial as the optimal dose combination under the alternative hypothesis. The solid black line represents the true maximum tolerated dose curve of each dose-toxicity scenario and the red dot represents the true optimal dose combination. [Colour figure can be viewed at https://dbpia.nl.go.kr]](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/jrsssc/71/5/10.1111_rssc.12598/1/m_rssc12598_fig-5.jpeg?Expires=1750376029&Signature=YiEhQ2izmhTwaP2rAiZJNHF1MMiTWprxRegG-3~kbjt~q9TxAkGy3JjiQ23M2g76SDeAkP3DgEV0KuJHsUQ5Sac~NdenSrUzdAQYc8I19AMkPK28mCDoLcB3-WuMPRAvzH2azvn641kgf0EWwCzLm7~Uo-VZLXsjvi2g77mr~oodUJO4KIZ16QPjMokht3OmJwBaaz6LBQd86XAlAIJAxGlfWll49yeUctQyUDpBcujOmEHp05qWy6YciHdAclE2pCX~vV708ylFU43ProkzcIw6Ns4pg89ESQig5M9ulx-RYXXLNwLf3M9DTc2gBShx8181mBGoeEMYNkVeC7zu5Q__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Density of the dose combinations recommended at the end of each simulated trial as the optimal dose combination under the alternative hypothesis. The solid black line represents the true maximum tolerated dose curve of each dose-toxicity scenario and the red dot represents the true optimal dose combination. [Colour figure can be viewed at https://dbpia.nl.go.kr]
In Figure S4, we present the distribution of dose combinations allocated to all patients during stage II, which works under the adaptive randomisation principle. We display all scenarios under the alternative hypothesis only. Results clearly show how the mode of the distribution corresponds to the dose combination with highest probability of efficacy under each scenario. Moreover, the proportion of patients during stage II that were allocated to dose combinations with a true probability of efficacy above ranges between 69% and 77% (see Figure S5). These results also holds under the null hypothesis (results not shown).
In Figure S6, we present the average posterior probability of early stopping for the stopping rules we pre-specified in Section 3.4. Under the alternative hypothesis, we see that the probability of early stopping for futility is always below 0.02. Under the null hypothesis, this probability ranges generally between 0.067 and 0.27, depending on the scenario. We see how the probability of early stopping for futility in dose-toxicity scenario 2—dose–efficacy scenario 4 is much lower (0.067) than in other scenarios because of the overestimation of the central region of the MTD curve and the allocation of patients with higher probability of efficacy. In terms of the probability of early stopping for safety, we see that under both the null and alternative hypotheses, this probability is practically equal to 0. The values presented here are in line with the operating characteristics from Tighiouart (2019).
4.2.2 Model robustness
In Figure S9, we present the power and type-I error obtained under model misspecification. In this assessment, the true underlying dose–toxicity and dose–efficacy relationships' link functions are different from the probit link function used in the design's working marginal models (see Section 3.2). In simple terms, toxicity and efficacy data will be generated using a common link function (e.g., logistic) and the design will proceed under the (wrong) assumption that such common link function is a probit link function. The purpose of this robustness assessment is to test whether the design is able to maintain the operating characteristics presented in previous sections under small deviations between the true underlying marginal models and the working marginal models. To limit the number of comparisons, the toxicity and efficacy marginal models employ the same link function.
For the assessment, we reproduced the first dose–toxicity scenario presented in Figure 1 and the first two dose–efficacy scenarios presented in Figure 2 using the logistic and the complementary log-log link functions. Because the link functions are different, the dose–toxicity and dose–efficacy surfaces have to be different, specially around the extremes of the the surfaces. We refer the reader to Tighiouart et al. (2014) to see how the dose–toxicity surfaces differ using different link functions. Under model misspecification, the true models' parameters are selected in a way that there is an overlap of the toxicity and efficacy profiles of all link functions in the region where the optimal dose combination is located. In Figures S7 and S8 we display the MTD curves as well as the efficacy profiles on the MTD curves generated with the logistic and complementary log-log link functions.
The results presented in Figure S9 show that our design is able to maintain its performance under mode misspecification, with power and type-I error values with minor differences between the three tested link functions. This is not unexpected since the quadratic terms included in the dose–efficacy model allow for higher flexibility.
Last, and for illustration purposes, in Figures S10 and S11 we present the contours of the marginal toxicity and efficacy models, respectively, with the logit, logistic and complementary log-log link functions. For this illustration we selected dose–toxicity scenario 1 and dose–efficacy scenario under the alternative hypothesis. In this robustness assessment, the true toxicity and efficacy model parameters are selected so that the MTD curves and the efficacy profiles within the MTD curves match across all the link functions (see Figures S7 and S8). However, outside of the MTD curves, we observe how each link function induces slightly different dose–toxicity and dose–efficacy surfaces.
5 DISCUSSION
We present a Bayesian two-stage design for cancer phase I-II clinical trials combining two cytotoxic agents with continuous dose levels. Motivated by the cisplatin-cabazitaxel trial, we extend the work of Tighiouart (2019) by allowing the continuous update of the MTD curve throughout the entire study as well as the use of stage I efficacy data under the assumption of a single patient population across stages. In stage I we aim to estimate the MTD curve following the EWOC principle and enroling up to 30 patients. In stage II we model treatment efficacy as a binary indicator of treatment response using a probit quadratic model, and we enrol 30 additional patients using response adaptive randomisation. The rational for quadratic terms in a setting of two cytotoxic agents is that, under the presence of (synergistic) interaction between the compounds, the dose–efficacy profile along the MTD can become quite complex, and we found that a linear model was not able to correctly identify the region with the most efficacious dose combinations in certain scenarios. However, under the presence of non-interactive compounds the dose–efficacy profiles cannot surpass a certain level of complexity and thus a linear dose–efficacy model would be sufficient to identify the most efficacious dose combinations.
We study the properties of our design under the two (synergistic) dose–toxicity scenarios that were expected by the principal investigator of the cisplatin-cabazitaxel trial introduced by Tighiouart (2019) with 30 patients in each stage and an effect size of 0.25 in dose–toxicity scenarios under the alternative hypothesis. We study four different dose–efficacy profiles that place the most efficacious dose combination in different regions of the MTD curve (i.e., left extreme, right extreme, central and both extremes). The estimated MTD curve tends to be slightly biassed around its extremes and less biassed around its central region as reported by (Jiménez et al., 2020; Tighiouart et al., 2017). In fact, because of the continuous update of the MTD curve during stage II, we have observed a slight reduction of the overall estimation bias with respect to the results presented by Jiménez et al. (2020) where the MTD curve was only updated in stage I. The proposed probit quadratic efficacy model has a good performance and is able to fully capture the different dose–efficacy profiles along the true MTD curve. This is fully reflected in Figure 5, dose-toxicity scenario 2—dose–efficacy scenario 3, where the probability of efficacy if high at both extremes of the true MTD curve. As a matter of fact, in the development of this research, we found that a marginal efficacy model with only linear terms was not able to capture this particular profile which is heavily conditioned by the synergistic nature of the MTD curve. Thus, adding quadratic terms allows to better capture any dose–efficacy profile along any MTD curve. Overall, the design is safe and yields good operating characteristics across scenarios with power and type-I error rates that are consistent with the values reported by (Jiménez et al., 2020; Tighiouart, 2019). Another contribution of this article is that we account for the uncertainty of the estimated MTD curve during stage II since we continuously update it after each cohort of patients. In fact, we have observed its behaviour in stage II, which was not explored in (Jiménez et al., 2020; Tighiouart, 2019). Results show that overall bias in the MTD curve is also driven by the adaptive randomisation part of the design where we allocate patients to dose combination that are likely to be efficacious along the estimated MTD curve. This type of patient allocation unbalances the amount of data that we have in different regions of the estimated MTD curve, which could potentially damage the precision of the estimation in the search of higher efficacy. We believe these findings are coherent with the fact that we jointly work with i) the uncertainty around the MTD curve estimation and ii) the uncertainty around the estimation of the dose–efficacy profile. With these conclusions in mind, we strongly recommend using informative prior distributions whenever possible, and/or a slightly more conservative EWOC criterion (i.e., a lower percentile) either throughout the entire study or in stage II only. An alternative approach to hypothesis testing is to report the percent selection of correct optimal dose combinations, as is traditionally done in phase I/II dose finding studies. Since we are dealing with continuous dose levels, a reasonable definition of this metric would be the percentage of recommended optimal dose combination with (true) and (true) , where represents the true maximum probability of efficacy of a given scenario (i.e., and in the dose–efficacy scenarios that we considered) and are pre-specified positive threshold values.
In this article, we have not used the informative prior distributions used by Tighiouart (2019) and therefore our estimation of the MTD curve is consequently less precise. The operating characteristics presented in this article can be considered as the baseline operating characteristics this design can have given the use of vague prior distributions and relatively low sample size. By increasing the sample size or using more informative prior distributions in the dose–toxicity model, the design's operating characteristics are expected to improve.
Finally, model robustness was explored by using logistic and complementary log-log link functions to generate toxicity and efficacy data. Results show that the proposed design is robust to model misspecification under small deviations of the true model from the working model.
On the application of the proposed design to settings with other type of compounds (i.e., immunotherapies or targeted therapies), we note that major changes would be needed to adapt the design to the characteristics of such compounds. The most relevant adaption would be the definition of the optimal dose combination combination. With cytotoxic agents, it is generally assumed that the optimal dose combination will be somewhere along the MTD curve. With immunotherapies or targeted therapies, this is not guaranteed and utility functions become necessary to find the right balance between toxicity and efficacy (see e.g., Liu et al. (2018)). Also, because we would no longer restrict the search to the estimated MTD, we would need to extend the stage II adaptive randomisation algorithm so that we are allowed to allocate patients to any dose combination with estimated probability of toxicity below the target probability of toxicity.
Also, the design could be implemented in a setting with discrete dose combinations either with cytotoxics or immunotherapies with minor modifications with respect to the version of the design with continuous dose levels. To do so, in stage I, the continuous dose combinations would be rounded to nearest discrete dose combinations whereas, in stage II, patients would only be allocated to the initial set of discrete combinations based on the updated probability of efficacy estimates (i.e., the higher estimated probability of efficacy of a dose combination the higher the probability of being allocated to that dose combination).
These two extensions are part of our research plan and we intent to evaluate the operating characteristics of the proposed design under these settings.
CONFLICT OF INTEREST
José L. Jiménez is employed by Novartis Pharma A.G. who provided support in the form of salary for the author, but did not have any additional role in the preparation of the manuscript. Also, the views expressed in this publication are those of the authors and should not be attributed to any of the funding institutions or organisations to which the authors are affiliated.
DATA AVAILABILITY STATEMENT
The R and JAGS scripts necessary to fully reproduce the results presented in this article are available at https://github.com/jjimenezm1989/Phase-I-II-design-combining-two-cytotoxics.
ACKNOWLEDGEMENTS
We thank the associate editor, two referees, and the editor for the constructive comments that led to significant improvements in the article. Mourad Tighiouart is funded by NIH the National Center for Advancing Translational Sciences (NCATS) UCLA CTSI (UL1 TR001881-01) and NCI P01 CA233452-02.
REFERENCES
Author notes
Funding information National Center for Advancing Translational Sciences, Grant/Award Numbers: NCI P01 CA233452-02; UL1 TR001881-01