Abstract

Covariate-adaptive randomization methods are widely used in clinical trials to balance baseline covariates. Recent studies have shown the validity of using regression-based estimators for treatment effects without imposing functional form requirements on the true data generation model. These studies have had limitations in certain scenarios; for example, in the case of multiple treatment groups, these studies did not consider additional covariates or assumed that the allocation ratios were the same across strata. To address these limitations, we develop a stratum-common estimator and a stratum-specific estimator under multiple treatments. We derive the asymptotic behaviors of these estimators and propose consistent nonparametric estimators for asymptotic variances. To determine their efficiency, we compare the estimators with the stratified difference-in-means estimator as the benchmark. We find that the stratum-specific estimator guarantees efficiency gains, regardless of whether the allocation ratios across strata are the same or different. Our conclusions were also validated by simulation studies and a real clinical trial example.

1 Introduction

Randomization is considered to be a gold standard in clinical trials and other intervention studies, such as online A/B tests and experiments in economics. As one of the basic randomization methods, simple randomization allocates experimental units into different treatment groups with a fixed probability. However, simple randomization may cause an imbalance in baseline covariates. In a regulatory guidance, European Medicines Agency (2015) recommended an adjustment for baseline covariates in randomized clinical trials. Covariate-adaptive randomization is arguably the most commonly used method to adjust covariate imbalance in the design stage of randomized clinical trials.

Covariate-adaptive randomization aims to balance treatment allocations among baseline covariates. For example, stratified block randomization (Zelen, 1974) defines sets of strata based on covariates and allocates units in each stratum by using block randomization. It is one of the most commonly used randomization methods in clinical trials, as according to recent surveys, it is implemented in nearly 70% of trials (Ciolino et al., 2019; Lin et al., 2015). Recently, stratified block randomization has also been used in COVID-19 vaccine or treatment trials (e.g., Baden et al., 2021; Wang et al., 2020). Minimization (Pocock & Simon, 1975; Taves, 1974) has been used for balancing covariates over their margins. This scheme has been generalized to control other types of imbalance measures, which may include overall and within-stratum imbalance measures (Hu & Hu, 2012; Hu et al., 2023). Details of above randomization methods and other methods such as stratified biased coin design (Efron, 1971; Shao et al., 2010) and model-based approaches (Atkinson, 1982; Begg & Iglewicz, 1980) are described in Rosenberger and Lachin (2015).

A critical challenge after randomization is to obtain a valid statistical inference for treatment effects. One of the practical strategies applied to address this challenge is the use of linear regression. The robustness of regression estimators under simple randomization has been discussed in several significant papers (Freedman, 2008; Lin, 2013; Yang & Tsiatis, 2001). However, under covariate-adaptive randomization, the challenge becomes complex. Various valid tests have been proposed for different allocation methods under model assumptions of the true data generation process (e.g., Ma et al., 2015, 2020; Wang & Ma, 2021; Shao et al., 2010). A model-assisted approach with regression adjustment for stratification covariates has recently been proposed in Bugni et al. (2018, 2019). This approach produces a valid inference by using a working model between responses and covariates, regardless of whether the working model is correct or not. To consider the regression adjustment for baseline covariates in addition to stratification covariates, stratum-common estimators and stratum-specific estimators have been developed, mainly for the case in which the allocation ratios are the same across strata (Liu et al., 2023; Ma et al., 2022; Ye et al., 2022a, 2022b). However, little attention has been paid to the case in which the allocation ratios are different across strata, especially when additional baseline covariates are included, although different allocation ratios are commonly used in practice and are more flexible (Angrist et al., 2014; Chong et al., 2016).

To fill this gap, we develop stratum-common and stratum-specific estimators under multiple treatments when allocation ratios vary across strata. We justify the asymptotic behaviors of these two estimators and propose nonparametric consistent estimators for the asymptotic variances. Moreover, we demonstrate that for any randomization scheme fulfilling our assumptions, the stratum-specific estimator can guarantee efficiency gains over the other estimators that are considered in this study. The results of our study are the most general obtained so far, with respect to regression adjustment for covariate-adaptive randomization. That is, our methods can handle multiple treatments, adjust for additional baseline covariates to improve efficiency, and allow different allocation ratios across strata.

The remainder of the paper is organized as follows. In Section 2, we introduce the framework and assumptions for covariate-adaptive randomization. In Section 3, after a brief review of the benchmark estimator, we describe the stratum-common estimator, the stratum-specific estimator, and their asymptotic properties. We present the simulation studies in Section 4. Section 5 provides a real example of a clinical trial and Section 6 consists of our recommendations and a discussion. All the estimators considered in this paper are implemented in the R package caratMULT, available at GitHub.

2 Framework and Assumptions

Consider a covariate-adaptive randomization procedure with n units. Let us suppose that treatments formula. Let formula be the control group and formula. Let formula be the indicators, such that formula indicates that the ith unit is assigned to the ath treatment. Let formula be the number of units in each treatment group. Let formula denote the stratum label, which takes value in formula. Here, K represents the total number of strata, which is fixed and finite. Let formula be the p-dimensional additional baseline covariates that are not used in the randomization procedure. Let formula be the target proportion of stratum k and formula be the target proportion of treatment a in stratum k, so that formula is the target proportion of treatment a. Let formula be the number of units in stratum k and formula be the number of units in stratum k and treatment group a. Let formula be the estimated proportion of stratum k.

We use the Neyman–Rubin model to define potential outcomes and treatment effects (Neyman, 1923; Rubin, 1974). Let {formula} be the potential outcomes and formula be the observed outcomes. Our goal is to estimate the treatment effect formula for all formula. That is, we will estimate formula based on the observed data formula. The sample means of the potential outcomes are defined as formula and formula. The population variance of a transformed outcome formula, such as formula, is denoted as formula. We define formula as the population variance of a transformed outcome in stratum k.

The following assumptions are made for the data generation and covariate-adaptive randomization procedures. We denote the sets of random variables with at least one positive stratum-specific variance as formula. Let formula be independent and identically distributed samples from the population distribution formula.

Assumption 1. formula and formula, for all formula.

Assumption 2. Conditional on formula, formula is independent with formula.

Assumption 3. formula as formula, for all formula and formula.

According to Assumption 2, given the strata, the potential outcomes and the additional covariates are conditionally independent of the assignment procedure. Assumption 3 is the same as Assumption 2.2(b) in Bugni et al. (2019). Several well-known randomization schemes satisfy this assumption, such as simple randomization, stratified permuted block randomization (Zelen, 1974) and stratified biased coin randomization (Kuznetsova & Johnson, 2017; Shao et al., 2010). In the special case in which formula are identical across all strata for each treatment group, Pocock and Simon's minimization (Pocock & Simon, 1975) also satisfies this assumption (Hu et al., 2023).

3 Regression-based Multiple Treatment Effect Estimation

3.1 Benchmark Estimator

First, we introduce the benchmark estimator—an estimator for formula without using additional covariates. This estimator was proposed in Bugni et al. (2019) and was obtained by considering the estimation of the linear regression

(1)

by the ordinary least squares (OLS) method. We denote formula as the OLS estimator of formula. From calculations,

Note that we use the estimated proportion formula rather than the target proportion formula in our estimators. This estimator can also be considered as a plug-in estimator (Liu & Yang, 2020). We define

as the treatment effect in stratum k. Therefore, the treatment effect can be obtained from

It was shown that formula is an unbiased and consistent estimator of formula under Assumptions 1–3, and the plug-in estimator formula is a consistent estimator of formula (Ma et al., 2022). Let formula be the estimator of τ. The asymptotic behavior of formula was also derived in Bugni et al. (2019), implying that

We provide the expanded form of formula later in the proof in the Web Appendix A. We seek to determine whether adjusting the imbalance of the additional covariates will improve the efficiency compared with formula.

Remark 1. Regression (1) is not the original regression used in Bugni et al. (2019); however, both regressions yield the same results when estimating formula. In our formulation, the estimator for formula can be directly obtained. Regression (1) is an extension of the third regression in Ma et al. (2022) to multiple treatments.

In addition to stratification indicators, the covariates formula may contain additional information that can improve estimation of the treatment effect. Regression is a common strategy for adjusting additional covariate imbalance. Below, we discuss two regression models with the inclusion of formula. Any covariates that can be linearly represented by stratification indicators should be removed from the additional covariates. For simplicity, we continue to use formula as the additional covariates without loss of generality.

Before formally specifying the models and the asymptotic results, we define several population-level regression coefficients. Let formula be the covariance matrix for any two random vectors R and Q, formula be the covariance matrix for R and Q given stratum k, and formula. We assume that X has a finite second moment and formula, formula, and formula are positive definite. Two types of coefficients are used for additional covariates: stratum-common coefficients formula and stratum-specific coefficients formula, which are defined as

Here, formula are the population-level weighted regression coefficients for regressing formula on formula with weights formula, and formula are the population-level regression coefficients for regressing formula on formula in stratum k. We also define some overall and stratum-specific sample-level means: formula, formula, formula; and some treatment-specific and stratum-specific sample-analog covariance matrices: formula, formula, formula, formula.

3.2 Stratum-common Estimator

We first consider the stratum-common estimator. The regression is

(2)

Let formula be the OLS estimator for τ and formula be the ath element of formula. As shown in our proof, regression (2) is equivalent to running formula regressions separately in the treatment and the control groups. Then, we obtain

where formula and formula are the estimators for the regression coefficients.

Theorem 1. Suppose that Assumptions 1–3 hold. Let formula. Then,

whereformula,

Here, 1A is an A-dimensional vector of ones and formula.

Remark 2. The asymptotic result is not restricted to the regression coefficients formula. For any population coefficients formula, if formula is a consistent estimator for formula, then formula retains this asymptotic behavior. In the special case in which formula for all formula, the benchmark estimator and its asymptotic behavior can be obtained.

An area of concern is whether the asymptotic variance will decrease when the additional covariates are added. Results from Ma et al. (2022) show that not all regression models can achieve efficiency gain due to the additional covariates. In general, the stratum-common estimator formula may have a more adverse impact on the efficiency than formula. In the special case in which formula for all formula, formula can improve the efficiency. Let formula denote that matrix V is negative semi-definite.

Corollary 1. When formula for all formula and formula, we obtain

Remark 3. When there is only one treatment group and one control group, our model reduces to the case in Ma et al. (2022), the regression model with interaction and additional covariates. Furthermore, our estimator and the asymptotic variance are identical to those derived in Ma et al. (2022) for this particular case.

Remark 4. The use of stratum-common coefficients formula was recommended by Ye et al. (2022a). In their work, the coefficients were achieved by the analysis of covariance using a heterogeneous working model (ANHECOVA) and the estimator derived by these coefficients for potential outcomes was referred to ANHECOVA estimator. With respect to the considerations in their study, the ANHECOVA estimator achieved a guaranteed efficiency gain over other estimators that they considered, not only under simple randomization but also under covariate-adaptive randomization. In general, when different allocation ratios are used across strata, the stratum-common estimator formula may adversely affect the efficiency. As shown in Corollary 1, when formula for all strata, the stratum-common estimator can improve the efficiency, as the ANHECOVA estimator does. To guarantee efficiency gains when using the stratum-common estimator, different weights can be assigned for each stratum; however, a more efficient estimator can be constructed, as described in the next section.

3.3 Stratum-specific Estimator

To develop a regression model for the stratum-specific estimator, we introduce the following linear model.

(3)

for all formula. Here, we use formula to indicate the units in stratum k. Let formula denote the OLS estimator for formula, formula denote the estimator for formula, and formula denote the estimator for τ. The regression can also be split into formula regression models, giving

where formula and formula are the estimators for the regression coefficients.

Remark 5. We do not provide an overall regression model because of the complicated expression arising from the three-way interaction between additional covariates formula. Such expression may not be welcomed for application reasons.

Theorem 2. Suppose that Assumptions 1–3 hold. Let formula; then

where formula with

and formula has the same definition as in Theorem 1.

Furthermore, formula is more efficient than formula and formula due to the following corollary.

Corollary 2. Suppose that Assumptions 1–3 hold.

In addition,

Remark 6. The use of a stratum-specific estimator under covariate-adaptive randomization has been discussed by Liu and Yang (2020) and Ye et al. (2022b). Liu and Yang (2020) showed that when there are two treatments, a stratum-specific estimator can achieve guaranteed efficiency gains over a stratum-common estimator. We generalize their results to multiple treatments. Ye et al. (2022b) also claimed efficiency gains for a stratum-specific estimator over other estimators when the same allocation proportion was applied across strata. Our results are more general, justifying that our stratum-specific estimator is more efficient than the other estimators considered in our study, regardless of whether the target allocation ratios across strata are the same or different.

3.4 Consistent Variance Estimators

For all of the estimators mentioned above, a crucial step when drawing a valid inference is to construct a consistent asymptotic variance estimator. Specifically, we consider the difference in the potential outcomes for any two treatments b and c. Let formula be an A-dimensional vector, such that all elements are zero, except for the bth element, which is 1, and the cth element, which is −1. Let formula, be the real difference in potential outcomes, and formula be the estimator for formula generated from the three estimators above, for formula. Based on simple calculation,

where

The matrix formula is the asymptotic variance for formula, and formula denotes the element in the ith row and jth column of formula, for formula. By using formula as the transformed outcomes again and formula as an estimator of formula, formula. Let formula. For the three estimators above, formula, where formula, respectively, for formula. We define

and

Let formula and formula denote the overall sample covariance matrix and stratum-specific sample covariance matrix, respectively. Let formula be the estimator of formula for formula.

Theorem 3. Suppose that Assumptions 1–3 hold. The variance estimators

are consistent estimators of formula for formula, respectively.

4 Simulation Studies

We ran simulations for several models to compare the finite-sample behavior of the proposed estimators. The potential outcomes are generated by the following model:

where formula is a discrete variable, takes a value from formula with probabilities (0.2,0.2,0.3,0.3), and is used for stratification such that formula. formula is a continuous variable serving as additional covariates. formula is a random error. We also ran additional simulations when formula was not normally distributed. The results are presented in Section B.1 in the Web Appendix. We considered formula, that is, two treatment groups and one control group. We estimated two treatment effects, τ1 and τ2. The number of units n was 600. Details of the outcome models are as follows:

For coefficients, we considered formula, and formula as an arithmetic sequence with 12 numbers, with 1 being the first number and 6 being the last number, first ordered by strata and then by treatments. Three types of randomization methods were considered: simple randomization, stratified block randomization, and Pocock and Simon's minimization. For Tables 1 and 2, the treatment allocations were the same across strata, with an equal ratio (1:1:1) and an unequal ratio (1:2:2), respectively. For Table 3, we assigned different treatment allocation ratios to different strata. When formula and formula, the allocation ratio was 1:2:2, and when formula and formula, the allocation ratio was 2:3:5. Note that we only considered simple randomization and stratified block randomization for Table 3, because minimization requires the same allocation ratios across strata. For stratified block randomization, the block size was 6 in Table 1 and 10 in Tables 2 and 3. For Pocock and Simon's minimization, range was used as the marginal imbalance metric, and no overall balance was considered. The allocation probabilities were determined by the procedure given in Han et al. (2009); in particular, when the treatment with the smallest allocation ratio was preferred, the probability for this treatment was 0.9. As the allocation ratios may vary step by step under unequal allocation ratio for the procedure in Han et al. (2009) in Table 2, we also considered the minimization procedure proposed by Kuznetsova and Tymofyeyev (2012), which preserves the allocation ratio at every step. The additional simulation results can be found in Section B.2 in the Web Appendix. The bias, standard deviation (SD) of the estimators, standard error (SE) estimators, and empirical coverage probabilities (CP) of 95% confidence intervals were evaluated based on 1,000 simulation runs. The results are summarized below.

TABLE 1

Simulated biases, standard deviations, standard errors, and coverage probabilities for different estimators and randomization methods under equal treatment allocation ratio 1:1:1 across strata.

τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0024.704.580.94−0.0015.876.000.96
formula0.0002.722.640.94−0.0023.233.170.95
formula0.0002.732.640.94−0.0033.243.170.94
SBRformula−0.0024.784.550.94−0.0036.045.970.95
formula0.0002.612.630.95−0.0013.133.170.95
formula−0.0012.632.620.95−0.0013.143.160.95
MINformula0.0064.574.550.950.0046.105.960.95
formula0.0002.592.640.960.0003.193.170.94
formula0.0002.612.630.950.0003.213.160.94
2SRformula−0.0037.577.460.95−0.00310.0110.230.96
formula0.0023.413.300.93−0.0044.674.600.95
formula0.0003.163.070.94−0.0034.504.430.94
SBRformula−0.0057.777.420.94−0.00710.3510.180.95
formula−0.0023.253.290.96−0.0044.514.590.94
formula−0.0023.023.050.95−0.0034.364.420.95
MINformula0.0107.487.410.950.00910.4410.170.95
formula−0.0013.283.300.960.0014.594.600.95
formula0.0003.053.060.950.0004.464.430.94
3SRformula0.0046.146.010.94−0.0028.138.120.94
formula−0.0016.165.940.94−0.0138.158.020.94
formula−0.0226.225.840.93−0.0558.267.880.93
SBRformula0.0046.035.960.94−0.0028.068.090.94
formula−0.0036.045.920.94−0.0158.098.030.94
formula−0.0246.025.800.94−0.0578.157.850.92
MINformula−0.0026.005.970.950.0038.018.080.95
formula−0.0085.975.930.95−0.0128.048.010.94
formula−0.0286.065.810.93−0.0518.107.840.93
τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0024.704.580.94−0.0015.876.000.96
formula0.0002.722.640.94−0.0023.233.170.95
formula0.0002.732.640.94−0.0033.243.170.94
SBRformula−0.0024.784.550.94−0.0036.045.970.95
formula0.0002.612.630.95−0.0013.133.170.95
formula−0.0012.632.620.95−0.0013.143.160.95
MINformula0.0064.574.550.950.0046.105.960.95
formula0.0002.592.640.960.0003.193.170.94
formula0.0002.612.630.950.0003.213.160.94
2SRformula−0.0037.577.460.95−0.00310.0110.230.96
formula0.0023.413.300.93−0.0044.674.600.95
formula0.0003.163.070.94−0.0034.504.430.94
SBRformula−0.0057.777.420.94−0.00710.3510.180.95
formula−0.0023.253.290.96−0.0044.514.590.94
formula−0.0023.023.050.95−0.0034.364.420.95
MINformula0.0107.487.410.950.00910.4410.170.95
formula−0.0013.283.300.960.0014.594.600.95
formula0.0003.053.060.950.0004.464.430.94
3SRformula0.0046.146.010.94−0.0028.138.120.94
formula−0.0016.165.940.94−0.0138.158.020.94
formula−0.0226.225.840.93−0.0558.267.880.93
SBRformula0.0046.035.960.94−0.0028.068.090.94
formula−0.0036.045.920.94−0.0158.098.030.94
formula−0.0246.025.800.94−0.0578.157.850.92
MINformula−0.0026.005.970.950.0038.018.080.95
formula−0.0085.975.930.95−0.0128.048.010.94
formula−0.0286.065.810.93−0.0518.107.840.93

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; MIN, minimization; SD, standard deviation; SE, standard error; CP, coverage probability.

TABLE 1

Simulated biases, standard deviations, standard errors, and coverage probabilities for different estimators and randomization methods under equal treatment allocation ratio 1:1:1 across strata.

τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0024.704.580.94−0.0015.876.000.96
formula0.0002.722.640.94−0.0023.233.170.95
formula0.0002.732.640.94−0.0033.243.170.94
SBRformula−0.0024.784.550.94−0.0036.045.970.95
formula0.0002.612.630.95−0.0013.133.170.95
formula−0.0012.632.620.95−0.0013.143.160.95
MINformula0.0064.574.550.950.0046.105.960.95
formula0.0002.592.640.960.0003.193.170.94
formula0.0002.612.630.950.0003.213.160.94
2SRformula−0.0037.577.460.95−0.00310.0110.230.96
formula0.0023.413.300.93−0.0044.674.600.95
formula0.0003.163.070.94−0.0034.504.430.94
SBRformula−0.0057.777.420.94−0.00710.3510.180.95
formula−0.0023.253.290.96−0.0044.514.590.94
formula−0.0023.023.050.95−0.0034.364.420.95
MINformula0.0107.487.410.950.00910.4410.170.95
formula−0.0013.283.300.960.0014.594.600.95
formula0.0003.053.060.950.0004.464.430.94
3SRformula0.0046.146.010.94−0.0028.138.120.94
formula−0.0016.165.940.94−0.0138.158.020.94
formula−0.0226.225.840.93−0.0558.267.880.93
SBRformula0.0046.035.960.94−0.0028.068.090.94
formula−0.0036.045.920.94−0.0158.098.030.94
formula−0.0246.025.800.94−0.0578.157.850.92
MINformula−0.0026.005.970.950.0038.018.080.95
formula−0.0085.975.930.95−0.0128.048.010.94
formula−0.0286.065.810.93−0.0518.107.840.93
τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0024.704.580.94−0.0015.876.000.96
formula0.0002.722.640.94−0.0023.233.170.95
formula0.0002.732.640.94−0.0033.243.170.94
SBRformula−0.0024.784.550.94−0.0036.045.970.95
formula0.0002.612.630.95−0.0013.133.170.95
formula−0.0012.632.620.95−0.0013.143.160.95
MINformula0.0064.574.550.950.0046.105.960.95
formula0.0002.592.640.960.0003.193.170.94
formula0.0002.612.630.950.0003.213.160.94
2SRformula−0.0037.577.460.95−0.00310.0110.230.96
formula0.0023.413.300.93−0.0044.674.600.95
formula0.0003.163.070.94−0.0034.504.430.94
SBRformula−0.0057.777.420.94−0.00710.3510.180.95
formula−0.0023.253.290.96−0.0044.514.590.94
formula−0.0023.023.050.95−0.0034.364.420.95
MINformula0.0107.487.410.950.00910.4410.170.95
formula−0.0013.283.300.960.0014.594.600.95
formula0.0003.053.060.950.0004.464.430.94
3SRformula0.0046.146.010.94−0.0028.138.120.94
formula−0.0016.165.940.94−0.0138.158.020.94
formula−0.0226.225.840.93−0.0558.267.880.93
SBRformula0.0046.035.960.94−0.0028.068.090.94
formula−0.0036.045.920.94−0.0158.098.030.94
formula−0.0246.025.800.94−0.0578.157.850.92
MINformula−0.0026.005.970.950.0038.018.080.95
formula−0.0085.975.930.95−0.0128.048.010.94
formula−0.0286.065.810.93−0.0518.107.840.93

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; MIN, minimization; SD, standard deviation; SE, standard error; CP, coverage probability.

TABLE 2

Simulated biases, standard deviations, standard errors, and coverage probabilities for different estimators and randomization methods under equal treatment allocation ratio 1:2:2 across strata.

τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0084.954.740.94−0.0075.915.920.95
formula−0.0022.972.900.94−0.0023.503.390.94
formula−0.0032.982.900.94−0.0033.503.390.94
SBRformula−0.0104.684.680.940.0025.935.870.94
formula−0.0092.902.870.95−0.0073.333.360.95
formula−0.0102.932.850.95−0.0083.353.350.95
MINformula0.0034.734.690.95−0.0085.805.870.95
formula0.0002.862.880.95−0.0013.453.380.95
formula0.0002.892.860.95−0.0013.463.360.94
2SRformula−0.0117.877.570.95−0.0149.909.920.95
formula−0.0013.653.560.94−0.0044.944.780.94
formula−0.0033.383.300.94−0.0034.704.590.94
SBRformula−0.0117.597.490.950.0099.999.860.95
formula−0.0103.643.530.94−0.0094.834.750.95
formula−0.0103.313.250.94−0.0094.554.550.95
MINformula0.0067.647.510.95−0.0159.669.850.96
formula0.0013.613.550.95−0.0014.874.770.94
formula0.0003.293.270.95−0.0014.684.570.95
3SRformula−0.0076.316.150.940.0047.857.930.94
formula−0.0016.336.060.940.0047.877.820.94
formula0.0126.315.940.930.0027.937.700.94
SBRformula−0.0065.906.060.950.0137.807.870.95
formula−0.0035.926.010.950.0117.877.810.94
formula0.0045.955.870.950.0037.927.650.94
MINformula0.0095.946.080.960.0097.747.890.96
formula0.0136.016.030.950.0097.757.830.96
formula0.0256.075.890.940.0027.747.670.95
τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0084.954.740.94−0.0075.915.920.95
formula−0.0022.972.900.94−0.0023.503.390.94
formula−0.0032.982.900.94−0.0033.503.390.94
SBRformula−0.0104.684.680.940.0025.935.870.94
formula−0.0092.902.870.95−0.0073.333.360.95
formula−0.0102.932.850.95−0.0083.353.350.95
MINformula0.0034.734.690.95−0.0085.805.870.95
formula0.0002.862.880.95−0.0013.453.380.95
formula0.0002.892.860.95−0.0013.463.360.94
2SRformula−0.0117.877.570.95−0.0149.909.920.95
formula−0.0013.653.560.94−0.0044.944.780.94
formula−0.0033.383.300.94−0.0034.704.590.94
SBRformula−0.0117.597.490.950.0099.999.860.95
formula−0.0103.643.530.94−0.0094.834.750.95
formula−0.0103.313.250.94−0.0094.554.550.95
MINformula0.0067.647.510.95−0.0159.669.850.96
formula0.0013.613.550.95−0.0014.874.770.94
formula0.0003.293.270.95−0.0014.684.570.95
3SRformula−0.0076.316.150.940.0047.857.930.94
formula−0.0016.336.060.940.0047.877.820.94
formula0.0126.315.940.930.0027.937.700.94
SBRformula−0.0065.906.060.950.0137.807.870.95
formula−0.0035.926.010.950.0117.877.810.94
formula0.0045.955.870.950.0037.927.650.94
MINformula0.0095.946.080.960.0097.747.890.96
formula0.0136.016.030.950.0097.757.830.96
formula0.0256.075.890.940.0027.747.670.95

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; MIN, minimization; SD, standard deviation; SE, standard error; CP, coverage probability.

TABLE 2

Simulated biases, standard deviations, standard errors, and coverage probabilities for different estimators and randomization methods under equal treatment allocation ratio 1:2:2 across strata.

τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0084.954.740.94−0.0075.915.920.95
formula−0.0022.972.900.94−0.0023.503.390.94
formula−0.0032.982.900.94−0.0033.503.390.94
SBRformula−0.0104.684.680.940.0025.935.870.94
formula−0.0092.902.870.95−0.0073.333.360.95
formula−0.0102.932.850.95−0.0083.353.350.95
MINformula0.0034.734.690.95−0.0085.805.870.95
formula0.0002.862.880.95−0.0013.453.380.95
formula0.0002.892.860.95−0.0013.463.360.94
2SRformula−0.0117.877.570.95−0.0149.909.920.95
formula−0.0013.653.560.94−0.0044.944.780.94
formula−0.0033.383.300.94−0.0034.704.590.94
SBRformula−0.0117.597.490.950.0099.999.860.95
formula−0.0103.643.530.94−0.0094.834.750.95
formula−0.0103.313.250.94−0.0094.554.550.95
MINformula0.0067.647.510.95−0.0159.669.850.96
formula0.0013.613.550.95−0.0014.874.770.94
formula0.0003.293.270.95−0.0014.684.570.95
3SRformula−0.0076.316.150.940.0047.857.930.94
formula−0.0016.336.060.940.0047.877.820.94
formula0.0126.315.940.930.0027.937.700.94
SBRformula−0.0065.906.060.950.0137.807.870.95
formula−0.0035.926.010.950.0117.877.810.94
formula0.0045.955.870.950.0037.927.650.94
MINformula0.0095.946.080.960.0097.747.890.96
formula0.0136.016.030.950.0097.757.830.96
formula0.0256.075.890.940.0027.747.670.95
τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0084.954.740.94−0.0075.915.920.95
formula−0.0022.972.900.94−0.0023.503.390.94
formula−0.0032.982.900.94−0.0033.503.390.94
SBRformula−0.0104.684.680.940.0025.935.870.94
formula−0.0092.902.870.95−0.0073.333.360.95
formula−0.0102.932.850.95−0.0083.353.350.95
MINformula0.0034.734.690.95−0.0085.805.870.95
formula0.0002.862.880.95−0.0013.453.380.95
formula0.0002.892.860.95−0.0013.463.360.94
2SRformula−0.0117.877.570.95−0.0149.909.920.95
formula−0.0013.653.560.94−0.0044.944.780.94
formula−0.0033.383.300.94−0.0034.704.590.94
SBRformula−0.0117.597.490.950.0099.999.860.95
formula−0.0103.643.530.94−0.0094.834.750.95
formula−0.0103.313.250.94−0.0094.554.550.95
MINformula0.0067.647.510.95−0.0159.669.850.96
formula0.0013.613.550.95−0.0014.874.770.94
formula0.0003.293.270.95−0.0014.684.570.95
3SRformula−0.0076.316.150.940.0047.857.930.94
formula−0.0016.336.060.940.0047.877.820.94
formula0.0126.315.940.930.0027.937.700.94
SBRformula−0.0065.906.060.950.0137.807.870.95
formula−0.0035.926.010.950.0117.877.810.94
formula0.0045.955.870.950.0037.927.650.94
MINformula0.0095.946.080.960.0097.747.890.96
formula0.0136.016.030.950.0097.757.830.96
formula0.0256.075.890.940.0027.747.670.95

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; MIN, minimization; SD, standard deviation; SE, standard error; CP, coverage probability.

TABLE 3

Simulated biases, standard deviations, standard errors, and coverage probabilities for different estimators and randomization methods under different treatment allocation ratios across strata.

τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0045.034.960.940.0005.725.700.95
formula0.0013.052.990.94−0.0013.483.370.94
formula0.0013.062.970.94−0.0013.483.350.94
SBRformula−0.0055.004.900.940.0105.785.650.94
formula−0.0043.012.940.94−0.0013.303.330.95
formula−0.0053.042.920.93−0.0023.323.310.95
2SRformula−0.0088.058.000.94−0.0019.499.490.95
formula0.0003.673.670.95−0.0024.874.770.94
formula0.0003.443.360.94−0.0024.674.550.94
SBRformula−0.0048.097.910.940.0219.659.430.94
formula−0.0023.743.610.940.0004.724.730.95
formula−0.0043.413.320.94−0.0014.494.520.95
3SRformula0.0026.616.430.950.0077.637.610.95
formula0.0036.606.370.940.0097.647.560.94
formula0.0066.676.190.930.0147.717.400.94
SBRformula−0.0036.416.360.940.0077.647.550.95
formula−0.0016.426.310.950.0087.677.500.94
formula−0.0026.516.150.930.0077.697.360.94
τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0045.034.960.940.0005.725.700.95
formula0.0013.052.990.94−0.0013.483.370.94
formula0.0013.062.970.94−0.0013.483.350.94
SBRformula−0.0055.004.900.940.0105.785.650.94
formula−0.0043.012.940.94−0.0013.303.330.95
formula−0.0053.042.920.93−0.0023.323.310.95
2SRformula−0.0088.058.000.94−0.0019.499.490.95
formula0.0003.673.670.95−0.0024.874.770.94
formula0.0003.443.360.94−0.0024.674.550.94
SBRformula−0.0048.097.910.940.0219.659.430.94
formula−0.0023.743.610.940.0004.724.730.95
formula−0.0043.413.320.94−0.0014.494.520.95
3SRformula0.0026.616.430.950.0077.637.610.95
formula0.0036.606.370.940.0097.647.560.94
formula0.0066.676.190.930.0147.717.400.94
SBRformula−0.0036.416.360.940.0077.647.550.95
formula−0.0016.426.310.950.0087.677.500.94
formula−0.0026.516.150.930.0077.697.360.94

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; SD, standard deviation; SE, standard error; CP, coverage probability.

TABLE 3

Simulated biases, standard deviations, standard errors, and coverage probabilities for different estimators and randomization methods under different treatment allocation ratios across strata.

τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0045.034.960.940.0005.725.700.95
formula0.0013.052.990.94−0.0013.483.370.94
formula0.0013.062.970.94−0.0013.483.350.94
SBRformula−0.0055.004.900.940.0105.785.650.94
formula−0.0043.012.940.94−0.0013.303.330.95
formula−0.0053.042.920.93−0.0023.323.310.95
2SRformula−0.0088.058.000.94−0.0019.499.490.95
formula0.0003.673.670.95−0.0024.874.770.94
formula0.0003.443.360.94−0.0024.674.550.94
SBRformula−0.0048.097.910.940.0219.659.430.94
formula−0.0023.743.610.940.0004.724.730.95
formula−0.0043.413.320.94−0.0014.494.520.95
3SRformula0.0026.616.430.950.0077.637.610.95
formula0.0036.606.370.940.0097.647.560.94
formula0.0066.676.190.930.0147.717.400.94
SBRformula−0.0036.416.360.940.0077.647.550.95
formula−0.0016.426.310.950.0087.677.500.94
formula−0.0026.516.150.930.0077.697.360.94
τ1τ2
ModelRand.EstimatorBiasSDSECPBiasSDSECP
1SRformula−0.0045.034.960.940.0005.725.700.95
formula0.0013.052.990.94−0.0013.483.370.94
formula0.0013.062.970.94−0.0013.483.350.94
SBRformula−0.0055.004.900.940.0105.785.650.94
formula−0.0043.012.940.94−0.0013.303.330.95
formula−0.0053.042.920.93−0.0023.323.310.95
2SRformula−0.0088.058.000.94−0.0019.499.490.95
formula0.0003.673.670.95−0.0024.874.770.94
formula0.0003.443.360.94−0.0024.674.550.94
SBRformula−0.0048.097.910.940.0219.659.430.94
formula−0.0023.743.610.940.0004.724.730.95
formula−0.0043.413.320.94−0.0014.494.520.95
3SRformula0.0026.616.430.950.0077.637.610.95
formula0.0036.606.370.940.0097.647.560.94
formula0.0066.676.190.930.0147.717.400.94
SBRformula−0.0036.416.360.940.0077.647.550.95
formula−0.0016.426.310.950.0087.677.500.94
formula−0.0026.516.150.930.0077.697.360.94

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; SD, standard deviation; SE, standard error; CP, coverage probability.

  • (1)

    Almost all estimators had negligible bias (formula). The results of Model 3 are an exception, with a maximum bias of approximately 3%.

  • (2)

    Most of the proposed variance estimators matched the corresponding simulated variance, and the expected CP of 95% was reached.

  • (3)

    In Tables 1 and 2, for all models, formula was more efficient than formula, which supports Corollary 1. Specifically, in Models 1 and 2, formula reduced the standard errors by approximately 40%–50%.

  • (4)

    For all models we considered, formula was more efficient than formula, as we showed in Corollary 2. The efficiency gain was greater when formula was used than when formula was used. As expected, in Model 1, the standard errors were similar for formula and formula. In Model 2, formula led to an approximately 10% reduction in standard errors compared with formula. These results also confirm that the efficiency gain is guaranteed, regardless of whether the target proportions across strata are the same or different.

  • (5)

    For two linear models (Models 1 and 2), estimators with additional covariates considerably improved the efficiency compared with formula. However, the improvement is not apparent in Model 3, which is nonlinear and may not benefit most from the regression-based adjustment.

5 Clinical Trial Example

In this section, we first analyze real data from a clinical trial that compared nefazodone, the cognitive behavioral analysis system of psychotherapy (CBASP), and their combination for the treatment of chronic depression (Keller et al., 2000). We consider the combination as the control group indexed as 0, nefazodone as the treatment indexed as 1, and the CBASP as the treatment indexed as 2. The trial randomly assigned 681 patients to three treatments with an allocation ratio of 1:1:1, and the outcome of interest was the final score of the 24-item Hamilton rating scale for depression. To evaluate the performance of our proposed estimators, we consider five covariates in the estimation: AGE, HAMD17, HAMD24, HAMD_COGNID, and GENDER, which are detailed in Table 4. The data are stratified using GENDER and HAMD17, where HAMD17 is discretized into two levels: less than 18 and greater than or equal to 18. AGE, formula, HAMD17, HAMD24, and HAMD_COGNID are treated as additional covariates. Table 5 presents the estimates along with the 95% confidence intervals and the variance reductions. The results show that our proposed estimators reduce the variances by 5.83% to 10.16%. Note that the proposed methods lead to slightly increased estimates. The increase is mainly due to the adjustment of the imbalance in HAMD17, which is highly predictive of the outcomes.

TABLE 4

Description of selected variables from the trial of nefazodone and CBASP.

VariableDescription
AGEAge in years at screening
HAMD17Total HAMD-17 score
HAMD24Total HAMD-24 score
HAMD_COGNIDHAMD cognitive disturbance score
GENDER1 female and 0 male
VariableDescription
AGEAge in years at screening
HAMD17Total HAMD-17 score
HAMD24Total HAMD-24 score
HAMD_COGNIDHAMD cognitive disturbance score
GENDER1 female and 0 male
TABLE 4

Description of selected variables from the trial of nefazodone and CBASP.

VariableDescription
AGEAge in years at screening
HAMD17Total HAMD-17 score
HAMD24Total HAMD-24 score
HAMD_COGNIDHAMD cognitive disturbance score
GENDER1 female and 0 male
VariableDescription
AGEAge in years at screening
HAMD17Total HAMD-17 score
HAMD24Total HAMD-24 score
HAMD_COGNIDHAMD cognitive disturbance score
GENDER1 female and 0 male
TABLE 5

Estimates, 95% CI and variance reduction for real data from the trial of nefazodone and CBSAP.

τ1τ2
EstimatorEstimate95% CIVR(%)Estimate95% CIVR(%)
formula4.61(2.90, 6.32)4.99(3.27, 6.70)
formula4.74(3.08, 6.40)5.835.11(3.46, 6.76)7.38
formula4.67(3.05, 6.30)9.785.22(3.60, 6.85)10.16
τ1τ2
EstimatorEstimate95% CIVR(%)Estimate95% CIVR(%)
formula4.61(2.90, 6.32)4.99(3.27, 6.70)
formula4.74(3.08, 6.40)5.835.11(3.46, 6.76)7.38
formula4.67(3.05, 6.30)9.785.22(3.60, 6.85)10.16

Note: CI, confidence interval; VR, variance reduction.

TABLE 5

Estimates, 95% CI and variance reduction for real data from the trial of nefazodone and CBSAP.

τ1τ2
EstimatorEstimate95% CIVR(%)Estimate95% CIVR(%)
formula4.61(2.90, 6.32)4.99(3.27, 6.70)
formula4.74(3.08, 6.40)5.835.11(3.46, 6.76)7.38
formula4.67(3.05, 6.30)9.785.22(3.60, 6.85)10.16
τ1τ2
EstimatorEstimate95% CIVR(%)Estimate95% CIVR(%)
formula4.61(2.90, 6.32)4.99(3.27, 6.70)
formula4.74(3.08, 6.40)5.835.11(3.46, 6.76)7.38
formula4.67(3.05, 6.30)9.785.22(3.60, 6.85)10.16

Note: CI, confidence interval; VR, variance reduction.

Furthermore, to illustrate the effectiveness of proposed estimators under more general settings of allocation ratio, we also analyze the synthetic data generated based on the trial. To be specific, we use simple randomization or stratified block randomization to allocate the patients with different allocation ratios. The unobserved potential outcomes are imputed by fitting the real trial data with a nonparametric spline using the function bigssa in the R package bigspline. The covariates we choose to fit the data are the same with the covariates in Table 4. The treatment effects and variances are estimated based on 500 replicates. The results are presented in Table 6 and show that our proposed estimators have a variance reduction compared to the benchmark estimator. The stratum-specific estimator reduces variance by 9.07% to 13.26%, and the stratum-common estimator reduces variance by 4.96% to 7.97%. The efficiency gain of the proposed estimators is observed under both simple randomization and stratified block randomization.

TABLE 6

Estimates, 95% CI and variance reduction for synthetic data from the trial of nefazodone and CBSAP under different randomization methods and allocation ratios.

τ1τ2
Rand.EstimatorEstimate95% CIVR (%)Estimate95% CIVR (%)
Allocation Ratio: 1:1:1
SRformula4.54(3.51, 5.57)5.01(3.96, 6.06)
formula4.53(3.53, 5.54)4.965.02(4.01, 6.04)6.67
formula4.53(3.55, 5.51)9.215.04(4.05, 6.03)10.85
SBRformula4.56(3.54, 5.58)5.05(4.01, 6.09)
formula4.55(3.56, 5.54)5.025.05(4.05, 6.06)6.83
formula4.53(3.56, 5.50)9.075.07(4.09, 6.06)10.66
Allocation Ratio: 1:2:2
SRformula4.51(3.38, 5.64)5.00(3.86, 6.15)
formula4.51(3.42, 5.60)6.505.01(3.91, 6.11)7.55
formula4.50(3.44, 5.56)12.415.03(3.96, 6.09)13.26
SBRformula4.52(3.41, 5.63)5.01(3.88, 6.14)
formula4.51(3.44, 5.59)6.625.01(3.92, 6.10)7.72
formula4.50(3.46, 5.55)11.815.03(3.97, 6.08)12.65
Allocation Ratio: 1:2:2 for 2 strata and 2:3:5 for the other 2 strata
SRformula4.51(3.36, 5.67)5.02(3.89, 6.16)
formula4.51(3.39, 5.63)6.405.03(3.94, 6.11)7.76
formula4.50(3.42, 5.58)13.105.04(3.98, 6.09)13.14
SBRformula4.53(3.39, 5.67)5.02(3.91, 6.13)
formula4.52(3.42, 5.62)6.555.02(3.95, 6.09)7.97
formula4.51(3.44, 5.57)12.495.04(3.99, 6.08)12.41
τ1τ2
Rand.EstimatorEstimate95% CIVR (%)Estimate95% CIVR (%)
Allocation Ratio: 1:1:1
SRformula4.54(3.51, 5.57)5.01(3.96, 6.06)
formula4.53(3.53, 5.54)4.965.02(4.01, 6.04)6.67
formula4.53(3.55, 5.51)9.215.04(4.05, 6.03)10.85
SBRformula4.56(3.54, 5.58)5.05(4.01, 6.09)
formula4.55(3.56, 5.54)5.025.05(4.05, 6.06)6.83
formula4.53(3.56, 5.50)9.075.07(4.09, 6.06)10.66
Allocation Ratio: 1:2:2
SRformula4.51(3.38, 5.64)5.00(3.86, 6.15)
formula4.51(3.42, 5.60)6.505.01(3.91, 6.11)7.55
formula4.50(3.44, 5.56)12.415.03(3.96, 6.09)13.26
SBRformula4.52(3.41, 5.63)5.01(3.88, 6.14)
formula4.51(3.44, 5.59)6.625.01(3.92, 6.10)7.72
formula4.50(3.46, 5.55)11.815.03(3.97, 6.08)12.65
Allocation Ratio: 1:2:2 for 2 strata and 2:3:5 for the other 2 strata
SRformula4.51(3.36, 5.67)5.02(3.89, 6.16)
formula4.51(3.39, 5.63)6.405.03(3.94, 6.11)7.76
formula4.50(3.42, 5.58)13.105.04(3.98, 6.09)13.14
SBRformula4.53(3.39, 5.67)5.02(3.91, 6.13)
formula4.52(3.42, 5.62)6.555.02(3.95, 6.09)7.97
formula4.51(3.44, 5.57)12.495.04(3.99, 6.08)12.41

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; CI, confidence interval; VR, variance reduction.

TABLE 6

Estimates, 95% CI and variance reduction for synthetic data from the trial of nefazodone and CBSAP under different randomization methods and allocation ratios.

τ1τ2
Rand.EstimatorEstimate95% CIVR (%)Estimate95% CIVR (%)
Allocation Ratio: 1:1:1
SRformula4.54(3.51, 5.57)5.01(3.96, 6.06)
formula4.53(3.53, 5.54)4.965.02(4.01, 6.04)6.67
formula4.53(3.55, 5.51)9.215.04(4.05, 6.03)10.85
SBRformula4.56(3.54, 5.58)5.05(4.01, 6.09)
formula4.55(3.56, 5.54)5.025.05(4.05, 6.06)6.83
formula4.53(3.56, 5.50)9.075.07(4.09, 6.06)10.66
Allocation Ratio: 1:2:2
SRformula4.51(3.38, 5.64)5.00(3.86, 6.15)
formula4.51(3.42, 5.60)6.505.01(3.91, 6.11)7.55
formula4.50(3.44, 5.56)12.415.03(3.96, 6.09)13.26
SBRformula4.52(3.41, 5.63)5.01(3.88, 6.14)
formula4.51(3.44, 5.59)6.625.01(3.92, 6.10)7.72
formula4.50(3.46, 5.55)11.815.03(3.97, 6.08)12.65
Allocation Ratio: 1:2:2 for 2 strata and 2:3:5 for the other 2 strata
SRformula4.51(3.36, 5.67)5.02(3.89, 6.16)
formula4.51(3.39, 5.63)6.405.03(3.94, 6.11)7.76
formula4.50(3.42, 5.58)13.105.04(3.98, 6.09)13.14
SBRformula4.53(3.39, 5.67)5.02(3.91, 6.13)
formula4.52(3.42, 5.62)6.555.02(3.95, 6.09)7.97
formula4.51(3.44, 5.57)12.495.04(3.99, 6.08)12.41
τ1τ2
Rand.EstimatorEstimate95% CIVR (%)Estimate95% CIVR (%)
Allocation Ratio: 1:1:1
SRformula4.54(3.51, 5.57)5.01(3.96, 6.06)
formula4.53(3.53, 5.54)4.965.02(4.01, 6.04)6.67
formula4.53(3.55, 5.51)9.215.04(4.05, 6.03)10.85
SBRformula4.56(3.54, 5.58)5.05(4.01, 6.09)
formula4.55(3.56, 5.54)5.025.05(4.05, 6.06)6.83
formula4.53(3.56, 5.50)9.075.07(4.09, 6.06)10.66
Allocation Ratio: 1:2:2
SRformula4.51(3.38, 5.64)5.00(3.86, 6.15)
formula4.51(3.42, 5.60)6.505.01(3.91, 6.11)7.55
formula4.50(3.44, 5.56)12.415.03(3.96, 6.09)13.26
SBRformula4.52(3.41, 5.63)5.01(3.88, 6.14)
formula4.51(3.44, 5.59)6.625.01(3.92, 6.10)7.72
formula4.50(3.46, 5.55)11.815.03(3.97, 6.08)12.65
Allocation Ratio: 1:2:2 for 2 strata and 2:3:5 for the other 2 strata
SRformula4.51(3.36, 5.67)5.02(3.89, 6.16)
formula4.51(3.39, 5.63)6.405.03(3.94, 6.11)7.76
formula4.50(3.42, 5.58)13.105.04(3.98, 6.09)13.14
SBRformula4.53(3.39, 5.67)5.02(3.91, 6.13)
formula4.52(3.42, 5.62)6.555.02(3.95, 6.09)7.97
formula4.51(3.44, 5.57)12.495.04(3.99, 6.08)12.41

Note: Rand., randomization; SR, simple randomization; SBR, stratified block randomization; CI, confidence interval; VR, variance reduction.

6 Discussion

In covariate-adaptive randomization, efficiency could be improved by adjusting baseline covariates. In this study, we consider stratum-common and stratum-specific treatment effect estimators for general scenarios, including multiple treatments and different allocation ratios across strata. We also develop consistent estimators for asymptotic variance. We found that when compared with the benchmark estimator and the stratum-common estimator, the stratum-specific estimator can ensure efficiency gains for all cases under our settings. Therefore, we recommend the use of the stratum-specific estimator when certain conditions hold.

A vital condition for the validity of the stratum-specific estimator is that the coefficient estimators obtained from regression must be consistent, which requires a considerable number of units in each stratum. This condition may not be satisfied in clinical trials that have many small strata. Future work could focus on addressing such issues. In addition, future studies could explore the extension of our results to generalized linear models, such as logistic regression for binary outcomes or Poisson regression for count data (Guo & Basse, 2023). Finally, another potential area of interest is a generalization to the high-dimensional settings under multiple treatments.

Data Availability Statement

The data that support the findings in this paper are available in the Supporting Information of this paper.

Supporting Information

Web Appendices A and B, referenced separately in Sections 3 and 4, and R codes to reproduce the simulation results are available with this paper at the Biometrics website on Wiley Online Library. An R package caratMULT, which implements all the estimators in this paper, is available at GitHub (https://github.com/guyujia98/caratMULT).

Acknowledgments

This work was supported by National Natural Science Foundation of China (Grant Nos. 12171476, 12071242).

References

Atkinson
,
A.C.
(
1982
)
Optimum biased coin designs for sequential clinical trials with prognostic factors
.
Biometrika
,
69
,
61
67
.

Angrist
,
J.
,
Oreopoulos
,
P.
&
Williams
,
T.
(
2014
)
When opportunity knocks, who answers? New evidence on college achievement awards
.
Journal of Human Resources
,
49
,
572
610
.

Baden
,
L.R.
,
El Sahly
,
H.M.
,
Essink
,
B.
,
Kotloff
,
K.
,
Frey
,
S.
,
Novak
,
R.
et al. (
2021
)
Efficacy and safety of the mRNA-1273 SARS-CoV-2 vaccine
.
New England Journal of Medicine
,
384
,
403
416
.

Bai
,
Y.
,
Romano
,
J.P.
&
Shaikh
,
A.M.
(
2022
)
Inference in experiments with matched pairs
.
Journal of the American Statistical Association
,
117
,
1726
1737
.

Begg
,
C.B.
&
Iglewicz
,
B.
(
1980
)
A treatment allocation procedure for sequential clinical trials
.
Biometrics
,
36
,
81
90
.

Bugni
,
F.A.
,
Canay
,
I.A.
&
Shaikh
,
A.M.
(
2018
)
Inference under covariate-adaptive randomization
.
Journal of the American Statistical Association
,
113
,
1784
1796
.

Bugni
,
F.A.
,
Canay
,
I.A.
&
Shaikh
,
A.M.
(
2019
)
Inference under covariate-adaptive randomization with multiple treatments
.
Quantitative Economics
,
10
,
1747
1785
.

Chong
,
A.
,
Cohen
,
I.
,
Field
,
E.
,
Nakasone
,
E.
&
Torero
,
M.
(
2016
)
Iron deficiency and schooling attainment in Peru
.
American Economic Journal: Applied Economics
,
8
,
222
255
.

Ciolino
,
J.D.
,
Palac
,
H.L.
,
Yang
,
A.
,
Vaca
,
M.
&
Belli
,
H.M.
(
2019
)
Ideal vs. real: a systematic review on handling covariates in randomized controlled trials
.
BMC Medical Research Methodology
,
19
,
1
11
.

Efron
,
B.
(
1971
)
Forcing a sequential experiment to be balanced
.
Biometrika
58
,
403
417
.

European Medicines Agency
(
2015
).
Guideline on adjustment for baseline covariates in clinical trials
. EMA/CHMP/295050/2013.

Freedman
,
D.A.
(
2008
)
On regression adjustments in experiments with several treatments
.
The Annals of Applied Statistics
,
2
,
176
196
.

Guo
,
K.
&
Basse
,
G.
(
2023
)
The generalized oaxaca-blinder estimator
.
Journal of the American Statistical Association
,
118
,
524
536
.

Han
,
B.
,
Enas
,
N.H.
,
McEntegart
D.
, (
2009
)
Randomization by minimization for unbalanced treatment allocation
.
Statistics in Medicine
,
28
,
3329
3346
.

Hu
,
F.
,
Ye
,
X.
&
Zhang
,
L.-X.
(
2023
)
Multi-arm covariate-adaptive randomization
.
Science China Mathematics
,
66
,
163
190
.

Hu
,
Y.
&
Hu
,
F.
(
2012
)
Asymptotic properties of covariate-adaptive randomization
.
The Annals of Statistics
,
40
,
1794
1815
.

Keller
,
M.B.
,
McCullough
,
J.P.
,
Klein
,
D.N.
,
Arnow
,
B.
,
Dunner
,
D.L.
,
Gelenberg
,
A.J.
et al. (
2000
)
A comparison of nefazodone, the cognitive behavioral-analysis system of psychotherapy, and their combination for the treatment of chronic depression
.
New England Journal of Medicine
,
342
,
1462
1470
.

Kuznetsova
,
O.M.
&
Johnson
,
V.P.
(
2017
)
Approaches to expanding the two-arm biased coin randomization to unequal allocation while preserving the unconditional allocation ratio
.
Statistics in Medicine
,
36
,
2483
2498
.

Kuznetsova
,
O.M.
&
Tymofyeyev
,
Y.
(
2012
)
Preserving the allocation ratio at every allocation with biased coin randomization and minimization in studies with unequal allocation
.
Statistics in Medicine
,
31
,
701
723
.

Lin
,
W.
(
2013
)
Agnostic notes on regression adjustments to experimental data: reexamining freedman's critique
.
The Annals of Applied Statistics
,
7
,
295
318
.

Lin
,
Y.
,
Zhu
,
M.
&
Su
,
Z.
(
2015
)
The pursuit of balance: an overview of covariate-adaptive randomization techniques in clinical trials
.
Contemporary Clinical Trials
,
45
,
21
25
.

Liu
,
H.
,
Tu
,
F.
&
Ma
,
W.
(
2023
)
Lasso-adjusted treatment effect estimation under covariate-adaptive randomization
.
Biometrika
,
110
,
431
447
.

Liu
,
H.
&
Yang
,
Y.
(
2020
)
Regression-adjusted average treatment effect estimates in stratified randomized experiments
.
Biometrika
,
107
,
935
948
.

Ma
,
W.
,
Hu
,
F.
&
Zhang
,
L.
(
2015
)
Testing hypotheses of covariate-adaptive randomized clinical trials
.
Journal of the American Statistical Association
,
110
,
669
680
.

Ma
,
W.
,
Qin
,
Y.
,
Li
,
Y.
&
Hu
,
F.
(
2020
)
Statistical inference for covariate-adaptive randomization procedures
.
Journal of the American Statistical Association
,
115
,
1488
1497
.

Ma
,
W.
,
Tu
,
F.
&
Liu
,
H.
(
2022
)
Regression analysis for covariate-adaptive randomization: A robust and efficient inference perspective
.
Statistics in Medicine
,
41
,
5645
5661
.

Morgan
,
K.L.
&
Rubin
,
D.B.
(
2012
)
Rerandomization to improve covariate balance in experiments
.
The Annals of Statistics
,
40
,
1263
1282
.

Neyman
,
J.S.
(
1923
)
On the application of probability theory to agricultural experiments. essay on principles Section 9
.
Annals of Agricultural Sciences
,
10
,
1
51
.

Pocock
,
S.J.
&
Simon
,
R.
(
1975
)
Sequential treatment assignment with balancing for prognostic factors in the controlled clinical trial
.
Biometrics
,
31
,
103
115
.

Rosenberger
,
W.F.
&
Lachin
,
J.M.
(
2015
)
Randomization in Clinical Trials: Theory and Practice
.
John Wiley & Sons
.

Rubin
,
D.B.
(
1974
)
Estimating causal effects of treatments in randomized and nonrandomized studies
.
Journal of Educational Psychology
,
66
,
688
701
.

Shao
,
J.
,
Yu
,
X.
&
Zhong
,
B.
(
2010
)
A theory for testing hypotheses under covariate-adaptive randomization
.
Biometrika
,
97
,
347
360
.

Taves
,
D.R.
(
1974
)
Minimization: a new method of assigning patients to treatment and control groups
.
Clinical Pharmacology & Therapeutics
,
15
,
443
453
.

Wang
,
T.
&
Ma
,
W.
(
2021
)
The impact of misclassification on covariate-adaptive randomized clinical trials
.
Biometrics
,
77
,
451
464
.

Wang
,
Y.
,
Zhang
,
D.
,
Du
,
G.
,
Du
,
R.
,
Zhao
,
J.
,
Jin
,
Y.
et al. (
2020
).
Remdesivir in adults with severe COVID-19: a randomised, double-blind, placebo-controlled, multicentre trial
.
The Lancet
,
395
,
1569
1578
.

Yang
,
L.
&
Tsiatis
,
A.A.
(
2001
)
Efficiency study of estimators for a treatment effect in a pretest–posttest trial
.
The American Statistician
,
55
,
314
321
.

Ye
,
T.
,
Shao
,
J.
,
Yi
,
Y.
&
Zhao
,
Q.
(
2022a
)
Toward better practice of covariate adjustment in analyzing randomized clinical trials
.
Journal of the American Statistical Association
, in press.

Ye
,
T.
,
Yi
,
Y.
&
Shao
,
J.
(
2022b
)
Inference on the average treatment effect under minimization and other covariate-adaptive randomization methods
.
Biometrika
,
109
,
33
47
.

Zelen
,
M.
(
1974
)
The randomization and stratification of patients to clinical trials
.
Journal of Chronic Diseases
,
27
,
365
375
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Supplementary data