Abstract

Constructing a confidence interval for the ratio of bivariate normal means is a classical problem in statistics. Several methods have been proposed in the literature. The Fieller method is known as an exact method, but can produce an unbounded confidence interval if the denominator of the ratio is not significantly deviated from 0; while the delta and some numeric methods are all bounded, they are only first-order correct. Motivated by a real-world problem, we propose the penalized Fieller method, which employs the same principle as the Fieller method, but adopts a penalized likelihood approach to estimate the denominator. The proposed method has a simple closed form, and can always produce a bounded confidence interval by selecting a suitable penalty parameter. Moreover, the new method is shown to be second-order correct under the bivariate normality assumption, that is, its coverage probability will converge to the nominal level faster than other bounded methods. Simulation results show that our proposed method generally outperforms the existing methods in terms of controlling the coverage probability and the confidence width and is particularly useful when the denominator does not have adequate power to reject being 0. Finally, we apply the proposed approach to the interval estimation of the median response dose in pharmacology studies to show its practical usefulness.

1 Introduction

The ratio estimate, defined as the ratio of means of two random variables, is often encountered in biomedical studies (Tin, 1965). Many problems reduce to forming a confidence interval (CI) for the ratio of two (asymptotically) normal random variables, such as noninferiority and bioequivalence assessment, and dose-response analysis (Vuorinen and Tuominen, 1994; Faraggi et al., 2003; Pigeot et al., 2003). Hence, several statistical methods have been proposed to address this problem (Fieller, 1954; Hayya et al., 1975; Choquet et al., 1999; Wang et al., 2015). Among them, the Fieller method and the delta method are two commonly used analytic approaches (Herson, 1975; Hirschberg and Lye, 2010; Sherman et al., 2011). The Fieller method, which gives an exact CI for achieving the required coverage probability (CP), is based on the inverting of the t statistic. This type of CI is asymmetric around the ratio estimate, which is a good property, as it reflects the skewness of the small-sample distribution of the ratio. However, when the denominator of the ratio is not significantly deviated from 0, the Fieller CI will be unbounded, being either the entire real line or the union of two disconnected infinite intervals (Fieller, 1954). Moreover, the Fieller method always has a positive unbounded probability (UP) even in large samples, and its UP increases when the significance level becomes smaller. That is, when the confidence level is set to be sufficiently close to 1, the Fieller CI can always be unbounded. On the other hand, the delta method directly adopts the first-order Taylor expansion by treating the ratio as a nonlinear function of the numerator and the denominator. By assuming asymptotic normality in large samples, this method produces a symmetric and bounded CI in contrast to the Fieller method. However, the delta method often has an inaccurate CP and unbalanced tail errors even in a moderate sample size. Furthermore, previous studies have shown that the delta method has even poorer performance if the true value of the ratio has opposite sign to the correlation coefficient between the numerator and denominator (Hirschberg and Lye, 2010). Because the exact CI for a ratio estimate is naturally asymmetric, this method has been regarded having the “most serious error” in CI construction due to its enforced symmetry (Efron and Tibshirani, 1994). Although a method based on second-order Taylor expansion (Hayya et al., 1975) has also been proposed, it is rarely used because this method has the same disadvantages as the delta method, and its calculation is more involved. Because of the main drawbacks of the preceding methods, some numerical procedures have been proposed, such as the direct integral method, the parametric bootstrap (PB) method, and the nonparametric bootstrap method (Choquet et al., 1999; Wang et al., 2015). Although the CIs obtained from those iterative procedures are bounded, and generally have a better CP than the delta method, they usually result in a much wider confidence width (CW) and are more computationally intensive (Wang et al., 2015). It should be noted that all the preceding methods, other than the Fieller CI, are first-order correct (Wang et al., 2015). This fact indicates the relatively limited improvement of the numeric methods over the delta method, because they are of the same order correctness.

In this article, we propose a novel analytic approach for constructing the CI for the ratio estimate. Our method, the penalized Fieller (PF) method, uses the same principle as the Fieller method, but adopts a penalized likelihood approach to estimate the denominator. The PF method always produces a bounded CI with an appropriately chosen penalty parameter. Moreover, the proposed method is second-order correct under the bivariate normality assumption. That is, its coverage probability will converge to the nominal level faster than other bounded methods. Simulation results show that the PF method generally outperforms the existing methods in terms of controlling the CP and the CW and is particularly useful when the denominator does not have adequate power to reject being 0. Finally, we apply the new approach to the interval estimation of the median response dose, which is commonly used in pharmacology, to show its practical usefulness.

2 Method

2.1 Definitions

Suppose that a pair of random variables formula jointly follow a bivariate normal distribution with marginal distributions, respectively, being formula and formula, and their covariance being σ12. Denote formula to be the correlation coefficient, and let formula, formula, ⋅⋅⋅, formula be an independent and identically distributed sample from the above distribution. Then, the sample means, variances, covariance, and correlation coefficient can be estimated by formula, formula, formula, formula, formula, and formula, respectively. Further, the variance and covariance of formula and formula, respectively, are formula, formula, and formula with their estimates, respectively, being formula, formula, and formula. We assume that μ2 is a nonzero constant so that formula can be well defined. The commonly used point estimate for r is the ratio estimate formula.

2.2 Overview of the Fieller Method

Because the PF method employs the same principle as the Fieller method, we first briefly introduce the latter. The Fieller CI is calculated by inverting the t statistic for testing the null hypothesis formula. It is shown that formula follows a t distribution with formula degrees of freedom under the null hypothesis. Hence, the confidence region of the Fieller CI for r with the confidence level formula is identical to the set of r that leads to the acceptance of the null hypothesis at the significance level α, and is determined by the inequality formula, where formula denotes the formula quantile of the t distribution with formula degrees of freedom. Rearranging this inequality results in the following quadratic inequality with respect to r: formula, where formula, formula, and formula. The solution of this inequality depends on the sign of A and formula. Through simple calculation, Δ can be further expressed as formula. Hence, formula also implies formula. Let formula and formula (formula) be the two roots of the equation formula if formula. In the situation of formula, the Fieller CI is formula. Notably, formula indicates that the denominator is significantly different from 0. However, if formula, the Fieller CI will be unbounded. For instance, if formula, the Fieller CI will be formula; otherwise, it will be formula. Meanwhile, it is clear to see that A will always be less than 0 (ie, unbounded) for a sufficiently small α. It should be emphasized that the null hypothesis formula is equivalent to formula only when formula. However, the Fieller method does not take this information into consideration. As such, the Fieller CI has the potential to overestimate the CW.

2.3 Penalized Estimate for the Denominator

The Fieller method can be unbounded when the denominator is not significantly different from 0. This scenario will occur if the denominator has a large variance. A natural idea to address this problem is to penalize the parameter μ2, so that we can reduce the variance of the denominator by introducing some bias. Specifically, by utilizing the information that μ2 cannot be 0, we define a penalized log-likelihood function of formula as

where formula is the penalty parameter. It is shown in Appendix  A that the preceding penalized log-likelihood function attains its maximum value when formula. Plugging formula into the equation, we obtain the penalized estimate for μ2 as formula. Without loss of generality, we assume formula. It can be seen that formula reduces to be formula when formula, and for formula, formula is a biased estimator for μ2, and the bias is formula. Through the penalty, formula always shrinks formula away from 0. Further, formula can be rewritten as formula, where formula is the sample coefficient of variation of Y. As such, formula tends to penalize the large coefficient of variation of the denominator. Once formula is obtained, a penalized ratio estimate for r can be proposed as formula.

Note that formula can be viewed as a bivariate function of formula and formula, denoted by formula. By making a Taylor expansion for formula around μ2 and v2, it is not difficult to show that formula and formula, where formula and formula. It is interesting that the error terms are only formula instead of the regular formula. Note that the magnitude of formula measures the extent of penalty with its range being (0.5,1). Specifically, formula represents the case formula (ie, no penalty), and formula corresponds to the scenario formula. As such, we asymptotically have formula, which means that the variance of the denominator is reduced through the penalty. However, for the fixed penalty parameter λ, we have formula. Notably, formula can also be written as formula, where formula is the coefficient of variation of Y. Hence, the value of formula only depends on λ and formula. Since formula and v2 are unknown, we estimate formula as formula, where formula. Similarly, we have formula and formula.

2.4 Adjustment of the Numerator

Recall that the Fieller method is based on the inverting of the t statistic where the null hypothesis is formula. To construct the test statistic, an unbiased estimator formula is used to estimate formula for the Fieller's method. However, if we simply replace formula by formula, then the latter is a biased estimator for any formula. Further, it is shown that formula. Let formula be any of the root-n consistent estimates for r, and define formula. If we adjust formula by formula, it can be seen that formula. Hence, formula is expected to have a smaller bias than formula. Note that formula is an exception, since formula is unbiased in this situation. If formula, the bias of formula is formula, while the bias of formula is generally formula, which would be asymptotically smaller.

On the other hand, we need to select an appropriate point estimate formula so as to make the bias of formula as small as possible. There are many kinds of point estimates for r that we can choose for such an adjustment (Tin, 1965). For simplicity, we only consider the commonly used ratio estimate formula and the proposed penalized ratio estimate formula as the plug-in estimates in this article. It is shown that formula and formula. By the facts formula and formula, we know that the difference between formula and formula should be small in large samples, but the latter provides a more conservative adjustment. In order to comprehensively compare the bias-corrected estimates formula and formula, Web Tables 1 and 2 provide the estimated biases of formula, formula, and formula under various simulation configurations. From those tables, we can see that formula usually has the smallest bias when formula, and formula generally has less bias than formula when λ is set at the recommended values (see Section 2.5), especially in the case of small n and large formula. Based on these results, we recommend adjusting formula by formula (denoted by formula). Moreover, formula has a dramatically simple expression, which also makes it easy to further evaluate its variance. Since formula and formula, we, respectively, estimate formula and formula as formula and formula. Note that formula is only a term of formula, but we still keep it in formula so that both formula and formula can always be guaranteed, where formula is the estimated correlation coefficient for formula and formula.

TABLE 1

Estimated UP (%), CP (%), CW, and ML/(ML+MR) of the two-sided 95% CI when formula, formula, formula, and formula and 50 for the PF, Fieller, delta, and PB methods based on 10 000 replicates Note. formula denotes the estimated UP of the Fieller method and the UP for all the remaining methods always stays at 0

CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
n=20
(a) Bivariate normal distribution
formula095.2695.2395.1195.590.550.550.540.550.470.470.210.19
formula095.0194.9394.8095.470.670.670.630.660.460.480.110.07
0.990.8895.7394.9093.6294.931.151.220.971.230.330.4400
0.909.8896.6395.1292.9194.961.641.931.252.320.140.1400
0.8020.5497.0695.2392.0995.072.032.801.445.220.030.0500
0.6040.3096.1795.2590.8295.202.918.271.7919.860.01000.01
0.4060.5092.9094.6688.4495.074.60formula2.3544.710000.02
0.2079.9783.7195.0484.1195.859.00formula3.4161.970000.07
(b) Bivariate Laplace distribution
formula094.7794.7194.9995.280.540.540.530.540.520.530.250.22
formula095.2595.1894.9895.690.640.640.610.640.450.470.100.07
0.992.0295.9295.1793.9595.141.081.130.931.140.340.440.010.01
0.9010.9697.0195.6493.0395.151.531.771.191.990.150.2100
0.8019.3996.3094.9991.0994.361.862.391.363.770.110.1400
0.6036.9795.9695.1790.6095.212.655.801.6914.900.020.0300.01
0.4056.3693.3495.4288.6395.344.16formula2.2138.7600.0100.02
0.2077.7884.6595.3583.9696.218.12formula3.1859.740000.10
(c) Bivariate skew normal distribution with shape parameter 10
formula095.2495.2494.6595.120.550.550.530.550.500.500.260.25
formula095.2495.1994.5995.410.660.660.630.660.600.610.230.22
0.990.0295.2094.6894.8896.501.171.230.981.250.680.740.020
0.904.3795.9594.4193.6396.451.692.021.282.500.600.6500
0.8013.6196.9993.9993.2096.562.072.891.465.440.400.3700
0.6039.0797.7694.4092.0497.093.049.411.8521.510.030.0200
0.4063.8195.4594.3790.2897.434.75formula2.4047.960000.04
0.2084.8785.2193.9985.7797.579.33formula3.5368.230000.16
n = 50
(a) Bivariate normal distribution
formula095.2095.1494.8395.480.470.470.450.470.500.510.140.11
formula095.0694.9494.7495.460.590.600.560.600.440.460.040
0.991.0496.1494.8493.3194.761.121.180.941.240.310.4200
0.909.4297.1795.1491.6694.661.651.961.242.810.010.0300
0.8020.1896.7194.7591.0294.652.032.781.426.7400.0100
0.6040.2795.7694.8989.8595.152.938.441.7923.110000.01
0.4059.9092.5594.9488.1195.614.43formula2.2947.060000.06
0.2079.8182.9394.8483.1996.188.88formula3.3462.330000.11
(b) Bivariate Laplace distribution
formula095.0594.9995.6295.900.460.460.450.460.540.550.130.09
formula095.1094.9494.8395.430.580.590.550.590.490.510.030.02
0.991.5296.2095.0293.1694.991.091.140.921.190.310.4400
0.9010.3396.8795.1691.7694.631.581.851.202.530.070.1000
0.8019.7296.6494.8890.5694.221.932.581.375.550.010.0200
0.6039.0695.2894.7789.8294.722.857.521.7520.990000.02
0.4057.8792.3295.4487.9295.584.25formula2.2143.860000.03
0.2078.7783.5795.3783.6496.388.31formula3.2160.100000.12
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula094.8194.7694.7995.210.470.470.450.470.660.660.290.29
formula094.8594.8394.7295.860.600.600.570.600.660.680.190.11
0.990.1695.7694.9393.9395.961.131.190.951.250.590.6800
0.906.5797.3894.6792.7695.981.641.951.242.790.290.3100
0.8016.8497.8894.8092.5696.232.052.861.447.230.030.0100
0.6038.2996.7894.4790.8796.202.938.561.7923.380000
0.4062.3493.9494.8989.0496.694.66formula2.3649.740000.04
0.2083.0983.8594.5884.5797.249.64formula3.5763.900000.20
CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
n=20
(a) Bivariate normal distribution
formula095.2695.2395.1195.590.550.550.540.550.470.470.210.19
formula095.0194.9394.8095.470.670.670.630.660.460.480.110.07
0.990.8895.7394.9093.6294.931.151.220.971.230.330.4400
0.909.8896.6395.1292.9194.961.641.931.252.320.140.1400
0.8020.5497.0695.2392.0995.072.032.801.445.220.030.0500
0.6040.3096.1795.2590.8295.202.918.271.7919.860.01000.01
0.4060.5092.9094.6688.4495.074.60formula2.3544.710000.02
0.2079.9783.7195.0484.1195.859.00formula3.4161.970000.07
(b) Bivariate Laplace distribution
formula094.7794.7194.9995.280.540.540.530.540.520.530.250.22
formula095.2595.1894.9895.690.640.640.610.640.450.470.100.07
0.992.0295.9295.1793.9595.141.081.130.931.140.340.440.010.01
0.9010.9697.0195.6493.0395.151.531.771.191.990.150.2100
0.8019.3996.3094.9991.0994.361.862.391.363.770.110.1400
0.6036.9795.9695.1790.6095.212.655.801.6914.900.020.0300.01
0.4056.3693.3495.4288.6395.344.16formula2.2138.7600.0100.02
0.2077.7884.6595.3583.9696.218.12formula3.1859.740000.10
(c) Bivariate skew normal distribution with shape parameter 10
formula095.2495.2494.6595.120.550.550.530.550.500.500.260.25
formula095.2495.1994.5995.410.660.660.630.660.600.610.230.22
0.990.0295.2094.6894.8896.501.171.230.981.250.680.740.020
0.904.3795.9594.4193.6396.451.692.021.282.500.600.6500
0.8013.6196.9993.9993.2096.562.072.891.465.440.400.3700
0.6039.0797.7694.4092.0497.093.049.411.8521.510.030.0200
0.4063.8195.4594.3790.2897.434.75formula2.4047.960000.04
0.2084.8785.2193.9985.7797.579.33formula3.5368.230000.16
n = 50
(a) Bivariate normal distribution
formula095.2095.1494.8395.480.470.470.450.470.500.510.140.11
formula095.0694.9494.7495.460.590.600.560.600.440.460.040
0.991.0496.1494.8493.3194.761.121.180.941.240.310.4200
0.909.4297.1795.1491.6694.661.651.961.242.810.010.0300
0.8020.1896.7194.7591.0294.652.032.781.426.7400.0100
0.6040.2795.7694.8989.8595.152.938.441.7923.110000.01
0.4059.9092.5594.9488.1195.614.43formula2.2947.060000.06
0.2079.8182.9394.8483.1996.188.88formula3.3462.330000.11
(b) Bivariate Laplace distribution
formula095.0594.9995.6295.900.460.460.450.460.540.550.130.09
formula095.1094.9494.8395.430.580.590.550.590.490.510.030.02
0.991.5296.2095.0293.1694.991.091.140.921.190.310.4400
0.9010.3396.8795.1691.7694.631.581.851.202.530.070.1000
0.8019.7296.6494.8890.5694.221.932.581.375.550.010.0200
0.6039.0695.2894.7789.8294.722.857.521.7520.990000.02
0.4057.8792.3295.4487.9295.584.25formula2.2143.860000.03
0.2078.7783.5795.3783.6496.388.31formula3.2160.100000.12
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula094.8194.7694.7995.210.470.470.450.470.660.660.290.29
formula094.8594.8394.7295.860.600.600.570.600.660.680.190.11
0.990.1695.7694.9393.9395.961.131.190.951.250.590.6800
0.906.5797.3894.6792.7695.981.641.951.242.790.290.3100
0.8016.8497.8894.8092.5696.232.052.861.447.230.030.0100
0.6038.2996.7894.4790.8796.202.938.561.7923.380000
0.4062.3493.9494.8989.0496.694.66formula2.3649.740000.04
0.2083.0983.8594.5884.5797.249.64formula3.5763.900000.20
TABLE 1

Estimated UP (%), CP (%), CW, and ML/(ML+MR) of the two-sided 95% CI when formula, formula, formula, and formula and 50 for the PF, Fieller, delta, and PB methods based on 10 000 replicates Note. formula denotes the estimated UP of the Fieller method and the UP for all the remaining methods always stays at 0

CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
n=20
(a) Bivariate normal distribution
formula095.2695.2395.1195.590.550.550.540.550.470.470.210.19
formula095.0194.9394.8095.470.670.670.630.660.460.480.110.07
0.990.8895.7394.9093.6294.931.151.220.971.230.330.4400
0.909.8896.6395.1292.9194.961.641.931.252.320.140.1400
0.8020.5497.0695.2392.0995.072.032.801.445.220.030.0500
0.6040.3096.1795.2590.8295.202.918.271.7919.860.01000.01
0.4060.5092.9094.6688.4495.074.60formula2.3544.710000.02
0.2079.9783.7195.0484.1195.859.00formula3.4161.970000.07
(b) Bivariate Laplace distribution
formula094.7794.7194.9995.280.540.540.530.540.520.530.250.22
formula095.2595.1894.9895.690.640.640.610.640.450.470.100.07
0.992.0295.9295.1793.9595.141.081.130.931.140.340.440.010.01
0.9010.9697.0195.6493.0395.151.531.771.191.990.150.2100
0.8019.3996.3094.9991.0994.361.862.391.363.770.110.1400
0.6036.9795.9695.1790.6095.212.655.801.6914.900.020.0300.01
0.4056.3693.3495.4288.6395.344.16formula2.2138.7600.0100.02
0.2077.7884.6595.3583.9696.218.12formula3.1859.740000.10
(c) Bivariate skew normal distribution with shape parameter 10
formula095.2495.2494.6595.120.550.550.530.550.500.500.260.25
formula095.2495.1994.5995.410.660.660.630.660.600.610.230.22
0.990.0295.2094.6894.8896.501.171.230.981.250.680.740.020
0.904.3795.9594.4193.6396.451.692.021.282.500.600.6500
0.8013.6196.9993.9993.2096.562.072.891.465.440.400.3700
0.6039.0797.7694.4092.0497.093.049.411.8521.510.030.0200
0.4063.8195.4594.3790.2897.434.75formula2.4047.960000.04
0.2084.8785.2193.9985.7797.579.33formula3.5368.230000.16
n = 50
(a) Bivariate normal distribution
formula095.2095.1494.8395.480.470.470.450.470.500.510.140.11
formula095.0694.9494.7495.460.590.600.560.600.440.460.040
0.991.0496.1494.8493.3194.761.121.180.941.240.310.4200
0.909.4297.1795.1491.6694.661.651.961.242.810.010.0300
0.8020.1896.7194.7591.0294.652.032.781.426.7400.0100
0.6040.2795.7694.8989.8595.152.938.441.7923.110000.01
0.4059.9092.5594.9488.1195.614.43formula2.2947.060000.06
0.2079.8182.9394.8483.1996.188.88formula3.3462.330000.11
(b) Bivariate Laplace distribution
formula095.0594.9995.6295.900.460.460.450.460.540.550.130.09
formula095.1094.9494.8395.430.580.590.550.590.490.510.030.02
0.991.5296.2095.0293.1694.991.091.140.921.190.310.4400
0.9010.3396.8795.1691.7694.631.581.851.202.530.070.1000
0.8019.7296.6494.8890.5694.221.932.581.375.550.010.0200
0.6039.0695.2894.7789.8294.722.857.521.7520.990000.02
0.4057.8792.3295.4487.9295.584.25formula2.2143.860000.03
0.2078.7783.5795.3783.6496.388.31formula3.2160.100000.12
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula094.8194.7694.7995.210.470.470.450.470.660.660.290.29
formula094.8594.8394.7295.860.600.600.570.600.660.680.190.11
0.990.1695.7694.9393.9395.961.131.190.951.250.590.6800
0.906.5797.3894.6792.7695.981.641.951.242.790.290.3100
0.8016.8497.8894.8092.5696.232.052.861.447.230.030.0100
0.6038.2996.7894.4790.8796.202.938.561.7923.380000
0.4062.3493.9494.8989.0496.694.66formula2.3649.740000.04
0.2083.0983.8594.5884.5797.249.64formula3.5763.900000.20
CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
n=20
(a) Bivariate normal distribution
formula095.2695.2395.1195.590.550.550.540.550.470.470.210.19
formula095.0194.9394.8095.470.670.670.630.660.460.480.110.07
0.990.8895.7394.9093.6294.931.151.220.971.230.330.4400
0.909.8896.6395.1292.9194.961.641.931.252.320.140.1400
0.8020.5497.0695.2392.0995.072.032.801.445.220.030.0500
0.6040.3096.1795.2590.8295.202.918.271.7919.860.01000.01
0.4060.5092.9094.6688.4495.074.60formula2.3544.710000.02
0.2079.9783.7195.0484.1195.859.00formula3.4161.970000.07
(b) Bivariate Laplace distribution
formula094.7794.7194.9995.280.540.540.530.540.520.530.250.22
formula095.2595.1894.9895.690.640.640.610.640.450.470.100.07
0.992.0295.9295.1793.9595.141.081.130.931.140.340.440.010.01
0.9010.9697.0195.6493.0395.151.531.771.191.990.150.2100
0.8019.3996.3094.9991.0994.361.862.391.363.770.110.1400
0.6036.9795.9695.1790.6095.212.655.801.6914.900.020.0300.01
0.4056.3693.3495.4288.6395.344.16formula2.2138.7600.0100.02
0.2077.7884.6595.3583.9696.218.12formula3.1859.740000.10
(c) Bivariate skew normal distribution with shape parameter 10
formula095.2495.2494.6595.120.550.550.530.550.500.500.260.25
formula095.2495.1994.5995.410.660.660.630.660.600.610.230.22
0.990.0295.2094.6894.8896.501.171.230.981.250.680.740.020
0.904.3795.9594.4193.6396.451.692.021.282.500.600.6500
0.8013.6196.9993.9993.2096.562.072.891.465.440.400.3700
0.6039.0797.7694.4092.0497.093.049.411.8521.510.030.0200
0.4063.8195.4594.3790.2897.434.75formula2.4047.960000.04
0.2084.8785.2193.9985.7797.579.33formula3.5368.230000.16
n = 50
(a) Bivariate normal distribution
formula095.2095.1494.8395.480.470.470.450.470.500.510.140.11
formula095.0694.9494.7495.460.590.600.560.600.440.460.040
0.991.0496.1494.8493.3194.761.121.180.941.240.310.4200
0.909.4297.1795.1491.6694.661.651.961.242.810.010.0300
0.8020.1896.7194.7591.0294.652.032.781.426.7400.0100
0.6040.2795.7694.8989.8595.152.938.441.7923.110000.01
0.4059.9092.5594.9488.1195.614.43formula2.2947.060000.06
0.2079.8182.9394.8483.1996.188.88formula3.3462.330000.11
(b) Bivariate Laplace distribution
formula095.0594.9995.6295.900.460.460.450.460.540.550.130.09
formula095.1094.9494.8395.430.580.590.550.590.490.510.030.02
0.991.5296.2095.0293.1694.991.091.140.921.190.310.4400
0.9010.3396.8795.1691.7694.631.581.851.202.530.070.1000
0.8019.7296.6494.8890.5694.221.932.581.375.550.010.0200
0.6039.0695.2894.7789.8294.722.857.521.7520.990000.02
0.4057.8792.3295.4487.9295.584.25formula2.2143.860000.03
0.2078.7783.5795.3783.6496.388.31formula3.2160.100000.12
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula094.8194.7694.7995.210.470.470.450.470.660.660.290.29
formula094.8594.8394.7295.860.600.600.570.600.660.680.190.11
0.990.1695.7694.9393.9395.961.131.190.951.250.590.6800
0.906.5797.3894.6792.7695.981.641.951.242.790.290.3100
0.8016.8497.8894.8092.5696.232.052.861.447.230.030.0100
0.6038.2996.7894.4790.8796.202.938.561.7923.380000
0.4062.3493.9494.8989.0496.694.66formula2.3649.740000.04
0.2083.0983.8594.5884.5797.249.64formula3.5763.900000.20
TABLE 2

Estimated UP (%), CP (%), CW, and ML/(ML+MR) of the two-sided 99% CI when formula, formula, formula, and formula and 50 for the PF, Fieller, delta, and PB methods based on 10 000 replicates formula denotes the estimated UP of the Fieller method and the UP for all the remaining methods always stays at 0

CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
formula
(a) Bivariate normal distribution
formula099.1999.1799.1599.140.730.730.700.710.430.450.080.06
formula098.9098.8898.6098.680.860.870.800.830.460.480.050.05
0.990.9299.3999.1198.1998.541.431.561.161.340.340.550.010
0.909.9199.4699.1097.5498.141.932.491.411.880.110.1100
0.8020.4299.2598.8696.9397.942.333.611.592.460.050.0900
0.6040.6499.2199.0296.2897.693.1211.181.905.370000
0.4060.4898.5098.8895.5497.444.28formula2.2517.690000
0.2079.8596.4799.0293.7596.876.83formula2.9252.340000.04
(b) Bivariate Laplace distribution
formula099.3299.3099.0699.150.710.710.680.700.470.500.090.08
formula099.3499.3198.8598.970.840.840.780.810.420.460.030.02
0.992.5199.2699.1497.9098.321.341.451.111.260.300.4100
0.9012.0099.4599.2797.5398.231.812.231.361.740.240.3500
0.8020.1799.4799.1997.2597.912.143.041.512.170.080.1400
0.6036.5699.2599.1496.4497.652.837.151.783.960.010.0200.01
0.4055.1098.7299.2995.2897.463.86formula2.1212.510000.02
0.2076.2396.0799.2094.1297.336.18formula2.7743.090000.04
(c) Bivariate skew normal distribution with shape parameter 10
formula099.0199.0298.5898.730.720.720.690.700.510.530.180.16
formula098.9298.9198.4598.550.850.860.790.820.590.620.140.15
0.990.0198.8998.7498.2698.771.421.551.151.330.700.780.030.02
0.903.2299.1498.5197.8098.581.972.541.441.920.720.8300
0.8012.4899.5298.7897.6298.512.363.701.612.510.440.5500
0.6037.8899.6698.4897.1098.603.1812.131.935.580.260.1200
0.4064.0899.3998.3596.0898.304.34formula2.2917.970.02000.01
0.2086.1697.2298.0694.7498.417.39formula3.1159.030000.06
formula
(a) Bivariate normal distribution
formula099.1999.1898.8098.920.600.610.570.590.480.500.030.04
formula099.1299.0698.3798.550.750.760.690.730.500.5300
0.990.9899.4299.1197.5198.271.351.491.091.300.090.3000
0.9010.2999.3298.8696.5197.631.902.461.371.9900.0200
0.8019.8299.2098.8296.0297.412.283.571.542.760000
0.6039.5299.0599.0995.3897.183.0310.241.837.400000
0.4060.3798.3699.0394.3597.154.22formula2.2323.130000.02
0.2079.1495.6599.0492.6596.996.76formula2.8957.080000.04
(b) Bivariate Laplace distribution
formula098.9798.9898.7798.850.590.590.560.580.450.460.060.04
formula099.1699.1198.5098.670.740.750.690.720.540.5900
0.991.7499.4099.1197.5898.231.321.441.071.270.120.2900
0.9010.2399.3498.9896.6497.701.802.271.321.850.030.0900
0.8020.6499.3799.1696.2897.632.213.321.512.580000
0.6038.8099.1399.1195.6897.502.959.041.806.740000
0.4058.1798.3699.1794.4197.044.16formula2.2021.160000.01
0.2078.7695.5699.1893.0097.156.84formula2.9155.410000.07
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula098.8498.8398.6898.780.600.600.570.590.660.680.170.15
formula098.9598.9598.7398.910.760.770.700.730.630.660.040.03
0.990.0599.2798.7997.6898.411.361.491.091.310.580.7900
0.905.5299.6598.8697.5398.541.922.481.381.990.230.2200
0.8015.2299.5898.7596.9398.082.323.671.562.910000
0.6038.8399.5698.6495.6997.873.1211.491.877.980000
0.4062.6599.0198.7695.7698.154.28formula2.2524.710000.01
0.2084.1196.0398.5393.3597.947.42formula3.1165.070000.05
CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
formula
(a) Bivariate normal distribution
formula099.1999.1799.1599.140.730.730.700.710.430.450.080.06
formula098.9098.8898.6098.680.860.870.800.830.460.480.050.05
0.990.9299.3999.1198.1998.541.431.561.161.340.340.550.010
0.909.9199.4699.1097.5498.141.932.491.411.880.110.1100
0.8020.4299.2598.8696.9397.942.333.611.592.460.050.0900
0.6040.6499.2199.0296.2897.693.1211.181.905.370000
0.4060.4898.5098.8895.5497.444.28formula2.2517.690000
0.2079.8596.4799.0293.7596.876.83formula2.9252.340000.04
(b) Bivariate Laplace distribution
formula099.3299.3099.0699.150.710.710.680.700.470.500.090.08
formula099.3499.3198.8598.970.840.840.780.810.420.460.030.02
0.992.5199.2699.1497.9098.321.341.451.111.260.300.4100
0.9012.0099.4599.2797.5398.231.812.231.361.740.240.3500
0.8020.1799.4799.1997.2597.912.143.041.512.170.080.1400
0.6036.5699.2599.1496.4497.652.837.151.783.960.010.0200.01
0.4055.1098.7299.2995.2897.463.86formula2.1212.510000.02
0.2076.2396.0799.2094.1297.336.18formula2.7743.090000.04
(c) Bivariate skew normal distribution with shape parameter 10
formula099.0199.0298.5898.730.720.720.690.700.510.530.180.16
formula098.9298.9198.4598.550.850.860.790.820.590.620.140.15
0.990.0198.8998.7498.2698.771.421.551.151.330.700.780.030.02
0.903.2299.1498.5197.8098.581.972.541.441.920.720.8300
0.8012.4899.5298.7897.6298.512.363.701.612.510.440.5500
0.6037.8899.6698.4897.1098.603.1812.131.935.580.260.1200
0.4064.0899.3998.3596.0898.304.34formula2.2917.970.02000.01
0.2086.1697.2298.0694.7498.417.39formula3.1159.030000.06
formula
(a) Bivariate normal distribution
formula099.1999.1898.8098.920.600.610.570.590.480.500.030.04
formula099.1299.0698.3798.550.750.760.690.730.500.5300
0.990.9899.4299.1197.5198.271.351.491.091.300.090.3000
0.9010.2999.3298.8696.5197.631.902.461.371.9900.0200
0.8019.8299.2098.8296.0297.412.283.571.542.760000
0.6039.5299.0599.0995.3897.183.0310.241.837.400000
0.4060.3798.3699.0394.3597.154.22formula2.2323.130000.02
0.2079.1495.6599.0492.6596.996.76formula2.8957.080000.04
(b) Bivariate Laplace distribution
formula098.9798.9898.7798.850.590.590.560.580.450.460.060.04
formula099.1699.1198.5098.670.740.750.690.720.540.5900
0.991.7499.4099.1197.5898.231.321.441.071.270.120.2900
0.9010.2399.3498.9896.6497.701.802.271.321.850.030.0900
0.8020.6499.3799.1696.2897.632.213.321.512.580000
0.6038.8099.1399.1195.6897.502.959.041.806.740000
0.4058.1798.3699.1794.4197.044.16formula2.2021.160000.01
0.2078.7695.5699.1893.0097.156.84formula2.9155.410000.07
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula098.8498.8398.6898.780.600.600.570.590.660.680.170.15
formula098.9598.9598.7398.910.760.770.700.730.630.660.040.03
0.990.0599.2798.7997.6898.411.361.491.091.310.580.7900
0.905.5299.6598.8697.5398.541.922.481.381.990.230.2200
0.8015.2299.5898.7596.9398.082.323.671.562.910000
0.6038.8399.5698.6495.6997.873.1211.491.877.980000
0.4062.6599.0198.7695.7698.154.28formula2.2524.710000.01
0.2084.1196.0398.5393.3597.947.42formula3.1165.070000.05
TABLE 2

Estimated UP (%), CP (%), CW, and ML/(ML+MR) of the two-sided 99% CI when formula, formula, formula, and formula and 50 for the PF, Fieller, delta, and PB methods based on 10 000 replicates formula denotes the estimated UP of the Fieller method and the UP for all the remaining methods always stays at 0

CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
formula
(a) Bivariate normal distribution
formula099.1999.1799.1599.140.730.730.700.710.430.450.080.06
formula098.9098.8898.6098.680.860.870.800.830.460.480.050.05
0.990.9299.3999.1198.1998.541.431.561.161.340.340.550.010
0.909.9199.4699.1097.5498.141.932.491.411.880.110.1100
0.8020.4299.2598.8696.9397.942.333.611.592.460.050.0900
0.6040.6499.2199.0296.2897.693.1211.181.905.370000
0.4060.4898.5098.8895.5497.444.28formula2.2517.690000
0.2079.8596.4799.0293.7596.876.83formula2.9252.340000.04
(b) Bivariate Laplace distribution
formula099.3299.3099.0699.150.710.710.680.700.470.500.090.08
formula099.3499.3198.8598.970.840.840.780.810.420.460.030.02
0.992.5199.2699.1497.9098.321.341.451.111.260.300.4100
0.9012.0099.4599.2797.5398.231.812.231.361.740.240.3500
0.8020.1799.4799.1997.2597.912.143.041.512.170.080.1400
0.6036.5699.2599.1496.4497.652.837.151.783.960.010.0200.01
0.4055.1098.7299.2995.2897.463.86formula2.1212.510000.02
0.2076.2396.0799.2094.1297.336.18formula2.7743.090000.04
(c) Bivariate skew normal distribution with shape parameter 10
formula099.0199.0298.5898.730.720.720.690.700.510.530.180.16
formula098.9298.9198.4598.550.850.860.790.820.590.620.140.15
0.990.0198.8998.7498.2698.771.421.551.151.330.700.780.030.02
0.903.2299.1498.5197.8098.581.972.541.441.920.720.8300
0.8012.4899.5298.7897.6298.512.363.701.612.510.440.5500
0.6037.8899.6698.4897.1098.603.1812.131.935.580.260.1200
0.4064.0899.3998.3596.0898.304.34formula2.2917.970.02000.01
0.2086.1697.2298.0694.7498.417.39formula3.1159.030000.06
formula
(a) Bivariate normal distribution
formula099.1999.1898.8098.920.600.610.570.590.480.500.030.04
formula099.1299.0698.3798.550.750.760.690.730.500.5300
0.990.9899.4299.1197.5198.271.351.491.091.300.090.3000
0.9010.2999.3298.8696.5197.631.902.461.371.9900.0200
0.8019.8299.2098.8296.0297.412.283.571.542.760000
0.6039.5299.0599.0995.3897.183.0310.241.837.400000
0.4060.3798.3699.0394.3597.154.22formula2.2323.130000.02
0.2079.1495.6599.0492.6596.996.76formula2.8957.080000.04
(b) Bivariate Laplace distribution
formula098.9798.9898.7798.850.590.590.560.580.450.460.060.04
formula099.1699.1198.5098.670.740.750.690.720.540.5900
0.991.7499.4099.1197.5898.231.321.441.071.270.120.2900
0.9010.2399.3498.9896.6497.701.802.271.321.850.030.0900
0.8020.6499.3799.1696.2897.632.213.321.512.580000
0.6038.8099.1399.1195.6897.502.959.041.806.740000
0.4058.1798.3699.1794.4197.044.16formula2.2021.160000.01
0.2078.7695.5699.1893.0097.156.84formula2.9155.410000.07
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula098.8498.8398.6898.780.600.600.570.590.660.680.170.15
formula098.9598.9598.7398.910.760.770.700.730.630.660.040.03
0.990.0599.2798.7997.6898.411.361.491.091.310.580.7900
0.905.5299.6598.8697.5398.541.922.481.381.990.230.2200
0.8015.2299.5898.7596.9398.082.323.671.562.910000
0.6038.8399.5698.6495.6997.873.1211.491.877.980000
0.4062.6599.0198.7695.7698.154.28formula2.2524.710000.01
0.2084.1196.0398.5393.3597.947.42formula3.1165.070000.05
CPCWformula
formulaformulaPFFiellerDeltaPBPFFiellerDeltaPBPFFiellerDeltaPB
formula
(a) Bivariate normal distribution
formula099.1999.1799.1599.140.730.730.700.710.430.450.080.06
formula098.9098.8898.6098.680.860.870.800.830.460.480.050.05
0.990.9299.3999.1198.1998.541.431.561.161.340.340.550.010
0.909.9199.4699.1097.5498.141.932.491.411.880.110.1100
0.8020.4299.2598.8696.9397.942.333.611.592.460.050.0900
0.6040.6499.2199.0296.2897.693.1211.181.905.370000
0.4060.4898.5098.8895.5497.444.28formula2.2517.690000
0.2079.8596.4799.0293.7596.876.83formula2.9252.340000.04
(b) Bivariate Laplace distribution
formula099.3299.3099.0699.150.710.710.680.700.470.500.090.08
formula099.3499.3198.8598.970.840.840.780.810.420.460.030.02
0.992.5199.2699.1497.9098.321.341.451.111.260.300.4100
0.9012.0099.4599.2797.5398.231.812.231.361.740.240.3500
0.8020.1799.4799.1997.2597.912.143.041.512.170.080.1400
0.6036.5699.2599.1496.4497.652.837.151.783.960.010.0200.01
0.4055.1098.7299.2995.2897.463.86formula2.1212.510000.02
0.2076.2396.0799.2094.1297.336.18formula2.7743.090000.04
(c) Bivariate skew normal distribution with shape parameter 10
formula099.0199.0298.5898.730.720.720.690.700.510.530.180.16
formula098.9298.9198.4598.550.850.860.790.820.590.620.140.15
0.990.0198.8998.7498.2698.771.421.551.151.330.700.780.030.02
0.903.2299.1498.5197.8098.581.972.541.441.920.720.8300
0.8012.4899.5298.7897.6298.512.363.701.612.510.440.5500
0.6037.8899.6698.4897.1098.603.1812.131.935.580.260.1200
0.4064.0899.3998.3596.0898.304.34formula2.2917.970.02000.01
0.2086.1697.2298.0694.7498.417.39formula3.1159.030000.06
formula
(a) Bivariate normal distribution
formula099.1999.1898.8098.920.600.610.570.590.480.500.030.04
formula099.1299.0698.3798.550.750.760.690.730.500.5300
0.990.9899.4299.1197.5198.271.351.491.091.300.090.3000
0.9010.2999.3298.8696.5197.631.902.461.371.9900.0200
0.8019.8299.2098.8296.0297.412.283.571.542.760000
0.6039.5299.0599.0995.3897.183.0310.241.837.400000
0.4060.3798.3699.0394.3597.154.22formula2.2323.130000.02
0.2079.1495.6599.0492.6596.996.76formula2.8957.080000.04
(b) Bivariate Laplace distribution
formula098.9798.9898.7798.850.590.590.560.580.450.460.060.04
formula099.1699.1198.5098.670.740.750.690.720.540.5900
0.991.7499.4099.1197.5898.231.321.441.071.270.120.2900
0.9010.2399.3498.9896.6497.701.802.271.321.850.030.0900
0.8020.6499.3799.1696.2897.632.213.321.512.580000
0.6038.8099.1399.1195.6897.502.959.041.806.740000
0.4058.1798.3699.1794.4197.044.16formula2.2021.160000.01
0.2078.7695.5699.1893.0097.156.84formula2.9155.410000.07
(c) Bivariate Skew Normal Distribution with Shape Parameter 10
formula098.8498.8398.6898.780.600.600.570.590.660.680.170.15
formula098.9598.9598.7398.910.760.770.700.730.630.660.040.03
0.990.0599.2798.7997.6898.411.361.491.091.310.580.7900
0.905.5299.6598.8697.5398.541.922.481.381.990.230.2200
0.8015.2299.5898.7596.9398.082.323.671.562.910000
0.6038.8399.5698.6495.6997.873.1211.491.877.980000
0.4062.6599.0198.7695.7698.154.28formula2.2524.710000.01
0.2084.1196.0398.5393.3597.947.42formula3.1165.070000.05

2.5 Selecting a Suitable Penalty Parameter

Before computing the PF CI, we need to select an appropriate penalty parameter λ. When formula, the PF method reduces to the Fieller method. However, if formula, it is shown in Appendix  B that the CW of the PF CI will tend to be 0, and the rate of convergence is of order formula. This fact indicates that, when formula, the CP of the PF method converges to 0 as well. Therefore, when λ is too small, the PF CI will still have a positive UP. However, if we select a very large λ, then the PF CI may risk underestimating the CP. Selecting λ is a trade-off between the UP and the CP, we have the following theorem.

Theorem 1 The PF CI has a lower UP than the Fieller CI, and specially, the former is always bounded (ie, UP=0) if and only if the penalty parameter formula.

The proof of Theorem 1 is provided in Appendix C. From the theorem, formula is the minimum value that guarantees the bounded property of the PF CI. As such, it is reasonable to believe that selecting formula is a good choice for the trade-off between the UP and the CP. To verify this, Web Tables 3-6 provide the empirical results of the performance of the PF CI against various λ. As shown in these tables, the PF CI tends to have a narrower CW with a larger λ, but may have more chance to underestimate the CP as λ increases. For instance, despite being slightly conservative, the PF CI with formula can control the CP well even when the UP of the Fieller CI is as large as 40%. We also find that the PF CI with formula has nearly the same performance in terms of controlling the CP as the PF CI with formula in our simulation study. However, when the UP of Fieller CI is larger than or equal to 10%, we have observed that the CP of the former is uniformly smaller than that of the latter. This observation indicates that the latter may maintain the valid CP in more situations. Further, when λ increases to formula, the PF CI may have an inflated CP even when the UP of the Fieller CI is 20% as shown in Web Tables 5 and 6, despite the fact that its CW is much narrower in some scenarios. In the case of formula, selecting λ becomes a trade-off between the CP and the CW. Because CP might be the most crucial criterion to evaluate a CI, we would rather select formula, which is shown to be the most conservative choice. Note that formula with the pre-set confidence level. Hence, our previous conclusions are not affected by selecting formula. Further, formula does not depend on the effect size, and can be pre-determined as long as we know the sample size n. In some circumstances, even when n is unknown, we can select formula as an approximation, where formula is the formula quantile of the standard normal distribution. For example, when formula and formula, λ should be 1.10. However, if we do not know the sample size, we can simply use formula instead.

TABLE 3

Summary statistics estimated from the Hewlett data, the Puerperants data, and the Beetles data Note. Listed in the parentheses are the standard errors. formula is the estimated correlation coefficient between formula and formula

Datasetformulaformulaformulaformula
Hewlett data−0.017328.2422 (3.3554)0.4892 (0.2495)−0.5195
Puerperants data0.147216.0936 (4.5516)−2.3687 (0.9458)0.8524
Beetles data1.23553.8930 (1.3151)−4.8098 (1.6210)0.9974
Datasetformulaformulaformulaformula
Hewlett data−0.017328.2422 (3.3554)0.4892 (0.2495)−0.5195
Puerperants data0.147216.0936 (4.5516)−2.3687 (0.9458)0.8524
Beetles data1.23553.8930 (1.3151)−4.8098 (1.6210)0.9974
TABLE 3

Summary statistics estimated from the Hewlett data, the Puerperants data, and the Beetles data Note. Listed in the parentheses are the standard errors. formula is the estimated correlation coefficient between formula and formula

Datasetformulaformulaformulaformula
Hewlett data−0.017328.2422 (3.3554)0.4892 (0.2495)−0.5195
Puerperants data0.147216.0936 (4.5516)−2.3687 (0.9458)0.8524
Beetles data1.23553.8930 (1.3151)−4.8098 (1.6210)0.9974
Datasetformulaformulaformulaformula
Hewlett data−0.017328.2422 (3.3554)0.4892 (0.2495)−0.5195
Puerperants data0.147216.0936 (4.5516)−2.3687 (0.9458)0.8524
Beetles data1.23553.8930 (1.3151)−4.8098 (1.6210)0.9974

2.6 Penalized Fieller's Confidence Interval

Once λ is determined, similar to the Fieller method, the confidence region of the PF method can be constructed by solving the following quadratic inequality: formula, where formula, formula, and formula. Note that both a and formula will be greater than 0 with formula. As such, the PF CI is directly given by formula.

Like other bounded methods, the PF method provides an asymptotic CI. It is known that the delta and existing numeric methods are all first-order correct. However, for the PF method, we have the following theorem.

Theorem 2 Under the bivariate normality assumption, the proposed PF method is second-order correct, that is, the differences of confidence limits between the PF method and the exact method are formula.

The proof of Theorem 2 is in Appendix  D. It should be emphasized that second-order correct is stronger than second-order accurate, because the former implies the latter (DiCiccio and Efron, 1996). That is, the actual CP of the PF method should be the nominal level formula. To further elaborate on the second-order correctness of the PF method, Web Figures 1-3 plot the estimated CP of the PF, Fieller, delta, and PB methods under the setting of formula (very large variation for the denominator). It is clearly shown from these figures that the CP of the PF method generally converges to the nominal level faster than those of the delta and PB methods except when formula and formula for the PB method, where the actual CP of the latter are only the nominal CP formula. Note that the Fieller method generally keeps an accurate CP regardless of the values of n, formula, and formula, because it is an exact method in the bivariate normality case. However, this method can have a large UP in our settings. For example, when formula, the UP for the Fieller method are 30.60% and 56.09% for formula and 0.99, respectively, which are large proportions. Hence, the PF method has an excellent performance in terms of controlling the CP and the UP.

It should be noted that the second-order correctness of the PF method depends on the assumption of bivariate normality. Under this condition, the sample means of the numerator and denominator also exactly jointly follow the bivariate normal distribution. However, if the bivariate normality assumption is violated, the sample means are only asymptotically normal. In this situation, all the methods, including the PF and Fieller methods, are first-order correct. However, it is well known that the t test is robust against nonnormality, especially when the underlying distribution is symmetric. Because the PF method is based on the inverting of the t statistic, it is reasonable to believe that the PF CI can remain a valid CP under a wide range of scenarios. We evaluate how sensitive the PF CI is to the violation of the bivariate normality assumption in Section 3. In real applications, if the numerator and the denominator are known to be independent, we simply set formula for computing the confidence limits of the PF method. Further, when the numerator and the denominator have different degrees of freedom, the Welch-Satterthwaite equation can be used to approximate the combined degree of freedom associated with the t statistic.

3 Simulation Study

3.1 Simulation Settings

To evaluate the proposed PF method, an extensive simulation study has been conducted for comparison with the Fieller, delta, and PB methods. For the PB method, we use a similar procedure to that in Wang et al. (2015) for calculating its confidence limits. The details of the delta and PB methods can be found in Web Appendix  A. In our simulation study, we fix the penalty parameter to be formula. The data are simulated from the bivariate normal distribution, the bivariate Laplace distribution, and the bivariate skew normal distribution. Notably, both the normal distribution and the Laplace distribution are symmetric, while the skew normal distribution is asymmetric. For the latter, we fix the shape parameter to be 10, so that the skewness of such distribution is about 0.96. From simulation study (data not shown), we found that the value of r has little impact on the simulation results except when formula. Hence, r is fixed to be 1 and 0. We set the mean of the denominator μ2 to be 1; otherwise, we can simultaneously divide the denominator and numerator by μ2 for any formula, so that the transformed denominator always has a mean of 1. For the numerator, its coefficient of variation formula is set to be 0.4 for formula, and when formula, we fix its variance formula to be 1. The sample size n is selected to be 20 and 50, and the confidence level formula is set to be 0.95 and 0.99. Furthermore, we have a similar finding to that of Wang et al. (2015), that is, the power of the t test (denoted by formula) for testing the denominator being 0 plays a prominent role in the performance of all the methods. We set formula to be formula, formula, 0.99, 0.90, 0.80, 0.60, 0.40, and 0.20, where such power ranges from perfect to low. Note that formula denotes the probability that the denominator is not statistically different from 0, which is also the theoretical UP for the Fieller method. Once n, formula, and formula are fixed, we can calculate formula by solving the equation formula, where formula is the distribution function of the noncentral t distribution with the noncentral parameter being formula. Web Table 7 gives the corresponding values of formula for various combinations of n, formula, and formula. It can be seen from Web Table 7 that the values of formula for formula are uniformly larger than those for formula when formula is fixed. For the choice of the correlation coefficient ρ, we emphasize the scenario in which ρ has the opposite sign to r. Therefore, ρ is assigned to be 0, −0.4, and −0.8. We also conduct a simulation study (data not shown) considering ρ being 0.4 and 0.8, but we find that the performances of all the methods are similar to that of ρ being 0. Note that when generating the data from the bivariate skew normal distribution, we first need to perform a parameterization, and the details are given in Web Appendix  B (Azzalini, 2005). Finally, the number of simulations k is set at 10 000.

We compare the performance of four types of CI based on CP, ML/(ML+MR), UP, and CW, where ML and MR, respectively, are the left and right tail errors missing the true value of r. CP is estimated by the proportion that the CI contains the true value of r among k replicates. ML and MR are calculated by formula and formula, respectively, where formula denotes the counting measure, and formula and formula are the confidence limits of the estimated CI. Note that formula means that the CI is bounded. As such, we only consider the bounded CIs when estimating the ML and the MR, as it is impossible to distinguish between the left side and the right side if the CI is the union of two disconnected infinite intervals. Further, UP is computed as formula. Note that the UPs of the PF, delta, and PB methods always stay at 0. On the other hand, as the Fieller CI may be unbounded (ie, has infinite length), we use the median length of CIs among k replicates to estimate the CW. It is believed that a good CI should control the CP well, have a narrow CW, and have balanced ML and MR. We deem that underestimating the CP is undesirable because of the inflated size, while slightly overestimating the CP can be acceptable. Further, if a balance between ML and MR is achieved, then ML/(ML+MR) should be close to 0.50.

3.2 Simulation Results

Tables 1 and 2, respectively, display the estimated UP, CP, CW, and ML/(ML+MR) of the two-sided 95% and 99% CIs for the PF, Fieller, delta, and PB methods when formula and formula. From these tables, it can be seen that the estimated UP of the Fieller CI is generally around the theoretical UP in all the scenarios. The Fieller method has a very accurate CP in scenarios of symmetric distributions regardless of the values of n, formula, and formula. However, if the underlying distribution is skew normal, we have observed that the Fieller method can slightly underestimate the CP when formula. Unlike the Fieller method, the performances of the PF and delta methods seem to be less affected by the underlying distribution. The PF CI can control the CP well as long as formula, regardless of the sample size, the confidence level, and the underlying distribution. However, for the delta method, we find that its CP may be underestimated when formula. This fact indicates that the delta method can only control the CP when the denominator has the perfect power to reject being 0 (eg, formula). We also observe that the CP of the PB method depends on the confidence level. When formula, the PB methods can control the CP, except for being a little conservative when the underlying distribution is skew normal. However, if the confidence level increases to 0.99, the PB method may underestimate the CP when formula, but its CP is still closer to the nominal level than that of the delta method. On the other hand, it can be seen that the delta method has the narrowest CW, while the CW of the PB method generally is the widest among all the bounded methods. The PF method typically has a width in between the delta method and PB method, and its CW is narrower than that of the Fieller CI when formula. In the case where the UP of the Fieller CI is larger than 50%, its median length becomes infinite as shown in Tables 1 and 2. From the values of ML/(ML+MR), we find that the delta and PB methods demonstrate the unbalanced tail errors on the left and right, because almost all the values of ML/(ML+MR) are far away from 0.50, while the PF and Fieller methods have much more balanced tail errors. This is not surprising because both the delta and PB methods are based on the normality assumption of the ratio estimate, which may not always be appropriate as it generally has a skewed distribution even in moderate samples. When the UP of Fieller's CI ranges from small to large, the tail errors of all the methods tend to be more skewed.

Notably, when formula, all the methods have nearly the same CP and the same CW in all the scenarios; but if formula, all the methods uniformly show poor performance. The interesting scenarios lie on the parameter space that formula is between 0.60 and 0.99. For simplicity, we restrict our following discussion to this parameter space. It can be seen that the PF method can overestimate the CP in some cases, but its CW still grows much slower than those of the Fieller and PB methods as the formula decreases. For the PB method, in the situation of formula, where it can control the CP, its CW is even wider than the Fieller CI. Hence, the PF method is superior to the Fieller and PB methods because its CI is narrower while still remaining a valid CP. Note that the delta method has a narrower CW than the PF method, but its CP is underestimated at the nominal level. When looking at 95% CI, we find that when formula decreases 0.10, the under-coverage of the delta CI will increase about 0.005-0.01 (10-20% inflation in type 1 error), and the CW of the PF CI is nearly 15% wider than the delta CI. When formula reduces from 0.99 to 0.60, the under-coverage of the delta method can range from 0.01 to 0.05 (20-100% inflation in type 1 error), despite the fact that its CW is about 15-60% narrower than that of the PF method. For the 99% CI, the delta method has an even worse CP (100-300% inflation in type 1 error), but the PF method still maintains the correct level. Recall the fact that the delta method typically has unbalanced tail errors. Hence, this method seems to narrow its width by sacrificing the accuracy of CP, and, to some extent, the accuracy of the left and right tail errors. Moreover, from Web Tables 3-6, we find that the PF method with penalty parameter formula demonstrates a similar CW to the delta method, but the former has a much better CP and more balanced tail errors than the latter.

3.3 Additional Simulation Results and Conclusions

All the other results of UP, CP, CW, and ML/(ML+MR) with formula and −0.8, or formula are provided in Web Tables 8-17. From Web Tables 8-11, it can be seen that ρ has little impact on the performance of the PF and Fieller methods. But when ρ changes from 0 to −0.4, and −0.8, we find that the delta and PB methods tend to have more skewed tail errors, although their CPs seem to be less affected. Web Tables 12-17 display the results for formula. From these tables, we find that the PF method performs even better in terms of controlling the CP when formula in the symmetric distribution scenarios, because it can still have an accurate CP even when formula. However, both the PF and Fieller methods can slightly underestimate the CP in the case of skew normal distribution. It can also be seen that the delta and PB methods tend to seriously overestimate the CP. However, the CW of the delta method is still the narrowest. The simulation results suggest that the delta method seems to be preferred only when formula, which, however, is just a special case.

In summary, in nearly all the simulation settings that we considered, the Fieller method generally keeps a good CP, but its CW can be overlong and even infinite; the PF method can generally control the CP well as long as the Fieller CI has a finite median CW (ie, formula%), while providing a competitive confidence length; although the delta method has the narrowest CW among all the methods, this may be due to its inaccurate CP and unbalanced tail errors; and the PB method generally has the worst performance among all the methods, because its CW is nearly as long as that of the Fieller method, but it can still underestimate the CP in some scenarios like the delta method. Further, the PB method is a resampling-based approach, and is more computationally intensive than other methods. Based on all the results, the PF method generally outperforms the existing methods in terms of controlling the CP and the CW and is particularly useful when the denominator does not have adequate power to reject being 0.

4 Application to the Interval Estimation of the Median Response Dose

4.1 Background

The median response dose is frequently used in pharmacology studies, especially in toxicity experiments, such as the median effective dose (formula, the dose of a drug that produces a desired effect in 50% of a tested population) or the median lethal dose (formula, the dose required to kill half the tested population after a specified duration). In such experiments, a small group of animals is exposed to a measurable stimulus at each of a small number of given doses, usually evenly distributed on a logarithmic scale. After that, the outcome of interest, usually a binary trait, such as death or affected, is recorded for each animal. For simplicity, we assume that the experiment comprises q dose levels with the base-10 log-doses l1, l2, …, formula, and for each dose level, we respectively have the replications m1, m2, …, formula. The number of the response animals at the ith level is denoted by formula. Consequently, s1, s2, …, formula are mutually independent binomial random variables with formula. We further assume that the probability formula is related to the log-dose formula through the logistic model

where formula represents the median response dose. For fixed γ, the slope β determines the shape of the response curve. When β ranges from 0 to formula, the response curve will become steeper and steeper. In the extreme cases of formula 0 and formula, formula will uniformly tend to be 0.5 and 1 for all the dose levels, respectively. In both situations, the data contain little information about γ. By fitting the standard logistic model, it is easy to obtain formula, formula, and their covariance matrix. The point estimate of γ is given by formula. It can be seen that both the numerator and denominator of formula come from the logistic regression coefficients. Because both formula and formula asymptotically follow normal distributions, we will show how the proposed PF method can be applied to the interval estimation of γ to three real-world datasets—the Hewlett data, the Puerperants data, and the Beetles data. It should be noted that the covariance matrix of formula and formula is based on large sample theory. Hence, formula is replaced by formula when calculating the CIs for all the methods.

4.2 Real-World Datasets

The Hewlett data are a classical data example with nine dose levels, a relatively moderate sample size, and a steep response curve. This dataset was first used in a paper by Abdelbasit and Plackett (1983) where the authors stated that it was obtained from a personal communication with Hewlett without any more details. Subsequently, it was further studied in several related papers (Sitter and Wu, 1993; Faraggi et al., 2003; Paige et al., 2011). The Puerperants data were collected to estimate the formula of levobupivacaine for labor analgesia (Sia et al., 2005). This dataset has five dose levels, and a steep response curve, but a relatively small sample size. In these data, 50 parturients in early labor were randomly assigned to receive one of the five doses studied. Effective analgesia is defined as a pain score (0-100 visual analog scale) of less than 10 within 15 min of injection, lasting for 45 min or more after the induction of analgesia. The Beetles data studied the toxicity of beetles to insecticide. This dataset has six dose levels, and a moderate sample size, but a relatively flat response curve. It is taken from Hewlett and Plackett (1950), and has been further discussed by Zelterman (1999) and Faraggi et al. (2003). In this study, groups of beetles were exposed to various concentrations of the insecticide. After a short period, the number of deaths within each group was reported. Web Tables 18-20, respectively, provide the details of all three datasets. Further, we fit the standard logistic regression to each of the three datasets, and the summary statistics are displayed in Table 3.

4.3 Further Simulation Study Based on the Hewlett Data

For the real data described in Section 4.2, both formula and formula are estimated from binary data, but such case has not been studied in the simulation study in Section 3. It is known that the mean and the variance of the logistic regression coefficient are correlated especially in small samples. Therefore, based on the Hewlett data, we further conduct an extensive simulation study, with various slopes and sample sizes, to investigate the performance of the PF method in this scenario, and the details are provided in Web Appendix  C. Based on the new simulation results in Web Tables 21 and 22, we conclude that the PF method has the overall best performance in terms of coverage, width, and location, and thus is still recommended in the binary data case (see description from Web Appendix  C).

4.4 Data Analysis

Table 4 displays the CIs estimated from all four methods based on the Hewlett data, the Puerperants data, and the Beetles data. In addition, we also evaluate the performance of all the methods in the scenario where the parameters formula, formula, γ, and β are taken or estimated from each of the datasets. From Table 4, we observe that both the PF and Fieller methods generally have a valid CP, but the former can provide a narrower CW than the latter, when the sample size is small or the response curve is flat. Meanwhile, it is not surprising to see that the delta method has the narrowest CW among all the methods. However, this method should be used with caution, because it may underestimate the CP even in moderate samples. For the Beetles data, which has a relatively flat response curve, although the delta method seems to have the best performance, this method can seriously overestimate the CP and may not be reliable as discussed in Web Appendix  C. Further, it can be seen that the PB method has the worst performance and thus is also not recommended for practical use.

TABLE 4

95% and 99% CIs of the median response dose γ for the PF, Fieller, delta, and PB methods for the Hewlett data, the Puerperants data, and the Beetles data Note. We also simulate the estimated UP (%), CP (%), CW, and ML/(ML+MR) for all four methods in the scenarios where the parameters formula, formula, γ, and β are taken or estimated from each of the datasets. The simulation is based on 10 000 replicates

formulaformula
MethodCIUPCPCWformulaCIUPCPCWformula
Hewlett data
PF(−0.0322, −0.0001)095.520.03210.50(−0.0368, 0.0063)099.190.04310.57
Fieller(−0.0322, 0)095.610.03220.51(−0.0368, 0.0065)099.210.04340.58
Delta(−0.0329, −0.0017)093.930.03120.31(−0.0378, 0.0032)098.330.04090.25
PB(−0.0333, −0.0013)094.680.03190.32(−0.0384, 0.0037)098.680.04190.26
Puerperants data
PF(0.0628, 0.2076)096.970.14730.39(0.0182, 0.2270)099.810.21220.16
Fieller(0.0577, 0.2101)14.4997.370.15540.44(−0.0112, 0.2379)14.5099.940.26280
Delta(0.0847, 0.2097)090.800.12710.74(0.0651, 0.2293)095.380.16700.90
PB(0.0810, 0.2134)084.810.15870.98(0.0602, 0.2342)086.050.20850.99
Beetles data
PF(1.1356, 1.2860)095.120.15520.18(1.0001, 1.2880)099.000.29030.05
Fieller(1.1610, 1.3197)14.3694.880.16960.49(1.0953, 1.4144)33.7398.910.36100.52
Delta(1.1761, 1.2949)098.460.12360.35(1.1575, 1.3135)099.930.16240.14
PB(1.1125, 1.3585)099.650.22630.23(1.0738, 1.3972)01000.2974-
formulaformula
MethodCIUPCPCWformulaCIUPCPCWformula
Hewlett data
PF(−0.0322, −0.0001)095.520.03210.50(−0.0368, 0.0063)099.190.04310.57
Fieller(−0.0322, 0)095.610.03220.51(−0.0368, 0.0065)099.210.04340.58
Delta(−0.0329, −0.0017)093.930.03120.31(−0.0378, 0.0032)098.330.04090.25
PB(−0.0333, −0.0013)094.680.03190.32(−0.0384, 0.0037)098.680.04190.26
Puerperants data
PF(0.0628, 0.2076)096.970.14730.39(0.0182, 0.2270)099.810.21220.16
Fieller(0.0577, 0.2101)14.4997.370.15540.44(−0.0112, 0.2379)14.5099.940.26280
Delta(0.0847, 0.2097)090.800.12710.74(0.0651, 0.2293)095.380.16700.90
PB(0.0810, 0.2134)084.810.15870.98(0.0602, 0.2342)086.050.20850.99
Beetles data
PF(1.1356, 1.2860)095.120.15520.18(1.0001, 1.2880)099.000.29030.05
Fieller(1.1610, 1.3197)14.3694.880.16960.49(1.0953, 1.4144)33.7398.910.36100.52
Delta(1.1761, 1.2949)098.460.12360.35(1.1575, 1.3135)099.930.16240.14
PB(1.1125, 1.3585)099.650.22630.23(1.0738, 1.3972)01000.2974-
TABLE 4

95% and 99% CIs of the median response dose γ for the PF, Fieller, delta, and PB methods for the Hewlett data, the Puerperants data, and the Beetles data Note. We also simulate the estimated UP (%), CP (%), CW, and ML/(ML+MR) for all four methods in the scenarios where the parameters formula, formula, γ, and β are taken or estimated from each of the datasets. The simulation is based on 10 000 replicates

formulaformula
MethodCIUPCPCWformulaCIUPCPCWformula
Hewlett data
PF(−0.0322, −0.0001)095.520.03210.50(−0.0368, 0.0063)099.190.04310.57
Fieller(−0.0322, 0)095.610.03220.51(−0.0368, 0.0065)099.210.04340.58
Delta(−0.0329, −0.0017)093.930.03120.31(−0.0378, 0.0032)098.330.04090.25
PB(−0.0333, −0.0013)094.680.03190.32(−0.0384, 0.0037)098.680.04190.26
Puerperants data
PF(0.0628, 0.2076)096.970.14730.39(0.0182, 0.2270)099.810.21220.16
Fieller(0.0577, 0.2101)14.4997.370.15540.44(−0.0112, 0.2379)14.5099.940.26280
Delta(0.0847, 0.2097)090.800.12710.74(0.0651, 0.2293)095.380.16700.90
PB(0.0810, 0.2134)084.810.15870.98(0.0602, 0.2342)086.050.20850.99
Beetles data
PF(1.1356, 1.2860)095.120.15520.18(1.0001, 1.2880)099.000.29030.05
Fieller(1.1610, 1.3197)14.3694.880.16960.49(1.0953, 1.4144)33.7398.910.36100.52
Delta(1.1761, 1.2949)098.460.12360.35(1.1575, 1.3135)099.930.16240.14
PB(1.1125, 1.3585)099.650.22630.23(1.0738, 1.3972)01000.2974-
formulaformula
MethodCIUPCPCWformulaCIUPCPCWformula
Hewlett data
PF(−0.0322, −0.0001)095.520.03210.50(−0.0368, 0.0063)099.190.04310.57
Fieller(−0.0322, 0)095.610.03220.51(−0.0368, 0.0065)099.210.04340.58
Delta(−0.0329, −0.0017)093.930.03120.31(−0.0378, 0.0032)098.330.04090.25
PB(−0.0333, −0.0013)094.680.03190.32(−0.0384, 0.0037)098.680.04190.26
Puerperants data
PF(0.0628, 0.2076)096.970.14730.39(0.0182, 0.2270)099.810.21220.16
Fieller(0.0577, 0.2101)14.4997.370.15540.44(−0.0112, 0.2379)14.5099.940.26280
Delta(0.0847, 0.2097)090.800.12710.74(0.0651, 0.2293)095.380.16700.90
PB(0.0810, 0.2134)084.810.15870.98(0.0602, 0.2342)086.050.20850.99
Beetles data
PF(1.1356, 1.2860)095.120.15520.18(1.0001, 1.2880)099.000.29030.05
Fieller(1.1610, 1.3197)14.3694.880.16960.49(1.0953, 1.4144)33.7398.910.36100.52
Delta(1.1761, 1.2949)098.460.12360.35(1.1575, 1.3135)099.930.16240.14
PB(1.1125, 1.3585)099.650.22630.23(1.0738, 1.3972)01000.2974-

In conclusion, the PF method performs well as long as the sample size at each dose is not very small and the response curve is neither extremely flat nor extremely steep. However, if such conditions are violated, all four methods may not work well. This is probably because the covariance matrix for the numerator and the denominator cannot be estimated accurately. Under this circumstance, the likelihood-based CI may be more reliable (Williams, 1986). It is therefore of interest to generalize the PF method to the penalized likelihood method. We leave this for future research.

5 Discussion

In this article, we have developed the penalized Fieller method to construct the CI for the ratio estimate. Like the Fieller method, the proposed approach is based on the inverting of the t statistic, and thus it is an analytic approach and naturally allows an asymmetric CI. Further, to overcome the unbounded issue of the Fieller CI, we adopt a penalized likelihood approach to estimate the denominator while adjusting the numerator accordingly to reduce the bias of the t statistic. By using a suitable penalty parameter, the PF method always produces a bounded CI. Moreover, we show theoretically that the PF method is second-order correct under the bivariate normality assumption, better than other bounded methods, which are all first-order correct. Simulation results demonstrate that the PF method has good performance in terms of controlling the CP and the CW. Hence, we recommend using the PF method for its robustness and analytic advantage.

Note that we obtain the penalized estimator formula by directly plugging formula into it. By using this strategy, we can get a much simpler expression of formula, and therefore simplify the subsequent statistical inferences as well. We can also obtain the joint estimates formula and formula by maximizing the full penalized likelihood, and the details are provided in Web Appendix  D. It is seen that formula is much more complicated but will converge to formula as formula. Note that the forms of formula and formula only differ at the variance estimate. For the variance estimate formula, it is related to the penalty parameter λ only at the third order. This fact indicates that it makes little difference to consider a penalty when estimating v2. Therefore, we directly plug formula into formula for mathematical convenience.

It should be emphasized that the PF method is not an exact method. That is, if the sample size is extremely small, and the denominator has a very large variation (the case in which the Fieller method has a very large UP), then the PF method still has the possibility of underestimating the CP as shown in the simulation results. Although the Fieller method is an exact method in the bivariate normality case, and is generally guaranteed to achieve the nominal level, this is at the potential cost of having intervals of infinite length. Further, the Fieller method ignores the information that the denominator cannot be 0. By excluding the case of formula, it is not surprising that the PF method has a narrower CW than the Fieller method. Notably, the denominator being not statistically different from 0 also means that the CI of the denominator covers 0. Because 0 is a singular point for the denominator of a ratio, it is expected that the Fieller CI will be unbounded in this case. However, the true value of the ratio must be a real number. Therefore, the unbounded interval is not as informative as the bounded interval because the latter always provides finite length and is much more common in practice. On the other hand, the PF method obtains the bounded CI by shrinking the CI of the denominator away from 0 through penalty, while adjusting the CI of the numerator simultaneously.

Note that the PF method is second-order correct only in the bivariate normality case. Otherwise, all the existing methods, including the PF and Fieller methods, are first-order correct. Like the Fieller method, despite the fact that the PF method is proposed based on the bivariate normality assumption, we have demonstrated its usefulness in a wide range of scenarios through simulation study and real data application. These scenarios include the ratio of means from both the bivariate normal and nonnormal distributions as well as the ratio of the nonlinear regression coefficients. However, in certain cases, when the underlying distribution is very skewed, the PF method may not work well, just like almost all the existing methods. This is probably due to two reasons. First, the mean estimator may need a much larger sample size in order to converge to a normal distribution. Second, using the mean to estimate the location parameter is no longer appropriate. In the case of very skewed distributions, the CI based on inverting of the nonparametric statistic may be more useful, but this needs further research. On the other hand, it is known that the MOVER-R (method of variance estimate recovery for a ratio) is a generalization of the Fieller method to the nonnormal case (Donner and Zou, 2012; Newcombe, 2016). So, developing the penalized MOVER-R is another possible direction for future research.

Recall the fact that we only use the penalized estimate for the denominator by shrinking it away from 0. Because the numerator generally has the possibility to be 0, we do not penalize both the denominator and the numerator simultaneously. For example, considering the median response dose formula, it is possible that the numerator β0 can equal 0. This corresponds to the scenario of formula. Because γ is generally on a logarithmic scale, the original median response dose will be 1. In some cases, if we know the information that the numerator cannot be 0, we can use the penalized estimate for both the numerator and denominator. By using more information, it is possible to get a narrower CW. However, how to obtain a second-order correct CI in this situation is not at all clear.

Finally, we use the penalized ratio estimate formula to estimate r in formula. Note that there exists other point estimates for r that we can select to plug into formula (Tin, 1965), which might be better than formula. However, further studies are needed to investigate their performance, especially when the denominator does not have adequate power to reject being 0.

Data Availability Statement

The data that support the findings in this paper are available in the Supporting Information of this article.

Acknowledgments

The authors would like to thank the associate editor, the two anonymous reviewers, and the co-editor for their constructive comments which greatly improved the presentation of this article. The authors also appreciate the help of Raymond J. Carroll, PhD for his critical reading of the original version of the manuscript. This work is partially supported by National Institutes of Health grants R03-DE024198 and R03-DE025646.

References

Abdelbasit
,
K.M.
and
Plackett
,
R.L.
(
1983
)
Experimental design for binary data
.
Journal of the American Statistical Association
,
78
,
90
98
.

Azzalini
,
A.
(
2005
)
The skew-normal distribution and related multivariate families
.
Scandinavian Journal of Statistics
,
32
,
159
188
.

Choquet
,
D.
,
L'ecuyer
,
P.
and
Léger
,
C.
(
1999
)
Bootstrap confidence intervals for ratios of expectations
.
ACM Transactions on Modeling and Computer Simulation
,
9
,
326
348
.

DiCiccio
,
T.J.
and
Efron
,
B.
(
1996
)
Bootstrap confidence intervals
.
Statistical Science
,
11
,
189
212
.

Donner
,
A.
and
Zou
,
G.Y.
(
2012
)
Closed-form confidence intervals for functions of the normal mean and standard deviation
.
Statistical Methods in Medical Research
,
21
,
347
359
.

Efron
,
B.
and
Tibshirani
,
R.J.
(
1994
)
An Introduction to the Bootstrap
.
Boca Raton, FL
:
Chapman and Hall/CRC
.

Faraggi
,
D.
,
Izikson
,
P.
and
Reiser
,
B.
(
2003
)
Confidence intervals for the 50 per cent response dose
.
Statistics in Medicine
,
22
,
1977
1988
.

Fieller
,
E. C.
(
1954
)
Some problems in interval estimation
.
Journal of the Royal Statistical Society Series B
,
16
,
175
185
.

Hayya
,
J.
,
Armstrong
,
D.
, and
Gressis
,
N.
(
1975
)
A note on the ratio of two normally distributed variables
.
Management Science
,
21
,
1338
1341
.

Herson
,
J.
(
1975
)
Fieller's theorem vs. the delta method for significance intervals for ratios
.
Journal of Statistical Computation and Simulation
,
3
,
265
274
.

Hewlett
,
P.S.
(
1950
)
Statistical aspects of the independent joint action of poisons, particularly insecticides: II. Examination of data for agreement with the hypothesis
.
Annals of Applied Biology
,
37
,
527
552
.

Hirschberg
,
J.
and
Lye
,
J.
(
2010
)
A geometric comparison of the delta and Fieller confidence intervals
.
The American Statistician
,
64
,
234
241
.

Koschat
,
M.A.
(
1981
)
A characterization of the Fieller solution
.
Annals of Statistics
,
15
,
462
468
.

Newcombe
,
R.G.
(
2016
)
MOVER-R confidence intervals for ratios and products of two independently estimated quantities
.
Statistical Methods in Medical Research
,
25
,
1774
1778
.

Paige
,
R.L.
,
Chapman
,
P.L.
and
Butler
,
R.W.
(
2011
)
Small sample LD50 confidence intervals using saddlepoint approximations
.
Journal of the American Statistical Association
,
493
,
334
344
.

Pigeot
,
I.
,
Schäfer
,
J.
,
Röhmel
,
J.
and
Hauschke
,
D.
(
2003
)
Assessing non-inferiority of a new treatment in a three-arm clinical trial including a placebo
.
Statistics in Medicine
,
22
,
883
899
.

Sia
,
A.T.
,
Goy
,
R.W.
,
Lim
,
Y.
and
Ocampo
,
C.E.
(
2005
)
A comparison of median effective doses of intrathecal levobupivacaine and ropivacaine for labor analgesia
.
Anesthesiology
,
102
,
651
656
.

Sitter
,
R.R.
and
Wu
,
C.F.J.
(
1993
)
On the accuracy of Fieller intervals for binary response data
.
Journal of the American Statistical Association
,
88
,
1021
1025
.

Sherman
,
M.
,
Maity
,
A.
and
Wang
,
S.
(
2011
)
Inferences for the ratio: Fieller's interval, log ratio, and large sample based confidence intervals
.
AStA Advances in Statistical Analysis
,
95
,
313
323
.

Tin
,
M.
(
1965
)
Comparison of some ratio estimators
.
Journal of the American Statistical Association
,
60
,
294
307
.

Vuorinen
,
J.
and
Tuominen
,
J.
(
1994
)
Fieller's confidence intervals for the ratio of two means in the assessment of average bioequivalence from crossover data
.
Statistics in Medicine
,
13
,
2531
2545
.

Wang
,
Y.
,
Wang
,
S.
and
Carroll
,
R.J.
(
2015
)
The direct integral method for confidence intervals for the ratio of two location parameters
.
Biometrics
,
71
,
704
713
.

Williams
,
D.A.
, (
1986
)
Interval estimation of the median lethal dose
.
Biometrics
,
42
,
641
645
.

Zelterman
,
D.
(
1999
)
Models for Discrete Data
.
Oxford
:
Oxford University Press
.

Appendix A: Maximum Point of the Penalized Log-Likelihood

Treating v2 as known and taking the first derivative with respect to μ2, we obtain

where formula and formula. It is easy to see that formula regardless of the sign of formula. Therefore, the penalized log-likelihood will increase when formula. That is, the penalized log-likelihood can only attain its maximum value at the point x1 or x2. Further,

If formula, we have formula, which implies formula. However, if formula, we obtain formula. Hence, the penalized log-likelihood attains its maximum value at the point formula.

APPENDIX B: Confidence Width of the penalized Fieller Method When λ → + ∞

Let formula denote the CW of the PF CI. According to Section 2.6, it is seen that formula when λ is greater than or equal to formula, where formula, formula, and formula. In order to avoid confusion, we use the notation formula to represent the order with respect to λ. It is easy to see that formula, formula, and formula. Hence, formula. That is, when formula, we have formula, and the rate of convergence is of order formula.

Appendix C: Proof of Theorem 1

Let formula and formula denote the UP of the PF and Fieller methods, respectively. According to Fieller's theorem, the PF CI is unbounded if and only if

Substituting formula into the above inequality, we have

Therefore,

Specially, formula can always be guaranteed if and only if the penalty parameter formula. This completes the proof.

Appendix D: Proof of Theorem 2

Under the bivariate normality assumption, the Fieller method is known as the only exact method (Koschat, 1981). Without loss of generality, let formula and formula, respectively, denote the confidence limit for the PF and Fieller methods that satisfy

(A1)
(A2)

When formula, both formula and formula are lower limits; otherwise, they are upper limits. To prove the theorem, we need to show that formula.

Because μ2 is a nonzero constant, we have formula and formula. From Equation (A1), we know that formula. Therefore,

Using the above equation, we rewrite the Equation (A1) as

(A3)

Note that

The first part of the above equation is formula and the second part can be expressed as formula, where

Equation (A2) minus Equation (A3) becomes

By the fact formula, we can see formula, which completes the proof.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Supplementary data