Response to Guo et al.’s Letter to the Editor

Fong, Youyi; Huang, Ying; Lemos, Maria P; Mcelrath, M Juliana

doi:10.1093/biostatistics/kxy061

Issue Section:

Letter to the Editors

We thank Drs Guo, Gao, Niu, and Zhang for their comments on the article by Fong and others (2018). They pointed out that the more classical methods, MW-MW $_{2}$ and SR-MW $_{2}$ ⁠, which only make comparisons between $X$ and $Y$ (paired observations) and between $X^{'}$ and $Y^{'}$ (unpaired observations) were useful alternatives to the proposed tests, MW-MW $_{0}^{l}$ and SR-MW $_{0}^{l}$ ⁠, which made comparisons between all $x$ ’s and all $y$ ’s. Dr Guo et al.’s recommendation was “to use MW-MW $_{0}^{l}$ and SR-MW $_{0}^{l}$ for ${X, Y, Y^{'}}$ ⁠, while use MW-MW $_{2}$ and SR-MW $_{2}$ for ${X, Y, X^{'}, Y^{'}}$ ⁠, especially when the correlation between the samples is high.” We agree that MW-MW $_{2}$ and SR-MW $_{2}$ are important to study as alternative approaches, and aim to refine the recommendations in this response so that practitioners may find it easier to choose the appropriate methods.

Before discussing power comparison, we would like to propose a variant of the MW-MW $_{2}$ test. Since MW-MW $_{2}$ only makes comparisons within the paired subset and the unpaired subset, it is possible to perform permutation tests to obtain p-values to avoid inflated Type 1 error rates under small sample sizes (Tables A.1–A.4 of the supplementary material available at Biostatistics online). We will refer to this test as MW-MW $_{2}^{p e r m}$ ⁠.

We study power comparison under four different distributional assumptions: normal (Table 1), logistic (Table B.1 of the supplementary material available at Biostatistics online), gamma (Table B.2 of the supplementary material available at Biostatistics online), and lognormal (Table B.3 of the supplementary material available at Biostatistics online). We also plot the results in Figure 1 and Figures B.1, B.2, and B.3 of the supplementary material available at Biostatistics online to help visualize these results. All estimates are based on $10^{4}$ Monte Carlo replicates. $m, l, n$ refer to the number of pairs, the number of independent $x$ ’s and the number of independent $y$ ’s, respectively. Three levels of correlation between the two samples are examined: 0, 0.5, and 0.8.

Table 1.

Open in new tab

Estimated power, normal distribution, $m = 20$

$(l, n)$	MW-MW $_{0}^{l}$			MW-MW $_{2}^{p e r m}$			SR			SR-MW $_{2}$			SR-MW $_{0}^{l}$
	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8
(10,5)	19	26	46	17	26	49	14	23	51	17	27	52	19	26	44
(10,10)	20	28	47	18	27	51	14	23	51	19	29	53	20	28	46
(40,5)	23	31	52	19	28	51	14	23	51	19	29	53	23	32	51

$(l, n)$	MW-MW $_{0}^{l}$			MW-MW $_{2}^{p e r m}$			SR			SR-MW $_{2}$			SR-MW $_{0}^{l}$
	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8
(10,5)	19	26	46	17	26	49	14	23	51	17	27	52	19	26	44
(10,10)	20	28	47	18	27	51	14	23	51	19	29	53	20	28	46
(40,5)	23	31	52	19	28	51	14	23	51	19	29	53	23	32	51

Table 1.

Open in new tab

Estimated power, normal distribution, $m = 20$

$(l, n)$	MW-MW $_{0}^{l}$			MW-MW $_{2}^{p e r m}$			SR			SR-MW $_{2}$			SR-MW $_{0}^{l}$
	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8
(10,5)	19	26	46	17	26	49	14	23	51	17	27	52	19	26	44
(10,10)	20	28	47	18	27	51	14	23	51	19	29	53	20	28	46
(40,5)	23	31	52	19	28	51	14	23	51	19	29	53	23	32	51

$(l, n)$	MW-MW $_{0}^{l}$			MW-MW $_{2}^{p e r m}$			SR			SR-MW $_{2}$			SR-MW $_{0}^{l}$
	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8	0	0.5	0.8
(10,5)	19	26	46	17	26	49	14	23	51	17	27	52	19	26	44
(10,10)	20	28	47	18	27	51	14	23	51	19	29	53	20	28	46
(40,5)	23	31	52	19	28	51	14	23	51	19	29	53	23	32	51

Fig. 1.

Power comparison when the marginal distribution is normal. Sample sizes: $m = 20$ and $(l, n)$ are given in the titles.

Open in new tab Download slide

First, focusing on lines 2 and 3 in the figures, we see that SR-MW $_{2}$ and MW-MW $_{2}^{p e r m}$ either outperform or closely match the performance of SR at all times. These empirical results are worth noting, because theoretically a test that combines two independent test statistics using weights proportional to the inverse of their variances is not always more powerful than each component test. Based on these results, we can narrow the choice down to be between SR-MW $_{0}^{l}$ /MW-MW $_{0}^{l}$ and SR-MW $_{2}$ /MW-MW $_{2}^{p e r m}$ when there are unpaired observations from both samples.

Now, focusing on lines 1 and 2 in the figures, we see that there is a clear trade-off between SR-MW $_{0}^{l}$ /MW-MW $_{0}^{l}$ and SR-MW $_{2}$ /MW-MW $_{2}^{p e r m}$ depending on $ρ$ and sample sizes. This is true for normal, logistic, and gamma distributions (Figure 1 and Figures B1, B2 of the supplementary material available at Biostatistics online); for lognormal distributions, there is also a trade-off between MW-MW $_{0}^{l}$ and MW-MW $_{2}^{p e r m}$ (Figure B3(b) of the supplementary material available at Biostatistics online), but SR-MW $_{0}^{l}$ appears mostly preferable over SR-MW $_{2}$ (Figure B3(a) of the supplementary material available at Biostatistics online). The cause of the latter result can be attributed to the interesting fact that the SR test is not an efficient test for lognormal data (Table C.1 of the supplementary material available at Biostatistics online). When the SR test does not fully take advantage of the information in the paired data (⁠ $X, Y$ ⁠), comparing $X$ with $Y^{'}$ and $X^{'}$ with $Y$ ⁠, as SR-MW $_{0}^{l}$ does, improves the efficiency of the overall test. The practical implication of this observation is that we should preprocess the data by applying proper transformation if the distributions appear highly skewed.

Our recommendation for the case when there are unpaired observations from both samples has two parts. If a simple rule of thumb is desirable, our recommendation is to choose SR-MW $_{0}^{l}$ /MW-MW $_{0}^{l}$ when $ρ < 0.5$ and SR-MW $_{2}$ /MW-MW $_{2}^{p e r m}$ when $ρ > 0.5$ ⁠. On the other hand, if an optimal choice is important, we recommend doing a simulation study to find the most powerful approach. To make this a feasible option for practitioners, we provide an easy-to-use function, choose.test, in the R package chngpt. The only information the function needs is the sample sizes and the estimated first and second moments from the data, and it is fast, for example, it takes only 2 s to run on an Intel i7 processor clocked at 2.6GHz when $m = 20, l = 40, n = 5$ ⁠.

For the case when there are only unpaired observations from one sample (thus SR-MW $_{2}$ /MW-MW $_{2}^{p e r m}$ are not applicable), we recommend choosing between SR and SR-MW $_{0}^{l}$ /MW-MW $_{0}^{l}$ through the choose.test function, since there is a trade-off in power between the two tests depending on $ρ$ and sample sizes (Tables D.1–D.3 of the supplementary material available at Biostatistics online).

Lastly, given the choice between SR-MW $_{0}^{l}$ and MW-MW $_{0}^{l}$ ⁠, we recommend SR-MW $_{0}^{l}$ if a monotone transformation can be performed on both samples so that the distributions from both samples are not too skewed. If that is not possible or desirable, for example, when one sample has a highly skewed distribution while the other does not, MW-MW $_{0}^{l}$ is preferred because it is a more robust test and invariant to monotone transformations applied to both samples. When using MW-MW $_{0}^{l}$ ⁠, one should proceed with caution as Type 1 error rates may be inflated when sample sizes are small (Tables D.4–D.6 of the supplementary material available at Biostatistics online). Similar arguments can be applied to the choice between SR-MW $_{2}$ and MW-MW $_{2}^{p e r m}$ ⁠, except that there is no concern of inflated Type 1 error rates here.

The chngpt package is available from the Comprehensive R Archive Network, and the Monte Carlo study code can be downloaded at https://github.com/youyifong/response_to_letter_on_rank.

Acknowledgments

The authors are grateful to Lindsay N. Carpp for help with editing. Conflict of Interest: None declared.

Funding

This work was supported by R01-AI122991, R01-GM106177, UM1-AI068635, UM1-AI068618, and OPP1099507.

References

Fong,

Y.

,

Huang,

Y.

,

Lemos,

M. P.

and

Mcelrath,

M. J.

(

2018

).

Rank-based two-sample tests for paired data with missing values

.

Biostatistics

,

19

,

281

–

294

.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Download all slides

Month:	Total Views:
December 2018	10
January 2019	15
February 2019	4
March 2019	9
April 2019	13
May 2019	6
June 2019	9
July 2019	13
August 2019	2
September 2019	12
October 2019	3
November 2019	2
December 2019	1
January 2020	3
February 2020	2
March 2020	1
May 2020	1
July 2020	2
January 2021	2
February 2021	3
April 2021	1
May 2021	1
June 2021	2
July 2021	2
November 2021	7
December 2021	2
January 2022	11
February 2022	12
March 2022	4
April 2022	10
May 2022	12
June 2022	2
July 2022	15
August 2022	20
September 2022	23
October 2022	17
November 2022	10
December 2022	15
January 2023	3
February 2023	7
March 2023	10
April 2023	11
May 2023	5
June 2023	13
July 2023	4
August 2023	8
September 2023	5
October 2023	2
November 2023	9
December 2023	5
January 2024	10
February 2024	6
March 2024	8
April 2024	4
May 2024	8
June 2024	4
July 2024	11
August 2024	8
September 2024	2
October 2024	9
November 2024	7
December 2024	3
January 2025	2
February 2025	10
March 2025	19
April 2025	5

Article Contents

Response to Guo et al.’s Letter to the Editor

Acknowledgments

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Response to Guo et al.’s Letter to the Editor

Acknowledgments

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only