Testing for Alpha in Linear Factor Pricing Models with a Large Number of Securities*

Descriptive statistics of Fama–French three factor regression results

	Average β estimates for FF3 factors			Average skewness and excess kurtosis of the residuals
	${\hat{β}}_{MKT}$	${\hat{β}}_{HML}$	${\hat{β}}_{SMB}$	Skewness	Excess kurtosis
Mean	1.05	0.07	0.18	0.32	2.76
SD	0.43	0.57	0.45	0.87	5.61
Median	1.02	0.00	0.17	0.14	1.19
Min	0.19	−1.46	−1.95	−1.53	−0.53
Max	2.92	2.91	1.99	6.34	57.57

	Average β estimates for FF3 factors			Average skewness and excess kurtosis of the residuals
	${\hat{β}}_{MKT}$	${\hat{β}}_{HML}$	${\hat{β}}_{SMB}$	Skewness	Excess kurtosis
Mean	1.05	0.07	0.18	0.32	2.76
SD	0.43	0.57	0.45	0.87	5.61
Median	1.02	0.00	0.17	0.14	1.19
Min	0.19	−1.46	−1.95	−1.53	−0.53
Max	2.92	2.91	1.99	6.34	57.57

Table 1.

Descriptive statistics of Fama–French three factor regression results

	Average β estimates for FF3 factors			Average skewness and excess kurtosis of the residuals
	${\hat{β}}_{MKT}$	${\hat{β}}_{HML}$	${\hat{β}}_{SMB}$	Skewness	Excess kurtosis
Mean	1.05	0.07	0.18	0.32	2.76
SD	0.43	0.57	0.45	0.87	5.61
Median	1.02	0.00	0.17	0.14	1.19
Min	0.19	−1.46	−1.95	−1.53	−0.53
Max	2.92	2.91	1.99	6.34	57.57

	Average β estimates for FF3 factors			Average skewness and excess kurtosis of the residuals
	${\hat{β}}_{MKT}$	${\hat{β}}_{HML}$	${\hat{β}}_{SMB}$	Skewness	Excess kurtosis
Mean	1.05	0.07	0.18	0.32	2.76
SD	0.43	0.57	0.45	0.87	5.61
Median	1.02	0.00	0.17	0.14	1.19
Min	0.19	−1.46	−1.95	−1.53	−0.53
Max	2.92	2.91	1.99	6.34	57.57

We generate the factor loadings as $IIDU (0.3, 1.8)$ for the market factor, $IIDU (- 1.0, 1.0)$ for the HML factor, and $IIDU (- 0.6, 0.9)$ for the SMB factor. In this way, we ensure that the means and standard deviations of the betas match their empirical counterparts and sufficient ranges of the estimates of $β^{'} s$ reported in Table 1 for the FF3 model are covered in the experiments.

The latent factor v_t is generated as IID(0, 1) and its loadings γ_i are generated to ensure a given factor strength denoted by the exponent

δ_{γ}

⁠. We generate γ_i as

\begin{array}{l} γ_{i} \sim IIDU (0.7, 0.9), for i = 1, 2, \dots, ⌊ N^{δ_{γ}} ⌋ \\ γ_{i} = 0, for ⌊ N^{δ_{γ}} ⌋ + 1, ⌊ N^{δ_{γ}} ⌋ + 2, …., N, \end{array}

and to avoid systematic errors we then randomly reshuffle γ_i over i before assigning them to the individual returns,

r_{i t} .

Our theoretical derivations suggest that the size of our proposed

{\hat{J}}_{α}

test should be under control so long as

δ_{γ} < 1 / 2

⁠. Accordingly, we consider the values of

δ_{γ} = 0

⁠, 1/4, and 1/2. Allowing for latent factors is important since in practice researchers cannot be sure that they have included all relevant risk factors in their models. The problem of missing (or latent) factors continues to apply even if we extend the list of observed factors as it is done in the recent literature. See, for example, Giglio and Xiu (2021) and the recent paper by Bailey, Kapetanios, and Pesaran (2021) who consider the estimation of factor strength.

In addition to allowing for latent factors, we also consider network (or spatial) type cross-sectional error dependence by generating the idiosyncratic errors

ε_{η, i t}

\begin{matrix} u_{i t} = γ_{i} v_{t} + η_{i t}, \\ Var (u_{i t}) = γ_{i}^{2} Var (v_{t}) + Var (η_{i t}) \\ η_{i t} = ψ \sum_{j = 1}^{N} w_{i j} η_{j t} + σ_{η i} ε_{η, i t}, for i = 1, 2, \dots, N, \end{matrix}

(73)

which can be solved for

η_{t} = {(η_{1 t}, η_{2 t}, \dots, η_{N t})}^{'}

η_{t} = {(I_{N} - ψ W)}^{- 1} D_{η} ε_{η, t},

where

ε_{η, t} = {(ε_{η, 1 t}, ε_{η, 2 t}, \dots, ε_{η, N t})}^{'}

⁠,

ψ = {0.0, 0.25}, D_{η} = diag {(σ_{η 1}, σ_{η 2}, \dots, σ_{η N})}^{'}

⁠. We adopt a rook form of

W = (w_{i j})

⁠, where all elements in W are zero except

w_{i + 1, i} = w_{j - 1, j} = 0.5

for

i = 1, 2, \dots, n - 2

and

j = 3, 4, \dots, n

⁠, with

w_{1, 2} = w_{n, n - 1} = 1

⁠, and standardized such that w_ii = 0 and

\sum_{j = 1}^{N} w_{i j} = 1

⁠. Case of error cross-sectional independence arises for the parameter values

ψ = 0

and

δ_{γ} = 0

⁠. We allow for error cross-sectional heteroskedasticity by generating

σ_{η i}^{2}

IID (1 + χ_{2, i}^{2}) / 3

⁠, and consider Gaussian (1)

ε_{η, i t} \sim IIDN (0, 1)

⁠, as well as non-Gaussian errors, (2)

ε_{η, i t} \sim IID \frac{t_{ν, i t}}{{[ν / (ν - 2)]}^{1 / 2}}

⁠, where

t_{ν, i t}

are independent draws from a t-distribution with ν degrees of freedom. In light of the properties of the empirical distribution of the FF3 regression residuals, for t distribution error, we choose ν = 8, so that the value of excess kurtosis, 1.5, falls between the sample mean and sample median shown in Table 1.

All the N return series are generated from $t = - 49, - 48, ….0, 1, 2, …., T$ ⁠, with $f_{ℓ, - 50} = 0$ and $h_{ℓ, - 50} = 1$ for $ℓ = 1, 2, 3$ ⁠. The first 50 observations are dropped to minimize the effects of the initial values and observations $r_{i t}, f_{t} = {(f_{1 t}, f_{2 t}, f_{3 t})}^{'}$ ⁠, for $t = 1, 2, \dots, T$ are used in the MC experiments. Further details are provided in the Supplementary Material.

To estimate size of the tests, we set

α_{i} = 0

for all i. To investigate power, we consider alternatives based on Equation (5), setting

λ_{0} = 0

⁠, namely

α_{i} = β_{i}^{'} (λ - μ) + ϖ_{i} .

For the scenario called “Power 1,” we set $λ = μ$ ⁠, and generated α_i as $α_{i} = ϖ_{i} \sim IIDN (0, 1)$ for $i = 1, 2, \dots, N_{α}$ with $N_{α} = ⌊ N^{δ_{α}} ⌋$ ⁠; $α_{i} = 0$ for $i = N_{α} + 1, N_{α} + 2, \dots, N$ ⁠. We considered the values $δ_{α} = 0.7$ ⁠. In another scenario called “Power 2,” we assume there are no pricing errors and set $ϖ_{i} = 0$ for all i, but consider the case where $λ - μ = c {(2.92, - 0.63, - 9.96)}^{'}$ ⁠, that match the estimates reported in Table 1 of GOS (p. 1011) for c = 1. To make the power of the tests for “Power 2” comparable for “Power 1,” we set c = 0.1. We do not consider the case both $λ \neq μ$ and $ϖ_{i} \neq 0$ ⁠, as it is clear that in this case higher power will be achieved.

All combinations of T = 60, 120, 240 and N = 50, 100, 200 (and 500, 1000, 2000, 5000 for the ${\hat{J}}_{α}$ test) are considered. All tests are conducted at the 5% significance level and all experiments are based on R = 2000 replications. To compute ${\tilde{ρ}}_{N, T}^{2}$ which enters the denominator of the ${\hat{J}}_{α}$ statistic, given by Equation (46), we consider $p = {0.05, 0.1}$ and $δ = {1, 2}$ ⁠. The results are very insensitive to the choice of the values of $(p, δ)$ and the case for $(p, δ) = (0.05, 1)$ is reported. It is worth noting that that the choice of p when computing ${\tilde{ρ}}_{N, T}^{2}$ is not governed or affected by the choice of the nominal size of the ${\hat{J}}_{α}$ test.

5.2 Size and Power

Table 2 reports the size and power of the ${\hat{J}}_{α}$ ⁠, GRS, GOS, SW, $F_{max}$ ⁠, BS, and SD tests in the case of normal errors, under various degrees of cross-sectional error correlations, as measured by the exponent, $δ_{γ}$ ⁠.

Table 2.

Size and power of the ${\hat{J}}_{α}$ and other tests with normal errors

Panel A: Size (⁠ $α_{i} = 0$ for all i)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
GRS
	60	5.0	–	–	4.1	–	–	5.3	–	–
	120	5.8	4.3	–	4.9	4.3	–	4.9	3.7	–
	240	4.3	4.9	4.5	4.8	5.4	4.9	5.9	4.6	5.1
GOS
	60	17.4	23.5	30.3	17.3	22.5	31.5	16.9	23.8	29.9
	120	11.3	12.3	13.9	9.8	12.2	14.4	9.6	11.7	14.7
	240	7.2	8.9	9.3	7.4	8.4	8.6	7.7	8.4	9.6
SW
	60	17.4	23.5	30.3	17.4	22.6	31.5	17.8	24.3	30.2
	120	11.3	12.3	13.9	10.0	12.2	14.4	22.9	19.6	16.0
	240	7.2	8.9	9.3	7.4	8.7	8.6	10.8	14.3	20.9
F_max
	60	0.4	0.2	0.1	0.1	0.0	0.2	0.4	0.2	0.0
	120	0.2	0.1	0.1	0.1	0.2	0.0	0.1	0.1	0.0
	240	0.1	0.2	0.2	0.1	0.1	0.2	0.1	0.1	0.1
BS
	60	4.2	4.0	4.6	3.4	4.4	3.9	3.9	4.4	4.3
	120	3.4	2.9	2.7	2.7	2.9	2.4	2.9	3.5	3.5
	240	2.0	2.4	2.0	2.6	2.5	2.0	3.2	2.9	3.0
SD
	60	10.9	12.0	13.2	10.2	12.1	13.5	9.3	11.2	11.9
	120	7.9	7.7	8.3	7.1	7.9	8.5	6.4	8.1	8.6
	240	5.0	6.7	6.7	5.7	6.3	5.8	5.9	6.7	7.3

Panel A: Size (⁠ $α_{i} = 0$ for all i)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
GRS
	60	5.0	–	–	4.1	–	–	5.3	–	–
	120	5.8	4.3	–	4.9	4.3	–	4.9	3.7	–
	240	4.3	4.9	4.5	4.8	5.4	4.9	5.9	4.6	5.1
GOS
	60	17.4	23.5	30.3	17.3	22.5	31.5	16.9	23.8	29.9
	120	11.3	12.3	13.9	9.8	12.2	14.4	9.6	11.7	14.7
	240	7.2	8.9	9.3	7.4	8.4	8.6	7.7	8.4	9.6
SW
	60	17.4	23.5	30.3	17.4	22.6	31.5	17.8	24.3	30.2
	120	11.3	12.3	13.9	10.0	12.2	14.4	22.9	19.6	16.0
	240	7.2	8.9	9.3	7.4	8.7	8.6	10.8	14.3	20.9
F_max
	60	0.4	0.2	0.1	0.1	0.0	0.2	0.4	0.2	0.0
	120	0.2	0.1	0.1	0.1	0.2	0.0	0.1	0.1	0.0
	240	0.1	0.2	0.2	0.1	0.1	0.2	0.1	0.1	0.1
BS
	60	4.2	4.0	4.6	3.4	4.4	3.9	3.9	4.4	4.3
	120	3.4	2.9	2.7	2.7	2.9	2.4	2.9	3.5	3.5
	240	2.0	2.4	2.0	2.6	2.5	2.0	3.2	2.9	3.0
SD
	60	10.9	12.0	13.2	10.2	12.1	13.5	9.3	11.2	11.9
	120	7.9	7.7	8.3	7.1	7.9	8.5	6.4	8.1	8.6
	240	5.0	6.7	6.7	5.7	6.3	5.8	5.9	6.7	7.3

Panel B: Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)		50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
		60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
		120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
		240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
GRS
	60		14.7	–	–	13.4	–	–	14.5	–	–
	120		82.8	48.9	–	80.1	49.3	–	79.6	48.5	–
	240		99.0	99.8	95.5	99.0	99.8	95.6	99.0	99.7	95.4
GOS
	60		83.1	93.0	98.6	80.3	91.7	97.9	71.7	86.0	96.0
	120		95.1	99.2	99.9	94.5	99.1	100.0	89.2	97.6	99.5
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.1	99.9	100.0
SW
	60		83.1	93.0	98.6	80.4	91.7	97.9	72.7	86.5	96.1
	120		95.1	99.2	99.9	94.5	99.1	100.0	94.6	98.6	99.7
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.6	100.0	100.0
F_max
	60		17.6	20.3	25.3	16.0	18.8	20.5	11.2	16.1	16.5
	120		53.2	65.8	76.0	50.0	63.6	72.7	38.2	50.3	65.0
	240		87.9	95.7	99.2	87.0	94.8	98.8	77.8	90.4	96.6
BS
	60		39.8	49.4	63.1	38.0	49.4	58.8	28.9	39.7	48.9
	120		73.2	86.2	95.0	71.0	85.7	94.1	63.2	79.7	90.1
	240		96.3	99.4	100.0	95.5	99.6	100.0	92.8	98.6	99.9
SD
	60		76.7	87.9	95.6	72.7	85.5	93.5	60.9	75.4	87.5
	120		94.4	98.8	99.8	93.0	98.7	99.9	86.3	96.4	99.1
	240		99.5	99.9	100.0	99.4	100.0	100.0	98.7	99.9	100.0

Panel B: Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)		50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
		60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
		120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
		240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
GRS
	60		14.7	–	–	13.4	–	–	14.5	–	–
	120		82.8	48.9	–	80.1	49.3	–	79.6	48.5	–
	240		99.0	99.8	95.5	99.0	99.8	95.6	99.0	99.7	95.4
GOS
	60		83.1	93.0	98.6	80.3	91.7	97.9	71.7	86.0	96.0
	120		95.1	99.2	99.9	94.5	99.1	100.0	89.2	97.6	99.5
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.1	99.9	100.0
SW
	60		83.1	93.0	98.6	80.4	91.7	97.9	72.7	86.5	96.1
	120		95.1	99.2	99.9	94.5	99.1	100.0	94.6	98.6	99.7
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.6	100.0	100.0
F_max
	60		17.6	20.3	25.3	16.0	18.8	20.5	11.2	16.1	16.5
	120		53.2	65.8	76.0	50.0	63.6	72.7	38.2	50.3	65.0
	240		87.9	95.7	99.2	87.0	94.8	98.8	77.8	90.4	96.6
BS
	60		39.8	49.4	63.1	38.0	49.4	58.8	28.9	39.7	48.9
	120		73.2	86.2	95.0	71.0	85.7	94.1	63.2	79.7	90.1
	240		96.3	99.4	100.0	95.5	99.6	100.0	92.8	98.6	99.9
SD
	60		76.7	87.9	95.6	72.7	85.5	93.5	60.9	75.4	87.5
	120		94.4	98.8	99.8	93.0	98.7	99.9	86.3	96.4	99.1
	240		99.5	99.9	100.0	99.4	100.0	100.0	98.7	99.9	100.0

Panel C: Power 2 (⁠ $α_{i} = β_{i}^{'} (λ - μ)$ with $(λ - μ) = 0.1 {(2.92, - 0.63, - 9.96)}^{'}$ ⁠)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	58.4	81.5	96.3	56.2	79.4	96.5	49.0	75.6	94.9
	120	94.4	99.7	100.0	93.0	99.6	100.0	90.0	99.4	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0
GRS
	60	11.8	–	–	12.3	–	–	12.4	–	–
	120	78.1	47.7	–	75.5	46.5	–	76.9	45.1	–
	240	99.9	100.0	99.3	99.8	100.0	99.0	99.8	100.0	99.1
GOS
	60	76.2	94.8	99.7	75.0	94.0	99.9	72.0	93.2	99.7
	120	96.5	100.0	100.0	96.1	99.7	100.0	94.0	99.9	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0
SW
	60	77.4	93.8	99.9	78.1	93.7	99.8	75.6	92.5	99.8
	120	97.1	99.8	100.0	95.7	100.0	100.0	95.7	99.7	100.0
	240	100.0	100.0	100.0	99.9	100.0	100.0	99.9	100.0	100.0
F_max
	60	1.6	1.9	1.7	1.5	1.5	1.5	1.3	1.4	1.5
	120	7.6	9.0	10.3	6.6	7.6	9.1	7.5	7.7	8.9
	240	35.2	43.7	55.4	31.4	44.5	56.7	29.3	42.3	54.9
BS
	60	25.8	44.6	70.9	23.4	42.1	69.4	18.7	33.8	57.7
	120	60.6	88.0	99.0	57.8	85.4	99.4	47.5	77.7	97.6
	240	96.3	100.0	100.0	95.2	100.0	100.0	91.9	99.6	100.0
SD
	60	67.6	89.0	99.0	65.9	87.4	98.9	59.4	83.8	97.9
	120	95.1	99.8	100.0	94.5	99.7	100.0	90.8	99.8	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0

Panel C: Power 2 (⁠ $α_{i} = β_{i}^{'} (λ - μ)$ with $(λ - μ) = 0.1 {(2.92, - 0.63, - 9.96)}^{'}$ ⁠)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	58.4	81.5	96.3	56.2	79.4	96.5	49.0	75.6	94.9
	120	94.4	99.7	100.0	93.0	99.6	100.0	90.0	99.4	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0
GRS
	60	11.8	–	–	12.3	–	–	12.4	–	–
	120	78.1	47.7	–	75.5	46.5	–	76.9	45.1	–
	240	99.9	100.0	99.3	99.8	100.0	99.0	99.8	100.0	99.1
GOS
	60	76.2	94.8	99.7	75.0	94.0	99.9	72.0	93.2	99.7
	120	96.5	100.0	100.0	96.1	99.7	100.0	94.0	99.9	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0
SW
	60	77.4	93.8	99.9	78.1	93.7	99.8	75.6	92.5	99.8
	120	97.1	99.8	100.0	95.7	100.0	100.0	95.7	99.7	100.0
	240	100.0	100.0	100.0	99.9	100.0	100.0	99.9	100.0	100.0
F_max
	60	1.6	1.9	1.7	1.5	1.5	1.5	1.3	1.4	1.5
	120	7.6	9.0	10.3	6.6	7.6	9.1	7.5	7.7	8.9
	240	35.2	43.7	55.4	31.4	44.5	56.7	29.3	42.3	54.9
BS
	60	25.8	44.6	70.9	23.4	42.1	69.4	18.7	33.8	57.7
	120	60.6	88.0	99.0	57.8	85.4	99.4	47.5	77.7	97.6
	240	96.3	100.0	100.0	95.2	100.0	100.0	91.9	99.6	100.0
SD
	60	67.6	89.0	99.0	65.9	87.4	98.9	59.4	83.8	97.9
	120	95.1	99.8	100.0	94.5	99.7	100.0	90.8	99.8	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0

Notes: This table summarizes the size and power of ${\hat{J}}_{α}$ ⁠, GRS, GOS, SW, $F_{max}$ ⁠, BS, and SD tests of $α_{i} = 0$ for $i = 1, 2, \dots, N$ ⁠, in the case of three-factor models. The observations are generated as $y_{i t} = α_{i} + \sum_{ℓ = 1}^{3} β_{ℓ i} f_{ℓ t} + u_{i t}, i = 1, 2, .., N; t = 1, 2, \dots, T, f_{ℓ t} = μ_{f ℓ} + ρ_{f ℓ} f_{ℓ, t - 1} + e_{ℓ t}$ ⁠, where $e_{ℓ t} = \sqrt{h_{ℓ t}} ξ_{ℓ t}$ ⁠, $h_{ℓ t} = μ_{h ℓ} + ρ_{1 h ℓ} h_{ℓ, t - 1} + ρ_{2 h ℓ} e_{ℓ, t - 1}^{2}, ξ_{ℓ t} \sim IIDN (0, 1), t = - 49, \dots, T$ with $f_{ℓ, - 50} = 0$ and $h_{ℓ, - 50} = 0, ℓ = 1, 2, 3$ ⁠. The idiosyncratic errors are generated as $u_{i t} = γ_{i} v_{t} + σ_{η i} ε_{η, i t}$ ⁠, where $ε_{η, i t} \sim IIDN (0, 1), v_{t} \sim IIDN (0, 1)$ and $σ_{η i}^{2} \sim IID (1 + χ_{2, i}^{2}) / 3$ ⁠. The first $⌊ N^{δ_{γ}} ⌋ (< N) γ_{i}$ are generated as $Uniform (0.7, 0.9)$ ⁠, and the remaining elements are set to 0. We consider the values $δ_{γ} = 0, 1 / 4$ ⁠, and 1/2. ${\hat{J}}_{α}$ is the proposed test; GRS is the F-test due to Gibbons et al. (1989) which is distributed as $F_{N, T - N - m},$ which is applicable when $T > N + 4$ ⁠. “–” signifies that the GRS statistic cannot be computed. GOS is the test proposed by Gagliardini et al. (2016) defined in Equation (47); SW is the test based on the POET estimator of Fan et al. (2013). $F_{max}$ is proposed by GL, BS and SD are tests of He et al. (2021), which are defined in the Supplementary Material. Values of ${\hat{J}}_{α}$ ⁠, GOS, SW, BS, and SD are compared with a positive one-sided critical value of the standard normal distribution. All tests are conducted at the 5% significance level. Experiments are based on 2000 replications.

Table 2.

Size and power of the ${\hat{J}}_{α}$ and other tests with normal errors

Panel A: Size (⁠ $α_{i} = 0$ for all i)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
GRS
	60	5.0	–	–	4.1	–	–	5.3	–	–
	120	5.8	4.3	–	4.9	4.3	–	4.9	3.7	–
	240	4.3	4.9	4.5	4.8	5.4	4.9	5.9	4.6	5.1
GOS
	60	17.4	23.5	30.3	17.3	22.5	31.5	16.9	23.8	29.9
	120	11.3	12.3	13.9	9.8	12.2	14.4	9.6	11.7	14.7
	240	7.2	8.9	9.3	7.4	8.4	8.6	7.7	8.4	9.6
SW
	60	17.4	23.5	30.3	17.4	22.6	31.5	17.8	24.3	30.2
	120	11.3	12.3	13.9	10.0	12.2	14.4	22.9	19.6	16.0
	240	7.2	8.9	9.3	7.4	8.7	8.6	10.8	14.3	20.9
F_max
	60	0.4	0.2	0.1	0.1	0.0	0.2	0.4	0.2	0.0
	120	0.2	0.1	0.1	0.1	0.2	0.0	0.1	0.1	0.0
	240	0.1	0.2	0.2	0.1	0.1	0.2	0.1	0.1	0.1
BS
	60	4.2	4.0	4.6	3.4	4.4	3.9	3.9	4.4	4.3
	120	3.4	2.9	2.7	2.7	2.9	2.4	2.9	3.5	3.5
	240	2.0	2.4	2.0	2.6	2.5	2.0	3.2	2.9	3.0
SD
	60	10.9	12.0	13.2	10.2	12.1	13.5	9.3	11.2	11.9
	120	7.9	7.7	8.3	7.1	7.9	8.5	6.4	8.1	8.6
	240	5.0	6.7	6.7	5.7	6.3	5.8	5.9	6.7	7.3

Panel A: Size (⁠ $α_{i} = 0$ for all i)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
GRS
	60	5.0	–	–	4.1	–	–	5.3	–	–
	120	5.8	4.3	–	4.9	4.3	–	4.9	3.7	–
	240	4.3	4.9	4.5	4.8	5.4	4.9	5.9	4.6	5.1
GOS
	60	17.4	23.5	30.3	17.3	22.5	31.5	16.9	23.8	29.9
	120	11.3	12.3	13.9	9.8	12.2	14.4	9.6	11.7	14.7
	240	7.2	8.9	9.3	7.4	8.4	8.6	7.7	8.4	9.6
SW
	60	17.4	23.5	30.3	17.4	22.6	31.5	17.8	24.3	30.2
	120	11.3	12.3	13.9	10.0	12.2	14.4	22.9	19.6	16.0
	240	7.2	8.9	9.3	7.4	8.7	8.6	10.8	14.3	20.9
F_max
	60	0.4	0.2	0.1	0.1	0.0	0.2	0.4	0.2	0.0
	120	0.2	0.1	0.1	0.1	0.2	0.0	0.1	0.1	0.0
	240	0.1	0.2	0.2	0.1	0.1	0.2	0.1	0.1	0.1
BS
	60	4.2	4.0	4.6	3.4	4.4	3.9	3.9	4.4	4.3
	120	3.4	2.9	2.7	2.7	2.9	2.4	2.9	3.5	3.5
	240	2.0	2.4	2.0	2.6	2.5	2.0	3.2	2.9	3.0
SD
	60	10.9	12.0	13.2	10.2	12.1	13.5	9.3	11.2	11.9
	120	7.9	7.7	8.3	7.1	7.9	8.5	6.4	8.1	8.6
	240	5.0	6.7	6.7	5.7	6.3	5.8	5.9	6.7	7.3

Panel B: Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)		50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
		60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
		120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
		240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
GRS
	60		14.7	–	–	13.4	–	–	14.5	–	–
	120		82.8	48.9	–	80.1	49.3	–	79.6	48.5	–
	240		99.0	99.8	95.5	99.0	99.8	95.6	99.0	99.7	95.4
GOS
	60		83.1	93.0	98.6	80.3	91.7	97.9	71.7	86.0	96.0
	120		95.1	99.2	99.9	94.5	99.1	100.0	89.2	97.6	99.5
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.1	99.9	100.0
SW
	60		83.1	93.0	98.6	80.4	91.7	97.9	72.7	86.5	96.1
	120		95.1	99.2	99.9	94.5	99.1	100.0	94.6	98.6	99.7
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.6	100.0	100.0
F_max
	60		17.6	20.3	25.3	16.0	18.8	20.5	11.2	16.1	16.5
	120		53.2	65.8	76.0	50.0	63.6	72.7	38.2	50.3	65.0
	240		87.9	95.7	99.2	87.0	94.8	98.8	77.8	90.4	96.6
BS
	60		39.8	49.4	63.1	38.0	49.4	58.8	28.9	39.7	48.9
	120		73.2	86.2	95.0	71.0	85.7	94.1	63.2	79.7	90.1
	240		96.3	99.4	100.0	95.5	99.6	100.0	92.8	98.6	99.9
SD
	60		76.7	87.9	95.6	72.7	85.5	93.5	60.9	75.4	87.5
	120		94.4	98.8	99.8	93.0	98.7	99.9	86.3	96.4	99.1
	240		99.5	99.9	100.0	99.4	100.0	100.0	98.7	99.9	100.0

Panel B: Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)		50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
		60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
		120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
		240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
GRS
	60		14.7	–	–	13.4	–	–	14.5	–	–
	120		82.8	48.9	–	80.1	49.3	–	79.6	48.5	–
	240		99.0	99.8	95.5	99.0	99.8	95.6	99.0	99.7	95.4
GOS
	60		83.1	93.0	98.6	80.3	91.7	97.9	71.7	86.0	96.0
	120		95.1	99.2	99.9	94.5	99.1	100.0	89.2	97.6	99.5
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.1	99.9	100.0
SW
	60		83.1	93.0	98.6	80.4	91.7	97.9	72.7	86.5	96.1
	120		95.1	99.2	99.9	94.5	99.1	100.0	94.6	98.6	99.7
	240		99.6	100.0	100.0	99.4	100.0	100.0	99.6	100.0	100.0
F_max
	60		17.6	20.3	25.3	16.0	18.8	20.5	11.2	16.1	16.5
	120		53.2	65.8	76.0	50.0	63.6	72.7	38.2	50.3	65.0
	240		87.9	95.7	99.2	87.0	94.8	98.8	77.8	90.4	96.6
BS
	60		39.8	49.4	63.1	38.0	49.4	58.8	28.9	39.7	48.9
	120		73.2	86.2	95.0	71.0	85.7	94.1	63.2	79.7	90.1
	240		96.3	99.4	100.0	95.5	99.6	100.0	92.8	98.6	99.9
SD
	60		76.7	87.9	95.6	72.7	85.5	93.5	60.9	75.4	87.5
	120		94.4	98.8	99.8	93.0	98.7	99.9	86.3	96.4	99.1
	240		99.5	99.9	100.0	99.4	100.0	100.0	98.7	99.9	100.0

Panel C: Power 2 (⁠ $α_{i} = β_{i}^{'} (λ - μ)$ with $(λ - μ) = 0.1 {(2.92, - 0.63, - 9.96)}^{'}$ ⁠)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	58.4	81.5	96.3	56.2	79.4	96.5	49.0	75.6	94.9
	120	94.4	99.7	100.0	93.0	99.6	100.0	90.0	99.4	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0
GRS
	60	11.8	–	–	12.3	–	–	12.4	–	–
	120	78.1	47.7	–	75.5	46.5	–	76.9	45.1	–
	240	99.9	100.0	99.3	99.8	100.0	99.0	99.8	100.0	99.1
GOS
	60	76.2	94.8	99.7	75.0	94.0	99.9	72.0	93.2	99.7
	120	96.5	100.0	100.0	96.1	99.7	100.0	94.0	99.9	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0
SW
	60	77.4	93.8	99.9	78.1	93.7	99.8	75.6	92.5	99.8
	120	97.1	99.8	100.0	95.7	100.0	100.0	95.7	99.7	100.0
	240	100.0	100.0	100.0	99.9	100.0	100.0	99.9	100.0	100.0
F_max
	60	1.6	1.9	1.7	1.5	1.5	1.5	1.3	1.4	1.5
	120	7.6	9.0	10.3	6.6	7.6	9.1	7.5	7.7	8.9
	240	35.2	43.7	55.4	31.4	44.5	56.7	29.3	42.3	54.9
BS
	60	25.8	44.6	70.9	23.4	42.1	69.4	18.7	33.8	57.7
	120	60.6	88.0	99.0	57.8	85.4	99.4	47.5	77.7	97.6
	240	96.3	100.0	100.0	95.2	100.0	100.0	91.9	99.6	100.0
SD
	60	67.6	89.0	99.0	65.9	87.4	98.9	59.4	83.8	97.9
	120	95.1	99.8	100.0	94.5	99.7	100.0	90.8	99.8	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0

Panel C: Power 2 (⁠ $α_{i} = β_{i}^{'} (λ - μ)$ with $(λ - μ) = 0.1 {(2.92, - 0.63, - 9.96)}^{'}$ ⁠)
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	58.4	81.5	96.3	56.2	79.4	96.5	49.0	75.6	94.9
	120	94.4	99.7	100.0	93.0	99.6	100.0	90.0	99.4	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0
GRS
	60	11.8	–	–	12.3	–	–	12.4	–	–
	120	78.1	47.7	–	75.5	46.5	–	76.9	45.1	–
	240	99.9	100.0	99.3	99.8	100.0	99.0	99.8	100.0	99.1
GOS
	60	76.2	94.8	99.7	75.0	94.0	99.9	72.0	93.2	99.7
	120	96.5	100.0	100.0	96.1	99.7	100.0	94.0	99.9	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0	100.0
SW
	60	77.4	93.8	99.9	78.1	93.7	99.8	75.6	92.5	99.8
	120	97.1	99.8	100.0	95.7	100.0	100.0	95.7	99.7	100.0
	240	100.0	100.0	100.0	99.9	100.0	100.0	99.9	100.0	100.0
F_max
	60	1.6	1.9	1.7	1.5	1.5	1.5	1.3	1.4	1.5
	120	7.6	9.0	10.3	6.6	7.6	9.1	7.5	7.7	8.9
	240	35.2	43.7	55.4	31.4	44.5	56.7	29.3	42.3	54.9
BS
	60	25.8	44.6	70.9	23.4	42.1	69.4	18.7	33.8	57.7
	120	60.6	88.0	99.0	57.8	85.4	99.4	47.5	77.7	97.6
	240	96.3	100.0	100.0	95.2	100.0	100.0	91.9	99.6	100.0
SD
	60	67.6	89.0	99.0	65.9	87.4	98.9	59.4	83.8	97.9
	120	95.1	99.8	100.0	94.5	99.7	100.0	90.8	99.8	100.0
	240	100.0	100.0	100.0	100.0	100.0	100.0	99.9	100.0	100.0

First, consider Panel A of Table 2 which reports the size of the tests. The GRS test when applicable (namely when T > N) is an exact test and has the correct size. The empirical size of the ${\hat{J}}_{α}$ test is also very close to the 5% nominal level for all combinations of N and T. Even when N = 200 and $δ_{γ} = 0.5$ ⁠, the size of the ${\hat{J}}_{α}$ test lies in the range 5.9–6.4% for different values of T. In contrast, both GOS and SW tests grossly over-reject the null hypothesis, and the degree of the over-rejection becomes more serious as N increases for a given T. In line with the discussion in Section 3.4, the size distortion of these tests is mitigated when T increases. The $F_{max}$ test severely under-rejects the null hypothesis, with the size ranging between 0.0% and 0.4%. Although less pronounced than the $F_{max}$ test, the BS test is very conservative and the size steadily drops as T (and N) rises. Again, although less pronounced than the GOS and SW tests, the SD test tends to over-reject the null hypothesis and the degree of the over-rejection becomes more serious as N increases for a given T.

The power of the tests based on the “Power 1” design is reported in Panel B of Table 2. The power of ${\hat{J}}_{α}$ test is substantially higher than that of the GRS test. This is in line with our discussion at the end of Section 1, and reflects the fact that GRS assumes an arbitrary degree of cross-sectional error correlations and thus relies on a large time dimension to achieve a reasonably high power. In contrast, the power of the ${\hat{J}}_{α}$ test is driven largely by the cross-sectional dimension. The power comparison of the GOS, SW, and SD tests with the ${\hat{J}}_{α}$ test seems inappropriate, given their large size-distortions. Having said this, it is perhaps remarkable that the power of the ${\hat{J}}_{α}$ test is comparable to the unadjusted power of the GOS, SW_POET, and SW_LW tests. The power of the $F_{max}$ and BS tests is uniformly lower than the power of the ${\hat{J}}_{α}$ test, likely due to the conservative nature of these tests. The power of the tests based on the “Power 2” design is reported in Panel C of Table 2. The properties of the tests with the “Power 2” design reported in Panel C of Table 2 are qualitatively very similar to those of the “Power 1” design. A detailed discussion of Table 2 is therefore omitted.

We now consider the case in which the errors are non-normal. The size results are summarized in Table 3. The results show that the size of the ${\hat{J}}_{α}$ test and the GRS test, as well as the $F_{max}$ ⁠, BS, and SD tests, is hardly affected by non-normality. The over-rejection of the GOS and SW tests tends to be somewhat magnified by non-normality.

Table 3.

Size of the ${\hat{J}}_{α}$ and other tests with non-normal errors

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.9	4.6	5.6	5.0	6.2	5.0	5.5	6.6	7.0
	120	5.7	4.8	5.2	4.3	6.2	6.0	5.8	5.7	5.1
	240	5.8	5.7	5.4	4.7	5.6	5.4	6.5	6.8	5.8
GRS
	60	5.0	–	–	4.5	–	–	5.4	–	–
	120	4.9	5.1	–	4.8	4.7	–	3.6	5.1	–
	240	5.5	4.7	4.2	3.7	5.0	4.7	5.4	5.6	5.0
GOS
	60	17.1	22.2	30.0	15.5	21.7	29.2	17.0	22.9	32.6
	120	9.5	10.8	14.0	9.5	11.9	14.3	8.9	12.4	14.4
	240	8.1	8.3	8.9	6.6	7.9	9.0	8.1	9.2	9.1
SW
	60	17.1	22.1	30.1	15.5	21.7	29.2	18.5	23.5	32.8
	120	9.5	10.8	14.0	9.5	11.8	14.4	19.7	19.9	15.5
	240	8.1	8.3	8.9	6.6	8.0	9.0	11.1	17.7	24.6
F_max
	60	0.0	0.2	0.1	0.2	0.1	0.1	0.1	0.2	0.2
	120	0.1	0.1	0.1	0.0	0.1	0.1	0.0	0.1	0.1
	240	0.2	0.1	0.2	0.2	0.2	0.1	0.1	0.3	0.1
BS
	60	3.9	3.6	4.6	2.9	4.4	3.5	3.5	4.5	4.7
	120	3.2	2.0	3.3	2.5	3.2	2.1	2.9	2.5	3.4
	240	2.2	1.8	2.2	2.1	2.6	2.1	3.0	2.6	3.0
SD
	60	10.8	11.3	13.0	9.4	12.2	12.7	9.6	12.1	13.3
	120	6.7	6.3	8.5	5.7	8.3	8.7	6.6	7.4	7.8
	240	5.9	6.1	6.4	4.8	6.0	6.6	6.3	7.1	6.8

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.9	4.6	5.6	5.0	6.2	5.0	5.5	6.6	7.0
	120	5.7	4.8	5.2	4.3	6.2	6.0	5.8	5.7	5.1
	240	5.8	5.7	5.4	4.7	5.6	5.4	6.5	6.8	5.8
GRS
	60	5.0	–	–	4.5	–	–	5.4	–	–
	120	4.9	5.1	–	4.8	4.7	–	3.6	5.1	–
	240	5.5	4.7	4.2	3.7	5.0	4.7	5.4	5.6	5.0
GOS
	60	17.1	22.2	30.0	15.5	21.7	29.2	17.0	22.9	32.6
	120	9.5	10.8	14.0	9.5	11.9	14.3	8.9	12.4	14.4
	240	8.1	8.3	8.9	6.6	7.9	9.0	8.1	9.2	9.1
SW
	60	17.1	22.1	30.1	15.5	21.7	29.2	18.5	23.5	32.8
	120	9.5	10.8	14.0	9.5	11.8	14.4	19.7	19.9	15.5
	240	8.1	8.3	8.9	6.6	8.0	9.0	11.1	17.7	24.6
F_max
	60	0.0	0.2	0.1	0.2	0.1	0.1	0.1	0.2	0.2
	120	0.1	0.1	0.1	0.0	0.1	0.1	0.0	0.1	0.1
	240	0.2	0.1	0.2	0.2	0.2	0.1	0.1	0.3	0.1
BS
	60	3.9	3.6	4.6	2.9	4.4	3.5	3.5	4.5	4.7
	120	3.2	2.0	3.3	2.5	3.2	2.1	2.9	2.5	3.4
	240	2.2	1.8	2.2	2.1	2.6	2.1	3.0	2.6	3.0
SD
	60	10.8	11.3	13.0	9.4	12.2	12.7	9.6	12.1	13.3
	120	6.7	6.3	8.5	5.7	8.3	8.7	6.6	7.4	7.8
	240	5.9	6.1	6.4	4.8	6.0	6.6	6.3	7.1	6.8

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that $u_{i t} = γ_{i} v_{t} + σ_{η i} ε_{η, i t}$ ⁠, where $ε_{η, i t}$ is independently drawn from standardized student t-distribution with eight degrees of freedom.

Table 3.

Size of the ${\hat{J}}_{α}$ and other tests with non-normal errors

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.9	4.6	5.6	5.0	6.2	5.0	5.5	6.6	7.0
	120	5.7	4.8	5.2	4.3	6.2	6.0	5.8	5.7	5.1
	240	5.8	5.7	5.4	4.7	5.6	5.4	6.5	6.8	5.8
GRS
	60	5.0	–	–	4.5	–	–	5.4	–	–
	120	4.9	5.1	–	4.8	4.7	–	3.6	5.1	–
	240	5.5	4.7	4.2	3.7	5.0	4.7	5.4	5.6	5.0
GOS
	60	17.1	22.2	30.0	15.5	21.7	29.2	17.0	22.9	32.6
	120	9.5	10.8	14.0	9.5	11.9	14.3	8.9	12.4	14.4
	240	8.1	8.3	8.9	6.6	7.9	9.0	8.1	9.2	9.1
SW
	60	17.1	22.1	30.1	15.5	21.7	29.2	18.5	23.5	32.8
	120	9.5	10.8	14.0	9.5	11.8	14.4	19.7	19.9	15.5
	240	8.1	8.3	8.9	6.6	8.0	9.0	11.1	17.7	24.6
F_max
	60	0.0	0.2	0.1	0.2	0.1	0.1	0.1	0.2	0.2
	120	0.1	0.1	0.1	0.0	0.1	0.1	0.0	0.1	0.1
	240	0.2	0.1	0.2	0.2	0.2	0.1	0.1	0.3	0.1
BS
	60	3.9	3.6	4.6	2.9	4.4	3.5	3.5	4.5	4.7
	120	3.2	2.0	3.3	2.5	3.2	2.1	2.9	2.5	3.4
	240	2.2	1.8	2.2	2.1	2.6	2.1	3.0	2.6	3.0
SD
	60	10.8	11.3	13.0	9.4	12.2	12.7	9.6	12.1	13.3
	120	6.7	6.3	8.5	5.7	8.3	8.7	6.6	7.4	7.8
	240	5.9	6.1	6.4	4.8	6.0	6.6	6.3	7.1	6.8

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.9	4.6	5.6	5.0	6.2	5.0	5.5	6.6	7.0
	120	5.7	4.8	5.2	4.3	6.2	6.0	5.8	5.7	5.1
	240	5.8	5.7	5.4	4.7	5.6	5.4	6.5	6.8	5.8
GRS
	60	5.0	–	–	4.5	–	–	5.4	–	–
	120	4.9	5.1	–	4.8	4.7	–	3.6	5.1	–
	240	5.5	4.7	4.2	3.7	5.0	4.7	5.4	5.6	5.0
GOS
	60	17.1	22.2	30.0	15.5	21.7	29.2	17.0	22.9	32.6
	120	9.5	10.8	14.0	9.5	11.9	14.3	8.9	12.4	14.4
	240	8.1	8.3	8.9	6.6	7.9	9.0	8.1	9.2	9.1
SW
	60	17.1	22.1	30.1	15.5	21.7	29.2	18.5	23.5	32.8
	120	9.5	10.8	14.0	9.5	11.8	14.4	19.7	19.9	15.5
	240	8.1	8.3	8.9	6.6	8.0	9.0	11.1	17.7	24.6
F_max
	60	0.0	0.2	0.1	0.2	0.1	0.1	0.1	0.2	0.2
	120	0.1	0.1	0.1	0.0	0.1	0.1	0.0	0.1	0.1
	240	0.2	0.1	0.2	0.2	0.2	0.1	0.1	0.3	0.1
BS
	60	3.9	3.6	4.6	2.9	4.4	3.5	3.5	4.5	4.7
	120	3.2	2.0	3.3	2.5	3.2	2.1	2.9	2.5	3.4
	240	2.2	1.8	2.2	2.1	2.6	2.1	3.0	2.6	3.0
SD
	60	10.8	11.3	13.0	9.4	12.2	12.7	9.6	12.1	13.3
	120	6.7	6.3	8.5	5.7	8.3	8.7	6.6	7.4	7.8
	240	5.9	6.1	6.4	4.8	6.0	6.6	6.3	7.1	6.8

Furthermore, the behavior of the test statistics is examined under the same DGP as that examined in Table 2, except that a spatial autoregressive component was incorporated into the error generation process. The results with such mixed factor-spatial errors are reported in Table 4. As can be seen, the size of the ${\hat{J}}_{α}$ test and GRS test is well controlled, with a slight over-rejection for T = 60, which disappears when T is increased to 120. In contrast, the size distortion of GOS and SW seems to be amplified with this design. The size properties of the $F_{max}$ ⁠, BS, and SD tests remain similar to those in Table 2.

Table 4.

Size of the ${\hat{J}}_{α}$ and other tests, spatially correlated errors

Size: $α_{i} = 0$ for all i
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
		(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60		7.3	7.1	7.8	5.8	7.0	6.1	6.7	6.8	6.4
	120		6.1	6.5	6.1	6.0	5.2	5.7	6.5	6.2	6.6
	240		6.5	6.1	5.6	5.8	4.9	5.9	6.9	7.0	5.9
GRS
	60		4.4	–	–	4.1	–	–	4.9	–	–
	120		5.5	5.4	–	4.4	5.2	–	5.4	5.5	–
	240		5.7	5.0	4.3	5.0	5.0	5.3	5.6	4.5	4.1
GOS
	60		17.4	23.9	32.3	17.7	24.0	31.1	19.3	24.5	30.9
	120		11.4	13.8	16.5	11.0	12.6	15.2	10.9	11.5	16.9
	240		8.9	10.2	9.8	8.6	8.6	10.8	8.5	9.8	9.4
SW
	60		17.5	23.9	32.2	17.8	24.1	31.2	20.5	25.5	31.0
	120		11.9	13.8	16.5	12.6	13.0	15.4	44.8	15.7	18.9
	240		17.7	12.8	11.3	15.8	14.3	12.9	20.3	44.9	26.5
F_max
	60		0.2	0.2	0.0	0.3	0.1	0.1	0.3	0.1	0.1
	120		0.1	0.1	0.1	0.2	0.1	0.0	0.0	0.2	0.1
	240		0.1	0.0	0.1	0.1	0.0	0.2	0.1	0.1	0.2
BS
	60		4.0	4.2	3.8	3.8	3.6	3.5	4.0	4.4	3.6
	120		3.1	3.2	3.4	2.8	3.0	2.6	3.0	3.2	3.6
	240		2.7	3.0	2.4	2.9	2.4	2.4	3.0	3.4	2.5
SD
	60		9.8	12.0	13.4	9.4	11.3	12.3	9.5	10.6	11.6
	120		6.8	7.7	7.9	6.4	6.9	7.7	7.4	7.0	8.0
	240		6.4	6.7	6.4	5.6	5.2	6.8	6.4	7.1	6.3

Size: $α_{i} = 0$ for all i
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
		(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60		7.3	7.1	7.8	5.8	7.0	6.1	6.7	6.8	6.4
	120		6.1	6.5	6.1	6.0	5.2	5.7	6.5	6.2	6.6
	240		6.5	6.1	5.6	5.8	4.9	5.9	6.9	7.0	5.9
GRS
	60		4.4	–	–	4.1	–	–	4.9	–	–
	120		5.5	5.4	–	4.4	5.2	–	5.4	5.5	–
	240		5.7	5.0	4.3	5.0	5.0	5.3	5.6	4.5	4.1
GOS
	60		17.4	23.9	32.3	17.7	24.0	31.1	19.3	24.5	30.9
	120		11.4	13.8	16.5	11.0	12.6	15.2	10.9	11.5	16.9
	240		8.9	10.2	9.8	8.6	8.6	10.8	8.5	9.8	9.4
SW
	60		17.5	23.9	32.2	17.8	24.1	31.2	20.5	25.5	31.0
	120		11.9	13.8	16.5	12.6	13.0	15.4	44.8	15.7	18.9
	240		17.7	12.8	11.3	15.8	14.3	12.9	20.3	44.9	26.5
F_max
	60		0.2	0.2	0.0	0.3	0.1	0.1	0.3	0.1	0.1
	120		0.1	0.1	0.1	0.2	0.1	0.0	0.0	0.2	0.1
	240		0.1	0.0	0.1	0.1	0.0	0.2	0.1	0.1	0.2
BS
	60		4.0	4.2	3.8	3.8	3.6	3.5	4.0	4.4	3.6
	120		3.1	3.2	3.4	2.8	3.0	2.6	3.0	3.2	3.6
	240		2.7	3.0	2.4	2.9	2.4	2.4	3.0	3.4	2.5
SD
	60		9.8	12.0	13.4	9.4	11.3	12.3	9.5	10.6	11.6
	120		6.8	7.7	7.9	6.4	6.9	7.7	7.4	7.0	8.0
	240		6.4	6.7	6.4	5.6	5.2	6.8	6.4	7.1	6.3

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that $u_{i t} = γ_{i} v_{t} + η_{i t}$ with $η_{i t} = ψ \sum_{j = 1}^{N} w_{i j} η_{j t} + σ_{η i} ε_{η, i t}$ ⁠. We have chosen the value $ψ = 1 / 4$ and a rook form for $W = (w_{i j})$ ⁠, namely, all elements in W are zero except $w_{i + 1, i} = w_{j - 1, j} = 0.5$ for $i = 1, 2, \dots, N - 2$ and $j = 3, 4…, N$ ⁠, with $w_{1, 2} = w_{N, N - 1} = 1$ ⁠.

Table 4.

Size of the ${\hat{J}}_{α}$ and other tests, spatially correlated errors

Size: $α_{i} = 0$ for all i
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
		(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60		7.3	7.1	7.8	5.8	7.0	6.1	6.7	6.8	6.4
	120		6.1	6.5	6.1	6.0	5.2	5.7	6.5	6.2	6.6
	240		6.5	6.1	5.6	5.8	4.9	5.9	6.9	7.0	5.9
GRS
	60		4.4	–	–	4.1	–	–	4.9	–	–
	120		5.5	5.4	–	4.4	5.2	–	5.4	5.5	–
	240		5.7	5.0	4.3	5.0	5.0	5.3	5.6	4.5	4.1
GOS
	60		17.4	23.9	32.3	17.7	24.0	31.1	19.3	24.5	30.9
	120		11.4	13.8	16.5	11.0	12.6	15.2	10.9	11.5	16.9
	240		8.9	10.2	9.8	8.6	8.6	10.8	8.5	9.8	9.4
SW
	60		17.5	23.9	32.2	17.8	24.1	31.2	20.5	25.5	31.0
	120		11.9	13.8	16.5	12.6	13.0	15.4	44.8	15.7	18.9
	240		17.7	12.8	11.3	15.8	14.3	12.9	20.3	44.9	26.5
F_max
	60		0.2	0.2	0.0	0.3	0.1	0.1	0.3	0.1	0.1
	120		0.1	0.1	0.1	0.2	0.1	0.0	0.0	0.2	0.1
	240		0.1	0.0	0.1	0.1	0.0	0.2	0.1	0.1	0.2
BS
	60		4.0	4.2	3.8	3.8	3.6	3.5	4.0	4.4	3.6
	120		3.1	3.2	3.4	2.8	3.0	2.6	3.0	3.2	3.6
	240		2.7	3.0	2.4	2.9	2.4	2.4	3.0	3.4	2.5
SD
	60		9.8	12.0	13.4	9.4	11.3	12.3	9.5	10.6	11.6
	120		6.8	7.7	7.9	6.4	6.9	7.7	7.4	7.0	8.0
	240		6.4	6.7	6.4	5.6	5.2	6.8	6.4	7.1	6.3

Size: $α_{i} = 0$ for all i
			$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
		(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60		7.3	7.1	7.8	5.8	7.0	6.1	6.7	6.8	6.4
	120		6.1	6.5	6.1	6.0	5.2	5.7	6.5	6.2	6.6
	240		6.5	6.1	5.6	5.8	4.9	5.9	6.9	7.0	5.9
GRS
	60		4.4	–	–	4.1	–	–	4.9	–	–
	120		5.5	5.4	–	4.4	5.2	–	5.4	5.5	–
	240		5.7	5.0	4.3	5.0	5.0	5.3	5.6	4.5	4.1
GOS
	60		17.4	23.9	32.3	17.7	24.0	31.1	19.3	24.5	30.9
	120		11.4	13.8	16.5	11.0	12.6	15.2	10.9	11.5	16.9
	240		8.9	10.2	9.8	8.6	8.6	10.8	8.5	9.8	9.4
SW
	60		17.5	23.9	32.2	17.8	24.1	31.2	20.5	25.5	31.0
	120		11.9	13.8	16.5	12.6	13.0	15.4	44.8	15.7	18.9
	240		17.7	12.8	11.3	15.8	14.3	12.9	20.3	44.9	26.5
F_max
	60		0.2	0.2	0.0	0.3	0.1	0.1	0.3	0.1	0.1
	120		0.1	0.1	0.1	0.2	0.1	0.0	0.0	0.2	0.1
	240		0.1	0.0	0.1	0.1	0.0	0.2	0.1	0.1	0.2
BS
	60		4.0	4.2	3.8	3.8	3.6	3.5	4.0	4.4	3.6
	120		3.1	3.2	3.4	2.8	3.0	2.6	3.0	3.2	3.6
	240		2.7	3.0	2.4	2.9	2.4	2.4	3.0	3.4	2.5
SD
	60		9.8	12.0	13.4	9.4	11.3	12.3	9.5	10.6	11.6
	120		6.8	7.7	7.9	6.4	6.9	7.7	7.4	7.0	8.0
	240		6.4	6.7	6.4	5.6	5.2	6.8	6.4	7.1	6.3

Since the autoregressive conditional heteroskedasticity is commonly found in security returns, the effect of cross-sectionally correlated errors with GARCH(1,1) processes is also investigated. The size properties of the tests are summarized in Table 5. The results are almost identical to those using unconditionally time-series homoskedastic (but cross-sectionally heteroskedastic) errors reported in Table 2. This is to be expected as the LFPM is a static model and unconditional homoskedastic GARCH errors do not affect our theoretical results.

Table 5.

Size of the ${\hat{J}}_{α}$ and other tests, GARCH(1,1) errors

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.6	4.9	5.6	5.9	5.6	5.0	6.0	6.1	5.5
	120	5.2	5.8	5.9	5.8	5.0	4.6	5.4	5.6	5.3
	240	6.1	4.9	6.0	5.8	5.6	4.9	5.6	6.8	4.8
GRS
	60	3.9	–	–	4.8	–	–	4.6	–	–
	120	3.7	4.8	–	5.3	5.6	–	4.9	4.9	–
	240	4.5	5.0	5.8	4.8	5.4	5.5	5.0	5.4	5.3
GOS
	60	15.3	21.5	29.9	17.9	20.6	32.5	18.5	22.7	29.8
	120	9.5	11.9	14.0	10.1	10.5	13.7	10.5	12.4	14.9
	240	8.2	7.3	9.3	8.2	8.9	8.9	7.8	10.1	9.8
SW
	60	16.1	22.5	29.6	16.1	22.1	29.4	19.0	23.5	31.5
	120	9.8	11.2	15.1	9.7	11.4	15.1	21.2	23.3	16.8
	240	7.7	8.8	8.1	7.8	8.7	8.5	11.1	16.9	27.4
F_max
	60	0.1	0.0	0.0	0.1	0.1	0.0	0.1	0.0	0.1
	120	0.1	0.2	0.1	0.1	0.1	0.1	0.1	0.0	0.1
	240	0.0	0.0	0.0	0.1	0.1	0.0	0.0	0.1	0.1
BS
	60	4.0	4.0	3.8	3.7	3.3	4.3	4.0	3.8	4.0
	120	2.9	3.3	3.9	2.8	2.8	2.9	3.0	3.1	2.3
	240	2.6	1.6	2.0	2.7	2.6	2.3	2.7	2.6	2.4
SD
	60	8.7	10.8	12.5	9.9	11.2	13.4	10.3	10.8	11.3
	120	6.6	8.2	9.1	7.2	7.2	7.7	6.4	7.4	7.3
	240	6.3	5.5	6.8	5.9	6.6	6.6	5.6	7.3	6.1

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.6	4.9	5.6	5.9	5.6	5.0	6.0	6.1	5.5
	120	5.2	5.8	5.9	5.8	5.0	4.6	5.4	5.6	5.3
	240	6.1	4.9	6.0	5.8	5.6	4.9	5.6	6.8	4.8
GRS
	60	3.9	–	–	4.8	–	–	4.6	–	–
	120	3.7	4.8	–	5.3	5.6	–	4.9	4.9	–
	240	4.5	5.0	5.8	4.8	5.4	5.5	5.0	5.4	5.3
GOS
	60	15.3	21.5	29.9	17.9	20.6	32.5	18.5	22.7	29.8
	120	9.5	11.9	14.0	10.1	10.5	13.7	10.5	12.4	14.9
	240	8.2	7.3	9.3	8.2	8.9	8.9	7.8	10.1	9.8
SW
	60	16.1	22.5	29.6	16.1	22.1	29.4	19.0	23.5	31.5
	120	9.8	11.2	15.1	9.7	11.4	15.1	21.2	23.3	16.8
	240	7.7	8.8	8.1	7.8	8.7	8.5	11.1	16.9	27.4
F_max
	60	0.1	0.0	0.0	0.1	0.1	0.0	0.1	0.0	0.1
	120	0.1	0.2	0.1	0.1	0.1	0.1	0.1	0.0	0.1
	240	0.0	0.0	0.0	0.1	0.1	0.0	0.0	0.1	0.1
BS
	60	4.0	4.0	3.8	3.7	3.3	4.3	4.0	3.8	4.0
	120	2.9	3.3	3.9	2.8	2.8	2.9	3.0	3.1	2.3
	240	2.6	1.6	2.0	2.7	2.6	2.3	2.7	2.6	2.4
SD
	60	8.7	10.8	12.5	9.9	11.2	13.4	10.3	10.8	11.3
	120	6.6	8.2	9.1	7.2	7.2	7.7	6.4	7.4	7.3
	240	6.3	5.5	6.8	5.9	6.6	6.6	5.6	7.3	6.1

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that $u_{i t} = γ_{i} v_{t} + ε_{η, i t}$ with $ε_{η, i t} = \sqrt{ω_{i t}} ζ_{i t}$ and $ζ_{i t} \sim IIDN (0, 1)$ ⁠, where $ω_{i t} = σ_{η i}^{2} (1 - ϱ - φ) + ϱ ω_{i, t - 1} + φ ε_{η, i t - 1}^{2}$ ⁠. We set $ϱ = 0.2$ and $φ = 0.6$ ⁠. First 50 time-series observations of $ε_{η, i t}$ are discarded.

Table 5.

Size of the ${\hat{J}}_{α}$ and other tests, GARCH(1,1) errors

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.6	4.9	5.6	5.9	5.6	5.0	6.0	6.1	5.5
	120	5.2	5.8	5.9	5.8	5.0	4.6	5.4	5.6	5.3
	240	6.1	4.9	6.0	5.8	5.6	4.9	5.6	6.8	4.8
GRS
	60	3.9	–	–	4.8	–	–	4.6	–	–
	120	3.7	4.8	–	5.3	5.6	–	4.9	4.9	–
	240	4.5	5.0	5.8	4.8	5.4	5.5	5.0	5.4	5.3
GOS
	60	15.3	21.5	29.9	17.9	20.6	32.5	18.5	22.7	29.8
	120	9.5	11.9	14.0	10.1	10.5	13.7	10.5	12.4	14.9
	240	8.2	7.3	9.3	8.2	8.9	8.9	7.8	10.1	9.8
SW
	60	16.1	22.5	29.6	16.1	22.1	29.4	19.0	23.5	31.5
	120	9.8	11.2	15.1	9.7	11.4	15.1	21.2	23.3	16.8
	240	7.7	8.8	8.1	7.8	8.7	8.5	11.1	16.9	27.4
F_max
	60	0.1	0.0	0.0	0.1	0.1	0.0	0.1	0.0	0.1
	120	0.1	0.2	0.1	0.1	0.1	0.1	0.1	0.0	0.1
	240	0.0	0.0	0.0	0.1	0.1	0.0	0.0	0.1	0.1
BS
	60	4.0	4.0	3.8	3.7	3.3	4.3	4.0	3.8	4.0
	120	2.9	3.3	3.9	2.8	2.8	2.9	3.0	3.1	2.3
	240	2.6	1.6	2.0	2.7	2.6	2.3	2.7	2.6	2.4
SD
	60	8.7	10.8	12.5	9.9	11.2	13.4	10.3	10.8	11.3
	120	6.6	8.2	9.1	7.2	7.2	7.7	6.4	7.4	7.3
	240	6.3	5.5	6.8	5.9	6.6	6.6	5.6	7.3	6.1

Size: $α_{i} = 0$ for all i
		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
${\hat{J}}_{α}$
	60	5.6	4.9	5.6	5.9	5.6	5.0	6.0	6.1	5.5
	120	5.2	5.8	5.9	5.8	5.0	4.6	5.4	5.6	5.3
	240	6.1	4.9	6.0	5.8	5.6	4.9	5.6	6.8	4.8
GRS
	60	3.9	–	–	4.8	–	–	4.6	–	–
	120	3.7	4.8	–	5.3	5.6	–	4.9	4.9	–
	240	4.5	5.0	5.8	4.8	5.4	5.5	5.0	5.4	5.3
GOS
	60	15.3	21.5	29.9	17.9	20.6	32.5	18.5	22.7	29.8
	120	9.5	11.9	14.0	10.1	10.5	13.7	10.5	12.4	14.9
	240	8.2	7.3	9.3	8.2	8.9	8.9	7.8	10.1	9.8
SW
	60	16.1	22.5	29.6	16.1	22.1	29.4	19.0	23.5	31.5
	120	9.8	11.2	15.1	9.7	11.4	15.1	21.2	23.3	16.8
	240	7.7	8.8	8.1	7.8	8.7	8.5	11.1	16.9	27.4
F_max
	60	0.1	0.0	0.0	0.1	0.1	0.0	0.1	0.0	0.1
	120	0.1	0.2	0.1	0.1	0.1	0.1	0.1	0.0	0.1
	240	0.0	0.0	0.0	0.1	0.1	0.0	0.0	0.1	0.1
BS
	60	4.0	4.0	3.8	3.7	3.3	4.3	4.0	3.8	4.0
	120	2.9	3.3	3.9	2.8	2.8	2.9	3.0	3.1	2.3
	240	2.6	1.6	2.0	2.7	2.6	2.3	2.7	2.6	2.4
SD
	60	8.7	10.8	12.5	9.9	11.2	13.4	10.3	10.8	11.3
	120	6.6	8.2	9.1	7.2	7.2	7.7	6.4	7.4	7.3
	240	6.3	5.5	6.8	5.9	6.6	6.6	5.6	7.3	6.1

The experimental results so far confirm that the finite sample performance of the ${\hat{J}}_{α}$ test is superior to the other tests we have considered. In the light of these promising results, we further investigate the properties of J-alpha tests, in particular the sensitivity of the choice of the values for { $δ, p$ } and the effectiveness of the standardization employed by the ${\hat{J}}_{α}$ ⁠.

First, we examine the sensitivity of the test to the choice of the value of { $δ, p$ }. As mentioned, the ${\hat{J}}_{α}$ we have considered employs δ = 1 and p = 0.1. To check whether this choice is appropriate, in the next experiment, we consider four combinations of { $δ, p$ } using $δ = 1, 2, p = 0.05, 0.01$ ⁠. Table 6 summarizes the size and power results. As can be seen, the choice of p has little effect on the size and power characteristics. Meanwhile, the performance of the test is slightly sensitive to the choice of δ, but this effect quickly disappears as T increases. These experimental results support the use of the ${\hat{J}}_{α}$ test with δ = 1 and p = 0.1.

Table 6.

Size and power of the ${\hat{J}}_{α}$ tests for $p = {0.1, 0.05}$ and $δ = {1, 2}$ with normal errors

		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
Size (⁠ $α_{i} = 0$ for all i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.0	7.6	6.8
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.5	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	6.4	5.6	4.8	6.1	6.1	6.1	5.6	6.9	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.9	6.2	6.2
	240	4.9	5.9	5.3	5.7	5.8	4.8	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.1	7.6	6.9
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.6	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4

Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
	120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	70.7	82.0	91.0	64.9	78.4	87.1	55.0	67.9	78.7
	120	93.6	98.5	99.7	91.9	98.3	99.8	84.9	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	70.5	81.9	90.9	64.8	78.3	87.0	53.8	66.2	77.7
	120	93.6	98.5	99.7	91.8	98.3	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	70.7	82.0	91.0	65.0	78.4	87.1	55.2	68.0	78.8
	120	93.6	98.5	99.7	91.9	98.3	99.8	85.0	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0

		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
Size (⁠ $α_{i} = 0$ for all i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.0	7.6	6.8
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.5	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	6.4	5.6	4.8	6.1	6.1	6.1	5.6	6.9	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.9	6.2	6.2
	240	4.9	5.9	5.3	5.7	5.8	4.8	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.1	7.6	6.9
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.6	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4

Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
	120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	70.7	82.0	91.0	64.9	78.4	87.1	55.0	67.9	78.7
	120	93.6	98.5	99.7	91.9	98.3	99.8	84.9	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	70.5	81.9	90.9	64.8	78.3	87.0	53.8	66.2	77.7
	120	93.6	98.5	99.7	91.8	98.3	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	70.7	82.0	91.0	65.0	78.4	87.1	55.2	68.0	78.8
	120	93.6	98.5	99.7	91.9	98.3	99.8	85.0	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0

Notes: See the note to Table 2. The DGP is the same as in Table 2. The p and δ are for the MT estimator ${\tilde{ρ}}_{i j} = {\hat{ρ}}_{i j} I [| \sqrt{v} {\hat{ρ}}_{i j} | > c_{p} (N)]$ ⁠, where $c_{p} (N) = Φ^{- 1} (1 - \frac{p}{2 N^{δ}})$ ⁠.

Table 6.

Size and power of the ${\hat{J}}_{α}$ tests for $p = {0.1, 0.05}$ and $δ = {1, 2}$ with normal errors

		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
Size (⁠ $α_{i} = 0$ for all i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.0	7.6	6.8
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.5	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	6.4	5.6	4.8	6.1	6.1	6.1	5.6	6.9	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.9	6.2	6.2
	240	4.9	5.9	5.3	5.7	5.8	4.8	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.1	7.6	6.9
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.6	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4

Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
	120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	70.7	82.0	91.0	64.9	78.4	87.1	55.0	67.9	78.7
	120	93.6	98.5	99.7	91.9	98.3	99.8	84.9	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	70.5	81.9	90.9	64.8	78.3	87.0	53.8	66.2	77.7
	120	93.6	98.5	99.7	91.8	98.3	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	70.7	82.0	91.0	65.0	78.4	87.1	55.2	68.0	78.8
	120	93.6	98.5	99.7	91.9	98.3	99.8	85.0	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0

		$δ_{γ} = 0$			$δ_{γ} = 1 / 4$			$δ_{γ} = 1 / 2$
	(T, N)	50	100	200	50	100	200	50	100	200
Size (⁠ $α_{i} = 0$ for all i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	6.4	5.6	4.7	6.1	6.1	6.1	5.5	6.8	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.8	6.1	6.1
	240	4.9	5.8	5.2	5.7	5.8	4.7	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.0	7.6	6.8
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.5	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	6.4	5.6	4.8	6.1	6.1	6.1	5.6	6.9	5.9
	120	6.5	5.6	4.7	5.9	5.9	5.3	5.9	6.2	6.2
	240	4.9	5.9	5.3	5.7	5.8	4.8	6.0	6.2	6.4
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	6.6	5.7	5.0	6.2	6.1	6.2	6.1	7.6	6.9
	120	6.6	5.6	4.7	6.0	5.9	5.3	6.0	6.6	6.5
	240	5.0	5.9	5.3	5.7	5.8	4.8	6.0	6.3	6.4

Power 1 (⁠ $α_{i} = ϖ_{i} \sim N (0, 1)$ for $i = 1, \dots, ⌊ N^{0.7} ⌋$ and $α_{i} = 0$ for other i)
${\hat{J}}_{α} (p = 0.1, δ = 1)$
	60	70.3	81.7	90.8	64.6	78.1	86.9	53.4	66.0	77.0
	120	93.6	98.5	99.7	91.7	98.2	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.1, δ = 2)$
	60	70.7	82.0	91.0	64.9	78.4	87.1	55.0	67.9	78.7
	120	93.6	98.5	99.7	91.9	98.3	99.8	84.9	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 1)$
	60	70.5	81.9	90.9	64.8	78.3	87.0	53.8	66.2	77.7
	120	93.6	98.5	99.7	91.8	98.3	99.8	84.7	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0
${\hat{J}}_{α} (p = 0.05, δ = 2)$
	60	70.7	82.0	91.0	65.0	78.4	87.1	55.2	68.0	78.8
	120	93.6	98.5	99.7	91.9	98.3	99.8	85.0	95.5	98.6
	240	99.5	99.9	100.0	99.4	100.0	100.0	98.8	99.9	100.0

Finally, an experiment was conducted to check the effectiveness of the standardization employed in the ${\hat{J}}_{α}$ ⁠. In particular, we check the effectiveness of the centering $t_{i}^{2} - v / (v - 2)$ employed by the ${\hat{J}}_{α}$ test compared with $t_{i}^{2} - 1$ employed by GOS, and the usefulness of estimating the cross-correlation of $t_{i}^{2}$ with the MT estimator ${\tilde{ρ}}_{N}$ ⁠, respectively. For this purpose, two J-alpha test variants, ${\tilde{J}}_{α}$ and $J_{α} (0)$ ⁠, are considered on top of the ${\hat{J}}_{α}$ statistic. ${\tilde{J}}_{α}$ is identical to ${\hat{J}}_{α}$ ⁠, but replaces $t_{i}^{2} - v / (v - 2)$ by $t_{i}^{2} - 1$ ⁠. The second statistic, $J_{α} (0)$ ⁠, sets ${\tilde{ρ}}_{N}$ equal to zero (i.e., does not control for cross-correlation). In the present experiment, to investigate the behavior of the ${\hat{J}}_{α}$ test in more challenging environments, N is considered with larger values, that is, $N = 500, 1000, 2000$ and 5000, while T is set to 60, 120, and 240 as before. The results are reported in Table 7, which reveal that the centering using $v / (v - 2)$ as well as the control of error cross-correlations by the MT estimator play a very significant role in controlling the size of the test for large N (and large T as shown in Panel A of Table 2).

Table 7.

Size of the ${\hat{J}}_{α}$ tests, for very large N with normal and non-normal errors

		$δ_{γ} = 0$				$δ_{γ} = 1 / 4$				$δ_{γ} = 1 / 2$
	(T, N)	500	1000	2000	5000	500	1000	2000	5000	500	1000	2000	5000
Panel A: Normal errors
${\tilde{J}}_{α}$
	60	14.5	19.4	29.4	52.4	13.0	19.3	29.5	53.3	14.3	18.7	28.2	51.5
	120	8.6	9.2	12.5	21.7	8.9	8.9	11.7	21.6	8.7	9.1	11.1	19.1
	240	6.6	7.4	7.1	11.3	6.9	7.1	7.7	11.7	6.7	7.1	7.0	10.8
$J_{α} (0)$
	60	6.9	5.3	4.3	5.2	5.5	5.7	5.2	5.1	7.5	7.4	6.9	7.8
	120	5.1	4.4	4.9	5.0	5.7	4.5	5.3	4.6	7.1	6.1	5.8	7.2
	240	5.0	5.0	4.2	5.2	5.1	5.1	4.1	5.0	6.9	6.6	6.0	7.1
${\hat{J}}_{α}$
	60	6.8	5.3	4.2	5.1	5.5	5.6	5.1	5.1	6.5	6.3	6.1	7.2
	120	5.1	4.2	4.8	5.0	5.6	4.4	5.2	4.5	5.6	4.5	4.4	5.8
	240	5.0	5.0	4.1	5.1	5.0	5.1	4.1	5.0	5.7	5.2	4.3	5.6

Panel B: Non-normal errors
${\tilde{J}}_{α}$
	60	13.7	18.5	28.1	52.0	13.1	17.7	28.6	51.3	12.6	18.5	25.7	49.7
	120	9.0	10.1	12.2	21.2	9.4	9.5	12.4	21.7	8.7	9.6	11.7	19.9
	240	6.3	7.3	7.9	12.2	6.7	7.4	7.5	12.2	7.7	7.7	8.1	10.0
$J_{α} (0)$
	60	5.6	5.0	4.1	4.1	4.9	4.6	4.0	4.6	7.3	5.9	6.1	5.8
	120	5.7	5.4	4.8	4.7	5.3	4.8	5.1	4.9	7.7	6.2	5.7	6.0
	240	5.2	5.4	4.7	5.4	5.3	4.8	4.5	5.3	7.7	7.2	6.0	6.5
${\hat{J}}_{α}$
	60	5.5	5.0	4.0	4.0	4.9	4.5	4.0	4.6	6.4	5.3	5.4	5.4
	120	5.6	5.4	4.6	4.7	5.2	4.7	5.0	4.9	6.4	4.7	4.5	4.9
	240	5.2	5.4	4.6	5.4	5.1	4.8	4.4	5.2	6.2	5.7	4.7	5.0

		$δ_{γ} = 0$				$δ_{γ} = 1 / 4$				$δ_{γ} = 1 / 2$
	(T, N)	500	1000	2000	5000	500	1000	2000	5000	500	1000	2000	5000
Panel A: Normal errors
${\tilde{J}}_{α}$
	60	14.5	19.4	29.4	52.4	13.0	19.3	29.5	53.3	14.3	18.7	28.2	51.5
	120	8.6	9.2	12.5	21.7	8.9	8.9	11.7	21.6	8.7	9.1	11.1	19.1
	240	6.6	7.4	7.1	11.3	6.9	7.1	7.7	11.7	6.7	7.1	7.0	10.8
$J_{α} (0)$
	60	6.9	5.3	4.3	5.2	5.5	5.7	5.2	5.1	7.5	7.4	6.9	7.8
	120	5.1	4.4	4.9	5.0	5.7	4.5	5.3	4.6	7.1	6.1	5.8	7.2
	240	5.0	5.0	4.2	5.2	5.1	5.1	4.1	5.0	6.9	6.6	6.0	7.1
${\hat{J}}_{α}$
	60	6.8	5.3	4.2	5.1	5.5	5.6	5.1	5.1	6.5	6.3	6.1	7.2
	120	5.1	4.2	4.8	5.0	5.6	4.4	5.2	4.5	5.6	4.5	4.4	5.8
	240	5.0	5.0	4.1	5.1	5.0	5.1	4.1	5.0	5.7	5.2	4.3	5.6

Panel B: Non-normal errors
${\tilde{J}}_{α}$
	60	13.7	18.5	28.1	52.0	13.1	17.7	28.6	51.3	12.6	18.5	25.7	49.7
	120	9.0	10.1	12.2	21.2	9.4	9.5	12.4	21.7	8.7	9.6	11.7	19.9
	240	6.3	7.3	7.9	12.2	6.7	7.4	7.5	12.2	7.7	7.7	8.1	10.0
$J_{α} (0)$
	60	5.6	5.0	4.1	4.1	4.9	4.6	4.0	4.6	7.3	5.9	6.1	5.8
	120	5.7	5.4	4.8	4.7	5.3	4.8	5.1	4.9	7.7	6.2	5.7	6.0
	240	5.2	5.4	4.7	5.4	5.3	4.8	4.5	5.3	7.7	7.2	6.0	6.5
${\hat{J}}_{α}$
	60	5.5	5.0	4.0	4.0	4.9	4.5	4.0	4.6	6.4	5.3	5.4	5.4
	120	5.6	5.4	4.6	4.7	5.2	4.7	5.0	4.9	6.4	4.7	4.5	4.9
	240	5.2	5.4	4.6	5.4	5.1	4.8	4.4	5.2	6.2	5.7	4.7	5.0

Notes: See the note to Table 2. The DGPs are the same as in Table 2 for normal errors and in Table 3 for non-normal errors. For the purpose of comparison to ${\hat{J}}_{α}$ ⁠, we also provide results for ${\tilde{J}}_{α}$ test, which controls for error cross-correlations as the ${\hat{J}}_{α}$ test but demean $t_{i}^{2}$ by 1 rather than $v / (v - 2)$ ⁠. The $J_{α} (0)$ test is defined by Equation (61) with $ρ_{N}^{2} = 0$ ⁠, which does not control for error cross-correlations.

Table 7.

Size of the ${\hat{J}}_{α}$ tests, for very large N with normal and non-normal errors

		$δ_{γ} = 0$				$δ_{γ} = 1 / 4$				$δ_{γ} = 1 / 2$
	(T, N)	500	1000	2000	5000	500	1000	2000	5000	500	1000	2000	5000
Panel A: Normal errors
${\tilde{J}}_{α}$
	60	14.5	19.4	29.4	52.4	13.0	19.3	29.5	53.3	14.3	18.7	28.2	51.5
	120	8.6	9.2	12.5	21.7	8.9	8.9	11.7	21.6	8.7	9.1	11.1	19.1
	240	6.6	7.4	7.1	11.3	6.9	7.1	7.7	11.7	6.7	7.1	7.0	10.8
$J_{α} (0)$
	60	6.9	5.3	4.3	5.2	5.5	5.7	5.2	5.1	7.5	7.4	6.9	7.8
	120	5.1	4.4	4.9	5.0	5.7	4.5	5.3	4.6	7.1	6.1	5.8	7.2
	240	5.0	5.0	4.2	5.2	5.1	5.1	4.1	5.0	6.9	6.6	6.0	7.1
${\hat{J}}_{α}$
	60	6.8	5.3	4.2	5.1	5.5	5.6	5.1	5.1	6.5	6.3	6.1	7.2
	120	5.1	4.2	4.8	5.0	5.6	4.4	5.2	4.5	5.6	4.5	4.4	5.8
	240	5.0	5.0	4.1	5.1	5.0	5.1	4.1	5.0	5.7	5.2	4.3	5.6

Panel B: Non-normal errors
${\tilde{J}}_{α}$
	60	13.7	18.5	28.1	52.0	13.1	17.7	28.6	51.3	12.6	18.5	25.7	49.7
	120	9.0	10.1	12.2	21.2	9.4	9.5	12.4	21.7	8.7	9.6	11.7	19.9
	240	6.3	7.3	7.9	12.2	6.7	7.4	7.5	12.2	7.7	7.7	8.1	10.0
$J_{α} (0)$
	60	5.6	5.0	4.1	4.1	4.9	4.6	4.0	4.6	7.3	5.9	6.1	5.8
	120	5.7	5.4	4.8	4.7	5.3	4.8	5.1	4.9	7.7	6.2	5.7	6.0
	240	5.2	5.4	4.7	5.4	5.3	4.8	4.5	5.3	7.7	7.2	6.0	6.5
${\hat{J}}_{α}$
	60	5.5	5.0	4.0	4.0	4.9	4.5	4.0	4.6	6.4	5.3	5.4	5.4
	120	5.6	5.4	4.6	4.7	5.2	4.7	5.0	4.9	6.4	4.7	4.5	4.9
	240	5.2	5.4	4.6	5.4	5.1	4.8	4.4	5.2	6.2	5.7	4.7	5.0

		$δ_{γ} = 0$				$δ_{γ} = 1 / 4$				$δ_{γ} = 1 / 2$
	(T, N)	500	1000	2000	5000	500	1000	2000	5000	500	1000	2000	5000
Panel A: Normal errors
${\tilde{J}}_{α}$
	60	14.5	19.4	29.4	52.4	13.0	19.3	29.5	53.3	14.3	18.7	28.2	51.5
	120	8.6	9.2	12.5	21.7	8.9	8.9	11.7	21.6	8.7	9.1	11.1	19.1
	240	6.6	7.4	7.1	11.3	6.9	7.1	7.7	11.7	6.7	7.1	7.0	10.8
$J_{α} (0)$
	60	6.9	5.3	4.3	5.2	5.5	5.7	5.2	5.1	7.5	7.4	6.9	7.8
	120	5.1	4.4	4.9	5.0	5.7	4.5	5.3	4.6	7.1	6.1	5.8	7.2
	240	5.0	5.0	4.2	5.2	5.1	5.1	4.1	5.0	6.9	6.6	6.0	7.1
${\hat{J}}_{α}$
	60	6.8	5.3	4.2	5.1	5.5	5.6	5.1	5.1	6.5	6.3	6.1	7.2
	120	5.1	4.2	4.8	5.0	5.6	4.4	5.2	4.5	5.6	4.5	4.4	5.8
	240	5.0	5.0	4.1	5.1	5.0	5.1	4.1	5.0	5.7	5.2	4.3	5.6

Panel B: Non-normal errors
${\tilde{J}}_{α}$
	60	13.7	18.5	28.1	52.0	13.1	17.7	28.6	51.3	12.6	18.5	25.7	49.7
	120	9.0	10.1	12.2	21.2	9.4	9.5	12.4	21.7	8.7	9.6	11.7	19.9
	240	6.3	7.3	7.9	12.2	6.7	7.4	7.5	12.2	7.7	7.7	8.1	10.0
$J_{α} (0)$
	60	5.6	5.0	4.1	4.1	4.9	4.6	4.0	4.6	7.3	5.9	6.1	5.8
	120	5.7	5.4	4.8	4.7	5.3	4.8	5.1	4.9	7.7	6.2	5.7	6.0
	240	5.2	5.4	4.7	5.4	5.3	4.8	4.5	5.3	7.7	7.2	6.0	6.5
${\hat{J}}_{α}$
	60	5.5	5.0	4.0	4.0	4.9	4.5	4.0	4.6	6.4	5.3	5.4	5.4
	120	5.6	5.4	4.6	4.7	5.2	4.7	5.0	4.9	6.4	4.7	4.5	4.9
	240	5.2	5.4	4.6	5.4	5.1	4.8	4.4	5.2	6.2	5.7	4.7	5.0

6 Empirical Application

6.1 Data Description

We consider the application of our proposed ${\hat{J}}_{α}$ test to the securities in the S&P 500 index of large cap U.S. equities market. Since the index is primarily intended as a leading indicator of U.S. equities, the composition of the index is monitored by S&P to ensure the widest possible overall market representation while reducing the index turnover to a minimum. Changes to the composition of the index are governed by published guidelines. In particular, a security is included if its market capitalization currently exceeds US$5.3 billion, is financially viable, and at least 50% of their equity is publicly floated. Companies that substantially violate one or more of the criteria for index inclusion, or are involved in merger, acquisition, or significant restructuring are replaced by other companies.

In order to take account for the change to the composition of the index over time, we compiled returns on all the 500 securities that constitute the S&P 500 index each month over the period January 1984 to April 2018. The monthly return of security i for month t is computed as $r_{i t} = 100 (P_{i t} - P_{i, t - 1}) / P_{i, t - 1} + {DY}_{i t} / 12$ ⁠, where P_it is the end of the month price of the security and DY_it is the percent per annum dividend yield on the security. Note that index i depends on the month in which the security i is a constituent of S&P 500, τ, say, which is suppressed for notational simplicity.

The time-series data on the safe rate of return, and the market factors are obtained from Ken French’s data library web page. The one-month U.S. treasury bill rate is chosen as the risk-free rate (r_ft), the value-weighted return on all NYSE, AMEX, and NASDAQ stocks (from CRSP) is used as a proxy for the market return (r_mt), the average return on the three small portfolios minus the average return on the three big portfolios (SMB_t), the average return on two value portfolios minus the average return on two growth portfolios (HML_t), the difference between the returns on diversified portfolios of stocks with robust and weak profitability (RMW_t), and the difference between the returns on diversified portfolios of the stocks of low and high investment firms (CMA_t). SMB, HML, RMW, and CMA are based on the stocks listed on the NYSE, AMEX, and NASDAQ. All data are measured in percent per month. See Section M1.3 in the Supplementary Material for further details.

6.2 Month End Test Results (September 1989–April 2018)

Encouraged by the satisfactory performance of the

{\hat{J}}_{α}

test, even in cases where N is much larger than T, we apply the

{\hat{J}}_{α}

test that allows for non-Gaussian and cross-correlated errors to all securities in the S&P 500 index at the end of each month spanning the period September 1989–April 2018.²² In this way, we minimize the possibility of survivorship bias since the sample of securities considered at the end of each month is decided in real time. As far as the choice of T is concerned, to reduce the impact of possible time variations in betas, we select a relatively short time period of T = 60 months. Accordingly, we estimated the CAPM, Fama and French (1993) three factor (FF3), and Fama and French (2015) five factor (FF5) regressions. The estimated FF5 regression is

\begin{array}{l} r_{i, τ t} - r_{f, τ t} = {\hat{α}}_{i τ} + {\hat{β}}_{1, i τ} (r_{m, τ t} - r_{f, τ t}) + {\hat{β}}_{2, i τ} {SMB}_{τ t} + {\hat{β}}_{3 i τ} {HML}_{τ t} \\ + {\hat{β}}_{4, i τ} {RMW}_{τ t} + {\hat{β}}_{5 i τ} {CMA}_{τ t} + {\hat{u}}_{i, τ t}, \end{array}

(74)

for

t = 1, 2, \dots, 60, i = 1, 2, \dots, N_{τ}

⁠, and the month ends, τ, from September 1989 to April 2018. The CAPM regression includes the first factor and the FF3 regression uses the first three factors in Equation (74) as regressors, respectively. All securities in the S&P 500 index are included except those with less than 60 months of observations and/or with five consecutive zeros in the middle of sample periods. See the Supplementary Material for discussions on the statistical properties of the regression residuals.

Table 8 reports the rejection frequencies of the ${\hat{J}}_{α}$ and GOS tests based on the CAPM, FF3, and FF5 models over the month ends, for the full sample periods, and three market disruption periods: (1) the Asian financial crisis (1997M07–1998M12), (2) the Dot-com bubble burst (2000M03–2002M10), and (3) the Great Recession (2007M12–2009M06) periods. Depending on the factor model (CAPM, FF3, or FF5) and nominal size (5% or 1%) considered, the ${\hat{J}}_{α}$ test rejects the null hypothesis $H_{0} : α_{i} = 0$ ⁠, from 24% to 30% of the total number of tests carried out, which is much smaller than the rejection rates of the GOS test that lie between 39% and 72%. The high rejection rates and their wide range in the GOS test may be due to the tendency of this test to over-reject when T is relatively small, as documented by Monte Carlo experiments in Section 5.

Table 8.

Empirical application: rejection frequencies of the ${\hat{J}}_{α}$ and GOS tests

Test	${\hat{J}}_{α}$ test			GOS test
Factor models	CAPM	FF3	FF5	CAPM	FF3	FF5
Significance level of 0.05
Full sample period (1989M09–2018M04)	0.28	0.27	0.30	0.42	0.57	0.72
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.06	0.22	0.39	0.33	0.83	1.00
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.50	0.66	0.09	0.72	1.00
(3) The Great Recession (2007M12–2009M06)	0.84	0.95	0.74	1.00	1.00	0.95
Significance level of 0.01
Full sample period (1989M09–2018M04)	0.24	0.27	0.24	0.39	0.49	0.62
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.00	0.11	0.28	0.28	0.83	0.67
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.25	0.56	0.03	0.59	1.00
(3) The Great Recession (2007M12–2009M06)	0.79	0.84	0.68	0.95	1.00	0.89

Test	${\hat{J}}_{α}$ test			GOS test
Factor models	CAPM	FF3	FF5	CAPM	FF3	FF5
Significance level of 0.05
Full sample period (1989M09–2018M04)	0.28	0.27	0.30	0.42	0.57	0.72
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.06	0.22	0.39	0.33	0.83	1.00
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.50	0.66	0.09	0.72	1.00
(3) The Great Recession (2007M12–2009M06)	0.84	0.95	0.74	1.00	1.00	0.95
Significance level of 0.01
Full sample period (1989M09–2018M04)	0.24	0.27	0.24	0.39	0.49	0.62
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.00	0.11	0.28	0.28	0.83	0.67
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.25	0.56	0.03	0.59	1.00
(3) The Great Recession (2007M12–2009M06)	0.79	0.84	0.68	0.95	1.00	0.89

Notes: This table provides rejection frequencies of the ${\hat{J}}_{α}$ and GOS tests with the significance levels of 0.05 and 0.01, applied to CAPM, FF3, and FF5 regressions of securities in the S&P 500 index using rolling T = 60 monthly estimation windows over the month ends during the full sample period and during the three market disruption periods.

Table 8.

Empirical application: rejection frequencies of the ${\hat{J}}_{α}$ and GOS tests

Test	${\hat{J}}_{α}$ test			GOS test
Factor models	CAPM	FF3	FF5	CAPM	FF3	FF5
Significance level of 0.05
Full sample period (1989M09–2018M04)	0.28	0.27	0.30	0.42	0.57	0.72
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.06	0.22	0.39	0.33	0.83	1.00
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.50	0.66	0.09	0.72	1.00
(3) The Great Recession (2007M12–2009M06)	0.84	0.95	0.74	1.00	1.00	0.95
Significance level of 0.01
Full sample period (1989M09–2018M04)	0.24	0.27	0.24	0.39	0.49	0.62
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.00	0.11	0.28	0.28	0.83	0.67
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.25	0.56	0.03	0.59	1.00
(3) The Great Recession (2007M12–2009M06)	0.79	0.84	0.68	0.95	1.00	0.89

Test	${\hat{J}}_{α}$ test			GOS test
Factor models	CAPM	FF3	FF5	CAPM	FF3	FF5
Significance level of 0.05
Full sample period (1989M09–2018M04)	0.28	0.27	0.30	0.42	0.57	0.72
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.06	0.22	0.39	0.33	0.83	1.00
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.50	0.66	0.09	0.72	1.00
(3) The Great Recession (2007M12–2009M06)	0.84	0.95	0.74	1.00	1.00	0.95
Significance level of 0.01
Full sample period (1989M09–2018M04)	0.24	0.27	0.24	0.39	0.49	0.62
Three market disruption periods:
(1) Asian financial crisis (1997M07–1998M12)	0.00	0.11	0.28	0.28	0.83	0.67
(2) The Dot-com Bubble Burst (2000M03–2002M10)	0.00	0.25	0.56	0.03	0.59	1.00
(3) The Great Recession (2007M12–2009M06)	0.79	0.84	0.68	0.95	1.00	0.89

As to be expected, rejection rates in the top panel of Table 8 (based on 5% level) are larger than those in the bottom panel (based on 1% level), but the differences are of second-order importance, particularly when compared with the choice of the underlying asset pricing models. Focusing on the test results based on the 5% level, we note wide variations in the test outcomes across models (CAPM, FF3, and FF5) particularly in the case of sub-samples representing the Asian Financial Crisis and the Dot-com Bubble. The test outcomes for these two sub-samples critically depend on the choice of the asset pricing model, although as for the full sample results the GOS test gives much larger rejection rates. Given the sensitivity of the test outcomes to the choice of the asset pricing model, no firm conclusions can be made in relation to these financial crises. The results based on the ${\hat{J}}_{α}$ only lead to substantial rejections only in the case of Dot-com Bubble period and when we base the test on the FF5 model.

The situation is very different when we consider the Great Recession period, where we find substantial rejection of the null of market efficiency irrespective of the model choice. Using the ${\hat{J}}_{α}$ there is no pattern to the rejection rates across the models, and using CAPM given a rejection rate of 84% when compared with 95% for FF3 and 74% for FF5. The GOS rejection rates are much higher (100% for CAPM and FF3 and 95% for FF5). Due to its over-rejection tendency, the GOS test seems to be less discriminatory when we compare the GOS rejection rates across the different sample periods. This is particularly so in the case of the GOS tests based on the FF5 model. Overall, both tests provide strong evidence of pricing errors during the Great Recession, but ${\hat{J}}_{α}$ test appears to provide more sensible results than the GOS test in this application.

7 Conclusion

In this article, we propose a simple test of LFPMs, the ${\hat{J}}_{α}$ test, when the number of securities, N, is large relative to the time dimension, T, of the return series. It is shown that the ${\hat{J}}_{α}$ test is more robust against error cross-sectional correlation than the SW tests based on an adaptive thresholding estimator of V, which is considered by Fan, Liao, and Yao (2015). It allows N to be much larger than T, when compared with alternative tests proposed in the literature. The proposed test also allows for a wide class of error dependencies including mixed weak-factor spatial autoregressive processes, and is shown to be robust to random time-variations in betas.

Using Monte Carlo experiments, designed specifically to match the distributional features of the residuals of Fama–French three factor regressions of individual securities in the S&P 500 index, we show that the proposed ${\hat{J}}_{α}$ test performs well even when N is much larger than T, and outperforms other existing tests such as the tests of GOS et al. (2015) and GL. Also, in cases where N < T and the standard F-test due to GRS can be computed, we still find that the ${\hat{J}}_{α}$ test has much higher power, especially when T is relatively small.

Application of the ${\hat{J}}_{α}$ test to all securities in the S&P 500 index with 60 months of return data at the end of each month over the period September 1989–April 2018 clearly illustrates the utility of the proposed test. Statistically significant evidence against Sharpe–Lintner CAPM and Fama–French three and five factor models is found mainly during periods of financial crises and market disruptions.

Supplemental Data

Supplemental data are available at https://www.datahostingsite.com.

Appendix: Proofs of the Theorems

In this Appendix, we provide proofs of the theorems set out in Section 4 of the article. These proofs make use of lemmas which are provided, together with their proofs, in the Supplementary Material.

For further clarity and convenience, we summarize some repeatedly used notations below:

M_{G} = (m_{t t^{'}}) = I_{T} - P_{G}, P_{G} = G {(G^{'} G)}^{- 1} G^{'}, G = (τ_{T}, F), v = Tr (M_{G}) = T - m - 1,

(A.1)

\begin{array}{l} M_{F} = (m_{F, t t^{'}}) = I_{T} - F {(F^{'} F)}^{- 1} F^{'}, H_{F} = h h^{'} = (h_{t} h_{t^{'}}) \\ with h = (h_{t}) = M_{F} τ_{T}, w_{T} = Tr (H_{F}) = h^{'} h = τ_{T}^{'} M_{F} τ_{T}, \end{array}

(A.2)

where F is a T × m matrix and

τ_{T} = {(1, 1, \dots, 1)}^{'}

is a

T \times 1

vector of ones. Also, before providing a proof of Theorem 1, we state a theorem due to Kelejian and Prucha (2001, KP) which is used to establish it.

Lemma 1 (Central Limit Theorem for Linear Quadratic Forms):

Consider the following linear quadratic form

Q_{N} = ε^{'} A ε + b^{'} ε = \sum_{i = 1}^{N} \sum_{j = 1}^{N} a_{i j} ε_{i} ε_{j} + \sum_{i = 1}^{N} b_{i} ε_{i},

where

{ε_{i}, i = 1, 2, \dots, N}

are real-valued random variables, and a_ij and b_i denote real-valued coefficients of the quadratic and linear forms. Suppose the following assumptions hold: Assumption KP1:

ε_{i}

, for

i = 1, 2, \dots, N

, have zero means and are independently distributed across i. Assumption KP2: Ais symmetric and

\sup_{i} \sum_{j = 1}^{N} | a_{i j} | < K

. Also,

N^{- 1} \sum_{i = 1}^{N} | b_{i} |^{2 + ε_{0}} < K

for some

ε_{0} > 0

. Assumption KP3:

\sup_{i} E | ε_{i} |^{4 + ε_{0}} < K

for some

ε_{0} > 0

. Then, assuming that

N^{- 1} Var (Q_{N}) \geq c

for some c > 0,

\frac{Q_{N} - E (Q_{N})}{\sqrt{Var (Q_{N})}} \to_{d} N (0, 1) .

Proof: See KP (Theorem 1, p. 227). ▪

Proof of Theorem 1:

Noting that

H_{F} = h h^{'}

⁠, where

h = {(h_{1}, h_{2}, \dots, h_{T})}^{'} = M_{F} τ_{T}

⁠, we can write

z_{i}^{2} = w_{T}^{- 1} ξ_{i}^{'} H_{F} ξ_{i}

with

w_{T} = τ_{T}^{'} M_{F} τ_{T}

⁠. Then,

\sum_{i = 1}^{N} z_{i}^{2} = w_{T}^{- 1} \sum_{i = 1}^{N} ξ_{i}^{'} H_{F} ξ_{i} = w_{T}^{- 1} {(\sum_{t = 1}^{T} u_{t} h_{t})}^{'} D_{σ}^{- 1} (\sum_{t = 1}^{T} u_{t} h_{t}),

where

D_{σ} = diag (σ_{11}, σ_{22}, \dots, σ_{N N})

⁠. Using Equation (54)

\begin{array}{l} N^{- 1 / 2} \sum_{i = 1}^{N} z_{i}^{2} = w_{T}^{- 1} \sum_{i = 1}^{N} N^{- 1 / 2} ξ_{i}^{'} H_{F} ξ_{i} \\ = w_{T}^{- 1} {[N^{- 1 / 2} \sum_{t = 1}^{T} (Γ v_{t} + η_{t}) h_{t}]}^{'} D_{σ}^{- 1} [\sum_{t = 1}^{T} (Γ v_{t} + η_{t}) h_{t}] \\ = a_{N T} + 2 b_{N T} + c_{N T}, \end{array}

(A.3)

where

\begin{array}{l} a_{N T} = w_{T}^{- 1} N^{- 1 / 2} (\sum_{t = 1}^{T} h_{t} v_{t}^{'} Γ^{'}) D_{σ}^{- 1} (\sum_{t = 1}^{T} h_{t} Γ v_{t}), \\ b_{N T} = w_{T}^{- 1} N^{- 1 / 2} (\sum_{t = 1}^{T} h_{t} v_{t}^{'} Γ^{'}) D_{σ}^{- 1} (\sum_{t = 1}^{T} h_{t} η_{t}), and \\ c_{N T} = w_{T}^{- 1} N^{- 1 / 2} (\sum_{t = 1}^{T} h_{t} η_{t}^{'}) D_{σ}^{- 1} (\sum_{t = 1}^{T} h_{t} η_{t}) . \end{array}

(A.4)

Consider the first term, a_NT, and note that

\begin{array}{l} a_{N T} = w_{T}^{- 1} N^{- 1 / 2} \sum_{t = 1}^{T} \sum_{r = 1}^{T} h_{t} h_{r} v_{t}^{'} Γ^{'} D_{σ}^{- 1} Γ v_{r} \\ = w_{T}^{- 1} N^{- 1 / 2} \sum_{t = 1}^{T} \sum_{r = 1}^{T} h_{t} h_{r} (\sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} v_{t} v_{r}^{'} {\tilde{γ}}_{i}), \end{array}

where

{\tilde{γ}}_{i} = \frac{γ_{i}}{\sqrt{σ_{i i}}} = \frac{γ_{i}}{\sqrt{γ_{i}^{'} γ_{i} + σ_{η, i i}}} .

(A.5)

Equivalently, letting

d_{T} = w_{T}^{- 1 / 2} \sum_{1 = 1}^{T} h_{t} v_{t}

⁠, and noting that for any conformable real symmetric positive semi-definite matrices A and B,

Tr (A B) \leq Tr (A) λ_{max} (B)

(this result is repeatedly used below), we have

\begin{array}{l} a_{N T} = N^{- 1 / 2} \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} [(w_{T}^{- 1 / 2} \sum_{1 = 1}^{T} h_{t} v_{t}) {(w_{T}^{- 1 / 2} \sum_{t = 1}^{T} h_{t} v_{t})}^{'}] = N^{- 1 / 2} \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} d_{T} d_{T}^{'} {\tilde{γ}}_{i} \\ \leq (N^{- 1 / 2} \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{i}) λ_{max} (d_{T} d_{T}^{'}) \leq (N^{- 1 / 2} \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{i}) (d_{T}^{'} d_{T}) . \end{array}

But since h_t are given constants such that

\sum_{t = 1}^{T} h_{t}^{2} = w_{T}

⁠, and by assumption

v_{t}

IID (0, I_{k}),

it then readily follows that

d_{T}^{'} d_{T} \to_{p} 1

⁠, and hence

a_{N T} = O_{p} (N^{- 1 / 2} \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{i}) .

Also, it is clear from Equation (A.5) that

| {\tilde{γ}}_{i s} | \leq 1

and

| {\tilde{γ}}_{i s} | \leq | γ_{i s} |

⁠, and

\begin{array}{l} N^{- 1 / 2} \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{i} = N^{- 1 / 2} \sum_{i = 1}^{N} \sum_{s = 1}^{k} {\tilde{γ}}_{i s}^{2} \leq N^{- 1 / 2} \sum_{s = 1}^{k} (\sum_{i = 1}^{N} | {\tilde{γ}}_{i s} |) \\ \leq N^{- 1 / 2} \sum_{s = 1}^{k} (\sum_{i = 1}^{N} | γ_{i s} |) \leq N^{- 1 / 2} \sup_{s} \sum_{i = 1}^{N} | γ_{i s} |, \end{array}

and hence by Assumption 2,

N^{- 1 / 2} \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{i} = O (N^{δ_{γ} - 1 / 2})

⁠, and overall

a_{N T} = O_{p} (N^{δ_{γ} - 1 / 2})

⁠. Similarly,

\begin{array}{l} b_{N T} = w_{T}^{- 1} N^{- 1 / 2} (\sum_{t = 1}^{T} h_{t} v_{t}^{'} Γ^{'}) D_{σ}^{- 1} (\sum_{t = 1}^{T} h_{t} η_{t}) \\ = w_{T}^{- 1} N^{- 1 / 2} \sum_{t = 1}^{T} \sum_{r = 1}^{T} h_{t} h_{r} v_{t}^{'} Γ^{'} D_{σ}^{- 1} η_{r} \\ = w_{T}^{- 1} N^{- 1 / 2} \sum_{t = 1}^{T} \sum_{r = 1}^{T} h_{t} h_{r} \sum_{i = 1}^{N} (\frac{η_{i r}}{σ_{i i}^{1 / 2}}) {\tilde{γ}}_{i}^{'} v_{t} \\ = N^{- 1 / 2} (w_{T}^{- 1 / 2} \sum_{t = 1}^{T} h_{t} v_{t}^{'}) [w_{T}^{- 1 / 2} \sum_{i = 1}^{N} \sum_{t = 1}^{T} h_{t} {\tilde{γ}}_{i} (\frac{η_{i t}}{σ_{i i}^{1 / 2}})] \\ = N^{- 1 / 2} [w_{T}^{- 1 / 2} \sum_{t = 1}^{T} \sum_{i = 1}^{N} h_{t} (d_{T}^{'} {\tilde{γ}}_{i}) (\frac{η_{i t}}{σ_{i i}^{1 / 2}})] . \end{array}

Since by Assumption, η_it and

v_{t}

(and hence

d_{T}

⁠) are independently distributed, it follows that

E (b_{N T}) = 0

⁠. Consider now

Var (b_{N T})

⁠, and note that for given values of

γ_{i}

we have (recall that η_it is independent over t and

\sum_{t = 1}^{T} h_{t}^{2} = w_{T}

⁠)

\begin{array}{l} Var (b_{N T}) = N^{- 1} w_{T}^{- 1} \sum_{t = 1}^{T} \sum_{r = 1}^{T} \sum_{i = 1}^{N} \sum_{j = 1}^{N} h_{t} h_{r} [{\tilde{γ}}_{i}^{'} E (d_{T} d_{T}^{'}) {\tilde{γ}}_{j}] E (\frac{η_{i t} η_{j r}}{σ_{i i}^{1 / 2} σ_{j j}^{1 / 2}}) \\ = N^{- 1} w_{T}^{- 1} \sum_{t = 1}^{T} \sum_{i = 1}^{N} \sum_{j = 1}^{N} h_{t}^{2} ({\tilde{γ}}_{i}^{'} E (d_{T} d_{T}^{'}) {\tilde{γ}}_{j}) (\frac{σ_{η, i j}}{σ_{i i}^{1 / 2} σ_{j j}^{1 / 2}}) \\ = N^{- 1} \sum_{i = 1}^{N} \sum_{j = 1}^{N} ({\tilde{γ}}_{i}^{'} E (d_{T} d_{T}^{'}) {\tilde{γ}}_{j}) (\frac{σ_{η, i j}}{σ_{i i}^{1 / 2} σ_{j j}^{1 / 2}}) . \end{array}

Also,

E (d_{T} d_{T}^{'}) = E [(w_{T}^{- 1 / 2} \sum_{1 = 1}^{T} h_{t} v_{t}) (w_{T}^{- 1 / 2} \sum_{1 = 1}^{T} h_{t} v_{t}^{'})] = I_{k}

and

Var (b_{N T}) = N^{- 1} \sum_{i = 1}^{N} \sum_{j = 1}^{N} ({\tilde{γ}}_{i}^{'} {\tilde{γ}}_{j}) (\frac{σ_{η, i j}}{σ_{i i}^{1 / 2} σ_{j j}^{1 / 2}}) .

Further,

| \frac{σ_{η, i j}}{σ_{i i}^{1 / 2} σ_{j j}^{1 / 2}} | = \frac{| σ_{η, i j} |}{\sqrt{(γ_{i}^{'} γ_{i} + σ_{η, i i}) (γ_{j}^{'} γ_{j} + σ_{η, j j})}} = \frac{| ρ_{η, i j} |}{\sqrt{(\frac{γ_{i}^{'} γ_{i}}{σ_{η, i i}} + 1) (\frac{γ_{j}^{'} γ_{j}}{σ_{η, j j}} + 1)}} \leq | ρ_{η, i j} | .

Therefore (recalling that

\sup_{j, s} | {\tilde{γ}}_{j s} | < K

and

| {\tilde{γ}}_{i s} | \leq | γ_{i s} |

⁠),

\begin{array}{l} Var (b_{N T}) \leq N^{- 1} \sum_{i = 1}^{N} \sum_{j = 1}^{N} | {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{j} | | ρ_{η, i j} | \leq N^{- 1} \sum_{i = 1}^{N} \sum_{j = 1}^{N} \sum_{s = 1}^{k} | {\tilde{γ}}_{i s} | | {\tilde{γ}}_{j s} | | ρ_{η, i j} | \\ \leq \sup_{j, s} | {\tilde{γ}}_{j s} | [N^{- 1} \sum_{s = 1}^{k} \sum_{i = 1}^{N} | {\tilde{γ}}_{i s} | (\sum_{j = 1}^{N} | ρ_{η, i j} |)] \\ \leq K N^{- 1} \sum_{s = 1}^{k} \sum_{i = 1}^{N} | γ_{i s} | (\sum_{j = 1}^{N} | ρ_{η, i j} |) . \end{array}

But by Condition (57) in Assumption 3 and

σ_{η, i i} > c > 0

imply

\sup_{j} \sum_{i = 1}^{N} | ρ_{η, i j} | < K

(also see Equation (58)) and by Equation (53), we have

\sup_{s} \sum_{i = 1}^{N} | γ_{i s} | = O (N^{δ_{γ}})

⁠. Then, it follows that

Var (b_{N T}) = O (N^{δ_{γ} - 1})

and

b_{N T} = O (N^{δ_{γ} / 2 - 1 / 2})

⁠. Therefore, b_NT is dominated by a_NT and using these results in Equation (A.3) we have

N^{- 1 / 2} \sum_{i = 1}^{N} z_{i}^{2} = w_{T}^{- 1} N^{- 1 / 2} (\sum_{t = 1}^{T} h_{t} η_{t}^{'}) D_{σ}^{- 1} (\sum_{t = 1}^{T} h_{t} η_{t}) + O_{p} (N^{δ_{γ} - 1 / 2}) .

(A.6)

Now using Equation (56), we can express the above as

N^{- 1 / 2} \sum_{i = 1}^{N} z_{i}^{2} = w_{T}^{- 1} N^{- 1 / 2} (\sum_{t = 1}^{T} h_{t} ε_{η, t}^{'} Q_{η}^{'}) D_{σ}^{- 1} (\sum_{t = 1}^{T} h_{t} Q_{η} ε_{η, t}) + O_{p} (N^{δ_{γ} - 1 / 2}) .

where

ε_{η, t} \sim IID (0, I_{N})

⁠. After some re-arrangement of the terms we now obtain

\begin{array}{l} N^{- 1 / 2} \sum_{i = 1}^{N} (z_{i}^{2} - 1) = N^{- 1 / 2} w_{T}^{- 1} (\sum_{t = 1}^{T} h_{t} ε_{η, t}^{'}) (Q_{η}^{'} D_{σ}^{- 1} Q_{η}) (\sum_{t = 1}^{T} h_{t} ε_{η, t}) + O_{p} (N^{δ_{γ} - 1 / 2}) \\ q_{N T} = N^{- 1 / 2} [x_{T}^{'} A x_{T} - Tr (A)] + N^{- 1 / 2} [Tr (A) - N] + O_{p} (N^{δ_{γ} - 1 / 2}), \end{array}

(A.7)

where

x_{T} = w_{T}^{- 1 / 2} \sum_{t = 1}^{T} h_{t} ε_{η, t} and A = Q_{η}^{'} D_{σ}^{- 1} Q_{η} .

(A.8)

First consider the deterministic component of q_NT, and using Equation (55) and under Assumption 3, we have

R = \tilde{Γ} {\tilde{Γ}}^{'} + D_{σ}^{- 1 / 2} Q_{η} Q_{η}^{'} D_{σ}^{- 1 / 2},

(A.9)

where

\tilde{Γ} = {({\tilde{γ}}_{1}, {\tilde{γ}}_{2}, \dots, {\tilde{γ}}_{N})}^{'}

⁠. Then,

Tr (R) = N = \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{i} + Tr (A) .

But, as before,

\begin{array}{l} Tr (\tilde{Γ} {\tilde{Γ}}^{'}) = \sum_{i = 1}^{N} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{i} = \sum_{i = 1}^{N} \sum_{s = 1}^{k} {\tilde{γ}}_{i s}^{2} \\ \leq \sum_{s = 1}^{k} \sum_{i = 1}^{N} | γ_{i s} | \leq k \sup_{s} \sum_{i = 1}^{N} | γ_{i s} | = O (N^{δ_{γ}}) . \end{array}

(A.10)

Hence,

N^{- 1 / 2} [Tr (A) - N] = O (N^{δ_{γ} - 1 / 2}),

and Equation (A.7) can be written as

q_{N T} = z_{N T} + O (N^{δ_{γ} - 1 / 2}) + O_{p} (N^{δ_{γ} - 1 / 2}),

(A.11)

where

z_{N T} = N^{- 1 / 2} x_{T}^{'} \tilde{A} x_{T}, with \tilde{A} = A - N^{- 1} Tr (A) I_{N} .

(A.12)

We now apply the central limit theorem for linear quadratic forms due to KP to z_NT, which is reproduced for convenience as Lemma 1. We first establish the conditions required by KP’s theorem (see Lemma 1). To this end, we first note that

E (x_{T}) = 0

⁠, and

\begin{array}{l} Var (x_{T}) = w_{T}^{- 1} E [(\sum_{t = 1}^{T} h_{t} ε_{η, t}) {(\sum_{t = 1}^{T} h_{t} ε_{η, t})}^{'}] \\ = w_{T}^{- 1} \sum_{t = 1}^{T} h_{t}^{2} E (ε_{η, t} ε_{η, t}^{'}) = I_{N} . \end{array}

Denote the ith element of

x_{T}

x_{i, T}

and note that it is given by

x_{i, T} = w_{T}^{- 1 / 2} \sum_{t = 1}^{T} h_{t} ε_{η, i t} = w_{T}^{- 1 / 2} h^{'} ε_{η, i}

⁠, where

ε_{η, i} = {(ε_{η, i 1} ε_{η, i 2}, \dots, ε_{η, i T})}^{'}

⁠, with an abuse of the notation. Then,

x_{i, T} = w_{T}^{- 1 / 2} ε_{η, i}^{'} M_{F} τ_{T}

and

x_{i, T}^{2} = w_{T}^{- 1} ε_{η, i}^{'} H_{F} ε_{η, i}

⁠; hence, for a given T, the elements of

x_{T}

have zero means, a unit variance, and are independently distributed as required by KP’s theorem. Using results on the moments of quadratic forms, it is also easily established that

E (x_{i, T}^{6}) = w_{T}^{- 3} E {(ε_{η, i}^{'} H_{F} ε_{η, i})}^{3} = 15 + O (v^{- 1}) \leq K

uniformly over i (see Lemma 11), and hence condition KP1 of the KP theorem (Lemma 1) is met. Consider now matrix

\tilde{A}

defined by Equation (A.12) and note that it is symmetric and we have

{‖ \tilde{A} ‖}_{\infty} \leq {‖ A - N^{- 1} Tr (A) I_{N} ‖}_{\infty} \leq {‖ A ‖}_{\infty} + N^{- 1} Tr (A)

and using Equation (A.8)

\begin{array}{l} {‖ \tilde{A} ‖}_{\infty} \leq {‖ Q_{η}^{'} D_{σ}^{- 1} Q_{η} ‖}_{\infty} + N^{- 1} Tr (Q_{η}^{'} D_{σ}^{- 1} Q_{η}) \\ \leq (\frac{1}{\min_{i} (σ_{i i})}) {‖ Q_{η} ‖}_{1} {‖ Q_{η} ‖}_{\infty} + N^{- 1} Tr (Q_{η}^{'} Q_{η}) λ_{max} (D_{σ}^{- 1}) \\ \leq (\frac{1}{\min_{i} (σ_{i i})}) [{‖ Q_{η} ‖}_{1} {‖ Q_{η} ‖}_{\infty} + N^{- 1} Tr (Q_{η}^{'} Q_{η})] . \end{array}

But under Condition (57) and noting that

σ_{i i} > c > 0

⁠, then

{‖ \tilde{A} ‖}_{\infty} = \sup_{i} \sum_{j = 1}^{N} | {\tilde{a}}_{i j} | < K,

and condition KP2 of Lemma 1 is met. To establish condition KP3, we note that

Tr (\tilde{A}) = 0, Tr ({\tilde{A}}^{2}) = Tr (A^{2}) - N^{- 1} {[Tr (A)]}^{2} .

Using Equation (A.9), let

B = D_{σ}^{- 1 / 2} Q_{η} Q_{η}^{'} D_{σ}^{- 1 / 2}

⁠, and note that

Tr (R^{2}) = Tr (B^{2}) + Tr [{({\tilde{Γ}}^{'} \tilde{Γ})}^{2}] + 2 Tr ({\tilde{Γ}}^{'} B \tilde{Γ}) .

(A.13)

Also,

Tr ({\tilde{Γ}}^{'} B \tilde{Γ}) \leq Tr ({\tilde{Γ}}^{'} \tilde{Γ}) λ_{max} (B),

and in view of Equation (57), we have

λ_{max} (B) = λ_{max} (Q_{η}^{'} D_{σ}^{- 1} Q_{η}) \leq {‖ (Q_{η}^{'} D_{σ}^{- 1} Q_{η}) ‖}_{1} \leq (\frac{1}{\min_{i} (σ_{i i})}) {‖ Q_{η} ‖}_{1} {‖ Q_{η} ‖}_{\infty} < K,

and hence (using Equation (A.10)):

Tr ({\tilde{Γ}}^{'} B \tilde{Γ}) = O (N^{δ_{γ}}) .

(A.14)

Also (recalling that

| {\tilde{γ}}_{i s} | \leq | γ_{i s} |

⁠),

\begin{array}{l} Tr {({\tilde{Γ}}^{'} \tilde{Γ})}^{2} = Tr {(\sum_{i = 1}^{N} {\tilde{γ}}_{i} {\tilde{γ}}_{i}^{'})}^{2} = \sum_{i = 1}^{N} \sum_{j = 1}^{N} T r ({\tilde{γ}}_{i} {\tilde{γ}}_{i}^{'} {\tilde{γ}}_{j} {\tilde{γ}}_{j}^{'}) \\ = \sum_{i = 1}^{N} \sum_{j = 1}^{N} {({\tilde{γ}}_{i}^{'} {\tilde{γ}}_{j})}^{2} = \sum_{s = 1}^{k} \sum_{s^{'} = 1}^{k} \sum_{i = 1}^{N} \sum_{j = 1}^{N} | {\tilde{γ}}_{i s} {\tilde{γ}}_{j s} {\tilde{γ}}_{i s^{'}} {\tilde{γ}}_{j s^{'}} | \\ \leq \sum_{s = 1}^{k} \sum_{s^{'} = 1}^{k} \sum_{i = 1}^{N} \sum_{j = 1}^{N} | γ_{i s} | | γ_{j s} | | γ_{i s^{'}} | | γ_{j s^{'}} | \\ \leq k^{2} {(\sup_{i} \sum_{i = 1}^{N} | γ_{i s} |)}^{2} = O (N^{2 δ_{γ}}) . \end{array}

(A.15)

Hence, using Equations (A.14) and (A.15) in Equation (A.13), we have

Tr (B^{2}) = Tr (R^{2}) + O (N^{2 δ_{γ}}) .

Also, in view of Equation (A.8)

Tr (B^{2}) = Tr [D_{σ}^{- 1 / 2} Q_{η} Q_{η}^{'} D_{σ}^{- 1 / 2} D_{σ}^{- 1 / 2} Q_{η} Q_{η}^{'} D_{σ}^{- 1 / 2}] = Tr [{(Q_{η}^{'} D_{σ}^{- 1} Q_{η})}^{2}] = Tr (A^{2}) .

To summarize

Tr (A) = \sqrt{N} + O (N^{δ_{γ}}) and  Tr (A^{2}) = Tr (R^{2}) + O (N^{2 δ_{γ}}),

which also yield (recall that

δ_{γ} < 1 / 2

⁠)

\begin{array}{l} Tr ({\tilde{A}}^{2}) = Tr (A^{2}) - N^{- 1} {[Tr (A)]}^{2} \\ = Tr (R^{2}) + O (N^{2 δ_{γ}}) - N^{- 1} {[\sqrt{N} + O (N^{δ_{γ}})]}^{2} \\ = Tr (R^{2}) + O (N^{2 δ_{γ}}) + O (N^{2 δ_{γ} - 1}) - 1. \end{array}

Therefore,

N^{- 1} Tr ({\tilde{A}}^{2}) = N^{- 1} Tr (R^{2}) + O (N^{2 δ_{γ} - 1}),

(A.16)

which is bounded in N under the assumptions that

N^{- 1} Tr (R^{2})

is bounded in N and

0 \leq δ_{γ} < 1 / 2

⁠. Furthermore, it is readily seen that

N^{- 1} Tr (R^{2}) = N^{- 1} \sum_{i = 1}^{N} \sum_{i = 1}^{N} ρ_{i j}^{2} = 1 + (N - 1) ρ_{N}^{2} .

Finally, using Equation (A.12)

Var (z_{N T}) = N^{- 1} Var (x_{T}^{'} \tilde{A} x_{T}) = N^{- 1} E [{(x_{T}^{'} \tilde{A} x_{T})}^{2}] .

Consider

\begin{array}{l} {(x_{T}^{'} \tilde{A} x_{T})}^{2} = w_{T}^{- 2} {(\sum_{t = 1}^{T} \sum_{t^{'} = 1}^{T} h_{t} h_{t^{'}} ε_{η, t}^{'} \tilde{A} ε_{η, t^{'}})}^{2} \\ = w_{T}^{- 2} \sum_{t = 1}^{T} \sum_{t^{'} = 1}^{T} \sum_{r = 1}^{T} \sum_{r^{'} = 1}^{T} h_{t} h_{t^{'}} h_{r} h_{r^{'}} (ε_{η, t}^{'} \tilde{A} ε_{η, t^{'}}) (ε_{η, r}^{'} \tilde{A} ε_{η, r^{'}}) . \end{array}

Since, by assumption,

ε_{η, t}

are serially independent, then using the results on moments of the quadratic forms, we have

\begin{array}{l} E [{(ε_{η, t}^{'} \tilde{A} ε_{η, t})}^{2}] = \sum_{i = 1}^{N} \sum_{j = 1}^{N} \sum_{i^{'} = 1}^{N} \sum_{j^{'} = 1}^{N} {\tilde{a}}_{i j} {\tilde{a}}_{i^{'} j^{'}} E (ε_{η, i t} ε_{η, j t} ε_{η, i^{'} t} ε_{η, j^{'} t}) \\ = γ_{2, ε_{η}} \sum_{i = 1}^{N} {\tilde{a}}_{i i}^{2} + {(\sum_{i = 1}^{N} {\tilde{a}}_{i i})}^{2} + 2 \sum_{i = 1}^{N} \sum_{j = 1}^{N} {\tilde{a}}_{i j} {\tilde{a}}_{j i}, \end{array}

where

γ_{2, ε_{η}} = E (ε_{η, i t}^{4}) - 3

and by assumption

| γ_{2, ε_{η}} | < K

⁠. Also,

E [(ε_{η, t}^{'} \tilde{A} ε_{η, t}) (ε_{η, r}^{'} \tilde{A} ε_{η, r})] = {[Tr (\tilde{A})]}^{2} for t \neq r .

For

r = t \neq t^{'} = r^{'}

⁠,

\begin{array}{l} E [(ε_{η, t}^{'} \tilde{A} ε_{η, t^{'}}) (ε_{η, t}^{'} \tilde{A} ε_{η, t^{'}})] = E [(ε_{η, t^{'}}^{'} \tilde{A} ε_{η, t}) (ε_{η, t}^{'} \tilde{A} ε_{η, t^{'}})] \\ = E (ε_{η, t^{'}}^{'} \tilde{A} \tilde{A} ε_{η, t^{'}}) = Tr ({\tilde{A}}^{2}) . \end{array}

Similarly, for

r^{'} = t \neq t^{'} = r,

we have

E [(ε_{η, t}^{'} \tilde{A} ε_{η, t^{'}}) (ε_{η, t^{'}}^{'} \tilde{A} ε_{η, t})] = Tr ({\tilde{A}}^{2})

⁠. Using these results

\begin{array}{l} w_{T}^{2} E [{(x_{T}^{'} \tilde{A} x_{T})}^{2}] = (\sum_{t = 1}^{T} h_{t}^{4}) [γ_{2, ε_{η}} \sum_{i = 1}^{N} {\tilde{a}}_{i i}^{2} + {(\sum_{i = 1}^{N} {\tilde{a}}_{i i})}^{2} + 2 \sum_{i = 1}^{N} \sum_{j = 1}^{N} {\tilde{a}}_{i j} {\tilde{a}}_{j i}] \\ + [\sum_{t = 1}^{T} \sum_{r = 1}^{T} h_{t}^{2} h_{r}^{2} - (\sum_{t = 1}^{T} h_{t}^{4})] {[Tr (\tilde{A})]}^{2} + 2 [\sum_{t = 1}^{T} \sum_{r = 1}^{T} h_{t}^{2} h_{r}^{2} - (\sum_{t = 1}^{T} h_{t}^{4})] Tr ({\tilde{A}}^{2}) . \end{array}

But

(\sum_{t = 1}^{T} \sum_{r = 1}^{T} h_{t}^{2} h_{r}^{2}) = {(\sum_{t = 1}^{T} h_{t}^{2})}^{2}, \sum_{i = 1}^{N} {\tilde{a}}_{i i} = Tr (\tilde{A}) = 0, \sum_{i = 1}^{N} \sum_{j = 1}^{N} {\tilde{a}}_{i j} {\tilde{a}}_{j i} = Tr ({\tilde{A}}^{2})

⁠, and we have

Var (z_{N T}) = N^{- 1} E [{(x_{T}^{'} \tilde{A} x_{T})}^{2}] = γ_{2, ε_{η}} w_{T}^{- 2} (N^{- 1} \sum_{i = 1}^{N} {\tilde{a}}_{i i}^{2}) (\sum_{t = 1}^{T} h_{t}^{4}) + 2 w_{T}^{- 2} {(\sum_{t = 1}^{T} h_{t}^{2})}^{2} N^{- 1} Tr ({\tilde{A}}^{2}),

and, further noting that

\sum_{t = 1}^{T} h_{t}^{2} = w_{T},

then

Var (z_{N T}) = 2 N^{- 1} Tr ({\tilde{A}}^{2}) + \frac{γ_{2, ε_{η}} (\sum_{t = 1}^{T} h_{t}^{4})}{w_{T}^{2}} (N^{- 1} \sum_{i = 1}^{N} {\tilde{a}}_{i i}^{2}),

and using Equation (A.16)

Var (z_{N T}) = 2 N^{- 1} Tr (R^{2}) + \frac{γ_{2, ε_{η}} (\sum_{t = 1}^{T} h_{t}^{4})}{w_{T}^{2}} (N^{- 1} \sum_{i = 1}^{N} {\tilde{a}}_{i i}^{2}) + O (N^{2 δ_{γ} - 1}),

where by assumption

N^{- 1} Tr (R^{2})

is bounded in N. Also, using Equation (S.15) in Lemma 8,

\sum_{t = 1}^{T} h_{t}^{4} = O (T)

⁠, and

\begin{array}{l} \frac{| γ_{2, ε_{η}} | (\sum_{t = 1}^{T} h_{t}^{4})}{w_{T}^{2}} (N^{- 1} \sum_{i = 1}^{N} {\tilde{a}}_{i i}^{2}) \leq K \frac{(\sum_{t = 1}^{T} h_{t}^{4})}{w_{T}^{2}} (N^{- 1} Tr ({\tilde{A}}^{2})) \\ \leq \frac{K}{T} [N^{- 1} Tr (R^{2})] + O (T^{- 1} N^{2 δ_{γ} - 1}) = O (T^{- 1}) + O (T^{- 1} N^{2 δ_{γ} - 1}) . \end{array}

Therefore,

Var (z_{N T}) = 2 N^{- 1} Tr (R^{2}) + O (T^{- 1}) + O (N^{2 δ_{γ} - 1}),

(A.17)

which is bounded for any N and T, so long as

N^{- 1} Tr (R^{2})

is bounded in N and

0 \leq δ_{γ} < 1 / 2

⁠. Also, using Equation (A.11), and under the same conditions, and as N and

T \to \infty

⁠, in any order,

\lim_{N, T \to \infty} Var (q_{N T}) = 2 ω^{2} > 0,

as required. This result also ensures that condition KP3 of Lemma 1 is satisfied and therefore, we also have

q_{N T} \to_{d} N (0, 2 ω^{2}),

as N and

T \to \infty

⁠, in any order. ▪

Proof of Theorem 2:

We have

S_{N T} = N^{- 1 / 2} \sum_{i = 1}^{N} [z_{i}^{2} (1 - \frac{1}{σ_{i i}^{- 1} {\hat{σ}}_{i i}})],

(A.18)

where

z_{i}^{2} = ξ_{i}^{'} H_{F} ξ_{i} / w_{T},

with

ξ_{i} = u_{i .} / σ_{i i}^{1 / 2}

being the standardized error of the return equation (6) and

w_{T} = τ_{T}^{'} M_{F} τ_{T}

and

{\hat{σ}}_{i i} = {\hat{u}}_{i .}^{'} {\hat{u}}_{i .} / T

⁠. Write

X_{i} = σ_{i i}^{- 1} {\tilde{σ}}_{i i}

and note that by assumption

σ_{i i} > 0

⁠, and by construction only securities with

{\hat{σ}}_{i i} > c > 0

are included in the

{\hat{J}}_{α}

test, so that

S_{N T} = N^{- 1 / 2} \sum_{i = 1}^{N} [z_{i}^{2} (1 - \frac{1}{X_{i}})],

(A.19)

where

X_{i} = ξ_{i}^{'} M_{G} ξ_{i} / v,

with

v = T - m - 1

and

M_{G} = (m_{t t^{'}}),

defined by Equation (A.1). Also, by Equation (37),

E (t_{i}^{2}) = E (z_{i}^{2} / X_{i}) = v / (v - 2) + O (T^{- 3 / 2})

for each i, and by Lemma 11,

E (z_{i}^{2}) = E (ξ_{i}^{'} H_{F} ξ_{i} / w_{T}) = w_{T}^{- 1} Tr (H_{F}) = 1,

for all i. Thus, we have

E (S_{N T}) = O (\sqrt{N / T^{2}}) .

(A.20)

Next, for all

i = 1, 2, \dots, N

⁠, we have

X_{i} > 0,

and Equation (A.19) can be written as

\begin{array}{l} S_{N T} = N^{- 1 / 2} \sum_{i = 1}^{N} z_{i}^{2} [(1 - X_{i}) + \frac{{(1 - X_{i})}^{2}}{X_{i}}] \\ = S_{1, N T} + S_{2, N T}, \end{array}

where

S_{1, N T} = N^{- 1 / 2} \sum_{i = 1}^{N} z_{i}^{2} (1 - X_{i}),

(A.21)

and

S_{2, N T} = N^{- 1 / 2} \sum_{i = 1}^{N} \frac{z_{i}^{2} {(1 - X_{i})}^{2}}{X_{i}} .

(A.22)

But since

X_{i} > c > 0

and

z_{i}^{2} {(1 - X_{i})}^{2} \geq 0

⁠, then

| S_{2, N T} | \leq c^{- 1} N^{- 1 / 2} \sum_{i = 1}^{N} z_{i}^{2} {(1 - X_{i})}^{2}

and

E | S_{2, N T} | \leq c^{- 1} N^{1 / 2} \sup_{i} E [z_{i}^{2} {(1 - X_{i})}^{2}] .

(A.23)

But

\begin{array}{l} E [z_{i}^{2} {(1 - X_{i})}^{2}] = E (z_{i}^{2} X_{i}^{2}) - 2 E (z_{i}^{2} X_{i}) + E (z_{i}^{2}) \\ = v^{- 2} w_{T}^{- 1} E [(ξ_{i}^{'} H_{F} ξ_{i}) {(ξ_{i}^{'} M_{G} ξ_{i})}^{2}] - 2 v^{- 1} w_{T}^{- 1} E [(ξ_{i}^{'} H_{F} ξ_{i}) (ξ_{i}^{'} M_{G} ξ_{i})] + 1. \end{array}

Now using results from Lemma 11, we have

\begin{array}{l} E [(ξ_{i}^{'} H_{F} ξ_{i}) (ξ_{i}^{'} M_{G} ξ_{i})] = v w_{T} + O (v), \\ E [(ξ_{i}^{'} H_{F} ξ_{i}) {(ξ_{i}^{'} M_{G} ξ_{i})}^{2}] = v^{2} w_{T} + O (v w_{T}), \end{array}

which yields

E [z_{i}^{2} {(1 - X_{i})}^{2}] = O (T^{- 1}), uniformly across i .

(A.24)

Using this result in Equation (A.23), we obtain

E | S_{2, N T} | \leq c^{- 1} N^{1 / 2} \sup_{i} E [z_{i}^{2} {(1 - X_{i})}^{2}] = O (\frac{\sqrt{N}}{T}),

and by Markov inequality we have

S_{2, N T} \to_{p} 0

⁠, so long as

N / T^{2} \to 0

⁠. Therefore, to establish

S_{N T} \to_{p} 0,

it is sufficient to show that

S_{1, N T} \to_{p} 0

⁠. By Lemma 17, we have

N^{- 1 / 2} \sum_{i = 1}^{N} z_{i}^{2} (X_{i} - 1) = N^{- 1 / 2} \sum_{i = 1}^{N} z_{η, i}^{2} (X_{η, i} - 1) + O_{p} (N^{δ_{γ} - 1 / 2}) .

where

z_{η, i}^{2} = η_{i}^{'} H_{F} η_{i} / (w_{T} σ_{η, i i}) > 0, X_{η, i} = η_{i}^{'} M_{G} η_{i} / (v σ_{η, i i}) > 0

⁠. Using results on the moments of quadratic forms, by Lemma 15, we have

N^{- 1 / 2} \sum_{i = 1}^{N} E [z_{η, i}^{2} (X_{η, i} - 1)] = \frac{\sum_{t} h_{t}^{2} m_{t t}}{v w_{T}} γ_{2, ε_{η}} N^{- 1 / 2} \sum_{i = 1}^{N} \sum_{ℓ = 1}^{N} {\tilde{q}}_{η, i ℓ}^{4},

where

γ_{2, ε_{η}} = E (ε_{η, i t}^{4}) - 3

(and

| γ_{2, ε_{η}} | < K

by assumption),

{\tilde{q}}_{η, i ℓ} = q_{η, i ℓ} / σ_{η, i i}^{1 / 2}

with

q_{η, i ℓ}

being such that

Q_{η} = (q_{η, i ℓ}), Q_{η}

defined by Equation (56). But as

0 \leq m_{t t} \leq 1

(⁠

M_{G} = (m_{t t^{'}})

⁠) by Lemma 8,

v^{- 1} w_{T}^{- 1} \sum_{t = 1}^{T} h_{t}^{2} m_{t t} \leq v^{- 1} w_{T}^{- 1} \sum_{t = 1}^{T} h_{t}^{2} = v^{- 1}

\sum_{t = 1}^{T} h_{t}^{2} = w_{T}

⁠, and also that

0 \leq \sum_{ℓ = 1}^{N} {\tilde{q}}_{η, i ℓ}^{4} \leq 1

⁠, as

\sum_{ℓ = 1}^{N} {\tilde{q}}_{η, i ℓ}^{2} = 1

(since

\sum_{ℓ = 1}^{N} q_{η, i ℓ}^{2} = σ_{η, i i}

⁠), and

| γ_{2, ε_{η}} | \leq K

⁠, we have

N^{- 1 / 2} \sum_{i = 1}^{N} E [z_{η, i}^{2} (X_{η, i} - 1)] = O (\sqrt{N} / T) .

Furthermore,

\begin{array}{l} Var [N^{- 1 / 2} \sum_{i = 1}^{N} z_{η, i}^{2} (X_{η, i} - 1)] = \frac{1}{N} \sum_{i} V ar [z_{η, i}^{2} (X_{η, i} - 1)] \\ + \frac{1}{N} \sum_{i \neq j} C ov [z_{η, i}^{2} (X_{η, i} - 1), z_{η, j}^{2} (X_{η, j} - 1)] . \end{array}

We first note that

Var [z_{η, i}^{2} (X_{η, i} - 1)] = E [z_{η, i}^{4} {(X_{η, i} - 1)}^{2}] - {E [z_{η, i}^{2} (X_{η, i} - 1)]}^{2} .

As has shown above,

E [z_{η, i}^{2} (X_{η, i} - 1)] = O (T^{- 1})

uniformly over i. Next, consider

E [z_{η, i}^{4} {(X_{η, i} - 1)}^{2}] = E (z_{η, i}^{4} X_{η, i}^{2}) - 2 E (z_{η, i}^{4} X_{η, i}) + E (z_{η, i}^{4}) .

(A.25)

But, using results on the moments of quadratic forms, by Lemma 11, we have

E (z_{η, i}^{4}) = 3 + O (T^{- 1}), E (z_{η, i}^{4} X_{η, i}) = 3 + O (T^{- 1}) and E (z_{η, i}^{4} X_{η, i}^{2}) = 3 + O (T^{- 1}),

(A.26)

uniformly over i. Substituting Equation (A.26) into Equation (A.25), we have

E [z_{η, i}^{4} {(X_{η, i} - 1)}^{2}] = O (T^{- 1}),

therefore,

Var [z_{η, i}^{2} (X_{η, i} - 1)] = O (T^{- 1})

uniformly over i. We conclude that

\frac{1}{N} \sum_{i} V ar [z_{η, i}^{2} (X_{η, i} - 1)] = O (T^{- 1}) .

Secondly, by Lemma 16,

\frac{1}{N} \sum_{i \neq j} C ov [z_{η, i}^{2} (X_{η, i} - 1), z_{η, j}^{2} (X_{η, j} - 1)] = O (T^{- 1}) + O (N / T^{2}) .

In sum, under Assumptions 1–3, $S_{N T} \to_{p} 0$ ⁠, so long as $0 \leq δ_{γ} < 1 / 2, N / T^{2} \to 0$ as N and $T \to \infty,$ jointly.▪

Proof of Theorem 3:

Under Assumptions 1–3, using Theorem 2 we have

N^{- 1 / 2} \sum_{i = 1}^{N} (z_{i}^{2} - t_{i}^{2}) / {[2 (1 + (N - 1) ρ_{N}^{2})]}^{1 / 2} \to_{p} 0,

where

z_{i}^{2}

is defined by Equation (22), so long as

(N - 1) ρ_{N}^{2} = O (1), N / T^{2} \to 0,

and

0 \leq δ_{γ} < 1 / 2,

as N and

T \to \infty,

jointly. Under these conditions (by Lemma 4), it follows that

N^{- 1 / 2} \sum_{i = 1}^{N} (t_{i}^{2} - \frac{v}{v - 2}) / {[2 (1 + (N - 1) ρ_{N}^{2})]}^{1 / 2}

has the same limit distribution as

N^{- 1 / 2} \sum_{i = 1}^{N} (z_{i}^{2} - 1) / {[2 (1 + (N - 1) ρ_{N}^{2})]}^{1 / 2}

⁠, which is shown to be standard normal by Theorem 1, and the desired result now follows, observing that

\lim_{T \to \infty} {(\frac{v}{v - 2})}^{2} \frac{2 (v - 1)}{v - 4} = 2

⁠. ▪

Proof of Theorem 4:

Let

ψ_{N T} = \frac{1}{N} \sum_{i, j = 1}^{N} ({\tilde{ρ}}_{i j}^{2} - ρ_{i j}^{2})

⁠, and note that

ψ_{N T} = \frac{1}{N} \sum_{i, j = 1}^{N} ({\tilde{ρ}}_{i j} + ρ_{i j}) ({\tilde{ρ}}_{i j} - ρ_{i j}),

and since

| {\tilde{ρ}}_{i j} | < 1

and

| ρ_{i j} | < 1

⁠, it also follows that

| ψ_{N T} | \leq \frac{2}{N} \sum_{i, j = 1}^{N} | {\tilde{ρ}}_{i j} - ρ_{i j} | .

(A.27)

Further, letting

I_{i j} = I [| {\hat{ρ}}_{i j} | > v^{- 1 / 2} c_{p} (N)]

⁠, we have

{\tilde{ρ}}_{i j} - ρ_{i j} = {\hat{ρ}}_{i j} I_{i j} - ρ_{i j} = [{\hat{ρ}}_{i j} - E ({\hat{ρ}}_{i j})] \times I_{i j} + [E ({\hat{ρ}}_{i j}) - ρ_{i j}] \times I_{i j} - ρ_{i j} (1 - I_{i j}),

and hence

\begin{array}{l} \frac{1}{2} E | ψ_{N T} | \leq \frac{1}{N} \sum_{i, j = 1}^{N} E (| {\hat{ρ}}_{i j} - E ({\hat{ρ}}_{i j}) | \times I_{i j}) + \frac{1}{N} \sum_{i, j = 1}^{N} | E ({\hat{ρ}}_{i j}) - ρ_{i j} | E (I_{i j}) \\ + \frac{1}{N} \sum_{i, j = 1}^{N} | ρ_{i j} | [1 - E (I_{i j})] = A_{1} + A_{2} + A_{3} . \end{array}

(A.28)

Now using Equation (41), we note that

{\hat{ρ}}_{i j} = \frac{u_{i .}^{'} M_{G} u_{j .}}{{(u_{i .}^{'} M_{G} u_{i .})}^{1 / 2} {(u_{j .}^{'} M_{G} u_{j .})}^{1 / 2}},

where

{\hat{u}}_{i .} = M_{G} u_{i .}

⁠. Also, since

M_{G}

is an (T × T) idempotent matrix of rank

v = T - m - 1

⁠, there exists an orthogonal T × T transformation matrix

L (L L^{'} = I_{T})

⁠, defined by

L M_{G} L^{'} = (\begin{array}{l} I_{v} & 0 \\ 0 & 0 \end{array}) .

(A.29)

Hence, setting

ζ_{i .} = σ_{i i}^{- 1 / 2} L u_{i .},

(A.30)

{\hat{ρ}}_{i j}

can be written equivalently in terms of the first v elements of

ζ_{i .} = {(ζ_{i 1}, ζ_{i 2}, \dots, ζ_{i T})}^{'}

as (see Lemma 19)

{\hat{ρ}}_{i j} = \frac{\sum_{t = 1}^{v} ζ_{i t} ζ_{j t}}{{(\sum_{t = 1}^{v} ζ_{i t}^{2})}^{1 / 2} {(\sum_{t = 1}^{v} ζ_{j t}^{2})}^{1 / 2}},

where

ζ_{i t} = \sum_{t^{'} = 1}^{T} l_{t t^{'}} ξ_{i t^{'}}

and

l_{t t^{'}}

is the

(t, t^{'})

element of L. Also, as shown in Lemma 19, for each

i, ζ_{i t}

’s are independently distributed over t, and

\begin{array}{l} E (ζ_{i t}) = 0, E (ζ_{i t}^{2}) = 1, E (ζ_{i t} ζ_{j t}) = ρ_{i j} . \\ κ_{i j} (4, 0) = E (ζ_{i t}^{4}) - 3, κ_{i j} (0, 4) = E (ζ_{i t}^{4}) - 3, \\ κ_{i j} (3, 1) = E (ζ_{i t}^{3} ζ_{j t}) - 3 ρ_{i j}, κ_{i j} (1, 3) = E (ζ_{i t} ζ_{j t}^{3}) - 3 ρ_{i j}, \\ κ_{i j} (2, 2) = E (ζ_{i t}^{2} ζ_{j t}^{2}) - 2 ρ_{i j}^{2} - 1. \end{array}

Furthermore, by Lemma 19

E ({\hat{ρ}}_{i j}) = ρ_{i j} + \frac{a_{i j}}{v} + O (v^{- 2}),

(A.31)

Var ({\hat{ρ}}_{i j}) = \frac{b_{i j}}{v} + O (v^{- 2}),

(A.32)

where

a_{i j} = - \frac{1}{2} ρ_{i j} (1 - ρ_{i j}^{2}) + \frac{3}{8} ρ_{i j} [κ_{i j} (4, 0) + κ_{i j} (0, 4)] - \frac{1}{2} [κ_{i j} (3, 1) + κ_{i j} (1, 3)] + \frac{1}{4} ρ_{i j} κ_{i j} (2, 2),

and

b_{i j} = {(1 - ρ_{i j}^{2})}^{2} + \frac{1}{4} ρ_{i j}^{2} [κ_{i j} (4, 0) + κ_{i j} (0, 4)] - ρ_{i j} [κ_{i j} (3, 1) + κ_{i j} (1, 3)] + \frac{1}{2} (2 + ρ_{i j}^{2}) κ_{i j} (2, 2) .

Hence, using Equation (A.31),

| E ({\hat{ρ}}_{i j}) - ρ_{i j} | \leq \frac{1}{v} | a_{i j} | + O (T^{- 2})

⁠, and we have the following bound on the second term of Equation (A.28):

A_{2} = \frac{1}{N} \sum_{i, j = 1}^{N} | E ({\hat{ρ}}_{i j}) - ρ_{i j} | E (I_{i j}) \leq \frac{1}{v N} \sum_{i, j = 1}^{N} | a_{i j} | + O (N T^{- 2}) .

Furthermore, since κ_ij are bounded, and by assumption

\sum_{i, j = 1}^{N} | ρ_{i j} | = O (N)

⁠, we have

\begin{array}{l} \frac{1}{N v} \sum_{i, j = 1}^{N} | a_{i j} | \\ \leq \frac{1}{2} \frac{1}{N v} \sum_{i, j = 1}^{N} | ρ_{i j} | | 1 - ρ_{i j}^{2} | + \frac{3}{8} \frac{1}{N v} \sum_{i, j = 1}^{N} | ρ_{i j} | | κ_{i j} (4, 0) + κ_{i j} (0, 4) | \\ + \frac{1}{4} \frac{1}{N v} \sum_{i, j = 1}^{N} | κ_{i j} (3, 1) + κ_{i j} (1, 3) | + \frac{1}{2 N v} \sum_{i, j = 1}^{N} | ρ_{i j} | | κ_{i j} (2, 2) | . \end{array}

But

\frac{1}{N v} \sum_{i, j = 1}^{N} | ρ_{i j} | | κ_{i j} (2, 2) | \leq \sup_{i j} | κ_{i j} (2, 2) | \frac{1}{N v} \sum_{i, j = 1}^{N} | ρ_{i j} | = O (v^{- 1}),

and hence

\frac{1}{N v} \sum_{i, j = 1}^{N} | a_{i j} | \leq \frac{1}{4} \frac{1}{N v} \sum_{i, j = 1}^{N} | κ_{i j} (3, 1) + κ_{i j} (1, 3) | + O (v^{- 1}) .

(A.33)

Also,

\begin{array}{l} \frac{1}{N v} \sum_{i, j = 1}^{N} | κ_{i j} (3, 1) + κ_{i j} (1, 3) | \\ \leq \frac{1}{N v} \sum_{i, j = 1}^{N} | E (ζ_{i t}^{3} ζ_{j t}) + E (ζ_{i t} ζ_{j t}^{3}) | + \frac{6}{N v} \sum_{i, j = 1}^{N} | ρ_{i j} | \\ = \frac{1}{N v} \sum_{i, j = 1}^{N} | E (ζ_{i t}^{3} ζ_{j t}) + E (ζ_{i t} ζ_{j t}^{3}) | + O (v^{- 1}), \end{array}

and as established in Lemma 20 (see (S.80) in the Supplementary Material), we have

\frac{1}{N v} \sum_{i, j = 1}^{N} | E (ζ_{i t}^{3} ζ_{j t}) + E (ζ_{i t} ζ_{j t}^{3}) | = O (T^{- 1} N^{2 δ_{γ} - 1}) + O (T^{- 1}),

which if used in Equation (A.33) yields

\frac{1}{N v} \sum_{i, j = 1}^{N} | a_{i j} | = O (v^{- 1} N^{2 δ_{γ} - 1}) + O (v^{- 1}) .

Overall, for the second term of Equation (A.28), we have

A_{2} = \frac{1}{N} \sum_{i, j = 1}^{N} | E ({\hat{ρ}}_{i j}) - ρ_{i j} | E (I_{i j}) = O (T^{- 1} N^{2 δ_{γ} - 1}) + O (v^{- 1}) + O (N v^{- 2}),

and since by assumption

δ_{γ} \leq 1 / 2

⁠, and

N / T^{2} \to 0

⁠, as N and

T \to \infty,

then

A_{2} \to 0.

(A.34)

To deal with the first and the third terms of Equation (A.28), we need to distinguish between values of

| ρ_{i j} |

that are strictly away from zero, namely those values that satisfy the condition

| ρ_{i j} | > ρ_{min} > 0

⁠, and those values that are zero or very close to zero. Note that for values of

| ρ_{i j} |

sufficiently close to zero, in the sense that

| ρ_{i j} | \leq κ N^{- ϕ_{ρ}}

⁠, for some

κ > 0

and

ϕ_{ρ} > 1

⁠, we have²³

A_{3} \leq \frac{1}{N} \sum_{i, j = 1}^{N} | ρ_{i j} | \leq κ N^{1 - ϕ_{ρ}} \to 0, if ϕ_{ρ} > 1.

Therefore, without loss of generality, we only consider the case where

| ρ_{i j} | > ρ_{min} > 0

⁠, for all i and j. In this case, we have

A_{3} = \frac{1}{N} \sum_{i, j = 1, | ρ_{i j} | > ρ_{min}}^{N} | ρ_{i j} | E (1 - I_{i j}) \leq \frac{1}{N} \sum_{i, j = 1, | ρ_{i j} | > ρ_{min}}^{N} E (1 - I_{i j}) .

(A.35)

Further, since

E (1 - I_{i j}) = \Pr [| {\hat{ρ}}_{i j} | \leq v^{- 1 / 2} c_{p} (N)]

⁠, then using result (A.7) in Lemma 4 of BPS (2017, supplement) we have (for some small

ϵ > 0

⁠)

\Pr [| {\hat{ρ}}_{i j} | \leq v^{- 1 / 2} c_{p} (N) | ρ_{i j} \neq 0] \leq K e^{\frac{- (1 - ϵ)}{2} \frac{v {(| ρ_{i j} | - \frac{c_{p} (N)}{\sqrt{v}})}^{2}}{b_{i j}}} [1 + o (1)] .

Using this result in Equation (A.35) now yields

A_{3} \leq K N e^{\frac{- (1 - ϵ)}{2} \frac{v {(ρ_{min} - \frac{c_{p} (N)}{\sqrt{v}})}^{2}}{b_{max}}} [1 + o (1)],

where

b_{max} = \sup_{i j} b_{i j} < K

⁠, which can be written equivalently as

A_{3} \leq K e^{\frac{- v (1 - ϵ)}{2} \frac{[{(ρ_{min} - \frac{c_{p} (N)}{\sqrt{v}})}^{2} - \frac{2 \ln (N)}{v (1 - ϵ)}]}{b_{max}}} [1 + o (1)] .

Noting that

c_{p}^{2} (N) / v

and

\ln (N) / v

have the same rate of convergence and both

\to 0

⁠, as N and

T \to \infty

⁠, it then follows that²⁴

A_{3} \to 0, for some ρ_{min} > 0.

(A.36)

Finally, consider the first term of Equation (A.28) and write it as

A_{1} = \frac{1}{N} \sum_{i, j = 1}^{N} E [| {\hat{ρ}}_{i j} - E ({\hat{ρ}}_{i j}) | \times I_{i j}] = \frac{1}{N} \sum_{i, j = 1}^{N} \sqrt{Var ({\hat{ρ}}_{i j})} E (| z_{i j} | \times I_{i j}),

(A.37)

where

z_{i j} = [{\hat{ρ}}_{i j} - E ({\hat{ρ}}_{i j})] / \sqrt{Var ({\hat{ρ}}_{i j})}

⁠, and

Var ({\hat{ρ}}_{i j})

is given by Equation (A.32). Also, by Cauchy–Schwarz inequality (noting that

E (z_{i j}^{2}) = 1

⁠)

\begin{array}{l} E (| z_{i j} | \times I_{i j}) = E (| z_{i j} | I [| {\hat{ρ}}_{i j} | > v^{- 1 / 2} c_{p} (N)]) \leq {[E ({| z_{i j} |}^{2})]}^{1 / 2} {(E {I [| {\hat{ρ}}_{i j} | > v^{- 1 / 2} c_{p} (N)]})}^{1 / 2} \\ \leq {\Pr [| {\hat{ρ}}_{i j} | > v^{- 1 / 2} c_{p} (N)]}^{1 / 2} \leq 1. \end{array}

Using this result and

Var ({\hat{ρ}}_{i j})

from Equation (A.32) in Equation (A.37) and distinguishing between non-zero and near zero values of

ρ_{i j},

we have

\begin{array}{l} A_{1} = N^{- 1} \sum_{i, j = 1}^{N} E [| {\hat{ρ}}_{i j} - E ({\hat{ρ}}_{i j}) | \times I_{i j}] \leq \\ N^{- 1} (\sqrt{\frac{b_{max}}{v}} + O (v^{- 1})) \sum_{i, j = 1}^{N} {\Pr [| {\hat{ρ}}_{i j} | > v^{- 1 / 2} c_{p} (N)] | | ρ_{i j} | = 0}^{1 / 2} \\ + N^{- 1} (\sqrt{\frac{b_{max}}{v}} + O (v^{- 1})) \sum_{i, j = 1}^{N} {\Pr [| {\hat{ρ}}_{i j} | > v^{- 1 / 2} c_{p} (N) | | ρ_{i j} | > ρ_{min}]}^{1 / 2} \\ = A_{11} + A_{12} . \end{array}

Under the sparsity conditions, Equations (32) and (33), the maximum number of non-zero

| ρ_{i j} |

is given by

m_{N}^{2}

⁠, and we have

A_{12} \leq \frac{1}{N} [\frac{\sqrt{b_{max}}}{\sqrt{v}} + O (v^{- 1})] m_{N}^{2} = O (\frac{m_{N}^{2}}{N \sqrt{v}}),

(A.38)

where

m_{N} = O (N^{δ_{ρ}})

⁠. Hence, since by assumption

δ_{ρ} < 1 / 2

⁠, then it follows that

A_{12} \to 0

⁠, as N and

v \to \infty

⁠. For

A_{11}

⁠, which relates to the near zero values of

| ρ_{i j} |

⁠, making use of result (A.5) in Lemma 4 of BPS (2017, supplement) we have

A_{11} \leq K \frac{(N^{2} - m_{N}^{2})}{N} [\frac{\sqrt{b_{max}}}{\sqrt{v}} + O (v^{- 1})] exp (\frac{- (1 - ϵ)}{4} \frac{c_{p}^{2} (N)}{φ_{max}}) [1 + o (1)],

where

φ_{max} = \max_{i j} φ_{i j} < K

⁠. Then for

A_{1}

to tend to zero it is sufficient that (note that

N^{- 1} m_{N}^{2} \to 0

⁠, since

δ_{ρ} < 1 / 2

⁠)

\frac{N}{\sqrt{v}} exp (\frac{- (1 - ϵ)}{4} \frac{c_{p}^{2} (N)}{φ}) \to 0, as N and v \to \infty .

(A.39)

To obtain a sufficient condition for Equation (A.39) to hold, set

T = c_{d} N^{d}

and note that (recall that

v = T - m - 1

and

T / (T - m - 1) < K

⁠, since m is fixed as

T \to \infty

⁠)

\begin{array}{l} \frac{N}{\sqrt{v}} exp (\frac{- (1 - ϵ)}{4} \frac{c_{p}^{2} (N)}{φ}) \leq \sqrt{\frac{T}{T - m - 1}} exp (\frac{- (1 - ϵ)}{4} \frac{c_{p}^{2} (N)}{φ} + (1 - d / 2) log (N)) \\ = \sqrt{\frac{T}{T - m - 1}} exp (- log (N) [\frac{\frac{(1 - ϵ)}{4} \frac{c_{p}^{2} (N)}{φ} - (1 - d / 2) log (N)}{log (N)}]) . \end{array}

But by result (b) of Lemma 2 of BPS (2017, supplement),

\lim_{N \to \infty} c_{p}^{2} (N) / log (N) = 2 δ,

and Condition (A.39) is met if

δ (1 - ϵ) / 2 φ_{max} - (1 - d / 2) > 0,

or equivalently if

δ > \frac{(2 - d)}{(1 - ϵ)} φ_{max}

⁠. Therefore, under this condition,

A_{11} \to 0

⁠, and together with Equation (A.38) establishes that

A_{1} \to 0

⁠. Therefore, using this result, Equations (A.34) and (A.36) in Equation (A.28) we have

E | ψ_{N T} | \to 0,

as required, and in turn implies

ψ_{N T} \to_{p} 0

⁠, by Markov inequality. Finally, using (S.79) in the Supplementary Material established in Lemma 20, and setting

γ_{i} = 0

⁠, for all i, and

σ_{η, i j} = 0

⁠, for all

i \neq j

⁠, to ensure that

ρ_{i j} = 0

⁠, for all

i \neq j

⁠, we have

φ_{i j} = E (ζ_{i t}^{2} ζ_{j t}^{2} | ρ_{i j} = 0) = γ_{2, ε_{η}} (\sum_{r = 1}^{T} l_{t r}^{4}) (\sum_{ℓ = 1}^{N} σ_{i i}^{- 1} σ_{j j}^{- 1} q_{η, i ℓ}^{2} q_{η, j ℓ}^{2}) + σ_{i i}^{- 1} σ_{j j}^{- 1} σ_{η, i i} σ_{η, j j} .

where l_tr is the (t, r) element of the T × T orthonormal matrix L defined by Equation (A.29),

q_{η, i ℓ}

is such that

Q_{η} = (q_{η, i ℓ}), Q_{η}

defined by Equation (56). Also,

| σ_{η, i i} / σ_{i i} | \leq 1, \sum_{r = 1}^{T} l_{t r}^{4} \leq {(\sum_{r = 1}^{T} l_{t r}^{2})}^{2} \leq 1, \sum_{ℓ = 1}^{N} {\tilde{q}}_{η, i ℓ}^{2} = \sum_{ℓ = 1}^{N} q_{η, i ℓ}^{2} / σ_{η, i i} = 1,

and

(\sum_{ℓ = 1}^{N} σ_{i i}^{- 1} σ_{j j}^{- 1} q_{η, i ℓ}^{2} q_{η, j ℓ}^{2}) = | \sum_{ℓ = 1}^{N} {\tilde{q}}_{η, i ℓ}^{2} {\tilde{q}}_{η, j ℓ}^{2} | \leq {(\sum_{ℓ = 1}^{N} {\tilde{q}}_{η, i ℓ}^{4})}^{1 / 2} {(\sum_{ℓ = 1}^{N} {\tilde{q}}_{η, j ℓ}^{4})}^{1 / 2} \leq 1.

Hence, $\sup_{i j} φ_{i j} \leq 1 + | γ_{2, ε_{η}} |$ ⁠, as required. ▪

Proof of Theorem 5:

By Theorem 3, $J_{α} (ρ_{N}^{2}) \to_{d} N (0, 1)$ so long as $N / T^{2} \to 0,$ and $0 \leq δ_{γ} < 1 / 2,$ as $N \to \infty$ and $T \to \infty,$ jointly, where $J_{α} (ρ_{N}^{2})$ and $δ_{γ}$ are defined by Equations (61) and (53), respectively. Since Theorem 4 ensures that ${\hat{J}}_{α} - J_{α} (ρ_{N}^{2}) \to_{p} 0,$ as $(N - 1) ({\tilde{ρ}}_{N, T}^{2} - ρ_{N}^{2}) \to_{p} 0$ when d > 2/3, as N and $T \to \infty,$ and $δ > \frac{(2 - d)}{(1 - ϵ)} φ_{max}$ ⁠, for some small $ϵ > 0$ ⁠, where $φ_{max} \leq 1 + | γ_{2, ε_{η}} |$ ⁠, under these conditions, ${\hat{J}}_{α}$ has the same limit distribution as $J_{α} (ρ_{N}^{2})$ (by Lemma 4), which establishes the result. ▪

Proof of Theorem 6:

The steps in the proof are similar to the ones in deriving the limiting distribution of ${\hat{J}}_{α}$ under the null hypothesis. First, Lemma 22 provides the proof of the result, under Assumptions 1–3, and under the local alternatives (68), $N^{- 1 / 2} \sum_{i = 1}^{N} (z_{i, a}^{2} - 1) \to_{d} N (ϕ^{2}, 2 ω^{2}),$ as $N \to \infty$ and $T \to \infty,$ jointly, where $z_{i, a}^{2}$ defined by (S.97) in the Supplementary Material, $ω^{2} = 1 + \lim_{N \to \infty} (N - 1) ρ_{N}^{2}, ρ_{N}^{2}$ is defined by Equation (60). Also, by Lemma 23, we have $N^{- 1 / 2} \sum_{i = 1}^{N} (z_{i, a}^{2} - t_{i}^{2}) = o_{p} (1)$ ⁠. Finally, ${\hat{J}}_{α} - J_{α} = o_{p} (1)$ ⁠, since the consistency result of the MT estimator ${\tilde{ρ}}_{N, T}^{2}$ given by Theorem 4 will not be affected by the introduction of local alternatives, as the MT estimator is obtained based on the regression residuals of the alternative model. This completes the proof of Theorem 6. ▪

Footnotes

This is a revised and updated version of the article entitled “Testing CAPM with a Large Number of Assets,” initially released in April 2012 as IZA Discussion Papers No. 6469. We would like to thank two anonymous referees and the Editor, Dacheng Xiu, for valuable comments. We are grateful to Ron Smith, Natalia Bailey, and Jay Shanken and other participants at the American Finance Association Meeting in San Diego, in January 2013 for helpful comments. The first author wishes to acknowledge partial support from the ESRC Grant No. ES/I031626/1. The second author acknowledges partial support from the JSPS KAKENHI (grant numbers 20H01484, 20H05631, 21H00700, and 21H04397).

Cross-sectional tests of CAPM have been considered by Douglas (1967); Black, Jensen, and Scholes (1972); and Fama and MacBeth (1973), among others. An early review of the literature can be found in Jensen (1972), and more recently in Fama and French (2004).

There exists a large literature in statistics and econometrics on estimation of high-dimensional covariance matrices which use regularization techniques such as shrinkage, adaptive thresholding, or other dimension-reducing procedures that impose certain structures on the variance matrix such as sparsity, or factor structures. See, for example, Wong, Carter, and Kohn (2003); Ledoit and Wolf (2004); HuAng et al. (2006); BL; Fan, Fan, and Lv (2008); Cai and Liu (2011); Fan, Liao, and Mincheva (2011, 2013); and BPS.

Monte Carlo experiments reported by Feng et al. (2022) also show significant over-rejection of the null by the GOS test when T = 50 and N = 500. These authors do not report simulation results for larger values of N as they increase T to 100 and 200. It is therefore unclear if the over-rejection continues when N is also increased beyond 500 when T = 100. As we also note in the article, increasing T to avoid over-rejection increases the likelihood of breaks in factor loadings which could be another source of over-rejection.

Some researchers have focused on testing the restrictions $λ - μ = 0$ ⁠, allowing λ₀ to be unrestricted. See, for example, Shanken (1992).

Note that the GRS test is also based on the same null hypothesis, $H_{0} : α_{i} = 0$ ⁠, and assumes zero pricing errors.

Noting that ${(1 + {\bar{f}}^{'} {\hat{Ω}}^{- 1} \bar{f})}^{- 1} = T^{- 1} (τ_{T}^{'} M_{F} τ_{T})$ ⁠, where $\bar{f} = T^{- 1} \sum_{t = 1}^{T} f_{t}$ and $\hat{Ω} = T^{- 1} \sum_{t = 1}^{T} (f_{t} - \bar{f}) {(f_{t} - \bar{f})}^{'}$ ⁠, it is easily seen that Equation (17) can be written as the widely used expression of the GRS statistic, $\frac{T - N - m}{N} {(1 + {\bar{f}}^{'} {\hat{Ω}}^{- 1} \bar{f})}^{- 1} {\hat{α}}^{'} {\hat{V}}^{- 1} \hat{α}$ ⁠. As discussed in GRS, ${\hat{α}}^{'} {\hat{V}}^{- 1} \hat{α}$ measures the ex post maximum pricing error.

Another candidate is the shrinkage estimator of V proposed by Ledoit and Wolf (2004), which we denote by ${\hat{V}}_{LW}$ ⁠, and refer to the associated SW statistic as SW_LW. Such “plug-in” approaches are subject to two important shortcomings. First, even if V can be estimated consistently, the test might perform poorly in the case of non-Gaussian errors. Notice that the standardization of the Wald statistic is carried out assuming Gaussianity. Further, consistent estimation of V in the Frobenius norm sense still requires T to rise faster than N, and in practice threshold estimators of V are not guaranteed to be invertible in finite samples where $N ≫ T$ ⁠.

Only securities with ${\hat{σ}}_{i i} > 0$ are included in ${\hat{W}}_{d}$ ⁠.

We conducted an experiment with GARCH(1,1) errors and the evidence supports our claim. The results are reported in Table 5.

See Lemma 21 in the Supplementary Material of the article.

Small sample evidence on the efficacy of using $N^{- 1 / 2} \sum_{i = 1}^{N} (t_{i}^{2} - \frac{v}{v - 2})$ over $N^{- 1 / 2} \sum_{i = 1}^{N} (t_{i}^{2} - 1)$ is reported in Table 7.

For a proof of Equation (39), see Lemma 18 in the Supplementary Material.

See, for example, Cai and Liu (2011); Fan, Liao, and Mincheva (2013); BPS, among others.

Other thresholding estimators of V proposed in the literature can also be used.

See Theorem 4 in Section 4 and its proof in the Appendix.

The robustness of the J_a test against non-Gaussianity is investigated and reported in Table 7. These results are generally supportive of setting δ = 1.

For more details, see Supplementary Section M1.1.

See Assumptions BD.1–3 in GOS.

We are grateful to Richard Luger for sharing the code to compute the resampling test.

SMB stands for “small market capitalization minus big” and HML for “high book-to-market ratio minus low.” See Fama and French (1993).

The estimates used in the generation of the factors and their volatilities are computed using monthly observations over the period May 2008–April 2018.

In all the empirical applications T < N and the GRS test cannot be computed. We have also decided to exclude other tests discussed in the Monte Carlo Section on the grounds of their substantial size distortion of the null and/or low power.

Note that the sparsity condition given by Equation (65) can be violated if $ϕ_{ρ} < 1$ ⁠.

Note that since by assumption $T = c_{d} N^{d}$ ⁠, with d > 1/2, then $\ln (N) / v = (T / (T - m - 1)) c_{d}^{- 1} N^{- d} \ln (N) \to 0$ ⁠, as $N \to \infty$ ⁠. Recall that m, the number of factors, is fixed as $T \to \infty$ ⁠.

References

Affleck-Graves

Mcdonald

1989

Nonnormalities and Tests of Asset Pricing Theories

The Journal of Finance

889

–

908

Affleck-Graves

Mcdonald

1990

Multivariate Tests of Asset Pricing: The Comparative Power of Alternative Statistics

Journal of Financial and Quantitative Analysis

163

–

185

Anderson

T. W.

2003

An Introduction to Multivariate Statistical Analysis

. 3rd edn.

Wiley

Ang

Chen

Xing

2006

Downside Risk

The Review of Financial Studies

1191

–

1239

Ang

Liu

Schwarz

2020

Using Stocks or Portfolios in Tests of Factor Models

Journal of Financial and Quantitative Analysis

709

–

750

Bai

Saranadasa

1996

Effect of High Dimension: By an Example of a Two Sample Problem

Statistica Sinica

311

–

329

Bailey

Kapetanios

Pesaran

M. H.

2021

Measurement of Factor Strength: Theory and Practice

Journal of Applied Econometrics

587

–

613

Bailey

Pesaran

M. H.

Smith

L. V.

2019

A multiple Testing Approach to the Regularisation of Large Sample Correlation Matrices

Journal of Econometrics

208

507

–

534

Beaulieu

M.-C.

Dufour

J.-M.

Khalaf

2007

Multivariate Tests of Mean–Variance Efficiency with Possibly Non-Gaussian Errors

Journal of Business & Economic Statistics

398

–

410

Bickel

P. J.

Levina

2008

Regularized Estimation of Large Covariance Matrices

The Annals of Statistics

199

–

227

Black

Jensen

M. C.

Scholes

1972

. “

The Capital Asset Pricing Model: Some Empirical Tests

.” In M. C. Jensen (Ed.), Studies in the Theory of Capital Markets, pp.

–

121

. New York:

Praeger

Breusch

Pagan

1980

The Lagrange Multiplier Test and Its Applications to Model Specification in Econometrics

Review of Economic Studies

239

–

253

Cai

Liu

2011

Adaptive Thresholding for Sparse Covariance Matrix Estimation

Journal of the American Statistical Association

106

672

–

684

Chamberlain

1983

Funds, Factors, and Diversification in Arbitrage Pricing Models

Econometrica

1305

–

1323

Chordia

Goyal

Shanken

2017

Cross-Sectional Asset Pricing with Individual Stocks: Betas versus Characteristics

Columbia Business School

Cremers

Halling

Weinbaum

2015

Aggregate Jump and Volatility Risk in the Cross-Section of Stock Returns

The Journal of Finance

577

–

614

Douglas

1967

Risk in Equity Markets: An Empirical Appraisal of Market Efficiency

University Microfilms

Fama

E. F.

French

K. R.

1993

Common Risk Factors in the Returns on Stocks and Bonds

Journal of Financial Economics

–

Fama

E. F.

French

K. R.

2004

The Capital Asset Pricing Model: Theory and Evidence

Journal of Economic Perspectives

–

Fama

E. F.

French

K. R.

2015

A Five-Factor Asset Pricing Model

Journal of Financial Economics

116

–

Fama

E. F.

MacBeth

J. D.

1973

Risk, Return, and Equilibrium: Empirical Tests

Journal of Political Economy

607

–

636

Fan

2008

High Dimensional Covariance Matrix Estimation Using a Factor Model

Journal of Econometrics

147

186

–

197

Fan

Liao

Mincheva

2011

High-Dimensional Covariance Matrix Estimation in Approximate Factor Models

Annals of Statistics

3320

–

3356

Fan

Liao

Mincheva

2013

Large Covariance Estimation by Thresholding Principal Orthogonal Complements

Journal of the Royal Statistical Society. Series B

603

–

680

Fan

Liao

Yao

2015

Power Enhancement in High-Dimensional Cross-Sectional Tests

Econometrica: Journal of the Econometric Society

1497

–

1541

Feng

Lan

Liu

2022

High-Dimensional Test for Alpha in Linear Factor Pricing Models with Sparse Alternatives

Journal of Econometrics

229

152

–

175

Gagliardini

Ossola

Scaillet

2016

Time-Varying Risk Premium in Large Cross-Sectional Equity Data Sets

Econometrica

985

–

1046

Gibbons

M. R.

Ross

S. A.

Shanken

1989

A test of the Efficiency of a Given Portfolio

Econometrica

1121

–

1152

Giglio

Xiu

2021

Asset Pricing with Omitted Factors

Journal of Political Economy

129

1947

–

1990

Gungor

Luger

2009

Exact Distribution-Free Tests of Mean–Variance Efficiency

Journal of Empirical Finance

816

–

829

Gungor

Luger

2016

Multivariate Tests of Mean–Variance Efficiency and Spanning with a Large Number of Assets and Time-Varying Covariances

Journal of Business & Economic Statistics

161

–

175

Huang

Yuan

Zhou

2021

. Tests of Asset Pricing Models with a Large Number of Assets. DOI:

10.2139/ssrn.3143752

Huang

J. Z.

Liu

Pourahmadi

Liu

2006

Covariance Matrix Selection and Estimation via Penalised Normal Likelihood

Biometrika

–

Hwang

Satchell

S. E.

2014

Testing Linear Factor Models on Individual Stocks Using the Average F-Test

The European Journal of Finance

463

–

498

K. S.

Pesaran

Shin

2003

Testing for Unit Roots in Heterogeneous Panels

Journal of Econometrics

115

–

Jensen

1972

Studies in the Theory of Capital Markets

Praeger

Jensen

M. C.

1968

The Performance of Mutual Funds in the Period 1945–1964

The Journal of Finance

389

–

416

Kelejian

H. H.

Prucha

I. R.

2001

On the Asymptotic Distribution of the Moran I Test Statistic with Applications

Journal of Econometrics

140

219

–

257

Lan

Feng

Luo

2018

Testing High-Dimensional Linear Asset Pricing Models

Journal of Financial Econometrics

191

–

210

Ledoit

Wolf

2004

A Well-Conditioned Estimator for Large-Dimensional Covariance Matrices

Journal of Multivariate Analysis

365

–

411

Lieberman

1994

A Laplace Approximation to the Moments of a Ratio of Quadratic Forms

Biometrika

681

–

690

Lintner

1965

The Valuation of Risk Assets and the Selection of Risky Investments in Stock Portfolios and Capital Budgets

The Review of Economics and Statistics

–

Longin

Solnik

2001

Extreme Correlation of International Equity Markets

The Journal of Finance

649

–

676

Lan

Tsai

C.-L.

2020

Testing Alphas in Conditional Time-Varying Factor Models with High-Dimensional Assets

Journal of Business & Economic Statistics

214

–

227

Pesaran

M. H.

Ullah

Yamagata

2008

A Bias-Adjusted LM Test of Error Cross-Section Independence

The Econometrics Journal

105

–

127

Raponi

Robotti

Zaffaroni

2019

Testing Beta-Pricing Models Using Large Cross-Sections

The Review of Financial Studies

2796

–

2842

Ross

S. A.

1976

The Arbitrage Theory of Capital Asset Pricing

Journal of Economic Theory

341

–

360

Shanken

1992

On the Estimation of Beta-Pricing Models

The Review of Financial Studies

–

Sharpe

W. F.

1964

Capital Asset Prices: A Theory of Market Equilibrium under Conditions of Risk

The Journal of Finance

425

–

442

Srivastava

M. S.

2008

A Test for the Mean Vector with Fewer Observations than the Dimension

Journal of Multivariate Analysis

386

–

402

Wong

Carter

C. K.

Kohn

2003

Efficient Estimation of Covariance Selection Models

Biometrika

809

–

830