Abstract

This article considers tests of alpha in linear factor pricing models when the number of securities, N, is much larger than the time dimension, T, of the individual return series. We focus on class of tests that are based on Student’s t-tests of individual securities which have a number of advantages over the existing standardized Wald type tests, and propose a test procedure that allows for non-Gaussianity and general forms of weakly cross-correlated errors. It does not require estimation of an invertible error covariance matrix, it is much faster to implement, and is valid even if N is much larger than T. We also show that the proposed test can account for some limited degree of pricing errors allowed under Ross’s arbitrage pricing theory condition. Monte Carlo evidence shows that the proposed test performs remarkably well even when T =60 and N =5000. The test is applied to monthly returns on securities in the S&P 500 at the end of each month in real time, using rolling windows of size 60. Statistically significant evidence against Sharpe–Lintner capital asset pricing model and Fama–French three and five factor models are found mainly during the period of Great Recession (2007M12–2009M06).

This article is concerned with testing for the presence of alpha in linear factor pricing models (LFPMs) such as the capital asset pricing model (CAPM) due to Sharpe (1964) and Lintner (1965), or the arbitrage pricing theory (APT) model due to Ross (1976), when factors are observed and the number of securities, N, is quite large relative to the time dimension, T, of the return series under consideration. There exists a large literature in empirical finance that tests various implications of Sharpe–Lintner model. Cross-sectional as well as time-series tests have been proposed and applied in many different contexts. Using time-series regressions, Jensen (1968) was the first to propose using standard t-statistics to test the null hypothesis that the intercept, αi, in the ordinary least squares (OLS) regression of the excess return of a given security, i, on the excess return of the market portfolio is zero.1

However, when a large number of securities are under consideration, due to dependence of the errors across securities in the LFPM regressions, the individual t-statistics are correlated which makes controlling the overall size of the test problematic. Gibbons, Ross, and Shanken (1989, GRS) propose an exact multivariate version of the test which deals with this problem if the CAPM regression errors are Gaussian and N < T. This is the standard test used in the literature, but its application has been confined to testing the market efficiency of a relatively small number of portfolios, typically 20−30, using monthly returns observed over relatively long time periods. The use of large T as a way of ensuring that N < T is also likely to increase the possibility of structural breaks in the β′s that could in turn adversely affect the performance of the GRS test.

Recently, there has been a growing body of finance literature which uses individual security returns rather than portfolio returns for the test of pricing errors. Ang, Liu, and Schwarz (2020) show that the smaller variation of beta estimates from creating portfolios may not lead to smaller variation of cross-section regression estimates. Cremers, Halling, and Weinbaum (2015) examine the pricing of both aggregate jump and volatility risk based on individual stocks rather than portfolios. Chordia, Goyal, and Shanken (2017) advocate the use of individual securities to investigate whether the source of expected return variation is from betas or security-specific characteristics.

Out of the two main assumptions that underlie the GRS test, the literature has focused on the implications of non-normal errors for the GRS test, and ways of allowing for non-normal errors when testing αi=0. Affleck-Graves and Mcdonald (1989) were among the first to consider the robustness of the GRS test to non-normal errors who, using simulation techniques, find that the size and power of GRS test can be adversely affected if the departure from non-normality of the errors is serious, but conclude that the GRS test is “reasonably robust with respect to typical levels of nonnormality.” (p. 889). More recently, Beaulieu, Dufour, and Khalaf (2007, BDK) and Gungor and Luger (2009) have proposed tests of αi=0 that allow for non-normal errors, but retain the restriction N < T. BDK develop an exact test which is applicable to a wide class of non-Gaussian error distributions, and use Monte Carlo simulations to achieve the correct size for their test. GL propose two distribution-free nonparametric sign tests in the case of single factor models that allow the error distribution to be non-normal but require it to be cross-sectionally independent and conditionally symmetrically distributed around zero.

Our primary focus in this article is on multivariate tests of H0:αi=0, for i=1,2,,N, when N > T, while allowing for non-Gaussian and weakly cross-sectionally correlated errors. The latter condition is required for consistent estimation of the error covariance matrix, V, when N is large relative to T. In the case of LFPM regressions with weakly cross-sectionally correlated errors, consistent estimation of V can be achieved by adaptive thresholding which sets to zero elements of the estimator of V that are below a given threshold. Alternatively, feasible estimators of V can be obtained by Bayesian or classical shrinkage procedures that scale down the off-diagonal elements of V relative to its diagonal elements.2Fan, Liao, and Mincheva (2011, 2013) consider consistent estimation of V in the context of an approximate factor model. They assume V is sparse and propose an adaptive thresholding estimator of V, which they show to be positive definite with satisfactory small sample properties. Fan, Liao, and Yao (2015) consider a standardized Wald (SW) test based on the estimator of V proposed by Fan, Liao, and Mincheva (2013) and derive the conditions under which the SW test of H0 can be asymptotically justified. Gungor and Luger (2016, GL) propose a simulation-based approach for testing pricing errors. They claim that their test procedure is robust against non-normality and cross-sectional dependence in the errors. Gagliardini, Ossola, and Scaillet (2016, GOS) develop two-pass regressions of individual stock returns, allowing time-varying risk premia, and propose a SW test. Lan, Feng, and Luo (2018) use random projection of the N security returns onto a smaller number of portfolios to circumvent the high-dimensional problem when testing for alphas, but require N and T to be of the same order of magnitude. Raponi, Robotti, and Zaffaroni (2019) propose a test of pricing error in cross-section regression for fixed number of time-series observations. They use a bias-corrected estimator of Shanken (1992) to standardize their test statistic. Ma et al. (2020) employ polynomial spline techniques to allow for time variations in factor loadings when testing for alphas. Feng et al. (2022) propose a max-of-square type test of alphas instead of the average used in the literature, and recommend using a combination of the two testing procedures. As noted by He et al. (2021), Bai and Saranadasa (1996, BS) consider yet another SW type test which requires N and T to be of the same order of magnitude.

In this article, we develop a test statistic that initially ignores the off-diagonal elements of V and base the test of H0 on the average of the squared t-ratios for αi=0, over i=1,2,,N. This idea was originally proposed in the working paper version of this article, independently of a similar approach subsequently followed by GOS. Despite the similarity of the two tests, as will be seen, our version of the test performs much better for all combinations of N and T considered in the literature, and delivers excellent size and power even if N is very large (around 5000), in contrast to other tests that tend to over reject as N is increased relative to T. We are also able to establish the asymptotic distribution of proposed test under much weaker conditions and without resorting to high level assumptions.3 We achieve this by making corrections to the numerator of the test statistic to ensure that the test is more accurately centered, and correct the denominator of the test statistic to allow for the effects of non-zero off-diagonal elements of the underlying error covariance matrix. The correction involves consistently estimating N1Tr(R2), where R=(ρij) is the error correlation matrix. The estimation of N1Tr(R2)=N1i=1N j=1Nρij2 is subject to the curse of dimensionality which we address by using the multiple testing (MT) threshold estimator, R˜, recently proposed by Bailey, Pesaran, and Smith (2019, BPS). We show that consistent estimation of N1Tr(R2) can be achieved under a more general specification of R when compared with tests that require a consistent estimator of the full matrix, R. We are able to establish that the resultant test is applicable more generally and continues to be valid for a wider class of error covariances, and holds even if N rises faster than T. The proposed test is also corrected for small sample effects of non-Gaussian errors, which is of particular importance in finance. We refer to this test as Jensen’s α test of LFPM and denote it by J^α. The test can also be viewed as a robust version of a SW test, in cases where the off-diagonal elements of V become relatively less important as N. Further, the implementation of the J^α test is computationally less demanding, since it does not involve estimation of an invertible high-dimensional error covariance matrix.

We note that the J^α test is not the first one which is based on the standardized squared t-ratio for αi=0. As discussed in He et al. (2021), Srivastava and Du (2008, SD) propose standardized squared t-ratio, using a different standardization from ours. As will be seen below, their standardization results in serious size distortion when N is larger than T (see the SD test discussed in Section 5). Also, Hwang and Satchell (2014) proposed a simulation-based test, using average of the squared t-ratios.

Our assumption regarding the sparsity of V advances on Chamberlain’s (1983) approximate factor model formulation of the asset model, where it is assumed that the largest eigenvalue of V (or R) is uniformly bounded in N (Chamberlain, 1983, p. 1307). We relax this assumption and allow the maximum column sum matrix norm of R to rise with N but at a rate slower than N, while controlling the overall sparsity of R by requiring N1Tr(R2) to be bounded in N. In this way, we are able to allow for two types of cross-sectional error dependence: one due to the presence of weak common factors that are not sufficiently strong to be detectable using standard estimation techniques, such as principal components and another due to the error dependence that arises from interactive and spill-over effects.

We establish that under the null hypothesis H0: αi=0, the J^α test is asymptotically distributed as N(0, 1) for T and N jointly, so long as N/T20,mN=R1=O(Nδρ), 0δρ<1/2, and N1Tr(R2) is bounded in N. The test is also shown to have power against alternatives that rise in N1/2T. We consider the implications of allowing for pricing errors on the asymptotic properties of the J^α test and show that testing H0 still allows for some very limited degree of non-zero pricing errors. The proofs are quite involved and in some parts rather tedious. For the purpose of clarity, we provide statements of the main theorems with the associated assumptions in the article, but relegate the mathematical details to the Appendix.

Small sample properties of the J^α test are investigated using Monte Carlo experiments designed specifically to match the distributional features of the residuals of Fama–French three factor regressions of individual securities in the Standard & Poor 500 (S&P 500) index. We consider the comparative test results for the following nine sample size combinations, T{60,120,240} and N={50,100,200}. The J^α test performs well for all sample size combinations with empirical size very close to the chosen nominal value of 5%, and satisfactory power. Comparing the size and power of the J^α test with the GRS test in the case of experiments with N < T for which the GRS statistics can be computed, we find that the J^α test has higher power than the GRS test in most experiments. This could be due to the non-normal errors adversely affecting the GRS test, as reported by Affleck-Graves and Mcdonald (1989, 1990). In addition, the J^α test outperforms the test proposed by GOS as well as the SW test of Fan, Liao, and Yao (2015) and the SD test of Srivastava and Du (2008). The J^α test also outperforms the simulation-based Fmax test of Gungor and Luger (2016) and the BS test of Bai and Saranadasa (1996), which are shown to be substantially undersized across the various designs, and has lower power when compared with the J^α test. Further, we carried out additional experiments using much larger values of N, namely N=500,1000, 2000, and 5000, while keeping T at 60, 120, and 240. We only considered the J^α test for these experiments and found no major evidence of size distortions even for the experiments with T =60 and N =5000.

Encouraged by the satisfactory performance of the J^α test even in cases where N is much larger than T, we applied the test to monthly returns on the securities in the S&P 500 index using rolling windows of size T =60 months. The survivorship bias problem is minimized by considering the sample of securities included in the S&P 500 at the end of each month in real time. We report the J^α test results for CAPM, three and five Fama–French factor models over the period September 1989 to April 2018, and the three sub-periods: (1) the Asian financial crisis (1997M07–1998M12), (2) the Dot-com bubble burst (2000M03–2002M10), and (3) the Great Recession (2007M12–2009M06) periods. We find that the J^α test rejects H0:αi=0, mainly during periods of major financial disruptions, particularly the period of Great Recession, with the GOS test rejecting the null for most periods, largely due to its tendency to over-reject when T is short relative to N.

The outline of the rest of the article is as follows. Section 1 sets out the LFPM, formulates the null hypothesis that underlies the tests for alphas which allow for pricing errors and weak latent or missing factors. Section 2 introduces the estimates of alpha and derives the GRS test as a point of departure for dealing with the case where N > T. Section 3 proposes the J^α test for large N panels and derives its asymptotic distribution, and Section 4 summarizes the main theoretical results. Section 5 reports on small sample properties of J^α, GRS, GOS, SW, Fmax, BS and SD tests, using Monte Carlo techniques. Section 6 presents the empirical application. Section 7 concludes. The proofs of the main theorems are provided in the Appendix, and the lemmas which are used for the proofs, as well as the additional Monte Carlo evidence and the detailed discussion on data sources, are provided in the Supplementary Material.

Notations: We use K and c to denote finite and small positive constants. If {ft}t=1 is any real sequence and {gt}t=1 is a sequences of positive real numbers, then ft=O(gt), if there exists a positive finite constant K such that |ft|/gtK for all t. ft=o(gt) if ft/gt0 as t. If {ft}t=1 and {gt}t=1 are both positive sequences of real numbers, then ft=(gt) if there exists T01 and positive finite constants C0 and C1, such that inftT0(ft/gt)C0 and suptT0(ft/gt)C1. For a N × N matrix A=(aij), the minimum and maximum eigenvalues of matrix A are denoted by λmin(A) and λmax(A), respectively, its trace by Tr(A), its maximum absolute column and row sum matrix norms by A=supij=1N|aij|, and, A1=supji=1N|aij|, respectively, its Frobenius and spectral norms by AF=Tr(AA), and A=λmax1/2(AA), respectively. For a N×1 dimensional vector, α,α=(αα)1/2.

1 The LFPM and APT Restrictions

We base our test of alpha on the following statistical factor model:
(1)
where Rit is the return on security i during period t, rtf is the risk-free rate, ft=(f1t,f2t,,fmt) is the m×1 vector of observed factors, βi=(βi1,βi2,,βim) is the associated vector of risk factors with mean μ=E(ft). Under the APT due to Ross (1976), the following restrictions are imposed:
(2)
where λ0 is the zero-beta expected excess return, λ is the m×1 vector of risk premia, and ϖi is the pricing error of security i such that
(3)
This latter condition is given by Equation (18) in Theorem II of Ross and ensures that under APT pricing errors are sparse. In this article, we consider a more general bound on the pricing errors and assume that
(4)
where the exponent δϖ measures the degrees of pervasiveness of pricing errors. Deviations from APT are measured in terms of δϖ (0δϖ<1). Large values of δϖ represent major departures from APT.
To motivate the alpha tests of αi=0 in the statistical model, we note that under Equation (1),
and for the statistical model to be compatible with the APT condition (2), we must have
(5)
Therefore, testing the null hypothesis, H0:αi=0 for all i, can be viewed as tests of the joint hypothesis λ0=0, λ=μ, (referred to as “no spanning errors” here after), and testing ϖi=0, for all i (referred to as “no pricing errors”). Under APT, the excess return regressions can be written as
(6)
where yit=Ritrtf and ϖi satisfies Equation (4).4 Under APT, the above model is often referred to as the LFPM, to be distinguished from the statistical linear factor model given by Equation (1). It is also worth noting that when testing H0 it is still possible to allow for a limited degree of non-zero pricing errors, depending on the prevalence of the pricing errors and the relative expansion rates of N and T. See Remark 8 below.5
To ensure that the risk from common factors, ft, cannot be fully diversified we assume that at least one of the observed factors is strong, in the sense that
(7)
Our test does not require all the observed factors to be strong, and allows these factors to have different degrees of strength. In a recent paper, Bailey, Kapetanios, and Pesaran (2021) find that among over 140 factors proposed in the literature only the market factor can be regarded as strong. The other factors are estimated to be semi-strong, such that the sum of their loadings in absolute terms rises with N but at the rate of δβ, where 1/2<δβ<1. Also, there is no guarantee that all relevant factors are included in the asset pricing model, and to allow for possible missing (or latent) factors, we assume that
(8)
where vt is a k×1 vector of latent common factors that are IID(0,Ik),γi=(γi1,γi2,,γik) is the associated vector of factor loadings with bounded elements, supi,s|γis|<K. The latent factors included in the error process must be weak such that
(9)

The idiosyncratic errors, ηit, are also allowed to be weakly cross-correlated. Specifically, we assume that ηt=(η1t,η2t,….,ηNt)=Qηεη,t, where εη,t=(εη,1t,εη,2t,….,εη,Nt),{εη,it} are IID processes over i and t, with zero means , unit variances, γ2,εη=E(εη,it4)3, and supi,tE(|εη,it|8+c)K<, for some c >0. We denote the correlation matrix of ηt by Rη=(ρη,ij) and note that Rη=QηQη. To ensure that ut=(u1t,u2t,,uNt) is weakly cross-correlated, we require that k, the number of weak factors, is finite, and Rη1Qη1QηK. The error specification in Equation (8) is quite general and allows for weak latent common factors as well as network and spatial error cross dependence. We note that common factors cannot substitute for network dependence and allowing for both types of dependence in the errors is important.

2 Preliminaries and the GRS Test

It proves useful to stack the panel regressions in Equation (6) by time series as well as by cross-section observations. Stacking by time-series observations we have
(10)
where yi.=(yi1,yi2,,yiT),τT=(1,1,,1),F=(f1,f2,,fT), and ui.=(ui1,ui2,,uiT). Stacking by cross-sectional observations we have
(11)
where yt=(y1t,y2t,,yNt), α=(α1,α2,,αN), B=(β1,β2,,βN), and ut=(u1t,u2t,,uNt).
For derivation of the exact GRS (Gibbons et al., 1989) test, we assume that utIIDN(0,V), namely errors, uit, are Gaussian, have zero means, and are serially uncorrelated such that E(uitujt)=0, for all i, j, and tt, with E(utut)=V, where V=(σij) is an N × N symmetric positive definite matrix. A non-Gaussian version of this assumption will be considered below. Starting with Jensen’s (1968) test of individual αi’s, we note that the OLS estimator of αi is given by
(12)
where MF=ITF(FF)1F, and is an efficient estimator despite the fact that V is not a diagonal matrix. This result follows since Equation (10) is a seemingly unrelated regression equation specification with the same set of regressors across all the N securities. It is also easily seen that
(13)
where
(14)
Stacking the N estimates in Equation (13), we have
where ui.c=t=1Tuitct, and ct is the tth element of c. Hence,
(15)
whereas before ut=(u1t,u2t,,uNt). Therefore, under Gaussianity,
Also, in the case where TN+m+1, an unbiased and invertible estimator of V is given by (TTm1)V^, where V^ is the sample covariance matrix estimator
(16)
u^t=(u^1t,u^2t,,u^Nt), u^it is the OLS residual from the regression of yit on an intercept and ft.
Under Gaussianity, u^t has a multivariate normal distribution with zero means, α^ and u^t are independently distributed, and hence using standard results from multivariate analysis it follows that (see, e.g., Theorem 5.2.2 in Anderson, 2003) the GRS statistic (see p. 1124 of GRS)
(17)
is distributed exactly as a non-central F distribution with (TNm) and N degrees of freedom, and the non-centrality parameter μα2=TNmN(τTMFτTT)αV1α, which is zero under H0:α=0.6

As noted in the introduction, the single most important limiting feature of the GRS and other related tests proposed in the literature is the requirement that T must be larger than N. Due to this, in applications of the GRS test, individual securities are grouped into (sub)portfolios and the GRS test is then typically applied to 20–30 portfolios over relatively long time periods. However, the market efficiency hypothesis implies that αi=0 for all individual securities which form the market portfolio, and it is clearly desirable to develop tests which permit N to be much larger than T. This is even more so if we would like to minimize the adverse effects of possible time variations in the βi’s.

It is also worth bearing in mind that the GRS test does not impose any restrictions on V, which is possible only because N is taken to be fixed as T. Large T is required to take account of non-Gaussian errors. While in the context of the approximate factor models advanced in Chamberlain (1983), the errors are at most weakly correlated, which places restrictions on the off-diagonal elements of V and its inverse. In addition, such restrictions are also statistically important in order to estimate V and its inverse when N > T. The test developed in this article for a large number of individual securities is therefore clearly different from the GRS test, both theoretically and statistically. Furthermore, as we shall see below, a test that exploits restrictions implied by the weak cross-sectional correlation of the errors is likely to have much better power properties than the GRS test that does not make use of such restrictions. Finally, being a multivariate F-test, the power of the GRS test is primarily driven by the time dimension, T, while for the analysis of a large number of assets or portfolios we need tests that have the correct size and are powerful for large N.

3 Large N Tests of Alpha in LFPMs

To develop large N tests of H0:α=0, we consider the following version of the GRS statistic, as set out in Equation (17),
(18)
where we have dropped the degrees of freedom adjustment term and replaced V^ by its true value. Under H0:α=0, and assuming that the errors are Gaussian we have WvχN2. Since the mean and the variance of a χN2 random variable are N and 2N, one could consider a SW test statistic defined by
(19)

Under Gaussianity and H0:α=0,SWvdN(0,1) as N. To construct tests of large N panels, a suitable estimator of V is required. But as was noted in the introduction this is possible only if we are prepared to impose restrictions on the structure of V. In the case of LFPM regressions where the errors are at most weakly cross-sectionally correlated, this can be achieved by adaptive thresholding which sets to zero elements of V that are sufficiently small, or by use of shrinkage type estimators that put a substantial amount of weight on the diagonal elements of the shrinkage estimator of V. Fan, Liao, and Mincheva (2011, 2013) consider consistent estimation of V in the context of an approximate factor model. They assume V is sparse and propose an adaptive threshold estimator, denoted as V^POET, which they show to be positive definite with satisfactory small sample properties. We refer to the feasible SW test statistic which replaces V with V^POET as SWPOET test.7

3.1 A ^Jα Test for Large N Securities

To overcome some of the above mentioned limitations of the plug-in procedures, we avoid using an estimator of V altogether and base our proposed test on diagonal elements of V, namely the N × N diagonal matrix, D=diag(σ11,σ22,,σNN), with σii=E(uit2), rather than the full covariance matrix. Specifically, we consider the statistic
(20)
and its feasible counterpart given by
(21)
where σ^ii=u^i.u^i./T. The degrees of freedom v=Tm1 are introduced to correct for small sample bias of the test.8 The infeasible statistic, Wd, can also be written as
(22)
where
(23)
It is then easily seen that
(24)
where ti denotes the standard t-ratio of αi given by
(25)
As with the panel testing strategy developed in Im, Pesaran, and Shin (2003), a standardized version of W^d, defined by Equation (24), can now be considered:
(26)
where
(27)
(28)
Under Gaussianity, the individual ti statistics are identically distributed as Student’s t with v degrees of freedom, and we have (assuming v=Tm1>4)
(29)
Using Equations (27)–(29), the standardized statistic (26) can now be written as
(30)
where
(31)
and

To make the Jα test operational, we need to provide a large N consistent estimator of θN2. Second, we need to show that, despite the fact that Jα test is standardized assuming ti has a standard t distribution, the test will continue to have satisfactory small sample performance even if such an assumption does not hold due to the non-Gaussianity of the underlying errors. More formally, in what follows we relax the Gaussianity assumption and assume that ut=Qεt, where Q is an N × N invertible matrix,εt=(ε1t,ε2t,,εNt), and {εit} is an IID process over i and t, with means zero and unit variances, and for some c >0, E(|εit|8+c) exists, for all i and t. Then, E(utut)=V=(σij)=QQ and V is an N × N symmetric positive definite matrix, with λmin(V)c>0. We allow for cross-sectional error heteroskedasticity, but assume that the errors are homoskedastic over time. This assumption can be relaxed by replacing the assumption of error independence by a suitable martingale difference assumption. This extension will not be attempted in this article.9

3.2 Sparsity Conditions on Error Correlation Matrix

As noted already, we advance on the literature by allowing V=(σij) to be approximately sparse. Equivalently, we define sparsity in terms of the elements of the correlation matrix R=(ρij), where ρij=σij/σiiσjj. We consider the following two conditions:
(32)
and
(33)
Under condition (32), mN is allowed to rise with N, but at a slower rate than N. For example, consider the case where condition (32) applies to the first p rows of R (with p fixed), and the rest of the Np rows of R are absolute summable, namely
Then, since |ρij|2|ρij|, it readily follows that
Another important case covered by our sparsity assumption is when uit has the weak factor structure given by Equation (8), with the factor loadings, γi, satisfying Equation (9). Denoting the correlation matrix of the idiosyncratic errors, ηt=(η1t,η2t,,ηNt) by Rη=(ρη,ij), and assuming that
(34)
we have Tr(N1Rη2)=O(1). It is now easily seen that Conditions (32) and (33) are also satisfied under this set up. Denoting the correlation matrix of ut by R=(ρij), we have
(35)
where γ˜i=γi/σii1/2=γi/(γiγi+ση,ii)1/2. Since |ρij|s=1k|γ˜is||γ˜js|+|ρη,ij|, then (note that ση,iiσii=γiγi+ση,ii)
Since supi,s|γ˜is|supi,s|γis|, and supsj=1N|γ˜js|supsj=1N|γjs|=O(Nδγ), and by assumption Rη<K, Condition (32) is met if δρδγ. Also, (noting that supi,s|γ˜is|1)
Hence,
and under Conditions (9) and (34), N1Tr(R2) is bounded in N if 0δγ1/2. 
Remark 1:

Our assumption of approximate sparsity allows for a sufficiently high degree of cross error correlation, which is important for the analysis of financial data, where it is not guaranteed that inclusion of observed factors in the return regressions will totally eliminate weak error correlations due to spatial and/or within sector error correlations. It is important that both factor and spatial type error correlations, representing strong and weak forms of interdependencies are taken into account when testing for alpha. By allowing the error term to include weak factors, one only needs to focus on identification of strong and semi-strong factors to be included inft. On this see also Bailey, Kapetanios, and Pesaran (2021).

3.3 Non-Gaussianity

For the discussion of the effects of non-Gaussianity on the Jα test below, it is convenient to introduce the following scaled error:
(36)
so that for each i, ξit has zero mean and unit variance. In the case where the errors are non-Gaussian the skewness and excess kurtosis of uit are given by γ1,i=E(ξit3) and γ2,i=E(ξit4)3, respectively, and could differ across i. Note that under non-Gaussian errors, ti is no longer Student t-distributed and E(ti2) and V(ti2) need not be the same across i, due to the heterogeneity of γ1,i and γ2,i over i. Using a slightly extended version of the Laplace approximation of moments of the ratio of quadratic forms by Lieberman (1994), we are able to derive the following approximations of E(ti2) and Var(ti2)10:
(37)
and
(38)
Substituting Equations (37) and (38) into Equation (26), we have the following non-Gaussian version of Jα(θN2), defined by Equation (30):
where θN2 is defined by Equation (31). When the numerator of the Jα statistic is replaced by N1/2i=1N(ti21), which is the typical mean adjustment employed by Fan, Liao, and Yao (2015) and GOS, then the order of the asymptotic error of the numerator of such test statistics becomes N/T2. This is one of the reasons why our proposed test performs better than the ones proposed in the literature, especially in cases where NT, and there are significant departures from Gaussianity. The asymptotic error of using (vv2)22(v1)(v4) for Var(ti2) under non-Gaussianity in the Jα test is O(T1), which is small for sufficiently large T.11

3.4 Allowing for Error Cross-Sectional Dependence

A second important difference between the Jα test and the other tests proposed in the literature is the inclusion of θN2 in the denominator of the test statistic to take account of error correlations. Using Equation (31), we first note that as N and T12
(39)
so long as N/T20, and 0δγ<1/2, where
(40)
Here, ρN2 is known as the average pair-wise squared correlation coefficient and plays a key role in tests of error cross-sectional correlations in panel regressions (see, e.g., Breusch and Pagan, 1980; Pesaran, Ullah, and Yamagata, 2008). To see the relationship between θN2 and the sparsity of V, we note that
which in view of Equation (39) justifies replacing 1+θN2 by N1Tr(R2) for N and T sufficiently large so long as N/T20, and 0δγ<1/2. Therefore, ignoring θN2 can lead to serious size-distortions even for large N and T panels when the errors are cross-correlated andN1Tr(R2)does not tend to zero, since the denominator of Jα will be under-estimated. The size distortion will be present even if we impose stronger sparsity conditions on V, for example, by requiring mN, defined by Equation (32), to be bounded in N. It is, therefore, important that θN2 (or ρN2) is replaced by a suitable estimator.
One possible way of estimating ρN2 would be to use sample correlation coefficients, ρ^ij, defined as
(41)
where σ^ij=T1t=1Tu^itu^jt and u^it is the residuals from the OLS regression of yi on G=(τT,F). However, such an estimator is likely to perform poorly in cases where N is large relative to T, and some form of thresholding is required, as discussed in the literature on estimation of large covariance matrices.13 Here, we consider the application of the MT approach to regularization of large covariance matrices proposed by BPS. However, BPS establish their results for yity¯i, while we need to apply the thresholding approach to u^it. Second, BPS consider exact sparsity conditions on the error covariance matrix, while we allow for much more general sparsity conditions. We extend BPS’s analysis to address both of these issues.14
The MT estimator of ρij, denoted by ρ˜ij, is given by
(42)
where v=Tm1,
(43)
p is the nominal significance level for testing ρij=0 (0<p<1), T=cdNd, where cd, δ, and d are finite positive constants. Using Equation (42), the MT estimator of ρN2 is given by
(44)
Under the sparsity conditions (32) and (33), it can be shown that (N1)(ρ˜N,T2ρN2)p0 as well as in l1-norm, so long as N/T20 (or equivalently if d >1∕2) as N and T, jointly, and if
(45)
for some small ϵ>0, where φmax1+|γ2,εη|, where γ2,εη=E(εη,it4)3,εη,it is the ith element of the N×1 error vector εη,t=Qη1ηt, with ηt=(η1t,η2t,….,ηNt).15 The critical value function, cp(N), depends on the nominal level of significance, p, and the choice of δ, subject to Condition (45). The test results are unlikely to be sensitive to the choice of p, over the conventional values in the range of 1–10%. d determines the relative expansion rate of N and T. The value of φ depends on the degree of dependence of the errors even if they are uncorrelated. In the case where the errors, εη,it, are Gaussian γ2,εη=0 and φ1, and it is sufficient to set δ=2d. However, in the non-Gaussian case, and given the evidence provided by Longin and Solnik (2001) and Ang, Chen, and Xing (2006) on the degree of nonlinear dependence of asset returns, higher values of δ might be required. In simulations and empirical exercises to be reported below, we set f(N)=N, which is equivalent to setting δ = 1.16
Accordingly, we propose the following feasible version of the Jα statistic
(46)
where ti is the t-ratio for testing αi=0, defined by Equation (25), v=Tm1, and ρ˜N,T2 is given by Equation (44). The J^α test is robust to non-Gaussian errors and allows for a relatively high degree of error cross-sectional dependence. In the next section, we provide a formal statement of the conditions under which J^α tends to a normal distribution.

3.5 Survivorship Bias

When applying the J^α test, it is important to minimize the effect of survivorship bias. To this end, the GRS type tests of alpha consider a relatively small number of portfolios over a relatively large time period to achieve sufficient power. By making use of portfolios rather than individual securities, the GRS test is less likely to suffer from survivorship bias. By comparison, tests such as the J^α test can suffer from the survivorship bias due to the fact that they are applied to individual securities directly and obtain power from increases in N as well as from T. To deal with the survivorship bias, we propose that the J^α test is applied recursively to securities that have been trading for at least T time periods (days or months) at any given time t. The set of securities included in the J^α test varies over time and dynamically takes account of exit and entry of securities in the market. The number of securities, Nτ, used in the test at any point of time, τ, depends on the choice of T, and declines as T is increased. It is clearly important that a balance is struck between T and Nτ. Since the J^α test is applicable even if N is much larger than T, and given that the power of the J^α test rises both in N and T, then it is advisable to set T such that minτ(Nτ)/T2 is sufficiently small. This procedure is followed in the empirical application discussed in Section 6, where we set T =60 and end up with Nτ in the range [464,487], giving minτ(Nτ)/T2=0.12.

3.6 Other Existing Tests

3.6.1 The GOS test

It might be helpful to compare our proposed test statistic J^α, given by Equation (46), with the one proposed by Gagliardini et al. (2016, pp. 1008–9):
(47)
where ρ^BL2 is an estimator of ρN2 based on Bickel and Levina (2008, BL) threshold estimator of ρij.17 As noted in the introduction, the GOS statistic is closely related to the J^α test statistic, and also differ from it in a number of important respects. First, GOS do not employ the degrees of freedom adjustment for the standardization of ti2, which we have shown will provide more accurate normal approximation especially when N is much larger than T. Despite the simplicity of the corrections, as can be seen from the Appendix and the Supplementary Material, the derivations and the proofs are not straightforward. Second, for the estimation of large variance–covariance matrix, the evidence in BPS shows that the MT estimator outperforms the BL estimator almost uniformly in their experiments, and our use of MT estimator of ρN2 turns out to yield much better results. Third, the BL estimation requires cross-validation, which can be computationally far more costly than the MT estimation. Finally, we derive limiting distribution of the J^α test statistic under primitive assumptions with fairly general error covariance structure, while GOS place the high level assumption of asymptotic normality of the test statistic (see their Assumption A.5) or only consider a restrictive error covariance structure (see their Appendix F).18 We believe that our error specification is valid more generally in empirical asset pricing literature where not all factors can be identified and estimated, and in consequence one needs to allow for a much wider degree of error cross-correlations to take account of weak unobserved effects.

3.6.2 The GL Fmax

GL propose a resampling test based on Fi=ti2 test statistic for αi=0, defined as
(48)

They consider various versions of the test, and recommend the use of the maximum test which we will consider in our Monte Carlo exercise. The authors claim that their resampling test procedure is robust against non-normality and cross-sectional error dependence.19 Their test effectively makes use of wild bootstrap resampling aimed at preserving the sample residual cross-sectional correlations, and deals with nuisance parameters by the introduction of a bounds testing procedure.

3.6.3 The BS and SD tests in He et al. (2021)

He et al. (2021) consider the following two test statistics. Based on BS, He et al. (2021) propose a SW type test which requires N and T to be of the same order of magnitude:
(49)
where c1=2(T1)(T2)(T1) and c2=1T1. Based on Srivastava and Du (2008), He et al. (2021) also propose a test statistic which is a standardized squared t-ratio, using different standardization from ours:
(50)
where c3=N(T1)T(T3),c4=2T2, c5=N2T1.

4 Summary of the Main Theoretical Results

In this section, we provide the list of assumptions and a formal statement of the theorems for the size and power of the proposed J^α. First, we state the assumptions required for establishing the results. 

Assumption 1:
The m×1 vector of common observed factors, ft, in the return regressions, Equation (6), are distributed independently of the errors, uit for all i, t, and t. The number of factors, m, is fixed, and at least one of the factors is strong, in the sense that
(51)
and the factors satisfy ftftK<, for all t. The (m+1)×(m+1) matrix T1GG, with G=(τT,F), is a positive definite matrix for all T, and as T, and τTMFτT>0, where MF=ITF(FF)1F.
 
Assumption 2:
The errors, uit, in Equation (6), have the following mixed weak-factor spatial representation
(52)
where γi=(γi1,γi2,,γik) is a k×1 vector of factor loadings, vt=(v1t,v2t,,vkt) is a k×1 vector of unobserved common factors, and ηit are the idiosyncratic components.

  • The unobserved factors vt are serially independent and the k elements are independent of each other, such that vtIID(0,Ik),γ2,v=E(vst4)3, and sups,tE(vst8+c)<K, for some c > 0. The factor loadings, γis for s=1,2,,k, are bounded, supi,s|γis|<K, and the factors, vt, are weak in the sense that
    (53)
  • For any i and j, the T pairs of realizations, {(ηi1,ηj1),(ηi2,ηj2),,(ηiT,ηjT)}, are independent draws from a common bivariate distribution with mean E(ηit)=0, Var(ηit)=ση,ii,0<c<ση,iiK, and the covariance E(ηitηjt)=ση,ij.

Writing the error factor specification, Equation (52), in matrix notation we have
(54)
where ut=(u1t,u2t,,uNt),Γ=(γ1,γ2,,γN), and ηt=(η1t,η2t,,ηNt). Under Assumption 2, and denoting E(ηtηt)=Vη=(ση,ij), we have
(55)
 
Assumption 3:
The covariance matrices V and Vη defined by Equation (55) are N × N symmetric, positive definite matrices with λmin(V)λmin(Vη)c,
(56)
where Q and Qη are the Cholesky factors of V and Vη, respectively. Matrix Qη is row and column bounded in the sense that
(57)

Here, {εit} and {εη,it} are IID processes over i and t, with zero means, unit variances, γ2,εη=E(εη,it4)3, supi,tE(|εit|8+c)K<, and supi,tE(|εη,it|8+c)K<, for some c >0.

 
Remark 2:

The above assumptions allow the returns on individual securities to be strongly cross-sectionally correlated through the observed factors, ft, and allow for weak error cross-correlations once the effects of strong factors are removed.

 
Remark 3:
Under Condition (57)
(58)
nevertheless due to the weak factors we have
and allow the overall error variance matrix, V, to be approximately sparse, in contrast to the literature that requiresV<K. The relaxation of the sparsity condition onVis particularly important in finance where security returns could be affected by weak unobserved factors.
 
Remark 4:

The high-order moment conditions in Assumption 3 allow us to relax the Gaussianity assumption while at the same time ensuring that our test is applicable even if N is much larger than T.

 
Remark 5:

Assumptions 2(ii) and 3 ensure that the sample cross-correlation coefficients of the residuals, ρ^ij, have an Edgeworth expansion which is needed for consistent estimation ofρN2, defined by Equation (40). For further details, see BPS.

Our main theoretical results are set out in the following theorems. The proofs of these theorems are provided in the Appendix, and necessary lemmas for the proofs are given in the Supplementary Material. 

Theorem 1:
Consider the return regression (6), and the statisticqNT=N1/2i=1N(zi21), wherezi2is defined by Equation (23). Suppose that Assumptions 1–3 hold, andN1Tr(R2)is bounded in N, whereR=(ρij),ρij=E(ξitξjt), andξit=uit/σii1/2is the standardized error of the return regression Equation (6). Then, underH0:αi=0,for all i,
(59)
asN andT,jointly, where
and
(60)
 
Theorem 2:
Consider the regression model (6), and the statistic SNT, given below, wherezi2andti2are defined by Equations (23) and (25), respectively. Suppose that Assumptions 1–3 hold. Then, under the null hypothesis, H0: αi=0for all i,
asNandTjointly, so long asN/T20,0δγ<1/2, whereδγis defined by Equation (53).
 
Theorem 3:
Consider the regression model (6), and suppose thatAssumptions 1–3 hold. Then, underH0:αi=0,for all i,
(61)
so long asN/T20,and0δγ<1/2,asNandT,jointly, where ti, ρN2, andδγare defined by Equations (25), (60), and (53), respectively, withv=Tm1.
 
Theorem 4:
Let
(62)
where
(63)
ρij=E(ξitξjt),ξit=uit/σii1/2, v=Tm1,ρ^ijis defined by Equation (41)
(64)
p is the nominal p-value (0<p<1), f(N)=Nδ, andT=cdNd, where cd, δ, and d are finite positive constants. Suppose that Assumptions 1–3 hold and
(65)

then, (N1)E|ρ˜N,T2ρN2|0, as N andT, which implies(N1)(ρ˜N,T2ρN2)p0, ifN/T2=(N12d)0, (or ifd>1/2), and ifδ>(2d)(1ϵ)φmax, for some smallϵ>0, whereφmax1+|γ2,εη|andγ2,εη=E(εη,it4)3(Assumption 3).

 
Theorem 5:
Consider the panel regression model (6) in asset returns and suppose that Assumptions 1–3 hold. Consider the test statistic
(66)
where ti is given by Equation (25), v=Tm1,ρ˜N,T2is defined by Equation (62), using the thresholdcp(N)given by Equation (64), with p (0<p<1), T=cdNd, where cd, δ, and d are finite positive constants, δ>(2d)(1ϵ)φmax, for some smallϵ>0, whereφmax1+|γ2,εη|andγ2,εη=E(εη,it4)3. Then, underH0:αi=0for all i,
(67)
ifN/T20, as N andT, jointly.
To investigate the power properties of the J^α test, we consider the local alternatives
(68)
and establish the following theorem.
 
Theorem 6:
Consider the panel regression model (6) in asset returns, and suppose that conditions ofTheorem 5 apply, andinf(σii)>c>0. Then, under the local alternatives, H0α, defined by Equation (68),
(69)
whereϕ2=limNϕN2and
(70)
 
Remark 7:

This theorem establishes that theJ^αtest is consistent (in the sense that its power tends to unity), ifϕ2>0. It is also of interest that the power of theJ^αtest increases uniformly with N and T, in contrast to the power of the GRS test that rises withT, only.

 
Remark 8:
The above theorem also sheds light on the effects of allowing for pricing errors on the size and power of theJ^αtest. It is clear that adding pricing errorsϖito αi in Equation (68) will increaseϕN2and hence the power of the test. But this will be at the expense of size distortions since the null of the test isH0:αi=0while if we allow for the pricing errors the null will beH0:αi=ϖi, subject to the APT conditioni=1Nϖi2=O(Nδϖ), withδϖ=0. (see Equation (4)). UnderH0and the alternatives, H0ain Equation (68) we have
and
These two conditions hold simultaneously ifN1/2T1i=1Nςi2=O(Nδϖ), which in turn implies that

SettingT=(Nd), we now haveϕN2=O(Nδϖ+d1/2)and theJ^αtest will have the correct size underH0ifd<1/2δϖ. Under Ross’s APT condition whereδϖ=0, it is required that d < 1∕2. But to allow for non-Gaussian errors and weak error cross-sectional dependence we require d > 1∕2 so thatN/T20,which is one of the conditions ofTheorem 5. Hence, we would expect some size distortions if we allow pricing errors that satisfy the APT condition ofRoss (1976). To avoid size distortions in the presence of pricing errors, we need to consider stronger restrictions on pricing errors so that they decline with N, for example, ϖi=O(Nϵ). Under this specification, sincei=1Nϖi2=O(N12ϵ), thenδϖ=12ϵ, and pricing errors can be accommodated in our analysis ifϵ>d/2+1/4. SinceTheorem 5 requiresd>1/2,then we must haveϵ>1/2.

 
Remark 9:

Pricing errors cannot be allowed for in the case of the GRS test since it requires N < T, and with N fixed it is not possible to distinguish αi fromϖiin the LFPM given by Equation (6).

5 Small Sample Evidence Based on Monte Carlo Experiments

We examine the finite sample properties of the J^α test by Monte Carlo experiments, and compare its performance to the existing tests, which are discussed in Section 3.6. Specifically, we consider the GRS test, the GOS test, and a feasible version of the SW test, as well as the distribution-free Fmax test and the BS and SD tests, which are defined by Equations (3), (47), (19), (48), (49), and (50), respectively. Computational details of these tests are given in Section M1.1 of the Supplementary Material.

5.1 Monte Carlo Designs and Experiments

We consider the following data generating process (DGP):
(71)
for i=1,2,..,N; t=1,2,,T, where ft for l =1, 2, 3 are the observed factors, and
(72)
in which vt is the missing factor and ηit is the idiosyncratic component of the return process defined below. The scalar coefficient κ is introduced so that the overall fit of the panel can be controlled to match the average fit of the return regressions defined by RNT2=N1i=1NRiT2, where RiT2 is the R-squared of the regression for rit, computed for a given sample. We calibrate κ=6.5 for N =500 and T =120 to match RNT2=0.30 for the model without omitted common component and spatial errors. The value of κ is fixed throughout the experiments.
The observed factors are calibrated to closely match the three Fama and French (1993, FF3) factors (market factor, HML, and SMB) and are generated as20
where (ρf,1,ρf,2,ρf,3)=(0.1,0.2,0.2),et=htξt with ξtIIDN(0,1) follow GARCH(1,1) models:
where (ω1,ω2,ω3)=(20.25,6.33,5.98),(ϱ1,ϱ2,ϱ3)=(0.61,0.70,0.31), and (φ1,φ2,φ3)=(0.31,0.21,0.10).21

To calibrate the empirical FF3 model, we estimated it using S&P500 security level monthly excess return for 120 months ending on April 2018. We chose the series with the full sample period, which left 457 securities. The results are summarized in Table 1.

Table 1.

Descriptive statistics of Fama–French three factor regression results

Average β estimates for FF3 factors
Average skewness and excess kurtosis of the residuals
β^MKTβ^HMLβ^SMBSkewnessExcess kurtosis
Mean1.050.070.180.322.76
SD0.430.570.450.875.61
Median1.020.000.170.141.19
Min0.19−1.46−1.95−1.53−0.53
Max2.922.911.996.3457.57
Average β estimates for FF3 factors
Average skewness and excess kurtosis of the residuals
β^MKTβ^HMLβ^SMBSkewnessExcess kurtosis
Mean1.050.070.180.322.76
SD0.430.570.450.875.61
Median1.020.000.170.141.19
Min0.19−1.46−1.95−1.53−0.53
Max2.922.911.996.3457.57
Table 1.

Descriptive statistics of Fama–French three factor regression results

Average β estimates for FF3 factors
Average skewness and excess kurtosis of the residuals
β^MKTβ^HMLβ^SMBSkewnessExcess kurtosis
Mean1.050.070.180.322.76
SD0.430.570.450.875.61
Median1.020.000.170.141.19
Min0.19−1.46−1.95−1.53−0.53
Max2.922.911.996.3457.57
Average β estimates for FF3 factors
Average skewness and excess kurtosis of the residuals
β^MKTβ^HMLβ^SMBSkewnessExcess kurtosis
Mean1.050.070.180.322.76
SD0.430.570.450.875.61
Median1.020.000.170.141.19
Min0.19−1.46−1.95−1.53−0.53
Max2.922.911.996.3457.57

We generate the factor loadings as IIDU(0.3,1.8) for the market factor, IIDU(1.0,1.0) for the HML factor, and IIDU(0.6,0.9) for the SMB factor. In this way, we ensure that the means and standard deviations of the betas match their empirical counterparts and sufficient ranges of the estimates of βs reported in Table 1 for the FF3 model are covered in the experiments.

The latent factor vt is generated as IID(0, 1) and its loadings γi are generated to ensure a given factor strength denoted by the exponent δγ. We generate γi as
and to avoid systematic errors we then randomly reshuffle γi over i before assigning them to the individual returns, rit. Our theoretical derivations suggest that the size of our proposed J^α test should be under control so long as δγ<1/2. Accordingly, we consider the values of δγ=0, 1/4, and 1/2. Allowing for latent factors is important since in practice researchers cannot be sure that they have included all relevant risk factors in their models. The problem of missing (or latent) factors continues to apply even if we extend the list of observed factors as it is done in the recent literature. See, for example, Giglio and Xiu (2021) and the recent paper by Bailey, Kapetanios, and Pesaran (2021) who consider the estimation of factor strength.
In addition to allowing for latent factors, we also consider network (or spatial) type cross-sectional error dependence by generating the idiosyncratic errors εη,it as
(73)
which can be solved for ηt=(η1t,η2t,,ηNt) as
where εη,t=(εη,1t,εη,2t,,εη,Nt), ψ={0.0,0.25},Dη=diag(ση1,ση2,,σηN). We adopt a rook form of W=(wij), where all elements in W are zero except wi+1,i=wj1,j=0.5 for i=1,2,,n2 and j=3,4,,n, with w1,2=wn,n1=1, and standardized such that wii = 0 and j=1Nwij=1. Case of error cross-sectional independence arises for the parameter values ψ=0 and δγ=0. We allow for error cross-sectional heteroskedasticity by generating σηi2 as IID (1+χ2,i2)/3, and consider Gaussian (1)εη,itIIDN(0,1), as well as non-Gaussian errors, (2)εη,itIIDtν,it[ν/(ν2)]1/2, where tν,it are independent draws from a t-distribution with ν degrees of freedom. In light of the properties of the empirical distribution of the FF3 regression residuals, for t distribution error, we choose ν = 8, so that the value of excess kurtosis, 1.5, falls between the sample mean and sample median shown in Table 1.

All the N return series are generated from t=49,48,….0,1,2,….,T, with f,50=0 and h,50=1 for =1,2,3. The first 50 observations are dropped to minimize the effects of the initial values and observations rit, ft=(f1t,f2t,f3t), for t=1,2,,T are used in the MC experiments. Further details are provided in the Supplementary Material.

To estimate size of the tests, we set αi=0 for all i. To investigate power, we consider alternatives based on Equation (5), setting λ0=0, namely

For the scenario called “Power 1,” we set λ=μ, and generated αi as αi=ϖiIIDN(0,1) for i=1,2,,Nα with Nα=Nδα; αi=0 for i=Nα+1,Nα+2,,N. We considered the values δα=0.7. In another scenario called “Power 2,” we assume there are no pricing errors and set ϖi=0 for all i, but consider the case where λμ=c(2.92,0.63,9.96), that match the estimates reported in Table 1 of GOS (p. 1011) for c =1. To make the power of the tests for “Power 2” comparable for “Power 1,” we set c =0.1. We do not consider the case both λμ and ϖi0, as it is clear that in this case higher power will be achieved.

All combinations of T =60, 120, 240 and N =50, 100, 200 (and 500, 1000, 2000, 5000 for the J^α test) are considered. All tests are conducted at the 5% significance level and all experiments are based on R =2000 replications. To compute ρ˜N,T2 which enters the denominator of the J^α statistic, given by Equation (46), we consider p={0.05,0.1} and δ={1,2}. The results are very insensitive to the choice of the values of (p,δ) and the case for (p,δ)=(0.05,1) is reported. It is worth noting that that the choice of p when computing ρ˜N,T2 is not governed or affected by the choice of the nominal size of the J^α test.

5.2 Size and Power

Table 2 reports the size and power of the J^α, GRS, GOS, SW, Fmax, BS, and SD tests in the case of normal errors, under various degrees of cross-sectional error correlations, as measured by the exponent, δγ.

Table 2.

Size and power of the J^α and other tests with normal errors

Panel A: Size (αi=0 for all i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
GRS
605.04.15.3
1205.84.34.94.34.93.7
2404.34.94.54.85.44.95.94.65.1
GOS
6017.423.530.317.322.531.516.923.829.9
12011.312.313.99.812.214.49.611.714.7
2407.28.99.37.48.48.67.78.49.6
SW
6017.423.530.317.422.631.517.824.330.2
12011.312.313.910.012.214.422.919.616.0
2407.28.99.37.48.78.610.814.320.9
Fmax
600.40.20.10.10.00.20.40.20.0
1200.20.10.10.10.20.00.10.10.0
2400.10.20.20.10.10.20.10.10.1
BS
604.24.04.63.44.43.93.94.44.3
1203.42.92.72.72.92.42.93.53.5
2402.02.42.02.62.52.03.22.93.0
SD
6010.912.013.210.212.113.59.311.211.9
1207.97.78.37.17.98.56.48.18.6
2405.06.76.75.76.35.85.96.77.3
Panel A: Size (αi=0 for all i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
GRS
605.04.15.3
1205.84.34.94.34.93.7
2404.34.94.54.85.44.95.94.65.1
GOS
6017.423.530.317.322.531.516.923.829.9
12011.312.313.99.812.214.49.611.714.7
2407.28.99.37.48.48.67.78.49.6
SW
6017.423.530.317.422.631.517.824.330.2
12011.312.313.910.012.214.422.919.616.0
2407.28.99.37.48.78.610.814.320.9
Fmax
600.40.20.10.10.00.20.40.20.0
1200.20.10.10.10.20.00.10.10.0
2400.10.20.20.10.10.20.10.10.1
BS
604.24.04.63.44.43.93.94.44.3
1203.42.92.72.72.92.42.93.53.5
2402.02.42.02.62.52.03.22.93.0
SD
6010.912.013.210.212.113.59.311.211.9
1207.97.78.37.17.98.56.48.18.6
2405.06.76.75.76.35.85.96.77.3
Panel B: Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
GRS
6014.713.414.5
12082.848.980.149.379.648.5
24099.099.895.599.099.895.699.099.795.4
GOS
6083.193.098.680.391.797.971.786.096.0
12095.199.299.994.599.1100.089.297.699.5
24099.6100.0100.099.4100.0100.099.199.9100.0
SW
6083.193.098.680.491.797.972.786.596.1
12095.199.299.994.599.1100.094.698.699.7
24099.6100.0100.099.4100.0100.099.6100.0100.0
Fmax
6017.620.325.316.018.820.511.216.116.5
12053.265.876.050.063.672.738.250.365.0
24087.995.799.287.094.898.877.890.496.6
BS
6039.849.463.138.049.458.828.939.748.9
12073.286.295.071.085.794.163.279.790.1
24096.399.4100.095.599.6100.092.898.699.9
SD
6076.787.995.672.785.593.560.975.487.5
12094.498.899.893.098.799.986.396.499.1
24099.599.9100.099.4100.0100.098.799.9100.0
Panel B: Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
GRS
6014.713.414.5
12082.848.980.149.379.648.5
24099.099.895.599.099.895.699.099.795.4
GOS
6083.193.098.680.391.797.971.786.096.0
12095.199.299.994.599.1100.089.297.699.5
24099.6100.0100.099.4100.0100.099.199.9100.0
SW
6083.193.098.680.491.797.972.786.596.1
12095.199.299.994.599.1100.094.698.699.7
24099.6100.0100.099.4100.0100.099.6100.0100.0
Fmax
6017.620.325.316.018.820.511.216.116.5
12053.265.876.050.063.672.738.250.365.0
24087.995.799.287.094.898.877.890.496.6
BS
6039.849.463.138.049.458.828.939.748.9
12073.286.295.071.085.794.163.279.790.1
24096.399.4100.095.599.6100.092.898.699.9
SD
6076.787.995.672.785.593.560.975.487.5
12094.498.899.893.098.799.986.396.499.1
24099.599.9100.099.4100.0100.098.799.9100.0
Panel C: Power 2 (αi=βi(λ-μ) with (λ-μ)=0.1(2.92,0.63,9.96))
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6058.481.596.356.279.496.549.075.694.9
12094.499.7100.093.099.6100.090.099.4100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0
GRS
6011.812.312.4
12078.147.775.546.576.945.1
24099.9100.099.399.8100.099.099.8100.099.1
GOS
6076.294.899.775.094.099.972.093.299.7
12096.5100.0100.096.199.7100.094.099.9100.0
240100.0100.0100.0100.0100.0100.0100.0100.0100.0
SW
6077.493.899.978.193.799.875.692.599.8
12097.199.8100.095.7100.0100.095.799.7100.0
240100.0100.0100.099.9100.0100.099.9100.0100.0
Fmax
601.61.91.71.51.51.51.31.41.5
1207.69.010.36.67.69.17.57.78.9
24035.243.755.431.444.556.729.342.354.9
BS
6025.844.670.923.442.169.418.733.857.7
12060.688.099.057.885.499.447.577.797.6
24096.3100.0100.095.2100.0100.091.999.6100.0
SD
6067.689.099.065.987.498.959.483.897.9
12095.199.8100.094.599.7100.090.899.8100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0
Panel C: Power 2 (αi=βi(λ-μ) with (λ-μ)=0.1(2.92,0.63,9.96))
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6058.481.596.356.279.496.549.075.694.9
12094.499.7100.093.099.6100.090.099.4100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0
GRS
6011.812.312.4
12078.147.775.546.576.945.1
24099.9100.099.399.8100.099.099.8100.099.1
GOS
6076.294.899.775.094.099.972.093.299.7
12096.5100.0100.096.199.7100.094.099.9100.0
240100.0100.0100.0100.0100.0100.0100.0100.0100.0
SW
6077.493.899.978.193.799.875.692.599.8
12097.199.8100.095.7100.0100.095.799.7100.0
240100.0100.0100.099.9100.0100.099.9100.0100.0
Fmax
601.61.91.71.51.51.51.31.41.5
1207.69.010.36.67.69.17.57.78.9
24035.243.755.431.444.556.729.342.354.9
BS
6025.844.670.923.442.169.418.733.857.7
12060.688.099.057.885.499.447.577.797.6
24096.3100.0100.095.2100.0100.091.999.6100.0
SD
6067.689.099.065.987.498.959.483.897.9
12095.199.8100.094.599.7100.090.899.8100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0

Notes: This table summarizes the size and power of J^α, GRS, GOS, SW, Fmax, BS, and SD tests of αi=0 for i=1,2,,N, in the case of three-factor models. The observations are generated as yit=αi+=13βift+uit, i=1,2,..,N;t=1,2,,T,ft=μf+ρff,t1+et, where et=htξt, ht=μh+ρ1hh,t1+ρ2he,t12,ξtIIDN(0,1),t=49,,T with f,50=0 and h,50=0,=1,2,3. The idiosyncratic errors are generated as uit=γivt+σηiεη,it, where εη,itIIDN(0,1),vtIIDN(0,1) and σηi2IID(1+χ2,i2)/3. The first Nδγ(<N) γi are generated as Uniform(0.7,0.9), and the remaining elements are set to 0. We consider the values δγ=0,1/4, and 1/2. J^α is the proposed test; GRS is the F-test due to Gibbons et al. (1989) which is distributed as FN,TNm, which is applicable when T>N+4. “–” signifies that the GRS statistic cannot be computed. GOS is the test proposed by Gagliardini et al. (2016) defined in Equation (47); SW is the test based on the POET estimator of Fan et al. (2013). Fmax is proposed by GL, BS and SD are tests of He et al. (2021), which are defined in the Supplementary Material. Values of J^α, GOS, SW, BS, and SD are compared with a positive one-sided critical value of the standard normal distribution. All tests are conducted at the 5% significance level. Experiments are based on 2000 replications.

Table 2.

Size and power of the J^α and other tests with normal errors

Panel A: Size (αi=0 for all i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
GRS
605.04.15.3
1205.84.34.94.34.93.7
2404.34.94.54.85.44.95.94.65.1
GOS
6017.423.530.317.322.531.516.923.829.9
12011.312.313.99.812.214.49.611.714.7
2407.28.99.37.48.48.67.78.49.6
SW
6017.423.530.317.422.631.517.824.330.2
12011.312.313.910.012.214.422.919.616.0
2407.28.99.37.48.78.610.814.320.9
Fmax
600.40.20.10.10.00.20.40.20.0
1200.20.10.10.10.20.00.10.10.0
2400.10.20.20.10.10.20.10.10.1
BS
604.24.04.63.44.43.93.94.44.3
1203.42.92.72.72.92.42.93.53.5
2402.02.42.02.62.52.03.22.93.0
SD
6010.912.013.210.212.113.59.311.211.9
1207.97.78.37.17.98.56.48.18.6
2405.06.76.75.76.35.85.96.77.3
Panel A: Size (αi=0 for all i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
GRS
605.04.15.3
1205.84.34.94.34.93.7
2404.34.94.54.85.44.95.94.65.1
GOS
6017.423.530.317.322.531.516.923.829.9
12011.312.313.99.812.214.49.611.714.7
2407.28.99.37.48.48.67.78.49.6
SW
6017.423.530.317.422.631.517.824.330.2
12011.312.313.910.012.214.422.919.616.0
2407.28.99.37.48.78.610.814.320.9
Fmax
600.40.20.10.10.00.20.40.20.0
1200.20.10.10.10.20.00.10.10.0
2400.10.20.20.10.10.20.10.10.1
BS
604.24.04.63.44.43.93.94.44.3
1203.42.92.72.72.92.42.93.53.5
2402.02.42.02.62.52.03.22.93.0
SD
6010.912.013.210.212.113.59.311.211.9
1207.97.78.37.17.98.56.48.18.6
2405.06.76.75.76.35.85.96.77.3
Panel B: Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
GRS
6014.713.414.5
12082.848.980.149.379.648.5
24099.099.895.599.099.895.699.099.795.4
GOS
6083.193.098.680.391.797.971.786.096.0
12095.199.299.994.599.1100.089.297.699.5
24099.6100.0100.099.4100.0100.099.199.9100.0
SW
6083.193.098.680.491.797.972.786.596.1
12095.199.299.994.599.1100.094.698.699.7
24099.6100.0100.099.4100.0100.099.6100.0100.0
Fmax
6017.620.325.316.018.820.511.216.116.5
12053.265.876.050.063.672.738.250.365.0
24087.995.799.287.094.898.877.890.496.6
BS
6039.849.463.138.049.458.828.939.748.9
12073.286.295.071.085.794.163.279.790.1
24096.399.4100.095.599.6100.092.898.699.9
SD
6076.787.995.672.785.593.560.975.487.5
12094.498.899.893.098.799.986.396.499.1
24099.599.9100.099.4100.0100.098.799.9100.0
Panel B: Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
GRS
6014.713.414.5
12082.848.980.149.379.648.5
24099.099.895.599.099.895.699.099.795.4
GOS
6083.193.098.680.391.797.971.786.096.0
12095.199.299.994.599.1100.089.297.699.5
24099.6100.0100.099.4100.0100.099.199.9100.0
SW
6083.193.098.680.491.797.972.786.596.1
12095.199.299.994.599.1100.094.698.699.7
24099.6100.0100.099.4100.0100.099.6100.0100.0
Fmax
6017.620.325.316.018.820.511.216.116.5
12053.265.876.050.063.672.738.250.365.0
24087.995.799.287.094.898.877.890.496.6
BS
6039.849.463.138.049.458.828.939.748.9
12073.286.295.071.085.794.163.279.790.1
24096.399.4100.095.599.6100.092.898.699.9
SD
6076.787.995.672.785.593.560.975.487.5
12094.498.899.893.098.799.986.396.499.1
24099.599.9100.099.4100.0100.098.799.9100.0
Panel C: Power 2 (αi=βi(λ-μ) with (λ-μ)=0.1(2.92,0.63,9.96))
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6058.481.596.356.279.496.549.075.694.9
12094.499.7100.093.099.6100.090.099.4100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0
GRS
6011.812.312.4
12078.147.775.546.576.945.1
24099.9100.099.399.8100.099.099.8100.099.1
GOS
6076.294.899.775.094.099.972.093.299.7
12096.5100.0100.096.199.7100.094.099.9100.0
240100.0100.0100.0100.0100.0100.0100.0100.0100.0
SW
6077.493.899.978.193.799.875.692.599.8
12097.199.8100.095.7100.0100.095.799.7100.0
240100.0100.0100.099.9100.0100.099.9100.0100.0
Fmax
601.61.91.71.51.51.51.31.41.5
1207.69.010.36.67.69.17.57.78.9
24035.243.755.431.444.556.729.342.354.9
BS
6025.844.670.923.442.169.418.733.857.7
12060.688.099.057.885.499.447.577.797.6
24096.3100.0100.095.2100.0100.091.999.6100.0
SD
6067.689.099.065.987.498.959.483.897.9
12095.199.8100.094.599.7100.090.899.8100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0
Panel C: Power 2 (αi=βi(λ-μ) with (λ-μ)=0.1(2.92,0.63,9.96))
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
6058.481.596.356.279.496.549.075.694.9
12094.499.7100.093.099.6100.090.099.4100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0
GRS
6011.812.312.4
12078.147.775.546.576.945.1
24099.9100.099.399.8100.099.099.8100.099.1
GOS
6076.294.899.775.094.099.972.093.299.7
12096.5100.0100.096.199.7100.094.099.9100.0
240100.0100.0100.0100.0100.0100.0100.0100.0100.0
SW
6077.493.899.978.193.799.875.692.599.8
12097.199.8100.095.7100.0100.095.799.7100.0
240100.0100.0100.099.9100.0100.099.9100.0100.0
Fmax
601.61.91.71.51.51.51.31.41.5
1207.69.010.36.67.69.17.57.78.9
24035.243.755.431.444.556.729.342.354.9
BS
6025.844.670.923.442.169.418.733.857.7
12060.688.099.057.885.499.447.577.797.6
24096.3100.0100.095.2100.0100.091.999.6100.0
SD
6067.689.099.065.987.498.959.483.897.9
12095.199.8100.094.599.7100.090.899.8100.0
240100.0100.0100.0100.0100.0100.099.9100.0100.0

Notes: This table summarizes the size and power of J^α, GRS, GOS, SW, Fmax, BS, and SD tests of αi=0 for i=1,2,,N, in the case of three-factor models. The observations are generated as yit=αi+=13βift+uit, i=1,2,..,N;t=1,2,,T,ft=μf+ρff,t1+et, where et=htξt, ht=μh+ρ1hh,t1+ρ2he,t12,ξtIIDN(0,1),t=49,,T with f,50=0 and h,50=0,=1,2,3. The idiosyncratic errors are generated as uit=γivt+σηiεη,it, where εη,itIIDN(0,1),vtIIDN(0,1) and σηi2IID(1+χ2,i2)/3. The first Nδγ(<N) γi are generated as Uniform(0.7,0.9), and the remaining elements are set to 0. We consider the values δγ=0,1/4, and 1/2. J^α is the proposed test; GRS is the F-test due to Gibbons et al. (1989) which is distributed as FN,TNm, which is applicable when T>N+4. “–” signifies that the GRS statistic cannot be computed. GOS is the test proposed by Gagliardini et al. (2016) defined in Equation (47); SW is the test based on the POET estimator of Fan et al. (2013). Fmax is proposed by GL, BS and SD are tests of He et al. (2021), which are defined in the Supplementary Material. Values of J^α, GOS, SW, BS, and SD are compared with a positive one-sided critical value of the standard normal distribution. All tests are conducted at the 5% significance level. Experiments are based on 2000 replications.

First, consider Panel A of Table 2 which reports the size of the tests. The GRS test when applicable (namely when T > N) is an exact test and has the correct size. The empirical size of the J^α test is also very close to the 5% nominal level for all combinations of N and T. Even when N =200 and δγ=0.5, the size of the J^α test lies in the range 5.9–6.4% for different values of T. In contrast, both GOS and SW tests grossly over-reject the null hypothesis, and the degree of the over-rejection becomes more serious as N increases for a given T. In line with the discussion in Section 3.4, the size distortion of these tests is mitigated when T increases. The Fmax test severely under-rejects the null hypothesis, with the size ranging between 0.0% and 0.4%. Although less pronounced than the Fmax test, the BS test is very conservative and the size steadily drops as T (and N) rises. Again, although less pronounced than the GOS and SW tests, the SD test tends to over-reject the null hypothesis and the degree of the over-rejection becomes more serious as N increases for a given T.

The power of the tests based on the “Power 1” design is reported in Panel B of Table 2. The power of J^α test is substantially higher than that of the GRS test. This is in line with our discussion at the end of Section 1, and reflects the fact that GRS assumes an arbitrary degree of cross-sectional error correlations and thus relies on a large time dimension to achieve a reasonably high power. In contrast, the power of the J^α test is driven largely by the cross-sectional dimension. The power comparison of the GOS, SW, and SD tests with the J^α test seems inappropriate, given their large size-distortions. Having said this, it is perhaps remarkable that the power of the J^α test is comparable to the unadjusted power of the GOS, SWPOET, and SWLW tests. The power of the Fmax and BS tests is uniformly lower than the power of the J^α test, likely due to the conservative nature of these tests. The power of the tests based on the “Power 2” design is reported in Panel C of Table 2. The properties of the tests with the “Power 2” design reported in Panel C of Table 2 are qualitatively very similar to those of the “Power 1” design. A detailed discussion of Table 2 is therefore omitted.

We now consider the case in which the errors are non-normal. The size results are summarized in Table 3. The results show that the size of the J^α test and the GRS test, as well as the Fmax, BS, and SD tests, is hardly affected by non-normality. The over-rejection of the GOS and SW tests tends to be somewhat magnified by non-normality.

Table 3.

Size of the J^α and other tests with non-normal errors

Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.94.65.65.06.25.05.56.67.0
1205.74.85.24.36.26.05.85.75.1
2405.85.75.44.75.65.46.56.85.8
GRS
605.04.55.4
1204.95.14.84.73.65.1
2405.54.74.23.75.04.75.45.65.0
GOS
6017.122.230.015.521.729.217.022.932.6
1209.510.814.09.511.914.38.912.414.4
2408.18.38.96.67.99.08.19.29.1
SW
6017.122.130.115.521.729.218.523.532.8
1209.510.814.09.511.814.419.719.915.5
2408.18.38.96.68.09.011.117.724.6
Fmax
600.00.20.10.20.10.10.10.20.2
1200.10.10.10.00.10.10.00.10.1
2400.20.10.20.20.20.10.10.30.1
BS
603.93.64.62.94.43.53.54.54.7
1203.22.03.32.53.22.12.92.53.4
2402.21.82.22.12.62.13.02.63.0
SD
6010.811.313.09.412.212.79.612.113.3
1206.76.38.55.78.38.76.67.47.8
2405.96.16.44.86.06.66.37.16.8
Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.94.65.65.06.25.05.56.67.0
1205.74.85.24.36.26.05.85.75.1
2405.85.75.44.75.65.46.56.85.8
GRS
605.04.55.4
1204.95.14.84.73.65.1
2405.54.74.23.75.04.75.45.65.0
GOS
6017.122.230.015.521.729.217.022.932.6
1209.510.814.09.511.914.38.912.414.4
2408.18.38.96.67.99.08.19.29.1
SW
6017.122.130.115.521.729.218.523.532.8
1209.510.814.09.511.814.419.719.915.5
2408.18.38.96.68.09.011.117.724.6
Fmax
600.00.20.10.20.10.10.10.20.2
1200.10.10.10.00.10.10.00.10.1
2400.20.10.20.20.20.10.10.30.1
BS
603.93.64.62.94.43.53.54.54.7
1203.22.03.32.53.22.12.92.53.4
2402.21.82.22.12.62.13.02.63.0
SD
6010.811.313.09.412.212.79.612.113.3
1206.76.38.55.78.38.76.67.47.8
2405.96.16.44.86.06.66.37.16.8

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that uit=γivt+σηiεη,it, where εη,itis independently drawn from standardized student t-distribution with eight degrees of freedom.

Table 3.

Size of the J^α and other tests with non-normal errors

Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.94.65.65.06.25.05.56.67.0
1205.74.85.24.36.26.05.85.75.1
2405.85.75.44.75.65.46.56.85.8
GRS
605.04.55.4
1204.95.14.84.73.65.1
2405.54.74.23.75.04.75.45.65.0
GOS
6017.122.230.015.521.729.217.022.932.6
1209.510.814.09.511.914.38.912.414.4
2408.18.38.96.67.99.08.19.29.1
SW
6017.122.130.115.521.729.218.523.532.8
1209.510.814.09.511.814.419.719.915.5
2408.18.38.96.68.09.011.117.724.6
Fmax
600.00.20.10.20.10.10.10.20.2
1200.10.10.10.00.10.10.00.10.1
2400.20.10.20.20.20.10.10.30.1
BS
603.93.64.62.94.43.53.54.54.7
1203.22.03.32.53.22.12.92.53.4
2402.21.82.22.12.62.13.02.63.0
SD
6010.811.313.09.412.212.79.612.113.3
1206.76.38.55.78.38.76.67.47.8
2405.96.16.44.86.06.66.37.16.8
Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.94.65.65.06.25.05.56.67.0
1205.74.85.24.36.26.05.85.75.1
2405.85.75.44.75.65.46.56.85.8
GRS
605.04.55.4
1204.95.14.84.73.65.1
2405.54.74.23.75.04.75.45.65.0
GOS
6017.122.230.015.521.729.217.022.932.6
1209.510.814.09.511.914.38.912.414.4
2408.18.38.96.67.99.08.19.29.1
SW
6017.122.130.115.521.729.218.523.532.8
1209.510.814.09.511.814.419.719.915.5
2408.18.38.96.68.09.011.117.724.6
Fmax
600.00.20.10.20.10.10.10.20.2
1200.10.10.10.00.10.10.00.10.1
2400.20.10.20.20.20.10.10.30.1
BS
603.93.64.62.94.43.53.54.54.7
1203.22.03.32.53.22.12.92.53.4
2402.21.82.22.12.62.13.02.63.0
SD
6010.811.313.09.412.212.79.612.113.3
1206.76.38.55.78.38.76.67.47.8
2405.96.16.44.86.06.66.37.16.8

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that uit=γivt+σηiεη,it, where εη,itis independently drawn from standardized student t-distribution with eight degrees of freedom.

Furthermore, the behavior of the test statistics is examined under the same DGP as that examined in Table 2, except that a spatial autoregressive component was incorporated into the error generation process. The results with such mixed factor-spatial errors are reported in Table 4. As can be seen, the size of the J^α test and GRS test is well controlled, with a slight over-rejection for T =60, which disappears when T is increased to 120. In contrast, the size distortion of GOS and SW seems to be amplified with this design. The size properties of the Fmax, BS, and SD tests remain similar to those in Table 2.

Table 4.

Size of the J^α and other tests, spatially correlated errors

Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
607.37.17.85.87.06.16.76.86.4
1206.16.56.16.05.25.76.56.26.6
2406.56.15.65.84.95.96.97.05.9
GRS
604.44.14.9
1205.55.44.45.25.45.5
2405.75.04.35.05.05.35.64.54.1
GOS
6017.423.932.317.724.031.119.324.530.9
12011.413.816.511.012.615.210.911.516.9
2408.910.29.88.68.610.88.59.89.4
SW
6017.523.932.217.824.131.220.525.531.0
12011.913.816.512.613.015.444.815.718.9
24017.712.811.315.814.312.920.344.926.5
Fmax
600.20.20.00.30.10.10.30.10.1
1200.10.10.10.20.10.00.00.20.1
2400.10.00.10.10.00.20.10.10.2
BS
604.04.23.83.83.63.54.04.43.6
1203.13.23.42.83.02.63.03.23.6
2402.73.02.42.92.42.43.03.42.5
SD
609.812.013.49.411.312.39.510.611.6
1206.87.77.96.46.97.77.47.08.0
2406.46.76.45.65.26.86.47.16.3
Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
607.37.17.85.87.06.16.76.86.4
1206.16.56.16.05.25.76.56.26.6
2406.56.15.65.84.95.96.97.05.9
GRS
604.44.14.9
1205.55.44.45.25.45.5
2405.75.04.35.05.05.35.64.54.1
GOS
6017.423.932.317.724.031.119.324.530.9
12011.413.816.511.012.615.210.911.516.9
2408.910.29.88.68.610.88.59.89.4
SW
6017.523.932.217.824.131.220.525.531.0
12011.913.816.512.613.015.444.815.718.9
24017.712.811.315.814.312.920.344.926.5
Fmax
600.20.20.00.30.10.10.30.10.1
1200.10.10.10.20.10.00.00.20.1
2400.10.00.10.10.00.20.10.10.2
BS
604.04.23.83.83.63.54.04.43.6
1203.13.23.42.83.02.63.03.23.6
2402.73.02.42.92.42.43.03.42.5
SD
609.812.013.49.411.312.39.510.611.6
1206.87.77.96.46.97.77.47.08.0
2406.46.76.45.65.26.86.47.16.3

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that uit=γivt+ηit with ηit=ψj=1Nwijηjt+σηiεη,it. We have chosen the value ψ=1/4 and a rook form for W=(wij), namely, all elements in W are zero except wi+1,i=wj1,j=0.5 for i=1,2,,N2 and j=3,4…,N, with w1,2=wN,N1=1.

Table 4.

Size of the J^α and other tests, spatially correlated errors

Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
607.37.17.85.87.06.16.76.86.4
1206.16.56.16.05.25.76.56.26.6
2406.56.15.65.84.95.96.97.05.9
GRS
604.44.14.9
1205.55.44.45.25.45.5
2405.75.04.35.05.05.35.64.54.1
GOS
6017.423.932.317.724.031.119.324.530.9
12011.413.816.511.012.615.210.911.516.9
2408.910.29.88.68.610.88.59.89.4
SW
6017.523.932.217.824.131.220.525.531.0
12011.913.816.512.613.015.444.815.718.9
24017.712.811.315.814.312.920.344.926.5
Fmax
600.20.20.00.30.10.10.30.10.1
1200.10.10.10.20.10.00.00.20.1
2400.10.00.10.10.00.20.10.10.2
BS
604.04.23.83.83.63.54.04.43.6
1203.13.23.42.83.02.63.03.23.6
2402.73.02.42.92.42.43.03.42.5
SD
609.812.013.49.411.312.39.510.611.6
1206.87.77.96.46.97.77.47.08.0
2406.46.76.45.65.26.86.47.16.3
Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
607.37.17.85.87.06.16.76.86.4
1206.16.56.16.05.25.76.56.26.6
2406.56.15.65.84.95.96.97.05.9
GRS
604.44.14.9
1205.55.44.45.25.45.5
2405.75.04.35.05.05.35.64.54.1
GOS
6017.423.932.317.724.031.119.324.530.9
12011.413.816.511.012.615.210.911.516.9
2408.910.29.88.68.610.88.59.89.4
SW
6017.523.932.217.824.131.220.525.531.0
12011.913.816.512.613.015.444.815.718.9
24017.712.811.315.814.312.920.344.926.5
Fmax
600.20.20.00.30.10.10.30.10.1
1200.10.10.10.20.10.00.00.20.1
2400.10.00.10.10.00.20.10.10.2
BS
604.04.23.83.83.63.54.04.43.6
1203.13.23.42.83.02.63.03.23.6
2402.73.02.42.92.42.43.03.42.5
SD
609.812.013.49.411.312.39.510.611.6
1206.87.77.96.46.97.77.47.08.0
2406.46.76.45.65.26.86.47.16.3

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that uit=γivt+ηit with ηit=ψj=1Nwijηjt+σηiεη,it. We have chosen the value ψ=1/4 and a rook form for W=(wij), namely, all elements in W are zero except wi+1,i=wj1,j=0.5 for i=1,2,,N2 and j=3,4…,N, with w1,2=wN,N1=1.

Since the autoregressive conditional heteroskedasticity is commonly found in security returns, the effect of cross-sectionally correlated errors with GARCH(1,1) processes is also investigated. The size properties of the tests are summarized in Table 5. The results are almost identical to those using unconditionally time-series homoskedastic (but cross-sectionally heteroskedastic) errors reported in Table 2. This is to be expected as the LFPM is a static model and unconditional homoskedastic GARCH errors do not affect our theoretical results.

Table 5.

Size of the J^α and other tests, GARCH(1,1) errors

Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.64.95.65.95.65.06.06.15.5
1205.25.85.95.85.04.65.45.65.3
2406.14.96.05.85.64.95.66.84.8
GRS
603.94.84.6
1203.74.85.35.64.94.9
2404.55.05.84.85.45.55.05.45.3
GOS
6015.321.529.917.920.632.518.522.729.8
1209.511.914.010.110.513.710.512.414.9
2408.27.39.38.28.98.97.810.19.8
SW
6016.122.529.616.122.129.419.023.531.5
1209.811.215.19.711.415.121.223.316.8
2407.78.88.17.88.78.511.116.927.4
Fmax
600.10.00.00.10.10.00.10.00.1
1200.10.20.10.10.10.10.10.00.1
2400.00.00.00.10.10.00.00.10.1
BS
604.04.03.83.73.34.34.03.84.0
1202.93.33.92.82.82.93.03.12.3
2402.61.62.02.72.62.32.72.62.4
SD
608.710.812.59.911.213.410.310.811.3
1206.68.29.17.27.27.76.47.47.3
2406.35.56.85.96.66.65.67.36.1
Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.64.95.65.95.65.06.06.15.5
1205.25.85.95.85.04.65.45.65.3
2406.14.96.05.85.64.95.66.84.8
GRS
603.94.84.6
1203.74.85.35.64.94.9
2404.55.05.84.85.45.55.05.45.3
GOS
6015.321.529.917.920.632.518.522.729.8
1209.511.914.010.110.513.710.512.414.9
2408.27.39.38.28.98.97.810.19.8
SW
6016.122.529.616.122.129.419.023.531.5
1209.811.215.19.711.415.121.223.316.8
2407.78.88.17.88.78.511.116.927.4
Fmax
600.10.00.00.10.10.00.10.00.1
1200.10.20.10.10.10.10.10.00.1
2400.00.00.00.10.10.00.00.10.1
BS
604.04.03.83.73.34.34.03.84.0
1202.93.33.92.82.82.93.03.12.3
2402.61.62.02.72.62.32.72.62.4
SD
608.710.812.59.911.213.410.310.811.3
1206.68.29.17.27.27.76.47.47.3
2406.35.56.85.96.66.65.67.36.1

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that uit=γivt+εη,it with εη,it=ωitζit and ζitIIDN(0,1), where ωit=σηi2(1ϱφ)+ϱωi,t1+φεη,it12. We set ϱ=0.2 and φ=0.6. First 50 time-series observations of εη,it are discarded.

Table 5.

Size of the J^α and other tests, GARCH(1,1) errors

Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.64.95.65.95.65.06.06.15.5
1205.25.85.95.85.04.65.45.65.3
2406.14.96.05.85.64.95.66.84.8
GRS
603.94.84.6
1203.74.85.35.64.94.9
2404.55.05.84.85.45.55.05.45.3
GOS
6015.321.529.917.920.632.518.522.729.8
1209.511.914.010.110.513.710.512.414.9
2408.27.39.38.28.98.97.810.19.8
SW
6016.122.529.616.122.129.419.023.531.5
1209.811.215.19.711.415.121.223.316.8
2407.78.88.17.88.78.511.116.927.4
Fmax
600.10.00.00.10.10.00.10.00.1
1200.10.20.10.10.10.10.10.00.1
2400.00.00.00.10.10.00.00.10.1
BS
604.04.03.83.73.34.34.03.84.0
1202.93.33.92.82.82.93.03.12.3
2402.61.62.02.72.62.32.72.62.4
SD
608.710.812.59.911.213.410.310.811.3
1206.68.29.17.27.27.76.47.47.3
2406.35.56.85.96.66.65.67.36.1
Size:αi=0 for all i
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200
J^α
605.64.95.65.95.65.06.06.15.5
1205.25.85.95.85.04.65.45.65.3
2406.14.96.05.85.64.95.66.84.8
GRS
603.94.84.6
1203.74.85.35.64.94.9
2404.55.05.84.85.45.55.05.45.3
GOS
6015.321.529.917.920.632.518.522.729.8
1209.511.914.010.110.513.710.512.414.9
2408.27.39.38.28.98.97.810.19.8
SW
6016.122.529.616.122.129.419.023.531.5
1209.811.215.19.711.415.121.223.316.8
2407.78.88.17.88.78.511.116.927.4
Fmax
600.10.00.00.10.10.00.10.00.1
1200.10.20.10.10.10.10.10.00.1
2400.00.00.00.10.10.00.00.10.1
BS
604.04.03.83.73.34.34.03.84.0
1202.93.33.92.82.82.93.03.12.3
2402.61.62.02.72.62.32.72.62.4
SD
608.710.812.59.911.213.410.310.811.3
1206.68.29.17.27.27.76.47.47.3
2406.35.56.85.96.66.65.67.36.1

Notes: See the note to Table 2. The DGP is the same as in Table 2, except that uit=γivt+εη,it with εη,it=ωitζit and ζitIIDN(0,1), where ωit=σηi2(1ϱφ)+ϱωi,t1+φεη,it12. We set ϱ=0.2 and φ=0.6. First 50 time-series observations of εη,it are discarded.

The experimental results so far confirm that the finite sample performance of the J^α test is superior to the other tests we have considered. In the light of these promising results, we further investigate the properties of J-alpha tests, in particular the sensitivity of the choice of the values for {δ,p} and the effectiveness of the standardization employed by the J^α.

First, we examine the sensitivity of the test to the choice of the value of {δ,p}. As mentioned, the J^α we have considered employs δ = 1 and p =0.1. To check whether this choice is appropriate, in the next experiment, we consider four combinations of {δ,p} using δ=1,2,p=0.05,0.01. Table 6 summarizes the size and power results. As can be seen, the choice of p has little effect on the size and power characteristics. Meanwhile, the performance of the test is slightly sensitive to the choice of δ, but this effect quickly disappears as T increases. These experimental results support the use of the J^α test with δ = 1 and p =0.1.

Table 6.

Size and power of the J^α tests for p={0.1,0.05} and δ={1,2} with normal errors

δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200

Size (αi=0 for all i)
J^α(p=0.1,δ=1)
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
J^α(p=0.1,δ=2)
606.65.75.06.26.16.26.07.66.8
1206.65.64.76.05.95.36.06.56.5
2405.05.95.35.75.84.86.06.36.4
J^α(p=0.05,δ=1)
606.45.64.86.16.16.15.66.95.9
1206.55.64.75.95.95.35.96.26.2
2404.95.95.35.75.84.86.06.26.4
J^α(p=0.05,δ=2)
606.65.75.06.26.16.26.17.66.9
1206.65.64.76.05.95.36.06.66.5
2405.05.95.35.75.84.86.06.36.4


Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
J^α(p=0.1,δ=1)
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.1,δ=2)
6070.782.091.064.978.487.155.067.978.7
12093.698.599.791.998.399.884.995.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=1)
6070.581.990.964.878.387.053.866.277.7
12093.698.599.791.898.399.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=2)
6070.782.091.065.078.487.155.268.078.8
12093.698.599.791.998.399.885.095.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200

Size (αi=0 for all i)
J^α(p=0.1,δ=1)
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
J^α(p=0.1,δ=2)
606.65.75.06.26.16.26.07.66.8
1206.65.64.76.05.95.36.06.56.5
2405.05.95.35.75.84.86.06.36.4
J^α(p=0.05,δ=1)
606.45.64.86.16.16.15.66.95.9
1206.55.64.75.95.95.35.96.26.2
2404.95.95.35.75.84.86.06.26.4
J^α(p=0.05,δ=2)
606.65.75.06.26.16.26.17.66.9
1206.65.64.76.05.95.36.06.66.5
2405.05.95.35.75.84.86.06.36.4


Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
J^α(p=0.1,δ=1)
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.1,δ=2)
6070.782.091.064.978.487.155.067.978.7
12093.698.599.791.998.399.884.995.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=1)
6070.581.990.964.878.387.053.866.277.7
12093.698.599.791.898.399.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=2)
6070.782.091.065.078.487.155.268.078.8
12093.698.599.791.998.399.885.095.598.6
24099.599.9100.099.4100.0100.098.899.9100.0

Notes: See the note to Table 2. The DGP is the same as in Table 2. The p and δ are for the MT estimator ρ˜ij=ρ^ijI[|vρ^ij|>cp(N)], where cp(N)=Φ1(1p2Nδ).

Table 6.

Size and power of the J^α tests for p={0.1,0.05} and δ={1,2} with normal errors

δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200

Size (αi=0 for all i)
J^α(p=0.1,δ=1)
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
J^α(p=0.1,δ=2)
606.65.75.06.26.16.26.07.66.8
1206.65.64.76.05.95.36.06.56.5
2405.05.95.35.75.84.86.06.36.4
J^α(p=0.05,δ=1)
606.45.64.86.16.16.15.66.95.9
1206.55.64.75.95.95.35.96.26.2
2404.95.95.35.75.84.86.06.26.4
J^α(p=0.05,δ=2)
606.65.75.06.26.16.26.17.66.9
1206.65.64.76.05.95.36.06.66.5
2405.05.95.35.75.84.86.06.36.4


Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
J^α(p=0.1,δ=1)
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.1,δ=2)
6070.782.091.064.978.487.155.067.978.7
12093.698.599.791.998.399.884.995.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=1)
6070.581.990.964.878.387.053.866.277.7
12093.698.599.791.898.399.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=2)
6070.782.091.065.078.487.155.268.078.8
12093.698.599.791.998.399.885.095.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
δγ=0
δγ=1/4
δγ=1/2
(T, N)501002005010020050100200

Size (αi=0 for all i)
J^α(p=0.1,δ=1)
606.45.64.76.16.16.15.56.85.9
1206.55.64.75.95.95.35.86.16.1
2404.95.85.25.75.84.76.06.26.4
J^α(p=0.1,δ=2)
606.65.75.06.26.16.26.07.66.8
1206.65.64.76.05.95.36.06.56.5
2405.05.95.35.75.84.86.06.36.4
J^α(p=0.05,δ=1)
606.45.64.86.16.16.15.66.95.9
1206.55.64.75.95.95.35.96.26.2
2404.95.95.35.75.84.86.06.26.4
J^α(p=0.05,δ=2)
606.65.75.06.26.16.26.17.66.9
1206.65.64.76.05.95.36.06.66.5
2405.05.95.35.75.84.86.06.36.4


Power 1 (αi=ϖiN(0,1) for i=1,,N0.7 and αi=0 for other i)
J^α(p=0.1,δ=1)
6070.381.790.864.678.186.953.466.077.0
12093.698.599.791.798.299.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.1,δ=2)
6070.782.091.064.978.487.155.067.978.7
12093.698.599.791.998.399.884.995.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=1)
6070.581.990.964.878.387.053.866.277.7
12093.698.599.791.898.399.884.795.598.6
24099.599.9100.099.4100.0100.098.899.9100.0
J^α(p=0.05,δ=2)
6070.782.091.065.078.487.155.268.078.8
12093.698.599.791.998.399.885.095.598.6
24099.599.9100.099.4100.0100.098.899.9100.0

Notes: See the note to Table 2. The DGP is the same as in Table 2. The p and δ are for the MT estimator ρ˜ij=ρ^ijI[|vρ^ij|>cp(N)], where cp(N)=Φ1(1p2Nδ).

Finally, an experiment was conducted to check the effectiveness of the standardization employed in the J^α. In particular, we check the effectiveness of the centering ti2v/(v2) employed by the J^α test compared with ti21 employed by GOS, and the usefulness of estimating the cross-correlation of ti2 with the MT estimator ρ˜N, respectively. For this purpose, two J-alpha test variants, J˜α and Jα(0), are considered on top of the J^α statistic. J˜α is identical to J^α, but replaces ti2v/(v2) by ti21. The second statistic, Jα(0), sets ρ˜N equal to zero (i.e., does not control for cross-correlation). In the present experiment, to investigate the behavior of the J^α test in more challenging environments, N is considered with larger values, that is, N=500,1000,2000 and 5000, while T is set to 60, 120, and 240 as before. The results are reported in Table 7, which reveal that the centering using v/(v2) as well as the control of error cross-correlations by the MT estimator play a very significant role in controlling the size of the test for large N (and large T as shown in Panel A of Table 2).

Table 7.

Size of the J^α tests, for very large N with normal and non-normal errors

δγ=0
δγ=1/4
δγ=1/2
(T, N)500100020005000500100020005000500100020005000

Panel A: Normal errors
J˜α
6014.519.429.452.413.019.329.553.314.318.728.251.5
1208.69.212.521.78.98.911.721.68.79.111.119.1
2406.67.47.111.36.97.17.711.76.77.17.010.8
Jα(0)
606.95.34.35.25.55.75.25.17.57.46.97.8
1205.14.44.95.05.74.55.34.67.16.15.87.2
2405.05.04.25.25.15.14.15.06.96.66.07.1
J^α
606.85.34.25.15.55.65.15.16.56.36.17.2
1205.14.24.85.05.64.45.24.55.64.54.45.8
2405.05.04.15.15.05.14.15.05.75.24.35.6


Panel B: Non-normal errors
J˜α
6013.718.528.152.013.117.728.651.312.618.525.749.7
1209.010.112.221.29.49.512.421.78.79.611.719.9
2406.37.37.912.26.77.47.512.27.77.78.110.0
Jα(0)
605.65.04.14.14.94.64.04.67.35.96.15.8
1205.75.44.84.75.34.85.14.97.76.25.76.0
2405.25.44.75.45.34.84.55.37.77.26.06.5
J^α
605.55.04.04.04.94.54.04.66.45.35.45.4
1205.65.44.64.75.24.75.04.96.44.74.54.9
2405.25.44.65.45.14.84.45.26.25.74.75.0
δγ=0
δγ=1/4
δγ=1/2
(T, N)500100020005000500100020005000500100020005000

Panel A: Normal errors
J˜α
6014.519.429.452.413.019.329.553.314.318.728.251.5
1208.69.212.521.78.98.911.721.68.79.111.119.1
2406.67.47.111.36.97.17.711.76.77.17.010.8
Jα(0)
606.95.34.35.25.55.75.25.17.57.46.97.8
1205.14.44.95.05.74.55.34.67.16.15.87.2
2405.05.04.25.25.15.14.15.06.96.66.07.1
J^α
606.85.34.25.15.55.65.15.16.56.36.17.2
1205.14.24.85.05.64.45.24.55.64.54.45.8
2405.05.04.15.15.05.14.15.05.75.24.35.6


Panel B: Non-normal errors
J˜α
6013.718.528.152.013.117.728.651.312.618.525.749.7
1209.010.112.221.29.49.512.421.78.79.611.719.9
2406.37.37.912.26.77.47.512.27.77.78.110.0
Jα(0)
605.65.04.14.14.94.64.04.67.35.96.15.8
1205.75.44.84.75.34.85.14.97.76.25.76.0
2405.25.44.75.45.34.84.55.37.77.26.06.5
J^α
605.55.04.04.04.94.54.04.66.45.35.45.4
1205.65.44.64.75.24.75.04.96.44.74.54.9
2405.25.44.65.45.14.84.45.26.25.74.75.0

Notes: See the note to Table 2. The DGPs are the same as in Table 2 for normal errors and in Table 3 for non-normal errors. For the purpose of comparison to J^α, we also provide results for J˜α test, which controls for error cross-correlations as the J^α test but demean ti2 by 1 rather than v/(v2). The Jα(0) test is defined by Equation (61) with ρN2=0, which does not control for error cross-correlations.

Table 7.

Size of the J^α tests, for very large N with normal and non-normal errors

δγ=0
δγ=1/4
δγ=1/2
(T, N)500100020005000500100020005000500100020005000

Panel A: Normal errors
J˜α
6014.519.429.452.413.019.329.553.314.318.728.251.5
1208.69.212.521.78.98.911.721.68.79.111.119.1
2406.67.47.111.36.97.17.711.76.77.17.010.8
Jα(0)
606.95.34.35.25.55.75.25.17.57.46.97.8
1205.14.44.95.05.74.55.34.67.16.15.87.2
2405.05.04.25.25.15.14.15.06.96.66.07.1
J^α
606.85.34.25.15.55.65.15.16.56.36.17.2
1205.14.24.85.05.64.45.24.55.64.54.45.8
2405.05.04.15.15.05.14.15.05.75.24.35.6


Panel B: Non-normal errors
J˜α
6013.718.528.152.013.117.728.651.312.618.525.749.7
1209.010.112.221.29.49.512.421.78.79.611.719.9
2406.37.37.912.26.77.47.512.27.77.78.110.0
Jα(0)
605.65.04.14.14.94.64.04.67.35.96.15.8
1205.75.44.84.75.34.85.14.97.76.25.76.0
2405.25.44.75.45.34.84.55.37.77.26.06.5
J^α
605.55.04.04.04.94.54.04.66.45.35.45.4
1205.65.44.64.75.24.75.04.96.44.74.54.9
2405.25.44.65.45.14.84.45.26.25.74.75.0
δγ=0
δγ=1/4
δγ=1/2
(T, N)500100020005000500100020005000500100020005000

Panel A: Normal errors
J˜α
6014.519.429.452.413.019.329.553.314.318.728.251.5
1208.69.212.521.78.98.911.721.68.79.111.119.1
2406.67.47.111.36.97.17.711.76.77.17.010.8
Jα(0)
606.95.34.35.25.55.75.25.17.57.46.97.8
1205.14.44.95.05.74.55.34.67.16.15.87.2
2405.05.04.25.25.15.14.15.06.96.66.07.1
J^α
606.85.34.25.15.55.65.15.16.56.36.17.2
1205.14.24.85.05.64.45.24.55.64.54.45.8
2405.05.04.15.15.05.14.15.05.75.24.35.6


Panel B: Non-normal errors
J˜α
6013.718.528.152.013.117.728.651.312.618.525.749.7
1209.010.112.221.29.49.512.421.78.79.611.719.9
2406.37.37.912.26.77.47.512.27.77.78.110.0
Jα(0)
605.65.04.14.14.94.64.04.67.35.96.15.8
1205.75.44.84.75.34.85.14.97.76.25.76.0
2405.25.44.75.45.34.84.55.37.77.26.06.5
J^α
605.55.04.04.04.94.54.04.66.45.35.45.4
1205.65.44.64.75.24.75.04.96.44.74.54.9
2405.25.44.65.45.14.84.45.26.25.74.75.0

Notes: See the note to Table 2. The DGPs are the same as in Table 2 for normal errors and in Table 3 for non-normal errors. For the purpose of comparison to J^α, we also provide results for J˜α test, which controls for error cross-correlations as the J^α test but demean ti2 by 1 rather than v/(v2). The Jα(0) test is defined by Equation (61) with ρN2=0, which does not control for error cross-correlations.

6 Empirical Application

6.1 Data Description

We consider the application of our proposed J^α test to the securities in the S&P 500 index of large cap U.S. equities market. Since the index is primarily intended as a leading indicator of U.S. equities, the composition of the index is monitored by S&P to ensure the widest possible overall market representation while reducing the index turnover to a minimum. Changes to the composition of the index are governed by published guidelines. In particular, a security is included if its market capitalization currently exceeds US$5.3 billion, is financially viable, and at least 50% of their equity is publicly floated. Companies that substantially violate one or more of the criteria for index inclusion, or are involved in merger, acquisition, or significant restructuring are replaced by other companies.

In order to take account for the change to the composition of the index over time, we compiled returns on all the 500 securities that constitute the S&P 500 index each month over the period January 1984 to April 2018. The monthly return of security i for month t is computed as rit=100(PitPi,t1)/Pi,t1+DYit/12, where Pit is the end of the month price of the security and DYit is the percent per annum dividend yield on the security. Note that index i depends on the month in which the security i is a constituent of S&P 500, τ, say, which is suppressed for notational simplicity.

The time-series data on the safe rate of return, and the market factors are obtained from Ken French’s data library web page. The one-month U.S. treasury bill rate is chosen as the risk-free rate (rft), the value-weighted return on all NYSE, AMEX, and NASDAQ stocks (from CRSP) is used as a proxy for the market return (rmt), the average return on the three small portfolios minus the average return on the three big portfolios (SMBt), the average return on two value portfolios minus the average return on two growth portfolios (HMLt), the difference between the returns on diversified portfolios of stocks with robust and weak profitability (RMWt), and the difference between the returns on diversified portfolios of the stocks of low and high investment firms (CMAt). SMB, HML, RMW, and CMA are based on the stocks listed on the NYSE, AMEX, and NASDAQ. All data are measured in percent per month. See Section M1.3 in the Supplementary Material for further details.

6.2 Month End Test Results (September 1989–April 2018)

Encouraged by the satisfactory performance of the J^α test, even in cases where N is much larger than T, we apply the J^α test that allows for non-Gaussian and cross-correlated errors to all securities in the S&P 500 index at the end of each month spanning the period September 1989–April 2018.22 In this way, we minimize the possibility of survivorship bias since the sample of securities considered at the end of each month is decided in real time. As far as the choice of T is concerned, to reduce the impact of possible time variations in betas, we select a relatively short time period of T =60 months. Accordingly, we estimated the CAPM, Fama and French (1993) three factor (FF3), and Fama and French (2015) five factor (FF5) regressions. The estimated FF5 regression is
(74)
for t=1,2,,60,i=1,2,,Nτ, and the month ends, τ, from September 1989 to April 2018. The CAPM regression includes the first factor and the FF3 regression uses the first three factors in Equation (74) as regressors, respectively. All securities in the S&P 500 index are included except those with less than 60 months of observations and/or with five consecutive zeros in the middle of sample periods. See the Supplementary Material for discussions on the statistical properties of the regression residuals.

Table 8 reports the rejection frequencies of the J^α and GOS tests based on the CAPM, FF3, and FF5 models over the month ends, for the full sample periods, and three market disruption periods: (1) the Asian financial crisis (1997M07–1998M12), (2) the Dot-com bubble burst (2000M03–2002M10), and (3) the Great Recession (2007M12–2009M06) periods. Depending on the factor model (CAPM, FF3, or FF5) and nominal size (5% or 1%) considered, the J^α test rejects the null hypothesis H0:αi=0, from 24% to 30% of the total number of tests carried out, which is much smaller than the rejection rates of the GOS test that lie between 39% and 72%. The high rejection rates and their wide range in the GOS test may be due to the tendency of this test to over-reject when T is relatively small, as documented by Monte Carlo experiments in Section 5.

Table 8.

Empirical application: rejection frequencies of the J^α and GOS tests

TestJ^α test
GOS test
Factor modelsCAPMFF3FF5CAPMFF3FF5
Significance level of 0.05
 Full sample period (1989M09–2018M04)0.280.270.300.420.570.72
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.060.220.390.330.831.00
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.500.660.090.721.00
 (3) The Great Recession (2007M12–2009M06)0.840.950.741.001.000.95
Significance level of 0.01
 Full sample period (1989M09–2018M04)0.240.270.240.390.490.62
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.000.110.280.280.830.67
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.250.560.030.591.00
 (3) The Great Recession (2007M12–2009M06)0.790.840.680.951.000.89
TestJ^α test
GOS test
Factor modelsCAPMFF3FF5CAPMFF3FF5
Significance level of 0.05
 Full sample period (1989M09–2018M04)0.280.270.300.420.570.72
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.060.220.390.330.831.00
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.500.660.090.721.00
 (3) The Great Recession (2007M12–2009M06)0.840.950.741.001.000.95
Significance level of 0.01
 Full sample period (1989M09–2018M04)0.240.270.240.390.490.62
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.000.110.280.280.830.67
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.250.560.030.591.00
 (3) The Great Recession (2007M12–2009M06)0.790.840.680.951.000.89

Notes: This table provides rejection frequencies of the J^α and GOS tests with the significance levels of 0.05 and 0.01, applied to CAPM, FF3, and FF5 regressions of securities in the S&P 500 index using rolling T =60 monthly estimation windows over the month ends during the full sample period and during the three market disruption periods.

Table 8.

Empirical application: rejection frequencies of the J^α and GOS tests

TestJ^α test
GOS test
Factor modelsCAPMFF3FF5CAPMFF3FF5
Significance level of 0.05
 Full sample period (1989M09–2018M04)0.280.270.300.420.570.72
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.060.220.390.330.831.00
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.500.660.090.721.00
 (3) The Great Recession (2007M12–2009M06)0.840.950.741.001.000.95
Significance level of 0.01
 Full sample period (1989M09–2018M04)0.240.270.240.390.490.62
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.000.110.280.280.830.67
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.250.560.030.591.00
 (3) The Great Recession (2007M12–2009M06)0.790.840.680.951.000.89
TestJ^α test
GOS test
Factor modelsCAPMFF3FF5CAPMFF3FF5
Significance level of 0.05
 Full sample period (1989M09–2018M04)0.280.270.300.420.570.72
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.060.220.390.330.831.00
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.500.660.090.721.00
 (3) The Great Recession (2007M12–2009M06)0.840.950.741.001.000.95
Significance level of 0.01
 Full sample period (1989M09–2018M04)0.240.270.240.390.490.62
 Three market disruption periods:
 (1) Asian financial crisis (1997M07–1998M12)0.000.110.280.280.830.67
 (2) The Dot-com Bubble Burst (2000M03–2002M10)0.000.250.560.030.591.00
 (3) The Great Recession (2007M12–2009M06)0.790.840.680.951.000.89

Notes: This table provides rejection frequencies of the J^α and GOS tests with the significance levels of 0.05 and 0.01, applied to CAPM, FF3, and FF5 regressions of securities in the S&P 500 index using rolling T =60 monthly estimation windows over the month ends during the full sample period and during the three market disruption periods.

As to be expected, rejection rates in the top panel of Table 8 (based on 5% level) are larger than those in the bottom panel (based on 1% level), but the differences are of second-order importance, particularly when compared with the choice of the underlying asset pricing models. Focusing on the test results based on the 5% level, we note wide variations in the test outcomes across models (CAPM, FF3, and FF5) particularly in the case of sub-samples representing the Asian Financial Crisis and the Dot-com Bubble. The test outcomes for these two sub-samples critically depend on the choice of the asset pricing model, although as for the full sample results the GOS test gives much larger rejection rates. Given the sensitivity of the test outcomes to the choice of the asset pricing model, no firm conclusions can be made in relation to these financial crises. The results based on the J^α only lead to substantial rejections only in the case of Dot-com Bubble period and when we base the test on the FF5 model.

The situation is very different when we consider the Great Recession period, where we find substantial rejection of the null of market efficiency irrespective of the model choice. Using the J^α there is no pattern to the rejection rates across the models, and using CAPM given a rejection rate of 84% when compared with 95% for FF3 and 74% for FF5. The GOS rejection rates are much higher (100% for CAPM and FF3 and 95% for FF5). Due to its over-rejection tendency, the GOS test seems to be less discriminatory when we compare the GOS rejection rates across the different sample periods. This is particularly so in the case of the GOS tests based on the FF5 model. Overall, both tests provide strong evidence of pricing errors during the Great Recession, but J^α test appears to provide more sensible results than the GOS test in this application.

7 Conclusion

In this article, we propose a simple test of LFPMs, the J^α test, when the number of securities, N, is large relative to the time dimension, T, of the return series. It is shown that the J^α test is more robust against error cross-sectional correlation than the SW tests based on an adaptive thresholding estimator of V, which is considered by Fan, Liao, and Yao (2015). It allows N to be much larger than T, when compared with alternative tests proposed in the literature. The proposed test also allows for a wide class of error dependencies including mixed weak-factor spatial autoregressive processes, and is shown to be robust to random time-variations in betas.

Using Monte Carlo experiments, designed specifically to match the distributional features of the residuals of Fama–French three factor regressions of individual securities in the S&P 500 index, we show that the proposed J^α test performs well even when N is much larger than T, and outperforms other existing tests such as the tests of GOS et al. (2015) and GL. Also, in cases where N < T and the standard F-test due to GRS can be computed, we still find that the J^α test has much higher power, especially when T is relatively small.

Application of the J^α test to all securities in the S&P 500 index with 60 months of return data at the end of each month over the period September 1989–April 2018 clearly illustrates the utility of the proposed test. Statistically significant evidence against Sharpe–Lintner CAPM and Fama–French three and five factor models is found mainly during periods of financial crises and market disruptions.

Supplemental Data

Supplemental data are available at https://www.datahostingsite.com.

Appendix: Proofs of the Theorems

In this Appendix, we provide proofs of the theorems set out in Section 4 of the article. These proofs make use of lemmas which are provided, together with their proofs, in the Supplementary Material.

For further clarity and convenience, we summarize some repeatedly used notations below:
(A.1)
(A.2)
where F is a T × m matrix and τT=(1,1,,1) is a T×1 vector of ones. Also, before providing a proof of Theorem 1, we state a theorem due to Kelejian and Prucha (2001, KP) which is used to establish it. 
Lemma 1 (Central Limit Theorem for Linear Quadratic Forms):
Consider the following linear quadratic form
where{εi, i=1,2,,N}are real-valued random variables, and aij and bi denote real-valued coefficients of the quadratic and linear forms. Suppose the following assumptions hold: Assumption KP1:εi, fori=1,2,,N, have zero means and are independently distributed across i. Assumption KP2:Ais symmetric andsupij=1N|aij|<K. Also, N1i=1N|bi|2+ε0<Kfor someε0>0. Assumption KP3:supiE|εi|4+ε0<Kfor someε0>0. Then, assuming thatN1Var(QN)cfor some c > 0,

Proof: See KP (Theorem 1, p. 227). ▪

 
Proof of Theorem 1:
Noting that HF=hh, where h=(h1,h2,,hT)=MFτT, we can write
with wT=τTMFτT. Then,
where Dσ=diag(σ11,σ22,,σNN). Using Equation (54)
(A.3)
where
(A.4)
Consider the first term, aNT, and note that
where
(A.5)
Equivalently, letting dT=wT1/21=1Thtvt, and noting that for any conformable real symmetric positive semi-definite matrices A and B, Tr(AB)Tr(A)λmax(B) (this result is repeatedly used below), we have
But since ht are given constants such that t=1Tht2=wT, and by assumption vt is IID(0,Ik), it then readily follows that dTdTp1, and hence
Also, it is clear from Equation (A.5) that |γ˜is|1 and |γ˜is||γis|, and
and hence by Assumption 2, N1/2i=1Nγ˜iγ˜i=O(Nδγ1/2), and overall aNT=Op(Nδγ1/2). Similarly,
Since by Assumption, ηit and vt (and hence dT) are independently distributed, it follows that E(bNT)=0. Consider now Var(bNT), and note that for given values of γi we have (recall that ηit is independent over t and t=1Tht2=wT)
Also, E(dTdT)=E[(wT1/21=1Thtvt)(wT1/21=1Thtvt)]=Ik and
Further,
Therefore (recalling that supj,s|γ˜js|<K and |γ˜is||γis|),
But by Condition (57) in Assumption 3 and ση,ii>c>0 imply supji=1N|ρη,ij|<K (also see Equation (58)) and by Equation (53), we have supsi=1N|γis|=O(Nδγ). Then, it follows that Var(bNT)=O(Nδγ1) and bNT=O(Nδγ/21/2). Therefore, bNT is dominated by aNT and using these results in Equation (A.3) we have
(A.6)
Now using Equation (56), we can express the above as
where εη,tIID(0,IN). After some re-arrangement of the terms we now obtain
(A.7)
where
(A.8)
First consider the deterministic component of qNT, and using Equation (55) and under Assumption 3, we have
(A.9)
where Γ˜=(γ˜1,γ˜2,,γ˜N). Then,
But, as before,
(A.10)
Hence,
and Equation (A.7) can be written as
(A.11)
where
(A.12)
We now apply the central limit theorem for linear quadratic forms due to KP to zNT, which is reproduced for convenience as Lemma 1. We first establish the conditions required by KP’s theorem (see Lemma 1). To this end, we first note that E(xT)=0, and
Denote the ith element of xT by xi,T and note that it is given by xi,T=wT1/2t=1Thtεη,it=wT1/2hεη,i, where εη,i=(εη,i1 εη,i2,,εη,iT), with an abuse of the notation. Then, xi,T=wT1/2εη,iMFτT and xi,T2=wT1εη,iHFεη,i; hence, for a given T, the elements of xT have zero means, a unit variance, and are independently distributed as required by KP’s theorem. Using results on the moments of quadratic forms, it is also easily established that E(xi,T6)=wT3E(εη,iHFεη,i)3=15+O(v1)K uniformly over i (see Lemma 11), and hence condition KP1 of the KP theorem (Lemma 1) is met. Consider now matrix A˜ defined by Equation (A.12) and note that it is symmetric and we have
and using Equation (A.8)
But under Condition (57) and noting that σii>c>0, then
and condition KP2 of Lemma 1 is met. To establish condition KP3, we note that
Using Equation (A.9), let B=Dσ1/2QηQηDσ1/2, and note that
(A.13)
Also,
and in view of Equation (57), we have
and hence (using Equation (A.10)):
(A.14)
Also (recalling that |γ˜is||γis|),
(A.15)
Hence, using Equations (A.14) and (A.15) in Equation (A.13), we have
Also, in view of Equation (A.8)
To summarize
which also yield (recall that δγ<1/2)
Therefore,
(A.16)
which is bounded in N under the assumptions that N1Tr(R2) is bounded in N and 0δγ<1/2. Furthermore, it is readily seen that
Finally, using Equation (A.12)
Consider
Since, by assumption, εη,t are serially independent, then using the results on moments of the quadratic forms, we have
where γ2,εη=E(εη,it4)3 and by assumption |γ2,εη|<K. Also,
For r=tt=r,
Similarly, for r=tt=r, we have E[(εη,tA˜εη,t)(εη,tA˜εη,t)]=Tr(A˜2). Using these results
But (t=1Tr=1Tht2hr2)=(t=1Tht2)2, i=1Na˜ii=Tr(A˜)=0,i=1Nj=1Na˜ija˜ji=Tr(A˜2), and we have
and, further noting that t=1Tht2=wT, then
and using Equation (A.16)
where by assumption N1Tr(R2) is bounded in N. Also, using Equation (S.15) in Lemma 8, t=1Tht4=O(T), and
Therefore,
(A.17)
which is bounded for any N and T, so long as N1Tr(R2) is bounded in N and 0δγ<1/2. Also, using Equation (A.11), and under the same conditions, and as N and T, in any order,
as required. This result also ensures that condition KP3 of Lemma 1 is satisfied and therefore, we also have qNTdN(0,2ω2), as N and T, in any order. ▪ 
Proof of Theorem 2:
We have
(A.18)
where zi2=ξiHFξi/wT, with ξi=ui./σii1/2 being the standardized error of the return equation (6) and wT=τTMFτT and σ^ii=u^i.u^i./T. Write Xi=σii1σ˜ii and note that by assumption σii>0, and by construction only securities with σ^ii>c>0 are included in the J^α test, so that
(A.19)
where Xi=ξiMGξi/v, with v=Tm1 and MG=(mtt), defined by Equation (A.1). Also, by Equation (37), E(ti2)=E(zi2/Xi)=v/(v2)+O(T3/2) for each i, and by Lemma 11, E(zi2)=E(ξiHFξi/wT)=wT1Tr(HF)=1, for all i. Thus, we have
(A.20)
Next, for all i=1,2,,N, we have Xi>0, and Equation (A.19) can be written as
where
(A.21)
and
(A.22)
But since Xi>c>0 and zi2(1Xi)20, then
and
(A.23)
But
Now using results from Lemma 11, we have
which yields
(A.24)
Using this result in Equation (A.23), we obtain
and by Markov inequality we have S2,NTp0, so long as N/T20. Therefore, to establish SNTp0, it is sufficient to show that S1,NTp0. By Lemma 17, we have
where zη,i2=ηiHFηi/(wTση,ii)>0,Xη,i=ηiMGηi/(vση,ii)>0. Using results on the moments of quadratic forms, by Lemma 15, we have
where γ2,εη=E(εη,it4)3 (and |γ2,εη|<K by assumption), q˜η,i=qη,i/ση,ii1/2 with qη,i being such that Qη=(qη,i),Qη defined by Equation (56). But as 0mtt1 (MG=(mtt)) by Lemma 8, v1wT1t=1Tht2mttv1wT1t=1Tht2=v1 as t=1Tht2=wT, and also that 0=1Nq˜η,i41, as =1Nq˜η,i2=1 (since =1Nqη,i2=ση,ii), and |γ2,εη|K, we have
Furthermore,
We first note that
As has shown above,
uniformly over i. Next, consider
(A.25)
But, using results on the moments of quadratic forms, by Lemma 11, we have
(A.26)
uniformly over i. Substituting Equation (A.26) into Equation (A.25), we have
therefore,
uniformly over i. We conclude that
Secondly, by Lemma 16,

In sum, under Assumptions 1–3, SNTp0, so long as 0δγ<1/2,N/T20 as N and T, jointly.▪ 

Proof of Theorem 3:
Under Assumptions 1–3, using Theorem 2 we have
where zi2 is defined by Equation (22), so long as (N1)ρN2=O(1),N/T20, and 0δγ<1/2, as N and T, jointly. Under these conditions (by Lemma 4), it follows that N1/2i=1N(ti2vv2)/[2(1+(N1)ρN2)]1/2 has the same limit distribution as N1/2i=1N(zi21)/[2(1+(N1)ρN2)]1/2, which is shown to be standard normal by Theorem 1, and the desired result now follows, observing that limT(vv2)22(v1)v4=2. ▪
 
Proof of Theorem 4:
Let ψNT=1Ni,j=1N(ρ˜ij2ρij2), and note that
and since |ρ˜ij|<1 and |ρij|<1, it also follows that
(A.27)

Further, letting Iij=I[|ρ^ij|>v1/2cp(N)], we have
and hence
(A.28)
Now using Equation (41), we note that
where u^i.=MGui.. Also, since MG is an (T × T) idempotent matrix of rank v=Tm1, there exists an orthogonal T × T transformation matrix L (LL=IT), defined by
(A.29)
Hence, setting
(A.30)
ρ^ij can be written equivalently in terms of the first v elements of ζi.=(ζi1,ζi2,,ζiT) as (see Lemma 19)
where ζit=t=1Tlttξit and ltt is the (t,t) element of L. Also, as shown in Lemma 19, for each i, ζit’s are independently distributed over t, and
Furthermore, by Lemma 19
(A.31)
(A.32)
where
and
Hence, using Equation (A.31), |E(ρ^ij)ρij|1v|aij|+O(T2), and we have the following bound on the second term of Equation (A.28):
Furthermore, since κij are bounded, and by assumption i,j=1N|ρij|=O(N), we have
But
and hence
(A.33)
Also,
and as established in Lemma 20 (see (S.80) in the Supplementary Material), we have
which if used in Equation (A.33) yields
Overall, for the second term of Equation (A.28), we have
and since by assumption δγ1/2, and N/T20, as N and T, then
(A.34)
To deal with the first and the third terms of Equation (A.28), we need to distinguish between values of |ρij| that are strictly away from zero, namely those values that satisfy the condition |ρij|>ρmin>0, and those values that are zero or very close to zero. Note that for values of |ρij| sufficiently close to zero, in the sense that |ρij|κNϕρ, for some κ>0 and ϕρ>1, we have23
Therefore, without loss of generality, we only consider the case where |ρij|>ρmin>0, for all i and j. In this case, we have
(A.35)
Further, since E(1Iij)=Pr[|ρ^ij|v1/2cp(N)], then using result (A.7) in Lemma 4 of BPS (2017, supplement) we have (for some small ϵ>0)
Using this result in Equation (A.35) now yields
where bmax=supijbij<K, which can be written equivalently as
Noting that cp2(N)/v and ln(N)/v have the same rate of convergence and both 0, as N and T, it then follows that24
(A.36)
Finally, consider the first term of Equation (A.28) and write it as
(A.37)
where zij=[ρ^ijE(ρ^ij)]/Var(ρ^ij), and Var(ρ^ij) is given by Equation (A.32). Also, by Cauchy–Schwarz inequality (noting that E(zij2)=1)
Using this result and Var(ρ^ij) from Equation (A.32) in Equation (A.37) and distinguishing between non-zero and near zero values of ρij, we have
Under the sparsity conditions, Equations (32) and (33), the maximum number of non-zero |ρij| is given by mN2, and we have
(A.38)
where mN=O(Nδρ). Hence, since by assumption δρ<1/2, then it follows that A120, as N and v. For A11, which relates to the near zero values of |ρij|, making use of result (A.5) in Lemma 4 of BPS (2017, supplement) we have
where φmax=maxijφij<K. Then for A1 to tend to zero it is sufficient that (note that N1mN20, since δρ<1/2)
(A.39)
To obtain a sufficient condition for Equation (A.39) to hold, set T=cdNd and note that (recall that v=Tm1 and T/(Tm1)<K, since m is fixed as T)
But by result (b) of Lemma 2 of BPS (2017, supplement), limNcp2(N)/log(N)=2δ, and Condition (A.39) is met if δ(1ϵ)/2φmax(1d/2)>0, or equivalently if δ>(2d)(1ϵ)φmax. Therefore, under this condition, A110, and together with Equation (A.38) establishes that A10. Therefore, using this result, Equations (A.34) and (A.36) in Equation (A.28) we have E|ψNT|0, as required, and in turn implies ψNTp0, by Markov inequality. Finally, using (S.79) in the Supplementary Material established in Lemma 20, and setting γi=0, for all i, and ση,ij=0, for all ij, to ensure that ρij=0, for all ij, we have
where ltr is the (t, r) element of the T × T orthonormal matrix L defined by Equation (A.29), qη,i is such that Qη=(qη,i),Qη defined by Equation (56). Also, |ση,ii/σii|1,r=1Tltr4(r=1Tltr2)21,=1Nq˜η,i2==1Nqη,i2/ση,ii=1, and

Hence, supijφij1+|γ2,εη|, as required. ▪ 

Proof of Theorem 5:

By Theorem 3, Jα(ρN2)dN(0,1) so long as N/T20, and 0δγ<1/2, as N and T, jointly, where Jα(ρN2) and δγ are defined by Equations (61) and (53), respectively. Since Theorem 4 ensures that J^αJα(ρN2)p0, as (N1)(ρ˜N,T2ρN2)p0 when d >2/3, as N and T, and δ>(2d)(1ϵ)φmax, for some small ϵ>0, where φmax1+|γ2,εη|, under these conditions, J^α has the same limit distribution as Jα(ρN2)(by Lemma 4), which establishes the result. ▪

 
Proof of Theorem 6:

The steps in the proof are similar to the ones in deriving the limiting distribution of J^α under the null hypothesis. First, Lemma 22 provides the proof of the result, under Assumptions 1–3, and under the local alternatives (68), N1/2i=1N(zi,a21)dN(ϕ2,2ω2), as N  and T, jointly, where zi,a2 defined by (S.97) in the Supplementary Material, ω2=1+limN(N1)ρN2, ρN2 is defined by Equation (60). Also, by Lemma 23, we have N1/2i=1N(zi,a2ti2)=op(1). Finally, J^αJα=op(1), since the consistency result of the MT estimator ρ˜N,T2 given by Theorem 4 will not be affected by the introduction of local alternatives, as the MT estimator is obtained based on the regression residuals of the alternative model. This completes the proof of Theorem 6. ▪

Footnotes

*

This is a revised and updated version of the article entitled “Testing CAPM with a Large Number of Assets,” initially released in April 2012 as IZA Discussion Papers No. 6469. We would like to thank two anonymous referees and the Editor, Dacheng Xiu, for valuable comments. We are grateful to Ron Smith, Natalia Bailey, and Jay Shanken and other participants at the American Finance Association Meeting in San Diego, in January 2013 for helpful comments. The first author wishes to acknowledge partial support from the ESRC Grant No. ES/I031626/1. The second author acknowledges partial support from the JSPS KAKENHI (grant numbers 20H01484, 20H05631, 21H00700, and 21H04397).

1

Cross-sectional tests of CAPM have been considered by Douglas (1967); Black, Jensen, and Scholes (1972); and Fama and MacBeth (1973), among others. An early review of the literature can be found in Jensen (1972), and more recently in Fama and French (2004).

2

There exists a large literature in statistics and econometrics on estimation of high-dimensional covariance matrices which use regularization techniques such as shrinkage, adaptive thresholding, or other dimension-reducing procedures that impose certain structures on the variance matrix such as sparsity, or factor structures. See, for example, Wong, Carter, and Kohn (2003); Ledoit and Wolf (2004); HuAng et al. (2006); BL; Fan, Fan, and Lv (2008); Cai and Liu (2011); Fan, Liao, and Mincheva (2011, 2013); and BPS.

3

Monte Carlo experiments reported by Feng et al. (2022) also show significant over-rejection of the null by the GOS test when T = 50 and N = 500. These authors do not report simulation results for larger values of N as they increase T to 100 and 200. It is therefore unclear if the over-rejection continues when N is also increased beyond 500 when T = 100. As we also note in the article, increasing T to avoid over-rejection increases the likelihood of breaks in factor loadings which could be another source of over-rejection.

4

Some researchers have focused on testing the restrictions λμ=0, allowing λ0 to be unrestricted. See, for example, Shanken (1992).

5

Note that the GRS test is also based on the same null hypothesis, H0: αi=0, and assumes zero pricing errors.

6

Noting that (1+f¯Ω^1f¯)1=T1(τTMFτT), where f¯=T1t=1Tft and Ω^=T1t=1T(ftf¯)(ftf¯), it is easily seen that Equation (17) can be written as the widely used expression of the GRS statistic, TNmN(1+f¯Ω^1f¯)1α^V^1α^. As discussed in GRS, α^V^1α^ measures the ex post maximum pricing error.

7

Another candidate is the shrinkage estimator of V proposed by Ledoit and Wolf (2004), which we denote by V^LW, and refer to the associated SW statistic as SWLW. Such “plug-in” approaches are subject to two important shortcomings. First, even if V can be estimated consistently, the test might perform poorly in the case of non-Gaussian errors. Notice that the standardization of the Wald statistic is carried out assuming Gaussianity. Further, consistent estimation of V in the Frobenius norm sense still requires T to rise faster than N, and in practice threshold estimators of V are not guaranteed to be invertible in finite samples where NT.

8

Only securities with σ^ii>0 are included in W^d.

9

We conducted an experiment with GARCH(1,1) errors and the evidence supports our claim. The results are reported in Table 5.

10

See Lemma 21 in the Supplementary Material of the article.

11

Small sample evidence on the efficacy of using N1/2i=1N(ti2vv2) over N1/2i=1N(ti21) is reported in Table 7.

12

For a proof of Equation (39), see Lemma 18 in the Supplementary Material.

13

See, for example, Cai and Liu (2011); Fan, Liao, and Mincheva (2013); BPS, among others.

14

Other thresholding estimators of V proposed in the literature can also be used.

15

See Theorem 4 in Section 4 and its proof in the Appendix.

16

The robustness of the Ja test against non-Gaussianity is investigated and reported in Table 7. These results are generally supportive of setting δ = 1.

17

For more details, see Supplementary Section M1.1.

18

See Assumptions BD.1–3 in GOS.

19

We are grateful to Richard Luger for sharing the code to compute the resampling test.

20

SMB stands for “small market capitalization minus big” and HML for “high book-to-market ratio minus low.” See Fama and French (1993).

21

The estimates used in the generation of the factors and their volatilities are computed using monthly observations over the period May 2008–April 2018.

22

In all the empirical applications T < N and the GRS test cannot be computed. We have also decided to exclude other tests discussed in the Monte Carlo Section on the grounds of their substantial size distortion of the null and/or low power.

23

Note that the sparsity condition given by Equation (65) can be violated if ϕρ<1.

24

Note that since by assumption T=cdNd, with d > 1/2, then ln(N)/v=(T/(Tm1))cd1Ndln(N)0, as N. Recall that m, the number of factors, is fixed as T.

References

Affleck-Graves
J.
,
Mcdonald
B.
1989
.
Nonnormalities and Tests of Asset Pricing Theories
.
The Journal of Finance
44
:
889
908
.

Affleck-Graves
J.
,
Mcdonald
B.
1990
.
Multivariate Tests of Asset Pricing: The Comparative Power of Alternative Statistics
.
Journal of Financial and Quantitative Analysis
25
:
163
185
.

Anderson
T. W.
2003
.
An Introduction to Multivariate Statistical Analysis
. 3rd edn.
Wiley
.

Ang
A.
,
Chen
J.
,
Xing
Y.
2006
.
Downside Risk
.
The Review of Financial Studies
19
:
1191
1239
.

Ang
A.
,
Liu
J.
,
Schwarz
K.
2020
.
Using Stocks or Portfolios in Tests of Factor Models
.
Journal of Financial and Quantitative Analysis
55
:
709
750
.

Bai
Z.
,
Saranadasa
H.
1996
.
Effect of High Dimension: By an Example of a Two Sample Problem
.
Statistica Sinica
6
:
311
329
.

Bailey
N.
,
Kapetanios
G.
,
Pesaran
M. H.
2021
.
Measurement of Factor Strength: Theory and Practice
.
Journal of Applied Econometrics
36
:
587
613
.

Bailey
N.
,
Pesaran
M. H.
,
Smith
L. V.
2019
.
A multiple Testing Approach to the Regularisation of Large Sample Correlation Matrices
.
Journal of Econometrics
208
:
507
534
.

Beaulieu
M.-C.
,
Dufour
J.-M.
,
Khalaf
L.
2007
.
Multivariate Tests of Mean–Variance Efficiency with Possibly Non-Gaussian Errors
.
Journal of Business & Economic Statistics
25
:
398
410
.

Bickel
P. J.
,
Levina
E.
2008
.
Regularized Estimation of Large Covariance Matrices
.
The Annals of Statistics
36
:
199
227
.

Black
F.
,
Jensen
M. C.
,
Scholes
M.
1972
. “
The Capital Asset Pricing Model: Some Empirical Tests
.” In M. C. Jensen (Ed.), Studies in the Theory of Capital Markets, pp.
79
121
. New York:
Praeger
.

Breusch
T.
,
Pagan
A.
1980
.
The Lagrange Multiplier Test and Its Applications to Model Specification in Econometrics
.
Review of Economic Studies
47
:
239
253
.

Cai
T.
,
Liu
W.
2011
.
Adaptive Thresholding for Sparse Covariance Matrix Estimation
.
Journal of the American Statistical Association
106
:
672
684
.

Chamberlain
G.
1983
.
Funds, Factors, and Diversification in Arbitrage Pricing Models
.
Econometrica
51
:
1305
1323
.

Chordia
T.
,
Goyal
A.
,
Shanken
J.
2017
.
Cross-Sectional Asset Pricing with Individual Stocks: Betas versus Characteristics
.
Columbia Business School
.

Cremers
M.
,
Halling
M.
,
Weinbaum
D.
2015
.
Aggregate Jump and Volatility Risk in the Cross-Section of Stock Returns
.
The Journal of Finance
70
:
577
614
.

Douglas
G.
1967
.
Risk in Equity Markets: An Empirical Appraisal of Market Efficiency
.
University Microfilms
.

Fama
E. F.
,
French
K. R.
1993
.
Common Risk Factors in the Returns on Stocks and Bonds
.
Journal of Financial Economics
33
:
3
56
.

Fama
E. F.
,
French
K. R.
2004
.
The Capital Asset Pricing Model: Theory and Evidence
.
Journal of Economic Perspectives
18
:
25
46
.

Fama
E. F.
,
French
K. R.
2015
.
A Five-Factor Asset Pricing Model
.
Journal of Financial Economics
116
:
1
22
.

Fama
E. F.
,
MacBeth
J. D.
1973
.
Risk, Return, and Equilibrium: Empirical Tests
.
Journal of Political Economy
81
:
607
636
.

Fan
J.
,
Fan
Y.
,
Lv
J.
2008
.
High Dimensional Covariance Matrix Estimation Using a Factor Model
.
Journal of Econometrics
147
:
186
197
.

Fan
J.
,
Liao
Y.
,
Mincheva
M.
2011
.
High-Dimensional Covariance Matrix Estimation in Approximate Factor Models
.
Annals of Statistics
39
:
3320
3356
.

Fan
J.
,
Liao
Y.
,
Mincheva
M.
2013
.
Large Covariance Estimation by Thresholding Principal Orthogonal Complements
.
Journal of the Royal Statistical Society. Series B
75
:
603
680
.

Fan
J.
,
Liao
Y.
,
Yao
J.
2015
.
Power Enhancement in High-Dimensional Cross-Sectional Tests
.
Econometrica: Journal of the Econometric Society
83
:
1497
1541
.

Feng
L.
,
Lan
W.
,
Liu
B.
,
Ma
Y.
2022
.
High-Dimensional Test for Alpha in Linear Factor Pricing Models with Sparse Alternatives
.
Journal of Econometrics
229
:
152
175
.

Gagliardini
P.
,
Ossola
E.
,
Scaillet
O.
2016
.
Time-Varying Risk Premium in Large Cross-Sectional Equity Data Sets
.
Econometrica
84
:
985
1046
.

Gibbons
M. R.
,
Ross
S. A.
,
Shanken
J.
1989
.
A test of the Efficiency of a Given Portfolio
.
Econometrica
57
:
1121
1152
.

Giglio
S.
,
Xiu
D.
2021
.
Asset Pricing with Omitted Factors
.
Journal of Political Economy
129
:
1947
1990
.

Gungor
S.
,
Luger
R.
2009
.
Exact Distribution-Free Tests of Mean–Variance Efficiency
.
Journal of Empirical Finance
16
:
816
829
.

Gungor
S.
,
Luger
R.
2016
.
Multivariate Tests of Mean–Variance Efficiency and Spanning with a Large Number of Assets and Time-Varying Covariances
.
Journal of Business & Economic Statistics
34
:
161
175
.

He
A.
,
Huang
D.
,
Yuan
M.
,
Zhou
G.
2021
. Tests of Asset Pricing Models with a Large Number of Assets. DOI:.

Huang
J. Z.
,
Liu
N.
,
Pourahmadi
M.
,
Liu
L.
2006
.
Covariance Matrix Selection and Estimation via Penalised Normal Likelihood
.
Biometrika
93
:
85
98
.

Hwang
S.
,
Satchell
S. E.
2014
.
Testing Linear Factor Models on Individual Stocks Using the Average F-Test
.
The European Journal of Finance
20
:
463
498
.

Im
K. S.
,
Pesaran
M.
,
Shin
Y.
2003
.
Testing for Unit Roots in Heterogeneous Panels
.
Journal of Econometrics
115
:
53
74
.

Jensen
M.
1972
.
Studies in the Theory of Capital Markets
.
Praeger
.

Jensen
M. C.
1968
.
The Performance of Mutual Funds in the Period 1945–1964
.
The Journal of Finance
23
:
389
416
.

Kelejian
H. H.
,
Prucha
I. R.
2001
.
On the Asymptotic Distribution of the Moran I Test Statistic with Applications
.
Journal of Econometrics
140
:
219
257
.

Lan
W.
,
Feng
L.
,
Luo
R.
2018
.
Testing High-Dimensional Linear Asset Pricing Models
.
Journal of Financial Econometrics
16
:
191
210
.

Ledoit
O.
,
Wolf
M.
2004
.
A Well-Conditioned Estimator for Large-Dimensional Covariance Matrices
.
Journal of Multivariate Analysis
88
:
365
411
.

Lieberman
O.
1994
.
A Laplace Approximation to the Moments of a Ratio of Quadratic Forms
.
Biometrika
81
:
681
690
.

Lintner
J.
1965
.
The Valuation of Risk Assets and the Selection of Risky Investments in Stock Portfolios and Capital Budgets
.
The Review of Economics and Statistics
47
:
13
37
.

Longin
F.
,
Solnik
B.
2001
.
Extreme Correlation of International Equity Markets
.
The Journal of Finance
56
:
649
676
.

Ma
S.
,
Lan
W.
,
Su
L.
,
Tsai
C.-L.
2020
.
Testing Alphas in Conditional Time-Varying Factor Models with High-Dimensional Assets
.
Journal of Business & Economic Statistics
38
:
214
227
.

Pesaran
M. H.
,
Ullah
A.
,
Yamagata
T.
2008
.
A Bias-Adjusted LM Test of Error Cross-Section Independence
.
The Econometrics Journal
11
:
105
127
.

Raponi
V.
,
Robotti
C.
,
Zaffaroni
P.
2019
.
Testing Beta-Pricing Models Using Large Cross-Sections
.
The Review of Financial Studies
33
:
2796
2842
.

Ross
S. A.
1976
.
The Arbitrage Theory of Capital Asset Pricing
.
Journal of Economic Theory
13
:
341
360
.

Shanken
J.
1992
.
On the Estimation of Beta-Pricing Models
.
The Review of Financial Studies
5
:
1
33
.

Sharpe
W. F.
1964
.
Capital Asset Prices: A Theory of Market Equilibrium under Conditions of Risk
.
The Journal of Finance
19
:
425
442
.

Srivastava
M. S.
,
Du
M.
2008
.
A Test for the Mean Vector with Fewer Observations than the Dimension
.
Journal of Multivariate Analysis
99
:
386
402
.

Wong
F.
,
Carter
C. K.
,
Kohn
R.
2003
.
Efficient Estimation of Covariance Selection Models
.
Biometrika
90
:
809
830
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data