Testing homogeneity: the trouble with sparse functional data Open Access

Comparison of size

			Without augmentation		With augmentation
Error	$n = m$	$N_{i}$	FPCA	$M E D$	FPCA	$MED$
Normal	150	1,2	$0.464$	$0.044$	$0.422$	$0.062$
		4,5,6	$0.120$	$0.042$	$0.050$	$0.056$
		8,9,10	$0.002$	$0.037$	0	$0.048$
	200	1,2	$0.560$	$0.034$	$0.452$	$0.056$
		4,5,6	$0.136$	$0.055$	$0.028$	$0.060$
		8,9,10	0	$0.057$	0	$0.058$
	300	1,2	$0.626$	$0.052$	$0.452$	$0.074$
		4,5,6	$0.096$	$0.059$	$0.018$	$0.060$
		8,9,10	0	$0.037$	0	$0.052$
Student t	150	1,2	$0.580$	$0.058$	$0.488$	$0.077$
		4,5,6	$0.016$	$0.036$	$0.010$	$0.044$
		8, 9,10	0	$0.052$	0	$0.048$
	200	1,2	$0.606$	$0.054$	$0.526$	$0.060$
		4,5,6	$0.012$	$0.048$	$0.002$	$0.060$
		8,9,10	$0.002$	$0.042$	0	$0.042$
	300	1,2	$0.638$	$0.058$	$0.518$	$0.099$
		4,5,6	$0.012$	$0.060$	0	$0.066$
		8,9,10	0	$0.044$	0	$0.038$

			Without augmentation		With augmentation
Error	$n = m$	$N_{i}$	FPCA	$M E D$	FPCA	$MED$
Normal	150	1,2	$0.464$	$0.044$	$0.422$	$0.062$
		4,5,6	$0.120$	$0.042$	$0.050$	$0.056$
		8,9,10	$0.002$	$0.037$	0	$0.048$
	200	1,2	$0.560$	$0.034$	$0.452$	$0.056$
		4,5,6	$0.136$	$0.055$	$0.028$	$0.060$
		8,9,10	0	$0.057$	0	$0.058$
	300	1,2	$0.626$	$0.052$	$0.452$	$0.074$
		4,5,6	$0.096$	$0.059$	$0.018$	$0.060$
		8,9,10	0	$0.037$	0	$0.052$
Student t	150	1,2	$0.580$	$0.058$	$0.488$	$0.077$
		4,5,6	$0.016$	$0.036$	$0.010$	$0.044$
		8, 9,10	0	$0.052$	0	$0.048$
	200	1,2	$0.606$	$0.054$	$0.526$	$0.060$
		4,5,6	$0.012$	$0.048$	$0.002$	$0.060$
		8,9,10	$0.002$	$0.042$	0	$0.042$
	300	1,2	$0.638$	$0.058$	$0.518$	$0.099$
		4,5,6	$0.012$	$0.060$	0	$0.066$
		8,9,10	0	$0.044$	0	$0.038$

Note. MED = marginal energy distance.

Table 1.

Comparison of size

			Without augmentation		With augmentation
Error	$n = m$	$N_{i}$	FPCA	$M E D$	FPCA	$MED$
Normal	150	1,2	$0.464$	$0.044$	$0.422$	$0.062$
		4,5,6	$0.120$	$0.042$	$0.050$	$0.056$
		8,9,10	$0.002$	$0.037$	0	$0.048$
	200	1,2	$0.560$	$0.034$	$0.452$	$0.056$
		4,5,6	$0.136$	$0.055$	$0.028$	$0.060$
		8,9,10	0	$0.057$	0	$0.058$
	300	1,2	$0.626$	$0.052$	$0.452$	$0.074$
		4,5,6	$0.096$	$0.059$	$0.018$	$0.060$
		8,9,10	0	$0.037$	0	$0.052$
Student t	150	1,2	$0.580$	$0.058$	$0.488$	$0.077$
		4,5,6	$0.016$	$0.036$	$0.010$	$0.044$
		8, 9,10	0	$0.052$	0	$0.048$
	200	1,2	$0.606$	$0.054$	$0.526$	$0.060$
		4,5,6	$0.012$	$0.048$	$0.002$	$0.060$
		8,9,10	$0.002$	$0.042$	0	$0.042$
	300	1,2	$0.638$	$0.058$	$0.518$	$0.099$
		4,5,6	$0.012$	$0.060$	0	$0.066$
		8,9,10	0	$0.044$	0	$0.038$

			Without augmentation		With augmentation
Error	$n = m$	$N_{i}$	FPCA	$M E D$	FPCA	$MED$
Normal	150	1,2	$0.464$	$0.044$	$0.422$	$0.062$
		4,5,6	$0.120$	$0.042$	$0.050$	$0.056$
		8,9,10	$0.002$	$0.037$	0	$0.048$
	200	1,2	$0.560$	$0.034$	$0.452$	$0.056$
		4,5,6	$0.136$	$0.055$	$0.028$	$0.060$
		8,9,10	0	$0.057$	0	$0.058$
	300	1,2	$0.626$	$0.052$	$0.452$	$0.074$
		4,5,6	$0.096$	$0.059$	$0.018$	$0.060$
		8,9,10	0	$0.037$	0	$0.052$
Student t	150	1,2	$0.580$	$0.058$	$0.488$	$0.077$
		4,5,6	$0.016$	$0.036$	$0.010$	$0.044$
		8, 9,10	0	$0.052$	0	$0.048$
	200	1,2	$0.606$	$0.054$	$0.526$	$0.060$
		4,5,6	$0.012$	$0.048$	$0.002$	$0.060$
		8,9,10	$0.002$	$0.042$	0	$0.042$
	300	1,2	$0.638$	$0.058$	$0.518$	$0.099$
		4,5,6	$0.012$	$0.060$	0	$0.066$
		8,9,10	0	$0.044$	0	$0.038$

Note. MED = marginal energy distance.

Example 2

The stochastic processes

{X_{i}}_{i = 1}^{n}

are i.i.d copies of X,

{Y_{i}}_{i = n + 1}^{n + m}

are i.i.d copies of Y, where for

t \in [0, 1]

⁠,

\begin{aligned} X (t) = ξ_{1} (- \cos (2 π t)) + ξ_{2} (\sin (2 π t)) \\ Y (t) = ς_{1} t^{2} + ς_{2} (\sqrt{1 - t^{4}}) \end{aligned}

ξ_{1}, ξ_{2} \sim^{i . i . d} N (0, 1),

and

ς_{1}, ς_{2}

are independently sampled from the following mixture of Gaussian distributions:

P (ς \leq a) = \frac{1}{2} P (N (μ_{ς}, σ_{ς}^{2}) \leq a) + \frac{1}{2} P (N (- μ_{ς}, σ_{ς}^{2}) \leq a) for any a \in R ⊊

The sampling plan is similar to Example 1.

By selecting $μ_{ς}$ and $σ_{ς}^{2}$ such that $μ_{ς}^{2} + σ_{ς}^{2} = 1$ ⁠, we have $var (X (t)) = var (Y (t))$ ⁠. Under this scenario, $X (s)$ and $Y (t)$ have the same marginal mean and variance, but different marginal distributions. In this example, we set $μ_{ς} = 0.98$ ⁠, $σ_{ς} = 0.199$ ⁠, $σ_{1}^{2} = σ_{2}^{2} = 0.2$ when $σ_{1} = σ_{2}$ and $σ_{1}^{2} = 0.05, σ_{2}^{2} = 0.25$ when $σ_{1} \neq σ_{2}$ ⁠. The power comparison results are provided in Table 2. Since the FPCA approach is often different from the nominal level, which makes the power comparison not so meaningful as a fair comparison requires similar levels of actual sizes. We have thus not included FPCA in the power analysis. Instead, we compare the power of $MED$ -based tests between sparse and intensively sampled data, where the sparse data are uniformly sampled from intensively sampled data. The same error-augmentation approach is applied to $MED$ when $σ_{1} \neq σ_{2}$ ⁠, regardless of design intensities. When the measurement errors follow scaled Student t-distributions, MED-based tests have moderate power loss compared to intensively sampled data when $N_{i} = 150, 200$ ⁠, but the powers catch up quickly when $N_{i} = 300$ ⁠. We note that when the measurement errors follow scaled t-distributions with $σ_{1} \neq σ_{2}$ ⁠, the error-augmentation approach is applied by adding additional Gaussian noise to the observations. The high powers for medium and not so sparse cases demonstrate the robustness of our tests against additional noises. For the extremely sparse case (⁠ $N_{i} = 1, 2$ ⁠) with Gaussian measurement errors, and as the sample sizes increase to 300, the power grows to 0.836 when $σ_{1} = σ_{2}$ and to 0.556 when $σ_{1} \neq σ_{2}$ ⁠.

Table 2.

Comparison of power

		Without augmentation		With augmentation
Error	$n = m$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$
Normal	150	1 (100)	$0.269 (1, 2)$	$1 (100)$	$0.184 (1, 2)$
			$0.894 (4, 5, 6)$		$0.694 (4, 5, 6)$
			$0.976 (8, 9, 10)$		$0.904 (8, 9, 10)$
	200	$1 (100)$	$0.424 (1, 2)$	$1 (100)$	$0.290 (1, 2)$
			$0.994 (4, 5, 6)$		$0.876 (4, 5, 6)$
			$0.998 (8, 9, 10)$		$0.984 (8, 9, 10)$
	300	$1 (100)$	$0.836 (1, 2)$	$1 (100)$	$0.556 (1, 2)$
			$1.000 (4, 5, 6)$		$0.992 (4, 5, 6)$
			$1.000 (8, 9, 10)$		$1.000 (8, 9, 10)$
Student t	150	$0.852 (100)$	$0.130 (1, 2)$	$0.906 (100)$	$0.134 (1, 2)$
			$0.372 (4, 5, 6)$		$0.314 (4, 5, 6)$
			$0.496 (8, 9, 10)$		$0.426 (8, 9, 10)$
	200	$0.986 (100)$	$0.122 (1, 2)$	$0.978 (100)$	$0.148 (1, 2)$
			$0.586 (4, 5, 6)$		$0.512 (4, 5, 6)$
			$0.760 (8, 9, 10)$		$0.674 (8, 9, 10)$
	300	1 (100)	$0.295 (1, 2)$	1 (100)	$0.259 (1, 2)$
			$0.892 (4, 5, 6)$		$0.795 (4, 5, 6)$
			$0.990 (8, 9, 10)$		$0.948 (8, 9, 10)$

		Without augmentation		With augmentation
Error	$n = m$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$
Normal	150	1 (100)	$0.269 (1, 2)$	$1 (100)$	$0.184 (1, 2)$
			$0.894 (4, 5, 6)$		$0.694 (4, 5, 6)$
			$0.976 (8, 9, 10)$		$0.904 (8, 9, 10)$
	200	$1 (100)$	$0.424 (1, 2)$	$1 (100)$	$0.290 (1, 2)$
			$0.994 (4, 5, 6)$		$0.876 (4, 5, 6)$
			$0.998 (8, 9, 10)$		$0.984 (8, 9, 10)$
	300	$1 (100)$	$0.836 (1, 2)$	$1 (100)$	$0.556 (1, 2)$
			$1.000 (4, 5, 6)$		$0.992 (4, 5, 6)$
			$1.000 (8, 9, 10)$		$1.000 (8, 9, 10)$
Student t	150	$0.852 (100)$	$0.130 (1, 2)$	$0.906 (100)$	$0.134 (1, 2)$
			$0.372 (4, 5, 6)$		$0.314 (4, 5, 6)$
			$0.496 (8, 9, 10)$		$0.426 (8, 9, 10)$
	200	$0.986 (100)$	$0.122 (1, 2)$	$0.978 (100)$	$0.148 (1, 2)$
			$0.586 (4, 5, 6)$		$0.512 (4, 5, 6)$
			$0.760 (8, 9, 10)$		$0.674 (8, 9, 10)$
	300	1 (100)	$0.295 (1, 2)$	1 (100)	$0.259 (1, 2)$
			$0.892 (4, 5, 6)$		$0.795 (4, 5, 6)$
			$0.990 (8, 9, 10)$		$0.948 (8, 9, 10)$

Note. MED = marginal energy distance.

Table 2.

Open in new tab Download slide

Comparison of power

		Without augmentation		With augmentation
Error	$n = m$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$
Normal	150	1 (100)	$0.269 (1, 2)$	$1 (100)$	$0.184 (1, 2)$
			$0.894 (4, 5, 6)$		$0.694 (4, 5, 6)$
			$0.976 (8, 9, 10)$		$0.904 (8, 9, 10)$
	200	$1 (100)$	$0.424 (1, 2)$	$1 (100)$	$0.290 (1, 2)$
			$0.994 (4, 5, 6)$		$0.876 (4, 5, 6)$
			$0.998 (8, 9, 10)$		$0.984 (8, 9, 10)$
	300	$1 (100)$	$0.836 (1, 2)$	$1 (100)$	$0.556 (1, 2)$
			$1.000 (4, 5, 6)$		$0.992 (4, 5, 6)$
			$1.000 (8, 9, 10)$		$1.000 (8, 9, 10)$
Student t	150	$0.852 (100)$	$0.130 (1, 2)$	$0.906 (100)$	$0.134 (1, 2)$
			$0.372 (4, 5, 6)$		$0.314 (4, 5, 6)$
			$0.496 (8, 9, 10)$		$0.426 (8, 9, 10)$
	200	$0.986 (100)$	$0.122 (1, 2)$	$0.978 (100)$	$0.148 (1, 2)$
			$0.586 (4, 5, 6)$		$0.512 (4, 5, 6)$
			$0.760 (8, 9, 10)$		$0.674 (8, 9, 10)$
	300	1 (100)	$0.295 (1, 2)$	1 (100)	$0.259 (1, 2)$
			$0.892 (4, 5, 6)$		$0.795 (4, 5, 6)$
			$0.990 (8, 9, 10)$		$0.948 (8, 9, 10)$

		Without augmentation		With augmentation
Error	$n = m$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$	$\underset{d e n s e}{M E D} (N_{i})$	$\underset{s p a r s e}{M E D} (N_{i})$
Normal	150	1 (100)	$0.269 (1, 2)$	$1 (100)$	$0.184 (1, 2)$
			$0.894 (4, 5, 6)$		$0.694 (4, 5, 6)$
			$0.976 (8, 9, 10)$		$0.904 (8, 9, 10)$
	200	$1 (100)$	$0.424 (1, 2)$	$1 (100)$	$0.290 (1, 2)$
			$0.994 (4, 5, 6)$		$0.876 (4, 5, 6)$
			$0.998 (8, 9, 10)$		$0.984 (8, 9, 10)$
	300	$1 (100)$	$0.836 (1, 2)$	$1 (100)$	$0.556 (1, 2)$
			$1.000 (4, 5, 6)$		$0.992 (4, 5, 6)$
			$1.000 (8, 9, 10)$		$1.000 (8, 9, 10)$
Student t	150	$0.852 (100)$	$0.130 (1, 2)$	$0.906 (100)$	$0.134 (1, 2)$
			$0.372 (4, 5, 6)$		$0.314 (4, 5, 6)$
			$0.496 (8, 9, 10)$		$0.426 (8, 9, 10)$
	200	$0.986 (100)$	$0.122 (1, 2)$	$0.978 (100)$	$0.148 (1, 2)$
			$0.586 (4, 5, 6)$		$0.512 (4, 5, 6)$
			$0.760 (8, 9, 10)$		$0.674 (8, 9, 10)$
	300	1 (100)	$0.295 (1, 2)$	1 (100)	$0.259 (1, 2)$
			$0.892 (4, 5, 6)$		$0.795 (4, 5, 6)$
			$0.990 (8, 9, 10)$		$0.948 (8, 9, 10)$

Note. MED = marginal energy distance.

Example 3

The stochastic processes ${X_{i}}_{i = 1}^{n}$ are i.i.d copies of X, ${Y_{i}}_{i = n + 1}^{n + m}$ are i.i.d copies of Y, where for $t \in [0, 1]$ ⁠, X is a Gaussian process with mean 0 and covariance structure $cov (X (s), X (t)) = min {s, t}$ ⁠, and Y is a Gaussian process with mean $α (t + \sin (2 π t))$ and covariance structure $cov (Y (s), Y (t)) = (1 - β) min {s, t} + β min {1 - s, 1 - t}$ ⁠. When $α = 0$ and $β > 0$ ⁠, X and Y have the same mean, but difference covariance structure. When $α > 0$ and $β = 0$ ⁠, X and Y have the same covariance structure but different means. In the two-sample context, the magnitudes of α and β determine the level of deviation from the null hypothesis. Larger values of $α, β$ would lead to easier testing problem so we can explore the power performance under various alternative hypotheses. We consider similar sampling designs to Example 1 with Gaussian measurement errors.

For this example, we consider the case $n = m = 200$ with noise level $σ_{1}^{2} = σ_{2}^{2} = 0.2$ ⁠. We compare the results for both sparsely (⁠ $N_{i} \in {4, 5, 6, 7, 8}$ ⁠) and intensively sampled data (⁠ $N_{i} = 100$ ⁠), from which the sparse data are sampled. The simulation results are shown in Figure 1, which reveals that MED-based approach enjoys strong power growth for intensively sampled data, and comparable power growth for sparsely sample data (see the left panel of Figure 1) when testing the equal-mean hypothesis. The result in the right panel of Figure 1 suggests that the sampling frequency has a more prominent role on the power of testing equal-covariance.

Figure 1.

Power study w.r.t. Example 3.

3.2 Applications to real data

In this subsection, we apply the proposed $MED$ -based tests to two real data sets. PBC data: The first data set is the PBC data from Mayo Clinic (Fleming & Harrington, 2005). This data set is from a clinical trial studying PBC of the liver. There were 312 patients assigned to either the treatment or control group. The drug D-penicillamine is given to the treatment group. Here, we are interested in testing the equality of the marginal distributions of prothrombin time, which is a blood test that measures how long it takes blood to clot. The trajectories of prothrombin time for different subjects are plotted in Figure 2, and there are on average six measurements per subject. For our tests, the bandwidth is set to be 2. Here, the equal distribution assumption for errors seems to work (the estimated variances for treatment and control group are 0.96 and 1.009, respectively). By using 200 permutations, the p-value of the $MED$ -based test is 0.54, which means that there is not enough evidence to conclude that the marginal distributions of prothrombin time are different between the two groups. This conclusion matches with existing knowledge that D-penicillamine is ineffective to treat PBC of liver.

Figure 2.

Trajectories of real data.

Open in new tab Download slide

Strawberry data: In the food industry, there is a continuing interest in distinguishing the pure fruit purees from the adulterated ones (Holland et al., 1998). One practical way to detect adulteration is by looking at the spectra of the fruit purees. Here, we are interested in testing the marginal distribution between the spectra of strawberry purees (authentic samples) and non-strawberry purees (adulterated strawberries and other fruits). The strawberry data can be downloaded from the UCR Time Series Classification Archive (Dau et al., 2018; https://www.cs.ucr.edu/˜eamonn/time_series_data_2018/). The single-beam spectra of the purees were normalized to back-ground spectra of water and then transformed into absorbance units. The spectral range was truncated to 899–1802 ${cm}^{- 1}$ (235 data points). The two samples of spectra are plotted in Figure 2 and more information about this data set can be found at Holland et al. (1998). The estimated variances of the measurement errors are 0.000279 and 0.00031 for the two samples, which indicates that there are practically no measurement errors. To check the performance of our method, we analyse the data using all 235 measurements as well as sparse subsamples that contain 2–10 observations per subject. The R package ‘energy’ is applied for the complete data. Both tests are conducted with 200 permutations and have p-value 0.005. Thus, we have strong evidence to conclude that the marginal distributions between the spectra of strawberry and non-strawberry purees are significantly different and our test produced similar results regardless of the sampling plan.

4 Conclusion

The literature on testing homogeneity for functional data is scarce probably because most approaches rely on intensive measurement schedules and the hope that measurement errors could be addressed by presmoothing the data. Since reconstruction of noise-free functional data is not feasible for sparsely observed functional data, a test of homogeneity is infeasible. In this work, we show what is feasible for sparse functional data, a.k.a. longitudinal data, and propose a test of marginal homogeneity that adapts to the sampling plan and provides the corresponding convergence rate. Our test is based on ED with a focus on testing the marginal homogeneity. To the best of our knowledge, this is the only nonparametric test with theoretical guarantees under sparse designs, which are ubiquitous.

There are several twists in our approach, including the handling of asynchronized longitudinal data and the unconventional way that measurement errors affect the method and theory. The asynchronization of the data can be overcome completely as we demonstrated in Section 2.1, but the handling of measurement errors requires some compromise when the distributions of the measurement errors are different for the two samples. This is the price one pays for lack of data and is not due to the use of the $L_{1}$ norm associated with testing the marginal homogeneity, as an $L^{2}$ norm for testing full homogeneity would also face the same challenge with measurement errors unless a presmoothing step has been employed to eliminate the measurement errors. As we mentioned in Section 1, this would require a super intensive sampling plan well beyond the usual requirement for dense or ultra-dense functional data (Zhang & Wang, 2016). While the new approach may involve error augmentation, numerical results show that the efficiency loss is minimal. Moreover, such an augmentation strategy is not uncommon. For instance, an error augmentation method has also been adopted in the SIMEX approach (Cook & Stefanski, 1994) to deal with measurement errors for vector data.

While testing marginal homogeneity has its own merits and advantages over a full-fledged test of homogeneity, our intention is not to particularly endorse it. Rather, we point out what is feasible and infeasible for sparsely or intensively measured functional data and develop theoretical support for the proposed test. To the best of our knowledge, we are the first to provide the convergence rate for the permuted statistics for sparse functional data. This proof and the proof of consistency for the proposed permutation test are non-conventional and different from the multivariate/high-dimensional case.

5 Technical details

5.1 Proof of Theorem 1

Here, we show that

\begin{aligned} ‖ {\hat{G}}_{1} - G_{1} ‖_{2} = O_{p} (h_{x}^{2} + \sqrt{\frac{1}{n^{2}} \sum_{i = 1}^{n} ϕ_{i}} + h_{y}^{2} + \sqrt{\frac{1}{n^{2}} \sum_{i = n + 1}^{n + m} ϕ_{i}}) \\ ‖ {\hat{G}}_{2} - G_{2} ‖_{2} = O_{p} (h_{x}^{2} + \sqrt{\frac{1}{n^{2}} \sum_{i = 1}^{n} ϕ_{i}}) \end{aligned}

and $‖ {\hat{G}}_{3} - G_{3} ‖_{2}$ can be bounded similarly. For $p_{1}, p_{2} = 0, 1, 2$ ⁠, set

\begin{aligned} T_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) & = \frac{1}{N_{i_{1}}} \frac{1}{N_{i_{2}}} \sum_{j_{1} = 1}^{N_{i_{1}}} \sum_{j_{2} = 1}^{N_{i_{2}}} K_{h_{i_{1}}} (T_{i_{1} j_{1}} - t_{1}) K_{i_{2}} (T_{i_{2} j_{2}} - t_{2}) \\ {(\frac{T_{i_{1} j_{1}} - t_{1}}{h_{i_{1}}})}^{p_{1}} {(\frac{T_{i_{2} j_{2}} - t_{2}}{h_{i_{2}}})}^{p_{2}} \end{aligned}

where $h_{i} = h_{x}, if 1 \leq i \leq n$ ⁠; otherwise $h_{i} = h_{y}$ ⁠. The weighted raw data are denoted as

\begin{aligned} Z_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) & = \frac{1}{N_{i_{1}}} \frac{1}{N_{i_{2}}} \sum_{j_{1} = 1}^{N_{i_{1}}} \sum_{j_{2} = 1}^{N_{i_{2}}} K_{h_{i_{1}}} (T_{i_{1} j_{1}} - t_{1}) K_{h_{i_{2}}} (T_{i_{2} j_{2}} - t_{2}) \\ {(\frac{T_{i_{1} j_{1}} - t_{1}}{h_{i_{1}}})}^{p_{1}} {(\frac{T_{i_{2} j_{2}} - t_{2}}{h_{i_{2}}})}^{p_{2}} | z_{i_{1} j_{1}} - z_{i_{2} j_{2}} | \end{aligned}

If there is no confusion, we will only write $T_{i_{1} i_{2}}$ ⁠, $Z_{i_{1} i_{2}}$ instead of $T_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2})$ ⁠, $Z_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2})$ for simplicity. Both (5) and (6) admit closed form solutions and some algebra show that for $I = 1, 2$

{\hat{G}}_{I} (t_{1}, t_{2}) = \frac{W_{I, 1} (t_{1}, t_{2}) V_{I}^{0, 0} (t_{1}, t_{2}) - W_{I, 2} (t_{1}, t_{2}) V_{I}^{1, 0} (t_{1}, t_{2}) + W_{I, 3} (t_{1}, t_{2}) V_{I}^{0, 1} (t_{1}, t_{2})}{W_{I, 1} (t_{1}, t_{2}) U_{I}^{0, 0} (t_{1}, t_{2}) - W_{I, 2} (t_{1}, t_{2}) U_{I}^{1, 0} (t_{1}, t_{2}) + W_{I, 3} (t_{1}, t_{2}) U_{I}^{0, 1} (t_{1}, t_{2})}

Here, ${W_{I, J} : I = 1, 2, J = 1, 2, 3}$ are two-dimensional functions defined as

\begin{aligned} W_{I, 1} (t_{1}, t_{2}) & = (U_{I}^{2, 0} (t_{1}, t_{2}) U_{I}^{0, 2} (t_{1}, t_{2})) - (U_{I}^{1, 1} (t_{1}, t_{2}))^{2} \\ W_{I, 2} (t_{1}, t_{2}) & = (U_{I}^{1, 0} (t_{1}, t_{2}) U_{I}^{0, 2} (t_{1}, t_{2})) - U_{I}^{0, 1} (t_{1}, t_{2}) U_{I}^{1, 1} (t_{1}, t_{2}) \\ W_{I, 3} (t_{1}, t_{2}) & = (U_{I}^{1, 0} (t_{1}, t_{2}) U_{I}^{1, 1} (t_{1}, t_{2})) - U_{I} (t_{1}, t_{2}; 0, 1) U_{I}^{2, 0} (t_{1}, t_{2}) \end{aligned}

where for $p_{1}, p_{2} = 0, 1, 2$ ⁠, ${U_{I}, V_{I} : I = 1, 2}$ have the following expressions:

\begin{aligned} U_{1}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{1}{n m} \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} T_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) \\ U_{2}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{2}{n (n - 1)} \sum_{1 \leq i_{1} < i_{2} \leq n} T_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) \\ V_{1}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{1}{n m} \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} Z_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) \\ V_{2}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{2}{n (n - 1)} \sum_{1 \leq i_{1} < i_{2} \leq n} Z_{i_{1} i_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) \end{aligned}

By some straight-forward calculations, for $I = 1, 2$

\begin{aligned} {\hat{G}}_{I} (t_{1}, t_{2}) - G_{I} (t_{1}, t_{2}) \\ = \frac{W_{I, 1} (t_{1}, t_{2}) F_{I}^{0, 0} (t_{1}, t_{2}) - W_{I, 2} (t_{1}, t_{2}) F_{I}^{1, 0} (t_{1}, t_{2}) + W_{I, 3} (t_{1}, t_{2}) F_{I}^{0, 1} (t_{1}, t_{2})}{W_{I, 1} (t_{1}, t_{2}) U_{I}^{0, 0} (t_{1}, t_{2}) - W_{I, 2} (t_{1}, t_{2}) U_{I}^{1, 0} (t_{1}, t_{2}) + W_{I, 3} (t_{1}, t_{2}) U_{I}^{0, 1} (t_{1}, t_{2})} \end{aligned}

(13)

where for $p_{1}, p_{2} = 0, 1, 2$ ⁠,

\begin{aligned} F_{1}^{p_{1} p_{2}} (t_{1}, t_{2}) & = V_{1}^{p_{1} p_{2}} (t_{1}, t_{2}) - G_{1} (t_{1}, t_{2}) U_{1}^{p_{1} p_{2}} (t_{1}, t_{2}) \\ - h_{x} \frac{\partial G_{1} (t_{1}, t_{2})}{\partial t_{1}} U_{1}^{p_{1} + 1, p_{2}} (t_{1}, t_{2}) - h_{y} \frac{\partial G_{1} (t_{1}, t_{2})}{\partial t_{2}} U_{1}^{p_{1}, p_{2} + 1} (t_{1}, t_{2}) \end{aligned}

and

\begin{aligned} F_{2}^{p_{1} p_{2}} (t_{1}, t_{2}) & = V_{2}^{p_{1} p_{2}} (t_{1}, t_{2}) - G_{2} (t_{1}, t_{2}) U_{2}^{p_{1} p_{2}} (t_{1}, t_{2}) \\ - h_{x} \frac{\partial G_{2} (t_{1}, t_{2})}{\partial t_{1}} U_{2}^{p_{1} + 1, p_{2}} (t_{1}, t_{2}) - h_{x} \frac{\partial G_{2} (t_{1}, t_{2})}{\partial t_{2}} U_{2}^{p_{1}, p_{2} + 1} (t_{1}, t_{2}) \end{aligned}

Lemma 3 entails that the denominator in (13) is bounded away from 0 with high probability. Thus, for $I = 1, 2$ ⁠, we have

‖ {\hat{G}}_{I} - G_{I} ‖_{2} = O_{p} (‖ F_{I}^{0, 0} ‖_{2} + ‖ F_{I}^{1, 0} ‖_{2} + ‖ F_{I}^{0, 1} ‖_{2})

It can be easily seen that $E [| z_{i_{1} j_{1}} - z_{i_{2} j_{2}} | | T_{i_{1} j_{1}}, T_{i_{2} j_{2}}] = G_{1} (T_{i_{1} j_{1}}, T_{i_{2}, j_{2}})$ if $1 \leq i_{1} \leq n$ ⁠, $n + 1 \leq i_{2} \leq n + m$ and $E [| z_{i_{1} j_{1}} - z_{i_{2} j_{2}} | | T_{i_{1} j_{1}}, T_{i_{2} j_{2}}] = G_{2} (T_{i_{1} j_{1}}, T_{i_{2}, j_{2}})$ if $1 \leq i_{1} < i_{2} \leq n$ ⁠. By Taylor expansion

‖ F_{I}^{p_{1} p_{2}} ‖_{2} = O (‖ L_{I}^{p_{1} p_{2}} ‖_{2} + h_{x}^{2} + h_{y}^{2} + h_{x} h_{y})

where

\begin{aligned} L_{1}^{p_{1} p_{2}} (t_{1}, t_{2}) & = \frac{1}{n m} \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} Z_{i_{1} i_{2}}^{p_{1} p_{2}} - E [Z_{i_{1} i_{2}}^{p_{1} p_{2}} ∣ {T_{i_{1} j_{1}}}_{j_{1} = 1}^{N_{i_{1}}}, {T_{i_{2}, j_{2}}}_{j_{2} = 1}^{N_{i_{2}}}] \\ L_{2}^{p_{1} p_{1}} (t_{1}, t_{2}) & = \frac{2}{n (n - 1)} \sum_{1 \leq i_{1} < i_{2} \leq n} Z_{i_{1} i_{2}}^{p_{1} p_{2}} - E [Z_{i_{1} i_{2}}^{p_{1} p_{2}} ∣ {T_{i_{1} j_{1}}}_{j_{1} = 1}^{N_{i_{1}}}, {T_{i_{2}, j_{2}}}_{j_{2} = 1}^{N_{i_{2}}}] \end{aligned}

Lemma 1

Under Assumptions 1 and 2, for

I = 1, 2

and

p_{1}, p_{2} = 0, 1, 2

⁠, it holds that

E [‖ L_{I}^{p_{1} p_{2}} ‖_{2}^{2}] ≲ \frac{1}{n^{2}} \sum_{i = 1}^{n} ϕ_{i} + \frac{1}{m^{2}} \sum_{i = n + 1}^{n + m} ϕ_{i}

Proof.

By setting

\begin{aligned} w_{i_{1} i_{2}, j_{1} j_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) & = K_{h_{i_{1}}} (T_{i_{1} j_{1}} - t_{1}) K_{h_{i_{2}}} (T_{i_{2} j_{2}} - t_{2}) {(\frac{T_{i_{1} j_{1}} - t_{1}}{h_{i_{1}}})}^{p_{1}} {(\frac{T_{i_{2} j_{2}} - t_{2}}{h_{i_{2}}})}^{p_{2}} \\ {| z_{i_{1} j_{1}} - z_{i_{2} j_{2}} | - E [| z_{i_{1} j_{1}} - z_{i_{2} j_{2}} | | T_{i_{1} j_{1}}, T_{i_{2} j_{2}}]} \end{aligned}

we can write

\begin{aligned} {\bar{Z}}_{i_{1} i_{2}}^{p_{1} p_{2}} & = Z_{i_{1} i_{2}}^{p_{1} p_{2}} - E [Z_{i_{1} i_{2}}^{p_{1} p_{2}} ∣ {T_{i_{1} j_{1}}}_{j_{1} = 1}^{N_{i_{1}}}, {T_{i_{2}, j_{2}}}_{j_{2} = 1}^{N_{i_{2}}}] \\ = \frac{1}{N_{i_{1}}} \frac{1}{N_{i_{2}}} \sum_{j_{1} = 1}^{N_{i_{1}}} \sum_{j_{2} = 1}^{N_{i_{2}}} w_{i_{1} i_{2}, j_{1} j_{2}}^{p_{1} p_{2}} (t_{1}, t_{2}) \end{aligned}

For any

p_{1}, p_{2} = 0, 1, 2

⁠,

E [w_{i_{1} i_{2}, j_{1} j_{2}}^{p_{1} p_{2}} (t_{1}, t_{2})] = E [E [w_{i_{1} i_{2}} (t_{1}, t_{2}) | T_{i_{1} j_{1}}, T_{i_{2} j_{2}}]] = 0

which implies that

E [{\bar{Z}}_{i_{1} i_{2}}^{p_{1} p_{2}}] = 0

and

E [{\bar{Z}}_{i_{1} i_{2}}^{p_{1} p_{2}} {\bar{Z}}_{i_{1}^{'} i_{2}^{'}}^{p_{1} p_{2}}] \neq 0,

only if

{i_{1}, i_{2}} \cap {i_{1}^{'}, i_{2}^{'}} \neq \emptyset

⁠. If

1 \leq i_{1}, i_{1}^{'} \leq n

⁠,

n + 1 \leq i_{2}, i_{2}^{'} \leq n + m

⁠,

i_{1} = i_{1}^{'}

⁠, we have

h_{i_{1}} = h_{i_{1}^{'}}

⁠,

h_{i_{2}} = h_{i_{2}^{'}}

and

\begin{aligned} h_{i_{1}}^{2} h_{i_{2}}^{2} | \int \int E [w_{i_{1} i_{2}, j_{1} j_{2}} (t_{1}, t_{2}) w_{i_{1}^{'} i_{2}^{'}, j_{1}^{'} j_{2}^{'}} (t_{1}, t_{2})] d t_{1} d t_{2} | \\ \leq C \times {\begin{matrix} h_{i_{1}} h_{i_{2}}^{2}, & j_{1} = j_{1}^{'}, i_{2} \neq i_{2}^{'} \\ h_{i_{1}} h_{i_{2}}^{2}, & j_{1} = j_{1}^{'}, i_{2} = i_{2}^{'}, j_{2} \neq j_{2}^{'} \\ h_{i_{1}} h_{i_{2}}, & j_{1} = j_{1}^{'}, i_{2} = i_{2}^{'}, j_{2} = j_{2}^{'} \\ h_{i_{1}}^{2} h_{i_{2}}^{2}, & j_{1} \neq j_{1}^{'}, i_{2} \neq i_{2}^{'} \\ h_{i_{1}}^{2} h_{i_{2}}^{2}, & j_{1} \neq j_{1}^{'}, i_{2} = i_{2}^{'}, j_{2} \neq j_{2}^{'} \\ h_{i_{1}}^{2} h_{i_{2}}, & j_{1} \neq j_{1}^{'}, i_{2} = i_{2}^{'}, j_{2} = j_{2}^{'} \end{matrix} \end{aligned}

where C is a constant that depends on

T_{x}, T_{y}, K, X, Y

⁠. Using the above inequality, if

i_{1} = i_{1}^{'}

and

i_{2} = i_{2}^{'}

⁠, we have

\begin{aligned} | \int \int E [{\bar{Z}}_{i_{1} i_{2}} (t_{1}, t_{2}) {\bar{Z}}_{i_{1}^{'} i_{2}^{'}} (t_{1}, t_{2})] d t_{1} d t_{2} | \\ \leq \frac{1}{N_{i_{1}}} \frac{1}{N_{i_{2}}} \sum_{j_{1} = 1}^{N_{i_{1}}} \sum_{j_{2} = 1}^{N_{i_{2}}} \frac{1}{N_{i_{1}^{'}}} \frac{1}{N_{i_{2}^{'}}} \sum_{j_{1}^{'} = 1}^{N_{i_{1}^{'}}} \sum_{j_{2}^{'} = 1}^{N_{i_{2}^{'}}} | \int \int E [w_{i_{1} i_{2}, j_{1} j_{2}} (t_{1}, t_{2}) w_{i_{1}^{'} i_{2}^{'}, j_{1}^{'} j_{2}^{'}} (t_{1}, t_{2})] d t_{1} d t_{2} | \\ \leq C {\frac{N_{i_{1}} h_{i_{1}} + N_{i_{1}} (N_{i_{1}} - 1) h_{i_{1}}^{2}}{N_{i_{1}}^{2} h_{i_{1}}^{2}} \frac{N_{i_{2}} h_{i_{2}} + N_{i_{2}} (N_{i_{2}} - 1) h_{i_{2}}^{2}}{N_{i_{2}}^{2} h_{i_{2}}^{2}}} \\ = C ϕ_{i_{1}} ϕ_{i_{2}} \end{aligned}

(14)

i_{1} = i_{1}^{'}

⁠,

i_{2} \neq i_{2}^{'}

⁠, we have

\begin{aligned} | \int \int E [{\bar{Z}}_{i_{1} i_{2}} (t_{1}, t_{2}) {\bar{Z}}_{i_{1}^{'} i_{2}^{'}} (t_{1}, t_{2})] d t_{1} d t_{2} | \\ \leq C {\frac{N_{i_{1}} h_{1} + N_{i_{1}} (N_{i_{1}} - 1) h_{1}^{2}}{N_{i_{1}}^{2} h_{1}^{2}}} \frac{N_{i_{2}} N_{i_{2}^{'}} h_{2}^{2}}{N_{i_{2}} N_{i_{2}^{'}} h_{2}^{2}} = C ϕ_{i_{1}} \end{aligned}

(15)

Similarly, for the case

i_{1} \neq i_{1}^{'}

and

i_{2} = i_{2}^{'}

⁠, it holds that

| \int \int E [{\bar{Z}}_{i_{1} i_{2}} (t_{1}, t_{2}) {\bar{Z}}_{i_{1}^{'} i_{2}^{'}} (t_{1}, t_{2})] d t_{1} d t_{2} | \leq C ϕ_{i_{2}}

(16)

Then, it follows from the above inequalities that

\begin{aligned} E [\int_{0}^{1} \int_{0}^{1} L_{1} (t_{1}, t_{2})^{2} d t_{1} d t_{2}] \\ ≲ \frac{1}{n^{2} m^{2}} \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} \sum_{1 \leq i_{1}^{'} \leq n} \sum_{n + 1 \leq i_{2}^{'} \leq n + m} | \int \int E [{\bar{Z}}_{i_{1} i_{2}} (t_{1}, t_{2}) {\bar{Z}}_{i_{1}^{'} i_{2}^{'}} (t_{1}, t_{2})] d t_{1} d t_{2} | \\ ≲ \frac{1}{n^{2}} \sum_{1 \leq i_{1} \leq n} ϕ_{i_{1}} + \frac{1}{m^{2}} \sum_{n + 1 \leq i_{2} \leq n + m} ϕ_{i_{2}} + \frac{1}{n^{2} m^{2}} \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} ϕ_{i_{1}} ϕ_{i_{2}} \\ ≲ \frac{1}{n^{2}} \sum_{1 \leq i_{1} \leq n} ϕ_{i_{1}} + \frac{1}{m^{2}} \sum_{n + 1 \leq i_{2} \leq n + m} ϕ_{i_{2}} \end{aligned}

The bound for

E [\int_{0}^{1} \int_{0}^{1} L_{2} (t_{1}, t_{2})^{2} d t_{1} d t_{2}]

can be derived using the same tactics. □

5.2 Proof of Theorem 2

For any permutation π, let ${\hat{G}}_{π, I}$ be the estimated function based on the permuted sample

π \cdot Z = (z_{π (1)}; z_{π (2)}; \dots; z_{π (n + m)})

Correspondingly, the explicit form of ${\hat{G}}_{π, I}$ depends on the following quantities:

\begin{aligned} U_{π, 1}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{1}{n m} \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} T_{π (i_{1}) π (i_{2})} \\ U_{π, 2}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{2}{n (n - 1)} \sum_{1 \leq i_{1} < i_{2} \leq n} T_{π (i_{1}) π (i_{2})} \\ V_{π, 1}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{1}{n m} \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} Z_{π (i_{1}) π (i_{2})} \\ V_{π, 2}^{p_{1} p_{2}} (t_{1}, t_{2}) = \frac{2}{n (n - 1)} \sum_{1 \leq i_{1} < i_{2} \leq n} Z_{π (i_{1}) π (i_{2})} \end{aligned}

For $I = 1, 2, J = 1, 2, 3$ ⁠, let $W_{π, I, J}^{p_{1} p_{2}}$ be defined similarly with $W_{I, J}^{p_{1} p_{2}}$ with $U_{I}, V_{I}$ replaced by $U_{π, I}, V_{π, I}$ ⁠, respectively. Then it can be shown that ${\hat{G}}_{π, I} (t_{1}, t_{2})$ admits a similar form as ${\hat{G}}_{I} (t_{1}, t_{2})$ with $W_{I}, U_{I}, V_{I}$ replaced by $W_{π, I}$ ⁠, $U_{π, I}, V_{π, I}$ ⁠.

The following lemma shows that $U_{π, I}, V_{π, I}$ as well as $U_{Π, I}, V_{Π, I}$ converge to their mean functions, where Π is a random permutation sampled uniformly from $P_{n + m}$ ⁠.

Lemma 2

Under the assumptions of Theorem 2, for any $p_{1}, p_{2} = 0, 1, 2$ ⁠, we have

for any fixed permutation $π \in P_{n + m}$ ⁠,
$\begin{aligned} \int \int E {(V_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [V_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2})])}^{2} d t_{1} d t_{2} \\ \leq C {\frac{1}{n^{2}} \sum_{1 \leq i_{1} \leq n} ϕ_{π (i_{1})} + \frac{1}{m^{2}} \sum_{n + 1 \leq i_{2} \leq n + m} ϕ_{π (i_{2})}} \end{aligned}$
where C is a constant that depends on $T_{x}, T_{y}, K, X, Y$ ⁠. Moreover, $\int \int E (U_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [U_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2})])^{2} d t_{1} d t_{2}$ satisfies the same bound.
For any random permutation Π sampled from $P_{n + m}$ uniformly,
$\begin{aligned} \int \int E {(V_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [V_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2})])}^{2} d t_{1} d t_{2} \\ \leq C {sup_{π \in P_{n + m}} \frac{1}{n^{2}} \sum_{1 \leq i_{1} \leq n} ϕ_{π (i_{1})} + sup_{π \in P_{n + m}} \frac{1}{m^{2}} \sum_{n + 1 \leq i_{2} \leq n + m} ϕ_{π (i_{2})}} \end{aligned}$
where C is a constant that depends on $T_{x}, T_{y}, K, X, Y$ ⁠. Moreover, $\int \int E (U_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [U_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2})])^{2} d t_{1} d t_{2}$ can attain the same rate as above.

Proof.

(i) Here, we only show the result for

V_{π, 1}

⁠, as the proof for

V_{π, 2}

is similar. Set

{\tilde{V}}_{π, 1} = (1 / n m) \sum_{1 \leq i_{1} \leq n} \sum_{n + 1 \leq i_{2} \leq n + m} {\tilde{Z}}_{π (i_{1}) π (i_{2})},

where

{\tilde{Z}}_{π (i_{1}) π (i_{2})} = Z_{π (i_{1}) π (i_{2})} - E [Z_{π (i_{1}) π (i_{2})}]

Since

E [{\tilde{Z}}_{π (i_{1}) π (i_{2})} {\tilde{Z}}_{π (i_{1}^{'}) π (i_{2}^{'})}] \neq 0

only if

{π (i_{1}), π (i_{2})} \cap {π (i_{1}^{'}), π (i_{2}^{'})} \neq \emptyset

⁠, by similar arguments with equations (14), (15), and (16), we can show that

| \int \int E [{\tilde{Z}}_{i_{1} i_{2}} (t_{1}, t_{2}) {\tilde{Z}}_{i_{1}^{'} i_{2}^{'}} (t_{1}, t_{2})] d t_{1} d t_{2} | \leq C {\begin{matrix} ϕ_{π (i_{1})} ϕ_{π (i_{2})} & if i_{1} = i_{1}^{'}, i_{2} = i_{2}^{'} \\ ϕ_{π (i_{1})} & if i_{1} = i_{1}^{'}, i_{2} \neq i_{2}^{'} \\ ϕ_{π (i_{2})} & if i_{1} \neq i_{1}^{'}, i_{2} = i_{2}^{'} \end{matrix}

which implies

\begin{aligned} \int \int E {(V_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [V_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2})])}^{2} d t_{1} d t_{2} \\ \leq 2 C {\frac{1}{n^{2}} \sum_{1 \leq i_{1} \leq n} ϕ_{π (i_{1})} + \frac{1}{m^{2}} \sum_{n + 1 \leq i_{2} \leq n + m} ϕ_{π (i_{2})}} \end{aligned}

The bound on

\int \int E (U_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [U_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2})])^{2} d t_{1} d t_{2}

can be shown similarly.

(ii) The result follows from the inequality:

\begin{aligned} \int \int E {(V_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [V_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2})])}^{2} d t_{1} d t_{2} \\ \leq 2 C {sup_{π \in P_{n + m}} \frac{1}{n^{2}} \sum_{1 \leq i_{1} \leq n} ϕ_{π (i_{1})} + sup_{π \in P_{n + m}} \frac{1}{m^{2}} \sum_{n + 1 \leq i_{2} \leq n + m} ϕ_{π (i_{2})}} \end{aligned}

and the fact that C is a constant independent of π. □

Lemma 3

Under the assumptions of Theorem 2, for any $p_{1}, p_{2} = 0, 1, 2$ ⁠, we have

for any fixed permutation $π \in P_{n + m}$ ⁠,
${sup}_{t_{1}, t_{2}} | U_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [U_{π, I}^{p_{1} p_{2}} (t_{1}, t_{2})] | = o_{p} (1)$
For any random permutation Π sampled from $P_{n + m}$ uniformly,
${sup}_{t_{1}, t_{2}} | U_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) - E [U_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2})] | = o_{p} (1)$

Proof.

(i) We can write

T_{i_{1} i_{2}}^{p_{1} p_{2}} = S_{i_{1}}^{p_{1}} S_{i_{2}}^{p_{2}}

and

\begin{aligned} U_{π, 1}^{p_{1} p_{2}} (t_{1}, t_{2}) & = R_{π, 1}^{p_{1}} (t_{1}) R_{π, 2}^{p_{2}} (t_{2}) \\ U_{π, 2}^{p_{1} p_{2}} (t_{1}, t_{2}) & = R_{π, 1}^{p_{1}} (t_{1}) R_{π, 1}^{p_{1}} (t_{1}) - \frac{1}{n^{2}} \sum_{i_{1} = 1}^{n} (S_{π (i_{1})}^{p_{1}} (t_{1}))^{2} \end{aligned}

where

R_{π, 1}^{p_{1}} (t_{1}) = (1 / n) \sum_{i_{1} = 1}^{n} S_{π (i_{1})}^{p_{1}} (t_{1}) and R_{π, 2}^{p_{2}} (t_{2}) = (1 / m) \sum_{i_{2} = n + 1}^{n + m} S_{π (i_{2})}^{p_{2}} (t_{2})

and

\begin{aligned} S_{i_{1}}^{p_{1}} (t_{1}) & = \frac{1}{N_{i_{1}}} \sum_{j_{1} = 1}^{N_{i_{1}}} K_{h_{i_{1}}} (T_{i_{1} j_{1}} - t_{1}) {(\frac{T_{i_{1} j_{1}} - t_{1}}{h_{i_{1}}})}^{p_{1}} \\ S_{i_{2}}^{p_{2}} (t_{2}) & = \frac{1}{N_{i_{2}}} \sum_{j_{2} = 1}^{N_{i_{2}}} K_{h_{i_{2}}} (T_{i_{2} j_{2}} - t_{2}) {(\frac{T_{i_{2} j_{2}} - t_{2}}{h_{i_{2}}})}^{p_{2}} \end{aligned}

It is sufficient to show that

\sup_{t_{1}} | R_{π, I} (t_{1}) - E [R_{π, I} (t_{1})] | = o_{p} (1)

⁠. Here, we only provide details for the case

I = 1

and the rest can be shown similarly. The superscripts

p_{1}, p_{2}

will be dropped if there is no confusion. Set

{\tilde{R}}_{π, 1} = \frac{1}{n} \sum_{1 \leq i_{1} \leq n} {\tilde{S}}_{π (i_{1})}

where

{\tilde{S}}_{π (i_{1})} = S_{π (i_{1})} - E [S_{π (i_{1})}]

and

a_{n} = {\log (\frac{n}{\sum_{i = 1}^{n} N_{π (i)}^{- 1} / n}) \frac{\sum_{i = 1}^{n} N_{π (i)}^{- 1} / n}{min {h_{x}, h_{y}}} \frac{1}{n}}^{1 / 2}, b_{n} = \frac{1}{(n^{- 2} \sum_{i = 1}^{n} N_{π (i)}^{- 1})^{2}}

Let

χ (b_{n}) \subset [0, 1]

be the set of equally spaced grid points of size

b_{n}

⁠, then

sup_{t_{1}} | {\tilde{R}}_{π, 1} (t_{1}) | ≲ sup_{t_{1} \in χ (b_{n})} | {\tilde{R}}_{π, 1} (t_{1}) | + sup_{| t_{1} - t_{1}^{'} | < 1 / b_{n}} | {\tilde{R}}_{π, 1} (t_{1}) - {\tilde{R}}_{π, 1} (t_{1}^{'}) |

To bound the second term, note that

\begin{aligned} | {\tilde{R}}_{π, 1} (t_{1}) - {\tilde{R}}_{π, 1} (t_{1}^{'}) | \\ \leq \frac{1}{n} \sum_{1 \leq i_{1} \leq n} {| S_{π (i_{1})} (t_{1}) - S_{π (i_{1})} (t_{1}^{'}) | + | E [S_{π (i_{1})} (t_{1})] - E [S_{π (i_{1})} (t_{1}^{'})] |} \end{aligned}

For any

t_{1}, t_{1}^{'}

such that

| t_{1} - t_{1}^{'} | < 1 / b_{n}

⁠,

\begin{aligned} sup_{| t_{1} - t_{1}^{'} | < 1 / b_{n}} \frac{1}{n} \sum_{1 \leq i_{1} \leq n} | S_{π (i_{1})} (t_{1}) - S_{π (i_{1})} (t_{1}^{'}) | \\ \leq C_{K} sup_{| t_{1} - t_{1}^{'} | < 1 / b_{n}} \frac{1}{n} \sum_{1 \leq i_{1} \leq n} \frac{1}{N_{π (i_{1})}} \sum_{j_{1} = 1}^{N_{π (i_{1})}} {| K_{h_{π (i_{1})}} (T_{π (i_{1}) j_{1}} - t_{1}) - K_{h_{π (i_{1})}} (T_{π (i_{1}) j_{1}} - t_{1}^{'}) | \\ + | {(\frac{T_{π (i_{1}) j_{1}} - t_{1}}{h_{π (i_{1})}})}^{p_{1}} - {(\frac{T_{π (i_{1}) j_{1}} - t_{1}^{'}}{h_{π (i_{1})}})}^{p_{1}} |} \\ \leq C_{K}^{'} \frac{1}{b_{n}} \frac{1}{min {h_{x}^{2}, h_{y}^{2}}} \to 0 \end{aligned}

(17)

where

C_{K}, C_{K}^{'}

are constants that depend on K. Similarly, we can show that

sup_{| t_{1} - t_{1}^{'} | < 1 / b_{n}} \frac{1}{n} \sum_{1 \leq i_{1} \leq n} | E [S_{π (i_{1})} (t_{1})] - E [S_{π (i_{1})} (t_{1}^{'})] | \leq C_{K}^{'} \frac{1}{b_{n}} \frac{1}{min {h_{x}^{2}, h_{y}^{2}}} \to 0

By similarly arguments as in the proof of Lemma 2 (Zhang & Wang, 2016), we can show that if M is large enough

P (sup_{t_{1} \in χ (b_{n})} | {\tilde{R}}_{π, 1} (t_{1}) | > {Ma}_{n}) \leq 2 {(\frac{max_{1 \leq i \leq n + m} N_{i}^{- 1}}{n})}^{M - C_{K, T} - 2} \to 0

(18)

where

C_{K, T}

is a constant that depends on K and

T_{x}, T_{y}

⁠.

(ii) The result follows from the fact that the upper bounds in (17) and (18) hold uniformly for all permutations in $P_{n + m}$ ⁠. □

Next, we resume the proof of Theorem 2. For any $1 \leq i_{1} \neq i_{2} \leq n + m$ ⁠, we set

\tilde{T} (t_{1}, t_{2}) = E [T_{Π (i_{1}), Π (i_{2})}] and \tilde{G} (t_{1}, t_{2}) = E [Z_{Π (i_{1}), Π (i_{2})}]

By symmetry of the kernel K, $E [U_{Π, I}^{p_{1} p_{2}}] = 0$ if $p_{1} = 1$ or $p_{2} = 1$ ⁠. Consequently, by Lemmas 2 and 3, if $p_{1} = 1 or p_{2} = 1$ ⁠,

{‖ U_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) ‖}_{2} = o_{p} (1) and sup_{t_{1}, t_{2}} | U_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) | = o_{p} (1)

Otherwise, $sup_{t_{1}, t_{2}} | U_{Π, I}^{p_{1} p_{2}} (t_{1}, t_{2}) |$ would converge to a positive constant in probability. This also implies that the denominator of ${\hat{G}}_{Π, I} (t_{1}, t_{2})$ is bounded away from 0 with high probability. Thus,

\begin{aligned} \int \int | {\hat{G}}_{Π, I} (t_{1}, t_{2}) - \frac{\tilde{G} (t_{1}, t_{2})}{\tilde{T} (t_{1}, t_{2})} | d t_{1} d t_{2} \\ = O_{p} (\int \int | U_{Π, I}^{0, 0} (t_{1}, t_{2}) - \tilde{T} (t_{1}, t_{2}) | d t_{1} d t_{2} + \int \int | V_{Π, I}^{0, 0} (t_{1}, t_{2}) - \tilde{G} (t_{1}, t_{2}) | d t_{1} d t_{2} \\ + \int \int | U_{Π, I}^{1, 0} (t_{1}, t_{2}) V_{Π, I}^{1, 0} (t_{1}, t_{2}) | d t_{1} d t_{2} + \int \int | U_{Π, I}^{0, 1} (t_{1}, t_{2}) V_{Π, I}^{1, 0} (t_{1}, t_{2}) | d t_{1} d t_{2} \\ + \int \int | U_{Π, I}^{1, 0} (t_{1}, t_{2}) V_{Π, I}^{0, 1} (t_{1}, t_{2}) | d t_{1} d t_{2} + \int \int | U_{Π, I}^{0, 1} (t_{1}, t_{2}) V_{Π, I}^{0, 1} (t_{1}, t_{2}) | d t_{1} d t_{2}) \\ = O_{p} ({‖ U_{Π, I}^{0, 0} - \tilde{T} ‖}_{2} + {‖ V_{Π, I}^{0, 0} - \tilde{G} ‖}_{2} + {‖ U_{Π, I}^{1, 0} ‖}_{2} + {‖ U_{Π, I}^{0, 1} ‖}_{2}) \\ = O_{p} (sup_{π \in P_{n + m}} \sqrt{\frac{1}{n^{2}} \sum_{i_{1} = 1}^{n} ϕ_{π (i_{1})}} + sup_{π \in P_{n + m}} \sqrt{\frac{1}{m^{2}} \sum_{i_{2} = n + 1}^{n + m} ϕ_{π (i_{2})}}) \end{aligned}

which entails that

\begin{aligned} | MED (Π \cdot Z) | & \leq \int_{0}^{1} 2 | {\tilde{G}}_{Π, 1} (t, t) - \frac{\tilde{G} (t, t)}{\tilde{T} (t, t)} | \\ + | {\tilde{G}}_{Π, 2} (t, t) - \frac{\tilde{G} (t, t)}{\tilde{T} (t, t)} | + | {\tilde{G}}_{Π, 3} (t, t) - \frac{\tilde{G} (t, t)}{\tilde{T} (t, t)} | d t \\ = O_{p} (sup_{π \in P_{n + m}} \sqrt{\frac{1}{n^{2}} \sum_{i_{1} = 1}^{n} ϕ_{π (i_{1})}} + sup_{π \in P_{n + m}} \sqrt{\frac{1}{m^{2}} \sum_{i_{2} = n + 1}^{n + m} ϕ_{π (i_{2})}}) \end{aligned}

5.3 Proof of Theorem 3

Under $H_{A}$ ⁠, set $c = MED (X, Y) \geq 0$ ⁠. By Theorems 1 and 2,

\begin{aligned} P_{H_{A}} (\hat{p} \leq α) & \geq P_{H_{A}} (| {MED}_{n} (Z) - c | < c / 2, | {MED}_{n} (Π_{l} \cdot Z) | < c / 2, l = 1, \dots, S - 1) \\ \geq 1 - {P_{H_{A}} (| {MED}_{n} (Z) - c | \geq c / 2) + (S - 1) P_{H_{A}} (| {MED}_{n} (Π_{l} \cdot Z) | \geq c / 2)} \\ \to 1 \end{aligned}

5.4 Proof of Corollary 2

Under Assumptions 1, 2, and 4, the result can be shown by adopting similar arguments as in the proof of Theorem 1 by replacing $z_{i_{1} j_{1}}, z_{i_{2} j_{2}}$ ⁠, $G_{1}$ ⁠, $G_{2}$ ⁠, ${\hat{G}}_{1}$ ⁠, ${\hat{G}}_{2}$ with ${\tilde{z}}_{i_{1} j_{1}}, {\tilde{z}}_{i_{2} j_{2}}$ ⁠, $H_{1}$ ⁠, $H_{2}$ ⁠, ${\hat{H}}_{1}$ ⁠, ${\hat{H}}_{2}$ ⁠, respectively.

5.5 Proof of Theorem 4

Proof.

Denote the density functions of

X (t) + e_{1}

⁠,

Y (t) + e_{2}

⁠,

X (t)

⁠,

Y (t)

⁠,

e_{1}

⁠,

e_{2}

{\tilde{f}}_{x} (\cdot | t)

⁠,

{\tilde{f}}_{y} (\cdot | t)

⁠,

f_{x} (\cdot | t)

⁠,

f_{y} (\cdot | t)

⁠,

η_{1} (\cdot)

⁠, and

η_{2} (\cdot)

⁠, respectively. Under Assumption 5, we have

η_{1} = η_{2}

⁠. Suppose

X (t) + e_{1} =^{d} Y (t) + e_{2}

⁠, then for all

a \in R ⊊

\int_{- \infty}^{\infty} f_{x} (u | t) η_{1} (a - u) d u = {\tilde{f}}_{x} (a | t) = {\tilde{f}}_{y} (a | t) = \int_{- \infty}^{\infty} f_{y} (u | t) η_{2} (a - u) d u

which entails that

f_{x} (u | t) - f_{y} (u | t)

is orthogonal to

l_{a} (u) = η_{1} (a - u)

for any

a \in R

⁠. By Wiener’s Tauberian theorem, the span of

{l_{a} (u) = η_{1} (a - u) | a \in R}

forms a dense subset of

L^{2} (R)

⁠. Thus, we can conclude that

f_{x} (u | t) - f_{y} (u | t) = 0

⁠. □

5.6 Proof of Corollary 3

Proof.

Under Assumptions 1, 3, and 4, we can adopt similar arguments in Section 5.2 and show that

| {MED}_{n} (Π \cdot \tilde{Z}) | = O_{p} (sup_{π \in P_{n + m}} \sqrt{\frac{1}{n^{2}} \sum_{i = 1}^{n} ϕ_{π (i)}} + sup_{π \in P_{n + m}} \sqrt{\frac{1}{m^{2}} \sum_{i = n + 1}^{n + m} ϕ_{π (i)}})

By Corollary 2 and Theorem 4, the result follows similarly from Theorem 3. □

Funding

This research was supported by NIH grant UH3OD023313 (ECHO programme) and NSF DMS-2210891 and DMS-1914917.

Data availability

The data are available in the book ‘Counting Process and Survival Analysis’ by Thomas R. Fleming and David P. Harrington (2005) [https://onlinelibrary.wiley.com/doi/book/10.1002/9781118150672] and from the UCR Time Series Classification Archive [https://www.cs.ucr.edu/eamonn/time_series_data_2018/].

References

Aoshima

Shen

Yata

Zhou

Y.-H.

, &

Marron

J. S.

(

2018

A survey of high dimension low sample size asymptotics

Australian & New Zealand Journal of Statistics

(

–

. https://doi.org/10.1111/anzs.12212

. https://doi.org/10.1214/07-AOS516

Benko

Härdle

, &

Kneip

(

2009

Common functional principal components

The Annals of Statistics

(

–

. https://doi.org/10.1214/aoms/1177697800

Bickel

P. J.

(

1969

A distribution free version of the Smirnov two sample test in the p-variate case

The Annals of Mathematical Statistics

(

–

. https://doi.org/10.1214/aop/1176993668

Bickel

P. J.

, &

Breiman

(

1983

Sums of functions of nearest neighbor distances, moment bounds, limit theorems and a goodness of fit test

The Annals of Probability

(

185

–

214

https://doi.org/10.1007/978-3-319-55846-2_11

Cabaña

Estrada

A. M.

Peña

, &

Quiroz

A. J

. (

2017

Permutation tests in the two-sample problem for functional data. In G. Aneiros, G. E. Bongiorno, R. Cao, & P. Vieu, (Eds) Functional statistics and related fields. contributions to statistics. Cham: Springer.

Carroll

Gajardo

Chen

Dai

Fan

Hadjipantelis

P. Z.

Han

Mueller

H.-G.

, &

Wang

J.-L.

(

2021

fdapace: Functional data analysis and empirical dynamics. R package version 0.5.6. https://github.com/functionaldata/tPACE

Chakraborty

, &

Zhang

(

2021

A new framework for distance and kernel-based metrics in high dimensions

Electronic Journal of Statistics

(

5455

–

5522

. https://doi.org/10.1214/21-EJS1889

. https://doi.org/10.1080/01621459.1994.10476871

Cook

J. R.

, &

Stefanski

L. A.

(

1994

Simulation–extrapolation estimation in parametric measurement error models

Journal of the American Statistical Association

(

428

1314

–

1328

. https://doi.org/10.1093/biomet/asn021

Cox

D. D.

, &

Lee

J. S.

(

2008

Pointwise testing with functional data using the Westfall–Young randomization method

Biometrika

(

621

–

634

. https://doi.org/10.1080/03461238.1928.10416862

Cramér

(

1928

On the composition of elementary errors

Scandinavian Actuarial Journal

1928

(

–

. https://doi.org/10.1016/j.csda.2003.10.021

Cuevas

Febrero

, &

Fraiman

(

2004

An ANOVA test for functional data

Computational Statistics & Data Analysis

(

111

–

122

Dau

H. A.

Keogh

Kamgar

Yeh

C.-C. M.

Zhu

Gharghabi

Ratanamahatana

C. A.

Yanping

Bagnall

Mueen

, &

Batista

(

2018

The UCR time series classification archive. https://www.cs.ucr.edu/eamonn/time˙series˙data˙2018/

Davidian

Lin

, &

Wang

(

2004

Introduction: Emerging issues in longitudinal and functional data analysis

Statistica Sinica

(

613

–

614

. https://doi.org/10.1080/01621459.1998.10473763

Fan

, &

Lin

S.-K.

(

1998

Test of significance when data are curves

Journal of the American Statistical Association

(

443

1007

–

1021

. https://doi.org/10.5705/ss.2010.085

Ferraty

González-Manteiga

Martínez-Calvo

, &

Vieu

(

2012

Presmoothing in functional linear regression

Statistica Sinica

(

–

. https://doi.org/10.1214/aos/1176344722

Fleming

T. R.

, &

Harrington

D. P.

(

2005

Counting processes and survival analysis

John Wiley & Sons

Friedman

J. H.

, &

Rafsky

L. C.

(

1979

Multivariate generalizations of the Wald–Wolfowitz and Smirnov two-sample tests

The Annals of Statistics

(

697

–

717

Gao

, &

Shao

(

2021

Two sample testing in high dimension via maximum mean discrepancy, arXiv, arXiv:2109.14913, preprint: not peer reviewed

Gretton

Borgwardt

K. M.

Rasch

M. J.

Schölkopf

, &

Smola

(

2012

A kernel two-sample test

Journal of Machine Learning Research

(

723

–

773

. https://doi.org/10.1080/01621459.2018.1483827

Gretton

Fukumizu

Teo

Song

Schölkopf

, &

Smola

(

2007

A kernel statistical test of independence. In Advances in neural information processing systems 20 (pp. 585–592)

Guo

Zhou

, &

Zhang

J.-T.

(

2019

New tests for equality of several covariance functions for functional data

Journal of the American Statistical Association

114

(

527

1251

–

1263

Hall

, &

Keilegom

I. V.

(

2007

Two-sample tests in functional data analysis starting from discrete data

Statistica Sinica

(

1511

–

1531

. https://doi.org/10.1111/j.1467-9868.2005.00510.x

Hall

Marron

J. S.

, &

Neeman

(

2005

Geometric representation of high dimension, low sample size data

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

(

427

–

444

), https://doi.org/10.5705/SS.202020.0339

Zhong

P.-S.

Cui

, &

Mandrekar

(

2023

Unified tests for nonparametric functions in RKHS with kernel selection and regularization

Statistica Sinica

(

forthcoming

. https://doi.org/10.1214/aos/1176350835

Henze

(

1988

A multivariate two-sample test based on the number of nearest neighbor type coincidences

The Annals of Statistics

(

772

–

783

. https://doi.org/10.1002/(SICI)1097-0010(199802)76:2<263::AID-JSFA943>3.0.CO;2-F

Holland

J. K.

Kemsley

E. K.

, &

Wilson

R. H.

(

1998

Use of Fourier transform infrared spectroscopy and partial least squares regression for the detection of adulteration of strawberry purées

Journal of the Science of Food and Agriculture

(

263

–

269

. https://doi.org/10.1016/j.jmva.2018.09.002

Horváth

, &

Kokoszka

(

2012

Inference for functional data with applications

. (Vol. 200).

Springer-Verlag

Hsing

, &

Eubank

(

2015

Theoretical foundations of functional data analysis, with an introduction to linear operators

John Wiley & Sons

Jiang

Hušková

Meintanis

S. G.

, &

Zhu

(

2019

Asymptotics, finite-sample comparisons and applications for two-sample tests with functional data

Journal of Multivariate Analysis

170

202

–

220

. https://doi.org/10.1214/21-AOS2103

Kim

Balakrishnan

, &

Wasserman

(

2022

Minimax optimality of permutation tests

The Annals of Statistics

(

225

–

251

Klebanov

(

2006

N-distances and their applications

University of Chicago Press

Google Preview

Kolmogorov

A. N.

(

1933

Sulla determinazione empirica di una legge di distribuzione

Giorn Dell'inst Ital Degli Att

–

. https://doi.org/10.17713/ajs.v50i4.1099

Krzyśko

, &

Smaga

Ł.

(

2021

Two-sample tests for functional data using characteristic functions

Austrian Journal of Statistics

(

–

Lehmann

E. L.

, &

Romano

J. P.

(

2005

Testing statistical hypotheses

(3rd ed.).

Springer-Verlag

Google Preview

. https://doi.org/10.1214/10-AOS813

, &

Hsing

(

2010

Uniform convergence rates for nonparametric regression and principal component analysis in functional/longitudinal data

The Annals of Statistics

(

3321

–

3351

. https://doi.org/10.1080/01621459.2020.1777138

Lin

, &

Wang

J.-L.

(

2022

Mean and covariance estimation for functional snippets

Journal of the American Statistical Association

117

(

537

348

–

360

Lyons

(

2013

Distance covariance in metric spaces

The Annals of Probability

(

3284

–

3305

. https://doi.org/10.1214/12-AOP803

. https://doi.org/10.1198/jasa.2010.tm09239

Panaretos

V. M.

Kraus

, &

Maddocks

J. H.

(

2010

Second-order comparison of Gaussian random functions and the geometry of DNA minicircles

Journal of the American Statistical Association

105

(

490

670

–

682

. https://doi.org/10.1093/biomet/asw033

Paparoditis

, &

Sapatinas

(

2016

Bootstrap-based testing of equality of mean functions or equality of covariance operators for functional data

Biometrika

103

(

727

–

733

. https://doi.org/10.1111/rssb.12235

Pfister

Bühlmann

Schölkopf

, &

Peters

(

2018

Kernel-based tests for joint independence

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

(

–

. https://doi.org/10.1111/biom.12476

Pini

, &

Vantini

(

2016

The interval testing procedure: A general framework for inference in functional data analysis

Biometrics

(

835

–

845

Pomann

G.-M.

Staicu

A.-M.

, &

Ghosh

(

2016

A two-sample distribution-free test for functional data with application to a diffusion tensor imaging study of multiple sclerosis

Journal of the Royal Statistical Society. Series C (Applied Statistics)

(

395

–

414

. https://doi.org/10.1111/rssc.12130

Ramsay

, &

Silverman

B. W.

(

2005

Functional data analysis

(2nd ed.).

Springer-Verlag

Rindt

Sejdinovic

, &

Steinsaltz

(

2021

Consistency of permutation tests of independence using distance covariance, HSIC and dHSIC

Stat

(

e364

. https://doi.org/10.1002/sta4.364

. https://doi.org/10.1080/01621459.1986.10478337

Rizzo

, &

Szekely

(

2021

energy: E-statistics: Multivariate inference via the energy of data. R package version 1.7-8. https://CRAN.R-project.org/package=energy

Schilling

M. F.

(

1986

Multivariate two-sample tests based on nearest neighbors

Journal of the American Statistical Association

(

395

799

–

806

Sejdinovic

Gretton

, &

Bergsma

(

2013

A kernel test for three-variable interactions. In Advances in neural information processing systems 26 (pp. 1124–1132)

Smirnov

N. V.

(

1939

On the estimation of the discrepancy between empirical curves of distribution for two independent samples

Moscow University Mathematics Bulletin

(

–

. https://doi.org/10.1016/j.jspi.2014.08.006

Staicu

A.-M.

Lahiri

S. N.

, &

Carroll

R. J.

(

2015

Significance tests for functional data with complex dependence structure

Journal of Statistical Planning and Inference

156

–

Székely

G. J.

, &

Rizzo

M. L.

(

2004

Testing for equal distributions in high dimension

InterStat

–

. https://doi.org/10.1214/009053607000000505

Székely

G. J.

Rizzo

M. L.

, &

Bakirov

N. K.

(

2007

Measuring and testing dependence by correlation of distances

The Annals of Statistics

(

2769

–

2794

. https://doi.org/10.1111/rssb.12246

Von Mises

(

1928

Statistik und wahrheit

Julius Springer

Wang

Zhong

P.-S.

Cui

, &

(

2018

Unified empirical likelihood ratio tests for functional concurrent linear models and the phase transition from sparse to dense functional data

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

(

343

–

364

. https://doi.org/10.1146/annurev-statistics-041715-033624

Wang

J.-L.

Chiou

J.-M.

, &

Müller

H.-G.

(

2016

Functional data analysis

Annual Review of Statistics and Its Application

(

257

–

295

. https://doi.org/10.1214/21-EJS1802

Wang

(

2021

Two-sample inference for sparse functional data

Electronic Journal of Statistics

1395

–

1423

Wynne

, &

Duncan

A. B.

(

2022

A kernel two-sample test for functional data

Journal of Machine Learning Research

–

. https://doi.org/10.1198/016214504000001745

Yao

Müller

H.-G.

, &

Wang

J.-L.

(

2005a

Functional data analysis for sparse longitudinal data

Journal of the American Statistical Association

100

(

470

577

–

590

. https://doi.org/10.1214/009053605000000660

Yao

Müller

H.-G.

, &

Wang

J.-L.

(

2005b

Functional linear regression analysis for longitudinal data

The Annals of Statistics

(

2873

–

2903

. https://doi.org/10.5705/ss.202017.0262

Yuan

Fang

H.-B.

C. O.

, &

Tan

M. T.

(

2020

Hypothesis testing for multiple mean and correlation curves with functional data

Statistica Sinica

(

1095

–

1116

. https://doi.org/10.1214/009053606000001505

Zhang

J.-T.

, &

Chen

(

2007

Statistical inferences for functional data

The Annals of Statistics

(

1052

–

1079

. https://doi.org/10.1016/j.csda.2018.05.004

Zhang

J.-T.

Cheng

M.-Y.

H.-T.

, &

Zhou

(

2019

A new test for functional one-way ANOVA with applications to ischemic heart screening

Computational Statistics & Data Analysis

132

–

. https://doi.org/10.1111/sjos.12025

Zhang

J.-T.

, &

Liang

(

2014

One-way ANOVA for functional data via globalizing the pointwise F-test

Scandinavian Journal of Statistics

(

–

. https://doi.org/10.1080/15598608.2010.10412005

Zhang

J.-T.

Liang

, &

Xiao

(

2010

On the two-sample Behrens–Fisher problem for functional data

Journal of Statistical Theory and Practice

(

571

–

587

. https://doi.org/10.1214/16-AOS1446

Zhang

, &

Wang

J.-L.

(

2016

From sparse to dense functional data and beyond

The Annals of Statistics

(

2281

–

2321

. https://doi.org/10.1111/sjos.12460

Zhong

P.-S.

, &

Kokoszka

(

2021

Multivariate analysis of variance and change points estimation for high-dimensional longitudinal data

Scandinavian Journal of Statistics

(

375

–

405

. https://doi.org/10.3150/20-BEJ1270

Zhu

, &

Shao

(

2021

Interpoint distance based two sample tests in high dimension

Bernoulli

(

1189

–

1211

. https://doi.org/10.1214/19-AOS1934

Zhu

Zhang

Yao

, &

Shao

(

2020

Distance-based and RKHS-based dependence metrics in high dimension

The Annals of Statistics

(

3366

–

3394