Identifying covariate-related subnetworks for whole-brain connectome analysis

Network-level inference results across all settings. The power is calculated separately for each of the two subnetworks (⁠|$G_1$||$|V_1|=25$| and |$G_2$||$|V_2|=50$|⁠), while the FPR is based on the aggregate false-positive findings. The means (standard deviations) of power and FPR are summarized based on 100 repeated simulations. SICERS generally performs well for all settings, followed by Louvain, Dense, and NBS. The Power and FPR largely rely on accurate subnetwork extraction and inference. Large subnetwork size, effect size, and sample size can improve the accuracy of subnetwork extraction and yield greater test statistics, thus increase power and sensitivity. SICERS outperforms the other methods because the |$\ell_0$| shrinkage and our new statistical inference methods can better capture and characterize covariate-related subnetworks

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	Power	\|$G_1$\|	1(0)	0.98(0.14)	0.84(0.37)	1(0)	1(0)	0.86(0.35)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.05(0.13)	0.03(0.09)	0(0)	0.03(0.09)	0.03(0.09)
Louvain	Power	\|$G_1$\|	1(0)	0.92(0.27)	1(0)	1(0)	0.86(0.35)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0.08(0.17)	0.07(0.16)	0(0)	0.11(0.2)	0.09(0.19)	0.01(0.05)
Dense	Power	\|$G_1$\|	1(0)	1(0)	0.36(0.48)	1(0)	1(0)	0.06(0.24)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.33(0)	0.44(0.08)	0(0)	0.33(0)	0.49(0.04)
NBS	Power	\|$G_1$\|	0.14(0.35)	0(0)	0(0)	0.26(0.44)	0(0)	0(0)
		\|$G_2$\|	1(0)	0(0)	0(0)	1(0)	0(0)	0(0)
	FPR		0.1(0.21)	1(0)	1(0)	0.08(0.2)	1(0)	1(0)

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	Power	\|$G_1$\|	1(0)	0.98(0.14)	0.84(0.37)	1(0)	1(0)	0.86(0.35)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.05(0.13)	0.03(0.09)	0(0)	0.03(0.09)	0.03(0.09)
Louvain	Power	\|$G_1$\|	1(0)	0.92(0.27)	1(0)	1(0)	0.86(0.35)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0.08(0.17)	0.07(0.16)	0(0)	0.11(0.2)	0.09(0.19)	0.01(0.05)
Dense	Power	\|$G_1$\|	1(0)	1(0)	0.36(0.48)	1(0)	1(0)	0.06(0.24)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.33(0)	0.44(0.08)	0(0)	0.33(0)	0.49(0.04)
NBS	Power	\|$G_1$\|	0.14(0.35)	0(0)	0(0)	0.26(0.44)	0(0)	0(0)
		\|$G_2$\|	1(0)	0(0)	0(0)	1(0)	0(0)	0(0)
	FPR		0.1(0.21)	1(0)	1(0)	0.08(0.2)	1(0)	1(0)

Table 1.

Network-level inference results across all settings. The power is calculated separately for each of the two subnetworks (⁠|$G_1$||$|V_1|=25$| and |$G_2$||$|V_2|=50$|⁠), while the FPR is based on the aggregate false-positive findings. The means (standard deviations) of power and FPR are summarized based on 100 repeated simulations. SICERS generally performs well for all settings, followed by Louvain, Dense, and NBS. The Power and FPR largely rely on accurate subnetwork extraction and inference. Large subnetwork size, effect size, and sample size can improve the accuracy of subnetwork extraction and yield greater test statistics, thus increase power and sensitivity. SICERS outperforms the other methods because the |$\ell_0$| shrinkage and our new statistical inference methods can better capture and characterize covariate-related subnetworks

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	Power	\|$G_1$\|	1(0)	0.98(0.14)	0.84(0.37)	1(0)	1(0)	0.86(0.35)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.05(0.13)	0.03(0.09)	0(0)	0.03(0.09)	0.03(0.09)
Louvain	Power	\|$G_1$\|	1(0)	0.92(0.27)	1(0)	1(0)	0.86(0.35)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0.08(0.17)	0.07(0.16)	0(0)	0.11(0.2)	0.09(0.19)	0.01(0.05)
Dense	Power	\|$G_1$\|	1(0)	1(0)	0.36(0.48)	1(0)	1(0)	0.06(0.24)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.33(0)	0.44(0.08)	0(0)	0.33(0)	0.49(0.04)
NBS	Power	\|$G_1$\|	0.14(0.35)	0(0)	0(0)	0.26(0.44)	0(0)	0(0)
		\|$G_2$\|	1(0)	0(0)	0(0)	1(0)	0(0)	0(0)
	FPR		0.1(0.21)	1(0)	1(0)	0.08(0.2)	1(0)	1(0)

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	Power	\|$G_1$\|	1(0)	0.98(0.14)	0.84(0.37)	1(0)	1(0)	0.86(0.35)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.05(0.13)	0.03(0.09)	0(0)	0.03(0.09)	0.03(0.09)
Louvain	Power	\|$G_1$\|	1(0)	0.92(0.27)	1(0)	1(0)	0.86(0.35)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0.08(0.17)	0.07(0.16)	0(0)	0.11(0.2)	0.09(0.19)	0.01(0.05)
Dense	Power	\|$G_1$\|	1(0)	1(0)	0.36(0.48)	1(0)	1(0)	0.06(0.24)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FPR		0(0)	0.33(0)	0.44(0.08)	0(0)	0.33(0)	0.49(0.04)
NBS	Power	\|$G_1$\|	0.14(0.35)	0(0)	0(0)	0.26(0.44)	0(0)	0(0)
		\|$G_2$\|	1(0)	0(0)	0(0)	1(0)	0(0)	0(0)
	FPR		0.1(0.21)	1(0)	1(0)	0.08(0.2)	1(0)	1(0)

Edge-level inference results. Given significant covariate-related subnetworks, we further evaluate the deviation of |$\hat{G}_c$| from |$G_c$| by measuring the edge-level difference. The |$\hat{G}_c$| versus |$G_c$| differences are measured at the edge-level with respect to sensitivity and FDR as follows: |$\text{Sensitivity} = \frac{\sum_{i<j}I(e_{ij}\in G_c,e_{ij}\in \hat G_c)}{\sum_{i<j}I(e_{ij}\in G_c)} \text{ and FDR} = \frac{\sum_{i<j}I(e_{ij}\notin G_c,e_{ij}\in \hat G_c)}{\sum_{i<j}I(e_{ij}\in \hat G_c)}.$|

Table 2 summarizes the performance of all methods in all settings. In general, SICERS, network detection, and dense algorithms can recover the covariate-related subnetworks. When the effect size is smaller, network detection and subgraph extraction algorithms tend to cover a maximal number of informative edges and thus also include false-positive edges in the estimated subnetworks; therefore, the detected subnetworks may differ from the true network. SICERS is more robust to false-positive noise for small effect size because it imposes an |$\ell_0$| penalty term on the objective function. NBS is more sensitive to noise because the subnetwork detection extraction algorithm of NBS seeks maximally connected components. Lastly, we compare the network analysis method with the univariate method BH-FDR (⁠|$q=0.05$|⁠). Without the aid of graph information, the univariate inference method tends to select a high proportion of false-positive edges and fails to recognize the network structure.

Table 2.

Edge-level inference results across all settings. The TPR and FDR are calculated separately for each of the two subnetworks (⁠|$G_1$||$|V_1|=25$| and |$G_2$||$|V_2|=50$|⁠). The means (standard deviations) of TPR and FDR are summarized based on 100 repeated simulations. TPR is determined by the proportion of edges in |$G_c$| that can be recovered by |$\hat{G}_c$|⁠, and FDR is the proportion of edges in |$\hat{G}_c$| are not in |$G_c$|⁠. TPR = 1 and FDR = 0 suggest a perfect recovery of |$G_c$| by |$\hat{G}_c$|⁠. SICERS outperforms the comparable methods because the objective function can maximize the signal while suppressing noise, and thereby better recovers the underlying true |$G_c$|⁠.

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	TPR	\|$G_1$\|	1(0)	0.87(0.2)	0.91(0.19)	1(0)	0.9(0.2)	0.88(0.2)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0.01)	0(0)	0.02(0.04)	0.02(0.04)
		\|$G_2$\|	0(0)	0.03(0.04)	0.09(0.21)	0(0)	0.04(0.05)	0.09(0.19)
Louvain	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0.21(0.1)	0.58(0.12)	0.44(0.11)	0.25(0.11)	0.58(0.12)	0.41(0.16)
		\|$G_2$\|	0.16(0.06)	0.03(0.03)	0.04(0.04)	0.16(0.05)	0.02(0.03)	0.03(0.03)
Dense	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0)	0(0)	0(0)	0(0)
		\|$G_2$\|	0(0)	0(0)	0.35(0.27)	0(0)	0(0)	0.52(0.13)
NBS	TPR	\|$G_1$\|	1(0)	NA	NA	1(0)	NA	NA
		\|$G_2$\|	1(0)	NA	NA	1(0)	NA	NA
	FDR	\|$G_0$\|	0.28(0.05)	NA	NA	0.21(0.1)	NA	NA
		\|$G_2$\|	0.59(0.16)	NA	NA	0.54(0.21)	NA	NA
BH-FDR	TPR		1(0)	0.95(0)	0.94(0)	1(0)	0.94(0.01)	0.75(0.01)
	FDR		0.18(0.01)	0.5(0)	0.54(0)	0.18(0.01)	0.5(0)	0.54(0.01)

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	TPR	\|$G_1$\|	1(0)	0.87(0.2)	0.91(0.19)	1(0)	0.9(0.2)	0.88(0.2)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0.01)	0(0)	0.02(0.04)	0.02(0.04)
		\|$G_2$\|	0(0)	0.03(0.04)	0.09(0.21)	0(0)	0.04(0.05)	0.09(0.19)
Louvain	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0.21(0.1)	0.58(0.12)	0.44(0.11)	0.25(0.11)	0.58(0.12)	0.41(0.16)
		\|$G_2$\|	0.16(0.06)	0.03(0.03)	0.04(0.04)	0.16(0.05)	0.02(0.03)	0.03(0.03)
Dense	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0)	0(0)	0(0)	0(0)
		\|$G_2$\|	0(0)	0(0)	0.35(0.27)	0(0)	0(0)	0.52(0.13)
NBS	TPR	\|$G_1$\|	1(0)	NA	NA	1(0)	NA	NA
		\|$G_2$\|	1(0)	NA	NA	1(0)	NA	NA
	FDR	\|$G_0$\|	0.28(0.05)	NA	NA	0.21(0.1)	NA	NA
		\|$G_2$\|	0.59(0.16)	NA	NA	0.54(0.21)	NA	NA
BH-FDR	TPR		1(0)	0.95(0)	0.94(0)	1(0)	0.94(0.01)	0.75(0.01)
	FDR		0.18(0.01)	0.5(0)	0.54(0)	0.18(0.01)	0.5(0)	0.54(0.01)

Table 2.

Open in new tab Download slide

Edge-level inference results across all settings. The TPR and FDR are calculated separately for each of the two subnetworks (⁠|$G_1$||$|V_1|=25$| and |$G_2$||$|V_2|=50$|⁠). The means (standard deviations) of TPR and FDR are summarized based on 100 repeated simulations. TPR is determined by the proportion of edges in |$G_c$| that can be recovered by |$\hat{G}_c$|⁠, and FDR is the proportion of edges in |$\hat{G}_c$| are not in |$G_c$|⁠. TPR = 1 and FDR = 0 suggest a perfect recovery of |$G_c$| by |$\hat{G}_c$|⁠. SICERS outperforms the comparable methods because the objective function can maximize the signal while suppressing noise, and thereby better recovers the underlying true |$G_c$|⁠.

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	TPR	\|$G_1$\|	1(0)	0.87(0.2)	0.91(0.19)	1(0)	0.9(0.2)	0.88(0.2)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0.01)	0(0)	0.02(0.04)	0.02(0.04)
		\|$G_2$\|	0(0)	0.03(0.04)	0.09(0.21)	0(0)	0.04(0.05)	0.09(0.19)
Louvain	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0.21(0.1)	0.58(0.12)	0.44(0.11)	0.25(0.11)	0.58(0.12)	0.41(0.16)
		\|$G_2$\|	0.16(0.06)	0.03(0.03)	0.04(0.04)	0.16(0.05)	0.02(0.03)	0.03(0.03)
Dense	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0)	0(0)	0(0)	0(0)
		\|$G_2$\|	0(0)	0(0)	0.35(0.27)	0(0)	0(0)	0.52(0.13)
NBS	TPR	\|$G_1$\|	1(0)	NA	NA	1(0)	NA	NA
		\|$G_2$\|	1(0)	NA	NA	1(0)	NA	NA
	FDR	\|$G_0$\|	0.28(0.05)	NA	NA	0.21(0.1)	NA	NA
		\|$G_2$\|	0.59(0.16)	NA	NA	0.54(0.21)	NA	NA
BH-FDR	TPR		1(0)	0.95(0)	0.94(0)	1(0)	0.94(0.01)	0.75(0.01)
	FDR		0.18(0.01)	0.5(0)	0.54(0)	0.18(0.01)	0.5(0)	0.54(0.01)

			\|$S = 240$\|			\|$S = 120$\|
		Cohen’s \|$d$\|	1.2	0.8	0.5	1.2	0.8	0.5
SICERS	TPR	\|$G_1$\|	1(0)	0.87(0.2)	0.91(0.19)	1(0)	0.9(0.2)	0.88(0.2)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0.01)	0(0)	0.02(0.04)	0.02(0.04)
		\|$G_2$\|	0(0)	0.03(0.04)	0.09(0.21)	0(0)	0.04(0.05)	0.09(0.19)
Louvain	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0.21(0.1)	0.58(0.12)	0.44(0.11)	0.25(0.11)	0.58(0.12)	0.41(0.16)
		\|$G_2$\|	0.16(0.06)	0.03(0.03)	0.04(0.04)	0.16(0.05)	0.02(0.03)	0.03(0.03)
Dense	TPR	\|$G_1$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
		\|$G_2$\|	1(0)	1(0)	1(0)	1(0)	1(0)	1(0)
	FDR	\|$G_1$\|	0(0)	0(0)	0(0)	0(0)	0(0)	0(0)
		\|$G_2$\|	0(0)	0(0)	0.35(0.27)	0(0)	0(0)	0.52(0.13)
NBS	TPR	\|$G_1$\|	1(0)	NA	NA	1(0)	NA	NA
		\|$G_2$\|	1(0)	NA	NA	1(0)	NA	NA
	FDR	\|$G_0$\|	0.28(0.05)	NA	NA	0.21(0.1)	NA	NA
		\|$G_2$\|	0.59(0.16)	NA	NA	0.54(0.21)	NA	NA
BH-FDR	TPR		1(0)	0.95(0)	0.94(0)	1(0)	0.94(0.01)	0.75(0.01)
	FDR		0.18(0.01)	0.5(0)	0.54(0)	0.18(0.01)	0.5(0)	0.54(0.01)

The average computing time of SICERS is around 14 min (greedy: 6 min and Louvain: 3 min) on a PC with Intel i7-9700K CPU and 16GB of RAM.

4. Applications to brain connectome data

4.1. Data background

We applied our SICERS method to rs-fMRI brain connectome analysis for schizophrenia research. The data were collected at the School of Medicine of the University of Maryland to investigate the associations of brain functional connectivity (Adhikari and others, 2019). The imaging acquisition parameters, patient inclusion and exclusion criteria, and preprocessing steps are described in detail in the supplementary material available at Biostatistics online.

To assess the replicability of brain connectome analysis, we used two independent data sets: a primary set |$D^1$| and a validation set |$D^2$|⁠. The primary data set |$D^1$| contained 70 schizophrenia patients (age = |$40.80\pm13.63$| years) and 70 control subjects (age = |$41.79\pm13.44$| years) matched by age (⁠|$t=0.62$|⁠, |$p=0.54$|⁠) and sex ratio (⁠|$\chi^2=0$|⁠, |$p=1$|⁠). The validation data set |$D^2$| contained another 30 individuals with schizophrenia (age = |$39.73\pm13.79$| years) and 30 control subjects (age = |$39.73\pm14.16$| years) matched by age (⁠|$t=0.27$|⁠, |$p=0.78$|⁠) and sex ratio (⁠|$\chi^2=0.09$|⁠, |$p=0.77$|⁠). The primary and validation data sets were randomly selected and shared recruitment procedures, inclusion and exclusion criteria, and imaging acquisition and preprocessing steps. Nodes of the connectome graph |$G$| were specified by the commonly used automated anatomical labeling (AAL). Time courses of all voxels within a 10-mm sphere around the centroid of each region were preprocessed as region-wise signals, followed by calculating 4005 Pearson correlation coefficients between the time courses of the 90 AAL regions (i.e., |$n=90$| in all |$\boldsymbol{A}^s$|⁠). We used Fisher’s |$Z$| transformation and normalization to obtain connectivity matrices. We performed statistical analysis on these data sets separately, identified the covariate-related subnetworks, and compared significant disease-related subnetworks for |$D^1$| and |$D^2$|⁠. We also compared the results obtained by SICERS with those of conventional edge-wise inference and commonly used network methods.

4.2. Covariate-related subnetworks

For |$D^1$|⁠, we first conducted an edge-wise Wilcoxon rank sum test for each age and sex adjusted edge |$A_{ij}$| to obtain the |$p$|-value |$p_{ij}$| and the inference matrix |$W^1$| with elements |$w_{ij}=-\log (p_{ij})$|⁠, although regression models could also be applied. Then, we applied SICERS to |$W^1$|⁠, and our method detected one significant subnetwork |$\hat G_1^1$| with an empirical subnetwork |$p$|-value of less than |$0.001$|⁠. This subnetwork contained 22 nodes, including the left medial frontal cortex, bilateral insula, bilateral anterior and middle cingulate cortices, bilateral Heschl gyrus and superior temporal cortices, bilateral paracentral and postcentral cortices, right precentral cortex, and precuneus (Figures 3(a)–(c)) (a full list of region names is given in Table S1 of the supplementary material available at Biostatistics online).

$Applying SICERS to clinical data $D^1$ (a)–(c) and replication data $D^2$ (d)–(f). (a) A heatmap of $\log(p)$ of the first data set ($D^1$); hotter pixels indicate more differential edges between cases and controls, and there is no apparent topological pattern for these hot edges. (b) We then perform SICERS in $D^1$ and find a significant subnetwork [the bold square, which is magnified in (c)]. (c) The enlarged disease-relevant subnetwork in $D^1$ with region names. (d) A heatmap of $\log(p)$ of the second data set ($D^2$). (e) The disease-relevant subnetwork was detected by using $D^2$ alone. (f) The enlarged network in $D^2$ with region names. To save space here, versions of the enlarged plots (c) and (f) with more-readable axis labels are included in the supplementary material available at Biostatistics online.$

Fig. 3.

We then applied the same steps of SICERS to |$D^2$| and also detected one significant subnetwork |$\hat G_1^2$| of 21 nodes, including the left medial superior frontal gyrus, bilateral insula, bilateral anterior and middle cingulate cortices, bilateral Heschl gyrus, Rolandic operculums, supplementary motor areas, paracentral lobules, postcentral lobules, and left precuneus (Figures 3(d) and (e)). In both |$\hat G_1^1$| and |$\hat G_1^2$|⁠, most edges showed reduced connectivity in patients with schizophrenia.

4.3. Replicability of disease-related subnetworks

A remarkable feature of our method is the high replicability of its network-level findings. Specifically, we find that the disease-related subnetworks for |$D^1$| and |$D^2$| are almost identical (⁠|$\hat G_1^2\subset \hat G_1^1$|⁠), which would occur with near-zero probability if significant |$H_{(i,j)}$|’s were not organized as subnetworks but rather scattered randomly. This demonstrates that the subnetwork structure detected by our method reflects not randomness but significant patterns that emerge stably across different independently collected data batches.

We also applied the NBS and univariate inference methods to input data (⁠|$D^1$| and |$D^2$|⁠). Neither NBS nor BH-FDR selected significant subnetworks/edges due to influence of noise. Rather, the uncorrected |$p$|-value of |$0.005$|—a commonly used threshold in the field of neuroimaging (Derado and others, 2010)—was applied to |$W^1$| and |$W^2$|⁠; 430 and 22 suprathreshold edges, respectively, were reported for the two data sets. However, among the two sets of suprathreshold edges, only two edges overlapped. To summarize, for these data sets, none of the benchmark methods rejected any individual |$H_{(i,j);0}$| and thus they all reported no pattern discovery, whereas our SICERS method—by exploiting the network structure—detected significant subnetwork structure with good replicability.

4.4. Biological insights from the covariate-related subnetwork

The brain region constellation of the covariate-related subnetwork consists of inferior frontal, superior temporal, insula, cingulate, and paracentral areas (as shown in Figure S5 in supplementary material available at Biostatistics online). These brain regions comprise three well-known networks: the SN (bilateral), part of the DMN, and part of the CEN. A large body of literature on schizophrenia research has reported well-replicated findings in the neurobiology of schizophrenic disorders pertaining to these three networks (Orliac and others, 2013). The consensus is that functional connections within and between these networks are weaker in patients with schizophrenia than in healthy controls (Lynall and others, 2010), although the potentially confounding effects of medications in these studies have not been ruled out effectively. This is aligned well with our finding that all edges in the disease-related subnetwork show decreased connectivity strengths in patients. Our findings regarding disease-related subnetworks are novel because they provide an integrated understanding of the intrinsic large-scale networks altered by the brain disorder. They reveal systematically the disruption of high-level coordination between neural populations that is linked with clinical symptoms of schizophrenia, including deficits in information processing or blunted reward (SN), language (temporal gyri), and anhedonia (CEN), and—more importantly—the integrated function formed by the interactions of these networks. In summary, our disease-related subnetwork analysis provides a comprehensive investigation of disease-specific brain networks and thus can yield new insights to understand the complex neurobiology of a brain disorder. We further demonstrate the utility of our method by investigating the age- and sex (covariate)-related subnetworks based on 22 000 participants collected from UK Biobank in supplementary material available at Biostatistics online.

5. Discussion

We have developed a new tool—SICERS—to identify covariate-related subnetworks in brain connectome data. Our work represents a new strategy for handling multivariate edge variables as outcomes constrained in an adjacency matrix. In practice, a covariate may influence a small proportion of edge outcomes that may reside in organized subgraphs/subnetworks. Like the popular cluster-wise inference for brain activity analysis, SICERS aims to extract covariate-related subnetworks as clusters of covariate-related edges for connectome analysis. However, extracting latent covariate-related subnetworks is more challenging than extracting activity clusters of spatially adjacent voxels. A small proportion of selected edges can almost surely connect nodes into a subnetwork including all nodes, and a covariate-related subnetwork involving all nodes is neither biologically sound nor statistically accurate. To address this challenge, we define a covariate-related subnetwork as a subgraph of an organized structure (e.g., a community) and concentrated with covariate-related edges. Lemma 1 demonstrates that the chance of a false-positive, nontrivial, and dense subnetwork is close to zero. Using both theoretical and numerical results in Sections 3 and 4, we further show that by leveraging this property, our subnetwork-level analysis can improve both network-level and edge-level sensitivity while controlling the false-positive findings.

We implement computationally efficient algorithms for SICERS to extract subnetworks covering maximal covariate-related edges (high sensitivity) with |$\ell_0$| penalty on subnetwork size. The |$\ell_0$| penalty ensures that the selected subnetworks are dense and suppresses false-positive edges (i.e., fewer nodes are included). Our algorithm differs from dense subgraph extraction algorithms because SICERS can reveal multiple subnetworks more effectively (as seen in the simulations). Our algorithm also suggests that implementing the |$\ell_0$| penalty for multivariate edge variables can be less computationally expensive than the |$\ell_0$| penalty for the traditional variable selection setting of a vector of variables. In addition, we perform the network-level statistical inference by the permutation test to control the FWER (Eklund and others, 2016) with tailored subnetwork-level test statistics. Since SICERS focuses on network-level inference, it cannot capture individual covariate-related edges that are not part of subnetworks. An alternative approach is to use edge-level inference with FWER/FDR correction to identify individual covariate-related edges.

SICERS is generally applicable to multivariate edge variables, for example, structural and functional brain connectome data. Although we focus on a single covariate in SICERS, we can extend the method straightforwardly to a contrast of parameters combining multiple covariates or a dominating factor of multiple covariates by dimension–reduction techniques. The software package for SICERS is at https://github.com/shuochenstats/SICERS.

Supplementary material

Supplementary material is available at http://biostatistics.oxfordjournals.org.

Acknowledgments

Conflict of Interest: The authors declare no conflict of interest.

Funding

This work was supported by the National Institutes of Health under Award Numbers 1DP1DA04896801, EB008432 and EB008281.

References

Adhikari,

B. M.

,

Hong,

L. E.

,

Calhoun,

V. D.

,

Du,

X.

,

Chen,

S.

and others. (

2019

).

Functional network connectivity impairments and core cognitive deficits in schizophrenia

.

Human Brain Mapping

40

,

4593

–

4605

.

Bickel,

P. J.

and

Chen,

A.

(

2009

).

A nonparametric view of network models and Newman–Girvan and other modularities

.

Proceedings of the National Academy of Sciences United States of America

106

,

21068

–

21073

.

Blondel,

V. D.

,

Guillaume,

J.-L.

,

Lambiotte,

R.

and

Lefebvre,

E.

(

2008

).

Fast unfolding of communities in large networks

.

Journal of Statistical Mechanics: Theory and Experiment

2008

,

P10008

.

Bowman,

F. D.

(

2005

).

Spatio-temporal modeling of localized brain activity

.

Biostatistics

6

,

558

–

575

.

Bowman,

F. D.

,

Zhang,

L.

,

Derado,

G.

and

Chen,

S.

(

2012

).

Determining functional connectivity using fMRI data with diffusion-based anatomical weighting

.

NeuroImage

62

,

1769

–

1779

.

Cai,

T.

,

Li,

H.

,

Ma,

J.

and

Xia,

Y.

(

2019

).

Differential Markov random field analysis with an application to detecting differential microbial community networks

.

Biometrika

106

,

401

–

416

.

Cao,

X.

,

Sandstede,

B.

and

Luo,

X.

(

2019

).

A functional data method for causal dynamic network modeling of task-related fMRI

.

Frontiers in Neuroscience

13

.

Chen,

S.

,

Bowman,

F. D.

and

Mayberg,

H. S.

(

2016

).

A Bayesian hierarchical framework for modeling brain connectivity for neuroimaging data

.

Biometrics

72

,

596

–

605

.

Chen,

S.

,

Kang,

J.

,

Xing,

Y.

and

Wang,

G.

(

2015

).

A parsimonious statistical method to detect groupwise differentially expressed functional connectivity networks

.

Human Brain mapping

36

,

5196

–

5206

.

Chen,

S.

,

Xing,

Y.

,

Kang,

J.

,

Kochunov,

P.

and

Hong,

L. E.

(

2020

).

Bayesian modeling of dependence in brain connectivity data

.

Biostatistics

21

,

269

–

286

.

Craddock,

R. C.

,

Jbabdi,

S.

,

Yan,

C.-G.

,

Vogelstein,

J. T.

,

Castellanos,

F. X.

and others. (

2013

).

Imaging human connectomes at the macroscale

.

Nature Methods

10

,

524

.

Derado,

G.

,

Bowman,

F. D.

and

Kilts,

C. D.

(

2010

).

Modeling the spatial and temporal dependence in fMRI data

.

Biometrics

66

,

949

–

957

.

Durante,

D.

,

Dunson,

D. B.

and others. (

2018

).

Bayesian inference and testing of group differences in brain networks

.

Bayesian Analysis

13

,

29

–

58

.

Eklund,

A.

,

Nichols,

T. E.

and

Knutsson,

H.

(

2016

).

Cluster failure: why fMRI inferences for spatial extent have inflated false-positive rates

.

Proceedings of the National Academy of Sciences United States of America

113

,

7900

–

7905

.

Fan,

L.

,

Li,

H.

,

Zhuo,

J.

,

Zhang,

Y.

,

Wang,

J.

and others. (

2016

).

The human brainnetome atlas: a new brain atlas based on connectional architecture

.

Cerebral Cortex

26

,

3508

–

3526

.

Hu,

Y.

,

Zeydabadinezhad,

M.

,

Li,

L.

and

Guo,

Y.

(

2022

).

A multimodal multilevel neuroimaging model for investigating brain connectome development

.

Journal of the American Statistical Association

117

,

1134

–

1148

.

Kundu,

S.

,

Ming,

J.

,

Pierce,

J.

,

McDowell,

J.

and

Guo,

Y.

(

2018

).

Estimating dynamic brain functional networks using multi-subject fMRI data

.

NeuroImage

183

,

635

–

649

.

Lukemire,

J.

,

Kundu,

S.

,

Pagnoni,

G.

and

Guo,

Y.

(

2021

).

Bayesian joint modeling of multiple brain functional networks

.

Journal of the American Statistical Association

116

,

518

–

530

.

Lynall,

M.-E.

, Bassett, D. S.,

Kerwin,

R.

,

McKenna,

P. J.

,

Kitzbichler,

M.

,

Muller,

U.

and

Bullmore,

E.

(

2010

).

Functional connectivity and brain networks in schizophrenia

.

Journal of Neuroscience

30

,

9477

–

9487

.

Manoliu,

A.

,

Riedl,

V.

,

Zherdin,

A.

,

Mühlau,

M.

,

Schwerthöffer,

D.

and others. (

2014

).

Aberrant dependence of default mode/central executive network interactions on anterior insular salience network activity in schizophrenia

.

Schizophrenia Bulletin

40

,

428

–

437

.

Nichols,

T. E.

(

2012

).

Multiple testing corrections, nonparametric methods, and random field theory

.

Neuroimage

62

,

811

–

815

.

Orliac,

F.

,

Naveau,

M.

,

Joliot,

M.

,

Delcroix,

N.

and others. (

2013

).

Links among resting-state default-mode network, salience network, and symptomatology in schizophrenia

.

Schizophrenia Research

148

,

74

–

80

.

Rosvall,

M.

and

Bergstrom,

C. T.

(

2008

).

Maps of random walks on complex networks reveal community structure

.

Proceedings of the national academy of sciences

105

,

1118

–

1123

.

Shi,

R.

and

Guo,

Y.

(

2016

).

Investigating differences in brain functional networks using hierarchical covariate-adjusted independent component analysis

.

The Annals of Applied Statistics

10

, 1930.

Simpson,

S. L.

,

Bahrami,

M.

and

Laurienti,

P. J.

(

2019

).

A mixed-modeling framework for analyzing multitask whole-brain network data

.

Network Neuroscience

3

,

307

–

324

.

Simpson,

S. L.

,

Bowman,

F. D.

and

Laurienti,

P. J.

(

2013

).

Analyzing complex functional brain networks: fusing statistics and network science to understand the brain

.

Statistics Surveys

7

,

1

.

Stepanov,

V. E.

(

1970

).

On the probability of connectedness of a random graph g_m(t)

.

Theory of Probability & Its Applications

15

,

55

–

67

.

Von Luxburg,

U.

(

2007

).

A tutorial on spectral clustering

.

Statistics and Computing

17

,

395

–

416

.

Wang,

W.

,

Zhang,

X.

and

Li,

L.

(

2019

).

Common reducing subspace model and network alternation analysis

.

Biometrics

75

,

1109

–

1120

.

Warnick,

R.

,

Guindani,

M.

,

Erhardt,

E.

,

Allen,

E.

,

Calhoun,

V.

and

Vannucci,

M.

(

2018

).

A Bayesian approach for estimating dynamic functional network connectivity in fMRI data

.

Journal of the American Statistical Association

113

,

134

–

151

.

Wu,

Q.

and

Hao,

J.-K.

(

2015

).

A review on algorithms for maximum clique problems

.

European Journal of Operational Research

242

,

693

–

709

.

Wu,

Q.

,

Huang,

X.

, Culbreth, A. J., Waltz, J. A.,

Hong,

L. E.

and

Chen,

S.

Extracting brain disease-related connectome subgraphs by adaptive dense subgraph discovery

.

Biometrics

.

Xia,

Y.

and

Li,

L.

(

2017

).

Hypothesis testing of matrix graph model with application to brain connectivity analysis

.

Biometrics

73

,

780

–

791

.

Zalesky,

A.

,

Fornito,

A.

and

Bullmore,

E. T.

(

2010

).

Network-based statistic: identifying differences in brain networks

.

Neuroimage

53

,

1197

–

1207

.

Zhang,

J.

,

Sun,

W. W.

and

Li,

L.

(

2023

).

Generalized Connectivity Matrix Response Regression with Applications in Brain Connectivity Studies

,

Journal of Computational and Graphical Statistics

32

,

252

–

262

.

Zhang,

T.

,

Yin,

Q.

,

Caffo,

B.

and others. (

2017

).

Bayesian inference of high-dimensional, cluster-structured ordinary differential equation models with applications to brain connectivity studies

.

The Annals of Applied Statistics

11

,

868

–

897

.

Zhang,

Y.

,

Levina,

E.

,

Zhu,

J.

and others. (

2016

).

Community detection in networks with node features

.

Electronic Journal of Statistics

10

,

3153

–

3178

.

Zhao,

Y.

,

Levina,

E.

,

Zhu,

J.

and others. (

2012

).

Consistency of community detection in networks under degree-corrected stochastic block models

.

The Annals of Statistics

40

,

2266

–

2292

.