Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations Free

Empirical type I error

α	CAnD empirical type I error (95% C.I.)
0.01	0.0118 (0.009, 0.015)
0.005	0.0053 (0.003, 0.007)
0.001	0.0004 (0, 0.0013)

Shown is CAnD empirical type I error (95% C.I.) at significance levels $α =$ 0.01, 0.005, and 0.001 based on 5000 simulated replicates. This simulation setting was conducted under the null hypothesis where the randomly drawn ancestry proportions of an admixed individual are the same for all chromosomes.

Table 1

Empirical type I error

α	CAnD empirical type I error (95% C.I.)
0.01	0.0118 (0.009, 0.015)
0.005	0.0053 (0.003, 0.007)
0.001	0.0004 (0, 0.0013)

Shown is CAnD empirical type I error (95% C.I.) at significance levels $α =$ 0.01, 0.005, and 0.001 based on 5000 simulated replicates. This simulation setting was conducted under the null hypothesis where the randomly drawn ancestry proportions of an admixed individual are the same for all chromosomes.

Power evaluation and comparison

We evaluated the power of CAnD for detecting heterogeneity in ancestry across 22 autosomal chromosomes in simulated samples with 50 admixed individuals. We also compared the power of CAnD to an ANOVA test that does not account for correlation in ancestry across chromosomes within an admixed individual. In the simulation studies, all autosomal chromosomes, except for chromosome 2, were chosen to have the same mean ancestry, on average. Chromosome 2 had a mean ancestry difference of $μ_{d}$ from the other autosomal chromosomes, and we considered values of $μ_{d}$ ranging from 0.005 to 0.2.

Empirical power results for CAnD and ANOVA at a significance level of $α = 0.01$ are given in Figure 1. CAnD has significantly higher power than ANOVA for detecting low to moderate chromosomal ancestry differences. For example, there is essentially no power to detect a mean ancestry difference of $μ_{d} = 5 %$ between chromosome 2 and all other chromosomes with ANOVA, while CAnD has power that is close to 1. The substantial loss in power with ANOVA is due to the method not accounting for correlated ancestry among chromosomes in the simulation study that has considerable between-individual variation in proportional ancestry. In practice, we expect that the CAnD test will provide higher power than ANOVA for detecting ancestry differences among chromosomes in recently admixed populations, such as Hispanics, who have large variation in continental admixture (Conomos et al. 2016).

Figure 1

Empirical power of CAnD and ANOVA tests with simulated data. Shown is the proportion of tests rejected at a significance level of 0.01 with the CAnD and ANOVA tests for mean ancestry difference values between chromosome 2 and the other 21 autosomal chromosomes ranging from 0.005 to 0.2. For each simulated mean ancestry difference setting, the proportion of tests rejected among 500 independent simulated replicates is shown, where each replicate sample contains 50 admixed individuals.

HapMap ASW ancestry

Table 2 shows the mean and SD of the local ancestry estimates for ASW by chromosome in each of the ancestral populations, and Figure 2A shows violin plots of the local ancestry results by chromosome. The ASW are largely African derived with significantly less European ancestry. Across both the autosomes and the X chromosome, proportional Native American, on average, is quite small in the ASW relative to African and European ancestry. Interestingly, RFMix estimated 57 of the 87 ASW individuals to have no Native American ancestry on the X chromosome and 11 individuals to have no European ancestry on the X chromosome. There were 9 ASW individuals estimated to have an X chromosome that is entirely African derived. Proportional African ancestry on the autosomes ranged from 0.56 to 0.97 and ranged from 0.33 to 1 on the X. The ASW ancestry patterns on the autosomes and the X can be seen in the bar plots shown in Figure 3A, which displays the proportion of ancestry for each sampled individual.

Summary of local ancestry estimates by chromosome

Table 2

Summary of local ancestry estimates by chromosome

	ASW			MXL
Chr	African	European	Native American	African	European	Native American
X	0.820 (0.139)	0.163 (0.136)	0.017 (0.047)	0.0396 (0.0521)	0.387 (0.245)	0.574 (0.248)
Autosomal-wide	0.783 (0.0861)	0.202 (0.0808)	0.0150 (0.0382)	0.0489 (0.0182)	0.508 (0.149)	0.444 (0.148)
1	0.762 (0.13)	0.228 (0.131)	0.00962 (0.0354)	0.047 (0.0389)	0.525 (0.192)	0.428 (0.191)
2	0.789 (0.132)	0.201 (0.128)	0.0106 (0.0241)	0.0457 (0.0379)	0.514 (0.195)	0.440 (0.188)
3	0.769 (0.155)	0.221 (0.154)	0.0102 (0.0369)	0.0462 (0.0345)	0.514 (0.18)	0.439 (0.183)
4	0.807 (0.136)	0.177 (0.134)	0.0164 (0.0398)	0.0461 (0.0408)	0.470 (0.212)	0.484 (0.206)
5	0.786 (0.149)	0.199 (0.146)	0.0148 (0.0536)	0.0539 (0.05)	0.528 (0.2)	0.418 (0.188)
6	0.774 (0.167)	0.201 (0.15)	0.0257 (0.0696)	0.0555 (0.0502)	0.500 (0.179)	0.445 (0.177)
7	0.804 (0.125)	0.184 (0.117)	0.012 (0.0539)	0.056 (0.047)	0.524 (0.193)	0.420 (0.188)
8	0.785 (0.163)	0.201 (0.16)	0.0141 (0.0419)	0.0397 (0.0349)	0.504 (0.187)	0.456 (0.179)
9	0.772 (0.12)	0.210 (0.116)	0.0175 (0.0506)	0.0499 (0.0521)	0.489 (0.21)	0.462 (0.213)
10	0.785 (0.145)	0.205 (0.14)	0.00997 (0.047)	0.059 (0.066)	0.502 (0.189)	0.439 (0.183)
11	0.778 (0.141)	0.212 (0.139)	0.00953 (0.0255)	0.0402 (0.0422)	0.525 (0.202)	0.435 (0.201)
12	0.779 (0.142)	0.202 (0.137)	0.0186 (0.0625)	0.0501 (0.0488)	0.511 (0.18)	0.439 (0.177)
13	0.804 (0.152)	0.180 (0.149)	0.0165 (0.032)	0.0488 (0.0424)	0.523 (0.199)	0.428 (0.201)
14	0.802 (0.162)	0.183 (0.155)	0.015 (0.0577)	0.0559 (0.0643)	0.470 (0.217)	0.474 (0.219)
15	0.817 (0.141)	0.172 (0.138)	0.0109 (0.0399)	0.0382 (0.0452)	0.528 (0.182)	0.434 (0.179)
16	0.778 (0.192)	0.201 (0.183)	0.0207 (0.0631)	0.0456 (0.0449)	0.498 (0.205)	0.457 (0.209)
17	0.772 (0.156)	0.207 (0.145)	0.0208 (0.079)	0.041 (0.043)	0.533 (0.195)	0.426 (0.192)
18	0.772 (0.21)	0.210 (0.196)	0.0184 (0.0605)	0.0501 (0.047)	0.537 (0.207)	0.413 (0.198)
19	0.780 (0.154)	0.213 (0.155)	0.00745 (0.0189)	0.0654 (0.0809)	0.506 (0.208)	0.429 (0.202)
20	0.801 (0.167)	0.187 (0.152)	0.0125 (0.0611)	0.0541 (0.0616)	0.520 (0.194)	0.426 (0.195)
21	0.760 (0.191)	0.220 (0.186)	0.0202 (0.0748)	0.0451 (0.0529)	0.475 (0.235)	0.480 (0.232)
22	0.747 (0.209)	0.234 (0.206)	0.0188 (0.0671)	0.0419 (0.0506)	0.470 (0.196)	0.488 (0.204)

	ASW			MXL
Chr	African	European	Native American	African	European	Native American
X	0.820 (0.139)	0.163 (0.136)	0.017 (0.047)	0.0396 (0.0521)	0.387 (0.245)	0.574 (0.248)
Autosomal-wide	0.783 (0.0861)	0.202 (0.0808)	0.0150 (0.0382)	0.0489 (0.0182)	0.508 (0.149)	0.444 (0.148)
1	0.762 (0.13)	0.228 (0.131)	0.00962 (0.0354)	0.047 (0.0389)	0.525 (0.192)	0.428 (0.191)
2	0.789 (0.132)	0.201 (0.128)	0.0106 (0.0241)	0.0457 (0.0379)	0.514 (0.195)	0.440 (0.188)
3	0.769 (0.155)	0.221 (0.154)	0.0102 (0.0369)	0.0462 (0.0345)	0.514 (0.18)	0.439 (0.183)
4	0.807 (0.136)	0.177 (0.134)	0.0164 (0.0398)	0.0461 (0.0408)	0.470 (0.212)	0.484 (0.206)
5	0.786 (0.149)	0.199 (0.146)	0.0148 (0.0536)	0.0539 (0.05)	0.528 (0.2)	0.418 (0.188)
6	0.774 (0.167)	0.201 (0.15)	0.0257 (0.0696)	0.0555 (0.0502)	0.500 (0.179)	0.445 (0.177)
7	0.804 (0.125)	0.184 (0.117)	0.012 (0.0539)	0.056 (0.047)	0.524 (0.193)	0.420 (0.188)
8	0.785 (0.163)	0.201 (0.16)	0.0141 (0.0419)	0.0397 (0.0349)	0.504 (0.187)	0.456 (0.179)
9	0.772 (0.12)	0.210 (0.116)	0.0175 (0.0506)	0.0499 (0.0521)	0.489 (0.21)	0.462 (0.213)
10	0.785 (0.145)	0.205 (0.14)	0.00997 (0.047)	0.059 (0.066)	0.502 (0.189)	0.439 (0.183)
11	0.778 (0.141)	0.212 (0.139)	0.00953 (0.0255)	0.0402 (0.0422)	0.525 (0.202)	0.435 (0.201)
12	0.779 (0.142)	0.202 (0.137)	0.0186 (0.0625)	0.0501 (0.0488)	0.511 (0.18)	0.439 (0.177)
13	0.804 (0.152)	0.180 (0.149)	0.0165 (0.032)	0.0488 (0.0424)	0.523 (0.199)	0.428 (0.201)
14	0.802 (0.162)	0.183 (0.155)	0.015 (0.0577)	0.0559 (0.0643)	0.470 (0.217)	0.474 (0.219)
15	0.817 (0.141)	0.172 (0.138)	0.0109 (0.0399)	0.0382 (0.0452)	0.528 (0.182)	0.434 (0.179)
16	0.778 (0.192)	0.201 (0.183)	0.0207 (0.0631)	0.0456 (0.0449)	0.498 (0.205)	0.457 (0.209)
17	0.772 (0.156)	0.207 (0.145)	0.0208 (0.079)	0.041 (0.043)	0.533 (0.195)	0.426 (0.192)
18	0.772 (0.21)	0.210 (0.196)	0.0184 (0.0605)	0.0501 (0.047)	0.537 (0.207)	0.413 (0.198)
19	0.780 (0.154)	0.213 (0.155)	0.00745 (0.0189)	0.0654 (0.0809)	0.506 (0.208)	0.429 (0.202)
20	0.801 (0.167)	0.187 (0.152)	0.0125 (0.0611)	0.0541 (0.0616)	0.520 (0.194)	0.426 (0.195)
21	0.760 (0.191)	0.220 (0.186)	0.0202 (0.0748)	0.0451 (0.0529)	0.475 (0.235)	0.480 (0.232)
22	0.747 (0.209)	0.234 (0.206)	0.0188 (0.0671)	0.0419 (0.0506)	0.470 (0.196)	0.488 (0.204)

Shown is mean (SD) of local ancestry estimates by chromosome, stratified by the ASW and MXL HapMap population samples.

Table 2

Summary of local ancestry estimates by chromosome

	ASW			MXL
Chr	African	European	Native American	African	European	Native American
X	0.820 (0.139)	0.163 (0.136)	0.017 (0.047)	0.0396 (0.0521)	0.387 (0.245)	0.574 (0.248)
Autosomal-wide	0.783 (0.0861)	0.202 (0.0808)	0.0150 (0.0382)	0.0489 (0.0182)	0.508 (0.149)	0.444 (0.148)
1	0.762 (0.13)	0.228 (0.131)	0.00962 (0.0354)	0.047 (0.0389)	0.525 (0.192)	0.428 (0.191)
2	0.789 (0.132)	0.201 (0.128)	0.0106 (0.0241)	0.0457 (0.0379)	0.514 (0.195)	0.440 (0.188)
3	0.769 (0.155)	0.221 (0.154)	0.0102 (0.0369)	0.0462 (0.0345)	0.514 (0.18)	0.439 (0.183)
4	0.807 (0.136)	0.177 (0.134)	0.0164 (0.0398)	0.0461 (0.0408)	0.470 (0.212)	0.484 (0.206)
5	0.786 (0.149)	0.199 (0.146)	0.0148 (0.0536)	0.0539 (0.05)	0.528 (0.2)	0.418 (0.188)
6	0.774 (0.167)	0.201 (0.15)	0.0257 (0.0696)	0.0555 (0.0502)	0.500 (0.179)	0.445 (0.177)
7	0.804 (0.125)	0.184 (0.117)	0.012 (0.0539)	0.056 (0.047)	0.524 (0.193)	0.420 (0.188)
8	0.785 (0.163)	0.201 (0.16)	0.0141 (0.0419)	0.0397 (0.0349)	0.504 (0.187)	0.456 (0.179)
9	0.772 (0.12)	0.210 (0.116)	0.0175 (0.0506)	0.0499 (0.0521)	0.489 (0.21)	0.462 (0.213)
10	0.785 (0.145)	0.205 (0.14)	0.00997 (0.047)	0.059 (0.066)	0.502 (0.189)	0.439 (0.183)
11	0.778 (0.141)	0.212 (0.139)	0.00953 (0.0255)	0.0402 (0.0422)	0.525 (0.202)	0.435 (0.201)
12	0.779 (0.142)	0.202 (0.137)	0.0186 (0.0625)	0.0501 (0.0488)	0.511 (0.18)	0.439 (0.177)
13	0.804 (0.152)	0.180 (0.149)	0.0165 (0.032)	0.0488 (0.0424)	0.523 (0.199)	0.428 (0.201)
14	0.802 (0.162)	0.183 (0.155)	0.015 (0.0577)	0.0559 (0.0643)	0.470 (0.217)	0.474 (0.219)
15	0.817 (0.141)	0.172 (0.138)	0.0109 (0.0399)	0.0382 (0.0452)	0.528 (0.182)	0.434 (0.179)
16	0.778 (0.192)	0.201 (0.183)	0.0207 (0.0631)	0.0456 (0.0449)	0.498 (0.205)	0.457 (0.209)
17	0.772 (0.156)	0.207 (0.145)	0.0208 (0.079)	0.041 (0.043)	0.533 (0.195)	0.426 (0.192)
18	0.772 (0.21)	0.210 (0.196)	0.0184 (0.0605)	0.0501 (0.047)	0.537 (0.207)	0.413 (0.198)
19	0.780 (0.154)	0.213 (0.155)	0.00745 (0.0189)	0.0654 (0.0809)	0.506 (0.208)	0.429 (0.202)
20	0.801 (0.167)	0.187 (0.152)	0.0125 (0.0611)	0.0541 (0.0616)	0.520 (0.194)	0.426 (0.195)
21	0.760 (0.191)	0.220 (0.186)	0.0202 (0.0748)	0.0451 (0.0529)	0.475 (0.235)	0.480 (0.232)
22	0.747 (0.209)	0.234 (0.206)	0.0188 (0.0671)	0.0419 (0.0506)	0.470 (0.196)	0.488 (0.204)

	ASW			MXL
Chr	African	European	Native American	African	European	Native American
X	0.820 (0.139)	0.163 (0.136)	0.017 (0.047)	0.0396 (0.0521)	0.387 (0.245)	0.574 (0.248)
Autosomal-wide	0.783 (0.0861)	0.202 (0.0808)	0.0150 (0.0382)	0.0489 (0.0182)	0.508 (0.149)	0.444 (0.148)
1	0.762 (0.13)	0.228 (0.131)	0.00962 (0.0354)	0.047 (0.0389)	0.525 (0.192)	0.428 (0.191)
2	0.789 (0.132)	0.201 (0.128)	0.0106 (0.0241)	0.0457 (0.0379)	0.514 (0.195)	0.440 (0.188)
3	0.769 (0.155)	0.221 (0.154)	0.0102 (0.0369)	0.0462 (0.0345)	0.514 (0.18)	0.439 (0.183)
4	0.807 (0.136)	0.177 (0.134)	0.0164 (0.0398)	0.0461 (0.0408)	0.470 (0.212)	0.484 (0.206)
5	0.786 (0.149)	0.199 (0.146)	0.0148 (0.0536)	0.0539 (0.05)	0.528 (0.2)	0.418 (0.188)
6	0.774 (0.167)	0.201 (0.15)	0.0257 (0.0696)	0.0555 (0.0502)	0.500 (0.179)	0.445 (0.177)
7	0.804 (0.125)	0.184 (0.117)	0.012 (0.0539)	0.056 (0.047)	0.524 (0.193)	0.420 (0.188)
8	0.785 (0.163)	0.201 (0.16)	0.0141 (0.0419)	0.0397 (0.0349)	0.504 (0.187)	0.456 (0.179)
9	0.772 (0.12)	0.210 (0.116)	0.0175 (0.0506)	0.0499 (0.0521)	0.489 (0.21)	0.462 (0.213)
10	0.785 (0.145)	0.205 (0.14)	0.00997 (0.047)	0.059 (0.066)	0.502 (0.189)	0.439 (0.183)
11	0.778 (0.141)	0.212 (0.139)	0.00953 (0.0255)	0.0402 (0.0422)	0.525 (0.202)	0.435 (0.201)
12	0.779 (0.142)	0.202 (0.137)	0.0186 (0.0625)	0.0501 (0.0488)	0.511 (0.18)	0.439 (0.177)
13	0.804 (0.152)	0.180 (0.149)	0.0165 (0.032)	0.0488 (0.0424)	0.523 (0.199)	0.428 (0.201)
14	0.802 (0.162)	0.183 (0.155)	0.015 (0.0577)	0.0559 (0.0643)	0.470 (0.217)	0.474 (0.219)
15	0.817 (0.141)	0.172 (0.138)	0.0109 (0.0399)	0.0382 (0.0452)	0.528 (0.182)	0.434 (0.179)
16	0.778 (0.192)	0.201 (0.183)	0.0207 (0.0631)	0.0456 (0.0449)	0.498 (0.205)	0.457 (0.209)
17	0.772 (0.156)	0.207 (0.145)	0.0208 (0.079)	0.041 (0.043)	0.533 (0.195)	0.426 (0.192)
18	0.772 (0.21)	0.210 (0.196)	0.0184 (0.0605)	0.0501 (0.047)	0.537 (0.207)	0.413 (0.198)
19	0.780 (0.154)	0.213 (0.155)	0.00745 (0.0189)	0.0654 (0.0809)	0.506 (0.208)	0.429 (0.202)
20	0.801 (0.167)	0.187 (0.152)	0.0125 (0.0611)	0.0541 (0.0616)	0.520 (0.194)	0.426 (0.195)
21	0.760 (0.191)	0.220 (0.186)	0.0202 (0.0748)	0.0451 (0.0529)	0.475 (0.235)	0.480 (0.232)
22	0.747 (0.209)	0.234 (0.206)	0.0188 (0.0671)	0.0419 (0.0506)	0.470 (0.196)	0.488 (0.204)

Shown is mean (SD) of local ancestry estimates by chromosome, stratified by the ASW and MXL HapMap population samples.

Figure 2

Local ancestry estimates by chromosome. Shown are chromosomal averaged local ancestry estimates for HapMap individuals using the RFMix software. Ancestry was estimated for each marker and then averaged across chromosomes. (A) Estimates for 87 HapMap ASW individuals. (B) Estimates for 86 HapMap MXL individuals. The reference samples for the European and African ancestries were HapMap CEU and YRI individuals, while the HGDP samples from the Americas were references for the Native American ancestry.

Figure 3

Bar plots of RFMix results. Shown are local ancestry estimates for HapMap individuals using the RFMix software. Each individual is represented by a vertical bar, where the European, African, and Native American ancestries are colored with blue, red, and green, respectively. Left and right panels represent the autosomal and X chromosome averages, respectively. (A) Estimates for 87 HapMap ASW individuals. (B) Estimates for 86 HapMap MXL individuals. The reference samples for the European and African ancestries were HapMap CEU and YRI individuals, while the HGDP samples from the Americas were references for the Native American ancestry.

We calculated the correlation of ancestry proportions across the autosomes and X chromosome for each ancestral subpopulation. The correlations between the autosomal and X chromosome proportions in the European and African ancestries are 0.20 and 0.17, respectively. Interestingly, with a correlation of 0.78, Native American ancestry between the autosomal and X chromosome is the highest despite this ancestry being the least prominent of the three. We find that the high correlation is being driven by two outlier individuals in the ASW with extremely high Native American ancestry (>0.2) on the autosomes and the X compared to the vast majority of ASW individuals who have little to no Native American ancestry. When the two outlier individuals in ASW with high Native American ancestry are excluded, the correlation between Native American ancestry on autosomes and the X chromosome is 0.029, which is similar to the correlation results of the least prominent ancestry in the MXL, as discussed in the next subsection.

HapMap MXL ancestry

From our local ancestry analysis of the 86 HapMap MXL individuals, we found the predominant ancestries to be European and Native American, as expected based on previously reported results (Thornton et al. 2012; Bryc et al. 2015), with African ancestry being quite modest with little variation. Table 2 shows the mean and SD of the average local ancestry estimates by chromosome and averaged across the autosomes within the MXL samples. Interestingly, proportional Native American ancestry is highest on the X chromosome, with a mean of 0.57, while for the autosomes, European ancestry is highest with a mean of 0.51. African ancestry on the autosomes and the X chromosome, however, is quite similar, with mean values of 0.04 and 0.05, respectively. Figure 2B shows violin plots by chromosome of the RFMix local ancestry estimates in the MXL samples. The plots illustrate the marked increase in proportional European ancestry across the autosomes and, correspondingly, a decrease in proportional Native American ancestry on the autosomes compared to the X chromosome. Figure 3B shows bar plots of the ancestral proportions within each individual. The proportion of both European and Native American ancestries on the X chromosome ranges from 0 to 1. The range and variation of the European and Native American ancestries on the X chromosome are larger than those estimated across the autosomes. Furthermore, Native American and European ancestries on the X chromosome are almost perfectly negatively correlated (corr = −0.98). Interestingly, there is one male MXL individual who has an X chromosome that is inferred to be completely Native American derived. The phased RFMix results of this individual’s mother indicate that one of her X chromosomes is entirely Native American derived while her other X chromosome is 69% Native American and 31% European, with five ancestry switches on the chromosome.

We also calculated correlations in ancestry between the average of the autosomes and the X chromosome. European and Native American ancestries have correlations of 0.71 and 0.67, respectively, between the autosomes and the X chromosome. With a correlation of 0.03, there is essentially no correlation in African ancestry between the autosomes and the X in the MXL.

Genome-wide ancestry heterogeneity testing: HapMap MXL and ASW

We applied the CAnD test to the set of 53 unrelated MXL individuals to test for heterogeneity in ancestry across all 23 chromosomes: the 22 autosomes, and the X chromosome. This CAnD test has 22 d.f. under the null hypothesis, and the genome-wide P-values for heterogeneity in African, European, and Native American ancestries are 0.592, 4.01e-05, and 9.57e-06, respectively. To gain insight into which chromosome(s) may be driving the significance of the genome-wide CAnD test for the European and Native American ancestries in the MXL, we used CAnD to test for ancestry differences between each chromosome and the pool of the ancestries of the other 22 chromosomes. Each of these tests has 1 d.f., and Figure 4 shows, by chromosome, the unadjusted (Figure 4A) and Bonferroni-adjusted (Figure 4B) CAnD P-values in the HapMap MXL for each of the three assumed ancestries. Chromosome 7 and the X chromosome have significantly larger proportions of Native American ancestry compared to the pooled Native American mean ancestry of all other chromosomes, at the 0.05 level before adjustment for multiple testing. The X chromosome also has significantly less European ancestry, at the 0.05 level, compared to the pooled autosomes. Chromosome 8 has a larger proportion of African ancestry compared to the pooled ancestry of all other chromosomes. Using a conservative Bonferroni multiple-testing correction, ancestry differences between the X chromosome and the autosomes remain significant for both the European and Native American ancestries in the MXL, while chromosomes 7 and 8 are no longer significant after Bonferroni correction.

Figure 4

Unadjusted and adjusted P-values from the CAnD test in the HapMap MXL samples. (A and B) Unadjusted (A) and adjusted (B) P-values by chromosome obtained from the CAnD test comparing the estimated ancestry for each chromosome with the mean ancestry of all remaining chromosomes, including the X chromosome, for the African, European, and Native American ancestries in the HapMap MXL samples. The adjusted P-values were calculated using the Bonferroni multiple-testing correction.

We also performed CAnD tests in the MXL excluding the X chromosome, and the overall CAnD test is not significant, with P-values of 0.532, 0.382, and 0.190 corresponding to the African, European, and Native American ancestries, respectively. These results provide additional evidence that differential ancestry on the X chromosome is driving the significant heterogeneity results of the genome-wide CAnD test. We also conducted CAnD tests for ancestry differences for each autosomal chromosome in turn compared to the pool of ancestries from the other autosomes, and none of the autosomal chromosomes are significant after Bonferroni correction (Figure S3).

In an analysis of the 45 unrelated ASW individuals, CAnD did not detect any significant differences in ancestry among the autosomal and X chromosomes. The genome-wide CAnD test for ancestry differences in the ASW had P-values of 0.122, 0.0858, and 0.243 for the African, European, and Native American ancestries, respectively (Figure S2). As previously mentioned, the autosomes and the X chromosome are predominantly African derived in the ASW, and a larger sample size is needed to achieve enough power to detect the smaller ancestry differences among chromosomes in the ASW. Indeed, in much larger population-based samples of African-Americans (Bryc et al. 2010a, 2015), increased African ancestry and decreased European ancestry have been reported for the X chromosome compared to the autosomes.

Assessing ancestry differences between the X and the autosomes: HapMap MXL and ASW

Previous studies have identified significant differences between autosomal and X chromosome ancestry proportions in individuals from admixed populations (Bryc et al. 2015), where these differences have been assessed using a pooled t-test that assumes independence in ancestry among chromosomes. As previously mentioned, CAnD can also be used to test for differences between the X chromosome and the pooled autosomes while appropriately accounting for ancestry correlations among chromosomes within an admixed individual.

Figure 5 shows histograms of the mean difference between the autosomal and X chromosome ancestry proportions for the subsets of 45 unrelated ASW (Figure 5A) and 53 unrelated MXL (Figure 5B) individuals, with a smoothed density line overlaid. The mean difference in European ancestry between the autosomes and the X chromosome is 0.12, and the mean difference for Native American ancestry is −0.13. Based on our simulation studies, we expect to have high power to detect such large differences in ancestry for a sample of this size. For the ASW samples, however, the mean difference between the X chromosome and the autosomes for the two predominant continental ancestries, African and European, is 0.04, which is a much smaller difference than observed for the two predominant ancestries in the MXL. As a result, we expect the power to detect a mean difference in ancestry between the X and the autosomes in the ASW to be much lower, compared to the MXL, for the predominant ancestries.

Figure 5

Difference in autosomal and X chromosome ancestry, by subpopulation. Shown are histograms of the difference in autosomal and X chromosome ancestry proportions among the (A) 45 unrelated HapMap ASW and (B) 53 unrelated HapMap MXL samples. The dashed line indicates the mean difference, whereas the solid line indicates zero. A smoothed density line is overlaid on each histogram.

We compared the results of the pooled t-test to a CAnD test with 1 d.f. for detecting differences in ancestry between the X chromosome and the autosomes in the HapMap ASW and MXL. As expected, no significant differences in ancestry were detected in the ASW with either method for any of the three continental ancestries. For the MXL, the pooled t-test identifies significant differences in European ancestry and Native American ancestry between the autosomes and the X chromosome, with a P-value of 0.001 for both analyses. In comparison, the CAnD test P-value is 9.17e-07 for a difference in European ancestry between the autosomes and the X chromosome in the MXL and 1.13e-06 for Native American ancestry, which is more than three orders of magnitude smaller than the P-values for the pooled t-test. There was no significant difference in African ancestry for both methods in the MXL.

Comparison of CAnD results using local vs. global ancestry estimates

We also performed a CAnD analysis in the HapMap MXL and ASW, using global ancestry estimates for each chromosome with the aforementioned FRAPPE method, which takes as input unphased genotype data and assumes independence among genetic markers on a chromosome (Figure S4). Table S1 contains the CAnD results using chromosome-wide ancestry estimates from FRAPPE as well as the previously discussed results from CAnD with local ancestry estimates from the RFMix method, which requires phased genotype data and takes into account LD among SNPs. For the ancestry heterogeneity analysis of the ASW with chromosome-wide ancestry estimates from FRAPPE, no differences in ancestry among chromosomes were detected with CAnD, similar to the CAnD results with local ancestry estimates from RFMix. Interestingly, for the MXL we found that the CAnD results for testing Native American ancestry are slightly more significant when using chromosome-wide ancestry estimates from FRAPPE compared to using local ancestry estimates from RFMix, with P-values of 9.47e-07 and 9.57e-06, respectively. However, this difference is likely due to FRAPPE ignoring LD among SNPs on a chromosome while RFMix incorporates LD in the ancestry estimation procedure. Despite methodological differences, however, inference about heterogeneity in population structure is qualitatively the same when using either local ancestry estimates from RFMix or global ancestry estimates from FRAPPE in the analyses of the ASW and MXL, as can be seen in Table S1.

We also compared autosomal-wide and X chromosome ancestry estimates from RFMix and FRAPPE, using genotype data for the HapMap MXL and ASW population samples. Table 3 shows the correlation of the ancestry estimates from the methods for each ancestral subpopulation. For the two predominant ancestries in the MXL (European and Native American) and ASW (African and European), the correlations between the ancestry estimates for the autosomes from RFMix and FRAPPE are all >0.99 and are ≥0.95 for the X chromosome. As previously mentioned, there is very little Native American ancestry and African ancestry in the ASW and MXL, respectively. Nevertheless, with a correlation of 0.99, Native American ancestry estimates on the autosomes are nearly perfectly correlated between RFMix and FRAPPE, and the correlation between the estimates is 0.90 for Native American ancestry on the X chromosome in the ASW. For proportional African ancestry in the MXL, the correlation between the two estimates is 0.893 for the autosomes and 0.93 for the X chromosome. So, for the predominant ancestries in the MXL and ASW, there appears to be little difference in estimating autosomal ancestries with FRAPPE or averaging local ancestry estimates from RFMix. There is high concordance between the methods for the predominant ancestry in ASW and MXL for the X chromosome as well. In general, there is less concordance between the methods when estimating proportional ancestries from populations with relatively small contributions to the admixed population, and local ancestry estimates, such as RFMix, are likely more accurate in inferring low levels of ancestral contribution than global ancestry methods, such as FRAPPE.

Correlation of ancestry estimates

Table 3

Correlation of ancestry estimates

	Autosomal		X chromosome
Ancestry	ASW	MXL	ASW	MXL
African	0.9990	0.8932	0.9697	0.9256
European	0.9979	0.9935	0.9548	0.9878
Native American	0.9963	0.9940	0.9001	0.9898

	Autosomal		X chromosome
Ancestry	ASW	MXL	ASW	MXL
African	0.9990	0.8932	0.9697	0.9256
European	0.9979	0.9935	0.9548	0.9878
Native American	0.9963	0.9940	0.9001	0.9898

Shown is correlation between ancestry estimates from RFMix and FRAPPE, stratified by autosomal and X chromosome estimates, in each of the population samples.

Table 3

Correlation of ancestry estimates

	Autosomal		X chromosome
Ancestry	ASW	MXL	ASW	MXL
African	0.9990	0.8932	0.9697	0.9256
European	0.9979	0.9935	0.9548	0.9878
Native American	0.9963	0.9940	0.9001	0.9898

	Autosomal		X chromosome
Ancestry	ASW	MXL	ASW	MXL
African	0.9990	0.8932	0.9697	0.9256
European	0.9979	0.9935	0.9548	0.9878
Native American	0.9963	0.9940	0.9001	0.9898

Shown is correlation between ancestry estimates from RFMix and FRAPPE, stratified by autosomal and X chromosome estimates, in each of the population samples.

Assortative mating for ancestry in the HapMap MXL

Sex-specific patterns of nonrandom mating at the time of or since admixture can result in ancestry differences between the autosomes and the X chromosome in an admixed population. Motivated by the CAnD results where significant heterogeneity between the autosomes and the X chromosome were detected in the MXL, we investigated evidence of assortative mating between pairs of individuals who are reported to have least one offspring. There are 24 such mate pairs; however, we excluded 3 mate pairs due to cryptic relatedness (as previously discussed), resulting in a subset of 21 independent MXL mate pairs included in the assortative mating analysis.

We used an empirical distribution to assess whether the observed correlations of ancestry on the autosomes and the X chromosome between mate pairs are significantly different from what would be expected under the null hypothesis of random mating. In particular, we randomly permuted the MXL mate pairs 5000 times, and for each of the 5000 permutations, we calculated the correlations in ancestry between the random mate pairs for each of the three continental ancestries (European, Native American, and African). The correlations in ancestry between mate pairs for the autosomes and the X chromosome were then used to construct empirical distributions under the null hypothesis of random mating in the MXL. The empirical distributions of ancestry correlations among mate pairs are centered ∼0 under random mating, with a standard deviation ∼0.2 for each of the three ancestries (Figure S5).

We first tested the null hypothesis vs. an alternative hypothesis of assortative mating for ancestry, using the observed correlations among mate pairs and the empirical null distributions. Table 4 shows the P-values for the autosomal and X chromosome correlations of African, European, and Native American ancestry proportions calculated from the 21 MXL mate pairs. There is significant evidence of assortative mating for European and Native American ancestries on the autosomes in the HapMap MXL, with corresponding P-values of 0.015 and 0.017, respectively. There is also significant evidence for assortative mating based on European and Native American ancestry on the X chromosome, with P-values of 0.011 and 0.007, respectively. The P-values remain significant, even after Bonferroni correction for testing three ancestries. There is no significant evidence of assortative mating for African ancestry for both the autosomes and the X chromosomes (P = 0.26 and 0.14, respectively). A two-sided test of the null hypothesis of random mating vs. an alternative hypothesis of nonrandom, e.g., assortative or disassortative mating, can also be conducted. The P-values for this test are given in Table 4 and are roughly twice the assortative mating P-values. We also performed permutation tests to assess evidence of assortative and nonrandom mating for 11 HapMap ASW mate pairs with a documented offspring. No significant evidence of assortative mating in the ASW was detected, and ASW P-values for the three continental ancestries are given in Table 4.

Ancestry correlation among mate pairs

Table 4

Ancestry correlation among mate pairs

	HapMap ASW			HapMap MXL
Chromosome Type	African	European	Native American	African	European	Native American
Autosomal
Assortative mating	0.365	0.388	0.234	0.139	0.015	0.017
Nonrandom mating	0.871	0.888	0.532	0.268	0.028	0.032
X chromosome
Assortative mating	0.842	0.788	0.564	0.256	0.011	0.007
Nonrandom mating	1.000	1.000	1.000	0.530	0.024	0.013

	HapMap ASW			HapMap MXL
Chromosome Type	African	European	Native American	African	European	Native American
Autosomal
Assortative mating	0.365	0.388	0.234	0.139	0.015	0.017
Nonrandom mating	0.871	0.888	0.532	0.268	0.028	0.032
X chromosome
Assortative mating	0.842	0.788	0.564	0.256	0.011	0.007
Nonrandom mating	1.000	1.000	1.000	0.530	0.024	0.013

Shown are P-values detecting assortative or disassortative mating for ancestry among 11 HapMap ASW and 21 HapMap MXL mate pairs, calculated on the autosomes and the X chromosome separately. The P-values are calculated from the empirical distribution created from sampling 5000 mate pairs at random. Results presented under “assortative mating” tested the hypothesis of no assortative mating, while “nonrandom mating” tested the hypothesis of neither assortative nor disassortative mating.

Table 4

Ancestry correlation among mate pairs

	HapMap ASW			HapMap MXL
Chromosome Type	African	European	Native American	African	European	Native American
Autosomal
Assortative mating	0.365	0.388	0.234	0.139	0.015	0.017
Nonrandom mating	0.871	0.888	0.532	0.268	0.028	0.032
X chromosome
Assortative mating	0.842	0.788	0.564	0.256	0.011	0.007
Nonrandom mating	1.000	1.000	1.000	0.530	0.024	0.013

	HapMap ASW			HapMap MXL
Chromosome Type	African	European	Native American	African	European	Native American
Autosomal
Assortative mating	0.365	0.388	0.234	0.139	0.015	0.017
Nonrandom mating	0.871	0.888	0.532	0.268	0.028	0.032
X chromosome
Assortative mating	0.842	0.788	0.564	0.256	0.011	0.007
Nonrandom mating	1.000	1.000	1.000	0.530	0.024	0.013

Shown are P-values detecting assortative or disassortative mating for ancestry among 11 HapMap ASW and 21 HapMap MXL mate pairs, calculated on the autosomes and the X chromosome separately. The P-values are calculated from the empirical distribution created from sampling 5000 mate pairs at random. Results presented under “assortative mating” tested the hypothesis of no assortative mating, while “nonrandom mating” tested the hypothesis of neither assortative nor disassortative mating.

Ancestry equilibrium on the X chromosome under random mating after an initial admixture event

We also investigated the number of generations required for males and females to reach ancestry equilibrium on the X chromosome in a randomly mating population. We considered the setting where there is admixing between two ancestral populations and where mate pairs at the initial admixture event consist of males with ancestry entirely from one population and females with ancestry derived from the other population. We computed proportional ancestry for each generation, assuming random mating after an initial admixing event between founder females and males under the extreme discordant ancestry setting between the two sexes at the time of admixture. Figure 6 shows the proportion of ancestry by generation in the admixed population for males and females. We find that an equilibrium of one-half is reached for autosomal ancestry in males and females in the first generation. Proportional ancestry on the X chromosome for both males and females tends to two-thirds and one-third of the founder female and male ancestries, respectively, where this equilibrium is achieved around eight generations after the initial admixing event. This equilibrium result is not surprising since females contribute two-thirds of the X chromosomes in a population. Recent work (Goldberg and Rosenberg 2015) identified a similar result (although the initial ancestry proportions at the time of admixture were not as extreme as what we consider here) and showed that the two-thirds and one-third ancestry equilibrium on the X does not hold if admixing is ongoing. Nevertheless, whether there is a single admixture event or ongoing admixture, the X chromosome and the autosomal chromosomes are not expected to have the same ancestry distribution at equilibrium in a randomly mating admixed population when the ancestry distribution for founder males is different from that for founder females at the time of the admixture event(s).

Figure 6

Ancestry proportions by generation under random mating. Shown is the proportion of ancestry for the autosomes and the X chromosome by sex, assuming females and males have opposite ancestries at the initial admixture event. After the initial admixture event, random mating is assumed. The gray line shows the equilibrium proportions on the X chromosome.

Discussion

Systematic ancestry differences at genomic loci may arise in recently admixed populations as a result of selection and ancestry-related assortative mating. Here, we developed the CAnD method for detecting heterogeneity in population structure across the genome in populations with admixed ancestry. CAnD uses inferred ancestry from genotyping data to identify chromosomes harboring genomic loci that have significantly different contributions from the underlying ancestral populations from what is expected based on genome-wide ancestry. The CAnD method takes into account correlated ancestries among chromosomes within individuals for both valid testing and improved power for detecting heterogeneity in population structure across the genome. Additional features of the CAnD method include (1) allowing for genetic data from the X chromosome to be included in a heterogeneity analysis and (2) flexibility of the method that allows for heterogeneity testing between subsets of chromosomes in the genome, such as the X chromosome vs. the pooled autosomes.

We performed simulation studies with admixture, using real genotype data from HapMap. We demonstrated that CAnD is properly calibrated with appropriate type I error under different significance levels. We also showed that the CAnD test has higher power to detect heterogeneity in ancestry genome-wide chromosomes than an ANOVA test that does not account for correlations in ancestry among chromosomes.

We applied the CAnD method to the HapMap MXL population sample where significant heterogeneity in European ancestry and Native American ancestry was detected across the genome (autosomal chromosomes and the X chromosome), with P-values of 4e-05 and 1e-05, respectively. A secondary analysis showed that the heterogeneity in ancestry across the MXL genomes detected by CAnD was largely due to elevated Native American ancestry and a deficit of European ancestry on the X chromosomes. These results are consistent with previous reports for U.S. Hispanic/Latinos (Bryc et al. 2015) and Latin Americans (Bryc et al. 2010b), where it has been suggested that the X vs. autosomal ancestry differences are likely due to sex-specific patterns of gene flow in which European male colonists contributed substantially more genetic material than European females at the time of admixture. There was no significant evidence of genetic heterogeneity with CAnD among HapMap ASW chromosomes and no significant differences in ancestry between the pooled autosomes and the X chromosome were detected. The autosomal chromosomes and the X chromosome in the ASW are largely African derived, and a much larger sample is required to have adequate power for detecting chromosomal ancestry differences in this population.

The CAnD method can incorporate estimates of local ancestry at specific locations across the genomes using software such as RFMix, as well as chromosome-wide ancestry estimates using global ancestry estimation software such as FRAPPE or ADMIXTURE. We compared the CAnD results for the HapMap MXL when using local ancestry estimates from RFMix, which requires phased genotype data, to the results when using chromosomal ancestry estimates with FRAPPE where unphased genotype data were used. Significant evidence of ancestry heterogeneity was detected with CAnD when using either local ancestry estimates from RFMix or chromosome-wide ancestry estimates from FRAPPE.

We also investigated the number of generations required for ancestry on the X chromosome to reach equilibrium in males and females after a single admixing event with two populations. In the most extreme setting where all males are from one population and all females are from the other population at the time of admixture, approximately 8 generations are required under random mating between males and females to reach ancestry equilibrium on the X. Estimates of the number of generations since admixture in the Mexican population (Johnson et al. 2011) range from 10 to 15, so it is reasonable to assume that equilibrium on the X chromosome for males and females should have been reached in the Mexican population if mating in this population is completely at random. Previous studies (Risch et al. 2009; Sebro et al. 2010), however, have shown evidence of nonrandom mating in Mexican populations. In the HapMap MXL, we detected significant evidence of assortative mating among mate pairs that produced an offspring, where the correlation of European and Native American ancestries on both the autosomes and the X chromosome is significantly higher for mate pairs than what would be expected under the null hypothesis of a random mating population. Evaluating differences in ancestry on the X chromosome between males and females may potentially be a useful tool for the detection of nonrandom mating in recently admixed populations, since under the most extreme setting of discordant ancestry between males and females at the time of admixture we find that that there should be no difference in ancestry on the X chromosome between males and females after 8 generations of random mating.

In this article, we proposed CAnD to identify heterogeneity in genome-wide ancestry. Secondary analyses can also be conducted with CAnD to identify specific chromosomes that have ancestry distributions that are significantly different from those of all other chromosomes. If local ancestry estimates are available, CAnD can also potentially be used as a fine-mapping tool for identifying chromosomal regions that may be under selection. For example, using a sliding-window approach, the CAnD test could be used to test regions on a chromosome that have systematic ancestry differences compared to the rest of the genome. This is future work to be considered.

Appendix A

Derivation of the Covariance Matrix for the CAnD Multivariate Statistic

Consider a set $G$ with m chromosomes and let $T_{k} = {(T_{k}^{1}, T_{k}^{2}, \dots, T_{k}^{m})}^{T}$ be the previously defined multivariate vector of length m for the CAnD test for a sample with n independent individuals, where $T_{k}^{c} = (1 / n) \sum_{i = 1}^{n} D_{i k}^{c} .$ Below we derive an estimate $\sum^{^}$ for $\sum,$ the covariance matrix of $T_{k}$ for testing the null hypothesis of heterogeneity in ancestry among m chromosomes in $G .$

Recall that we denote

a_{i k}^{c}

to be the ancestry proportion for subpopulation k on chromosome c for individual i. To estimate

∑_{c, c^{'}},

the covariance of

T_{k}^{c}

and

T_{k}^{c^{'}}

for

c \neq c^{'},

we must consider ancestry correlations across pairs of chromosomes within individuals. For a random individual i sampled from the population, we denote

w_{k, c c^{'}} = cov (a_{i k}^{c}, a_{i k}^{c^{'}})

to be the covariance in ancestry for a given subpopulation k between chromosomes c and

c^{'}

under the null hypothesis. Our estimator of the covariance of

T_{k}^{c}

and

T_{k}^{c^{'}},

which is the corresponding element in

∑

for chromosomes c and

c^{'},

is

\begin{matrix} ∑_{c, c^{'}}^{k} = cov (T_{k}^{c}, T_{k}^{c^{'}}) \\ = \frac{1}{n} (\frac{1}{{(m - 1)}^{2}} \sum_{M^{'} \in G_{- c^{'}}} \sum_{M \in G_{- c}} w_{k, M M^{'}} \\ - \frac{1}{m - 1} \sum_{M \in G_{- c}} w_{k, M c^{'}} \\ - \frac{1}{m - 1} \sum_{M^{'} \in G_{- c^{'}}} w_{k, M^{'} c} + w_{k, c c^{'}}), \end{matrix}

(A1)

where

G_{- c}

is the subset of all chromosomes in

G

except for c.

In practice, we must estimate

w_{k, c c^{'}} .

For a given subpopulation k and chromosome c, denote the average ancestry proportion across all individuals i to be

\bar{a_{k}^{c}} = (1 / n) \sum_{i = 1}^{n} a_{i k}^{c} .

The estimator that we propose for the covariance of subpopulation k ancestry proportions between chromosomes c and

c^{'}

is

{\hat{w}}_{k, c c^{'}} = \frac{1}{n - 1} \sum_{i = 1}^{n} (a_{i k}^{c} - \bar{a_{k}^{c}}) (a_{i k}^{c^{'}} - \bar{a_{k}^{c^{'}}}) .

(A2)

Then our estimator of

∑_{c, c^{'}}^{k}

is Equation 4 evaluated with the estimator

{\hat{w}}_{k, c c^{'}}

of Equation 5. To estimate the variance of

T_{k}^{c}

for chromosome

c \in G

under the null hypothesis,

∑_{c, c}^{k},

a similar estimator to Equation 4 can be used. However, we find that an estimator based on the sample variance of the

D_{i k}^{c}

values works well in practice, and therefore we propose using

{\sum^{^}}_{c, c}^{k} = \frac{1}{n (n - 1)} \sum_{i = 1}^{n} {(D_{i k}^{c} - T_{k}^{c})}^{2},

(A3)

where

T_{k}^{c}

is the average of

D_{i k}^{c}

across all sampled individuals.

Appendix B

CAnD Multivariate Statistic in Matrix Form

The multivariate statistic $T_{k} = {(T_{k}^{1}, T_{k}^{2}, \dots, T_{k}^{m})}^{T}$ can be written as $T_{k} = (1 / n) \sum_{i = 1}^{n} W A_{i k},$ where $A_{i k} = {(a_{i k}^{1}, \dots, a_{i k}^{m})}^{T}$ is a length m vector of subpopulation k proportional ancestries for each of individual i’s chromosomes in $G,$ and $W$ is an $m \times m$ matrix with diagonal elements equal to 1 and off-diagonal elements equal to $- 1 / (m - 1) .$ The rank of $W$ is $m - 1,$ since each row of the $m \times m$ matrix $W$ can be written as a linear combination of the other $m - 1$ rows. From this result, it follows that the corresponding CAnD statistic $C A_{k}$ given in Equation 2 follows a $χ^{2}$ distribution with $m - 1$ d.f.

Acknowledgments

The authors thank the two anonymous reviewers for helpful comments that improved the manuscript. This work was supported by National Institutes of Health grants K01 CA148958 (to T.A.T.) and P01 HG0099568 (to C.M. and T.A.T.) and Hispanic Community Health Study/Study of Latinos Genetic Analysis Center grant HHSN268201300005C (to C.M. and T.A.T.).

Footnotes

Communicating editor: E. Eskin

Supplemental material is available online at www.genetics.org/lookup/suppl/doi:10.1534/genetics.115.184184/-/DC1.

Literature Cited

Alexander

D

,

Novembre

J

,

Lange

K

,

2009

Fast model-based estimation of ancestry in unrelated individuals.

Genome Res.

19

:

1655

–

1664

.

Altshuler

D M

,

Gibbs

R A

,

Peltonen

L

,

Dermitzakis

E

,

Schaffner

S F

et al. ,

2010

Integrating common and rare genetic variation in diverse human populations.

Nature

467

:

52

–

58

.

PubMed

OpenURL Placeholder Text

Bhatia

G

,

Tandon

A

,

Patterson

N

,

Aldrich

M

,

Ambrosone

C B

et al. ,

2014

Genome-wide scan of 29,141 African Americans finds no evidence of directional selection since admixture.

Am. J. Hum. Genet.

95

:

437

–

444

.

Browning

S R

,

Browning

B L

,

2007

Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Am. J. Hum. Genet.

81

:

1084

–

1097

.

Bryc

K

,

Auton

A

,

Nelson

M R

,

Oksenberg

J R

,

Hauser

S L

et al. ,

2010

a

Genome-wide patterns of population structure and admixture in West Africans and African Americans.

Proc. Natl. Acad. Sci. USA

107

:

786

–

791

.

Crossref

Bryc

K

,

Velez

C

,

Karafet

T

,

Moreno-Estrada

A

,

Reynolds

A

et al. ,

2010

b

Genome-wide patterns of population structure and admixture among Hispanic/Latino populations.

Proc. Natl. Acad. Sci. USA

107

:

8954

–

8961

.

Crossref

Bryc

K

,

Durand

E Y

,

Macpherson

J M

,

Reich

D

,

Mountain

J L

,

2015

The genetic ancestry of African Americans, Latinos, and European Americans across the United States.

Am. J. Hum. Genet.

96

:

37

–

53

.

Conomos

M P

,

Laurie

C A

,

Stilp

A M

,

Gogarten

S M

,

McHugh

C P

et al. ,

2016

Genetic diversity and association studies in US Hispanic/Latino populations: applications in the Hispanic community health study/study of Latinos.

Am. J. Hum. Genet.

98

:

165

–

184

.

Goldberg

A

,

Rosenberg

N A

,

2015

Beyond 2/3 and 1/3: the complex signatures of sex-biased admixture on the X chromosome.

Genetics

201

:

263

–

279

.

Jin

W

,

Xu

S

,

Wang

H

,

Yu

Y

,

Shen

Y

et al. ,

2012

Genome-wide detection of natural selection in African Americans pre- and post-admixture.

Genome Res.

22

:

519

–

527

.

Johnson

N A

,

Coram

M A

,

Shriver

M D

,

Romieu

I

,

Barsh

G S

et al. ,

2011

Ancestral components of admixed genomes in a Mexican cohort.

PLoS Genet.

7

:

e1002410

.

Li

J Z

,

Absher

D M

,

Tang

H

,

Southwick

A M

,

Casto

A M

et al. ,

2008

Worldwide human relationships inferred from genome-wide patterns of variation.

Science

319

:

1100

–

1104

.

Manichaikul

A

,

Palmas

W

,

Rodriguez

C J

,

Peralta

C A

,

Divers

J

et al. ,

2012

Population structure of Hispanics in the United States: the multi-ethnic study of atherosclerosis.

PLoS Genet.

8

:

e1002640

.

Maples

B K

,

Gravel

S

,

Kenny

E E

,

Bustamante

C D

,

2013

RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference.

Am. J. Hum. Genet.

93

:

278

–

288

.

Nelis

M

,

Esko

T O

,

Mägi

R

,

Zimprich

F

,

Toncheva

D

et al. ,

2009

Genetic structure of Europeans: a view from the North-East.

PLoS One

4

:

e5472

.

Novembre

J

,

Johnson

T

,

Bryc

K

,

Kutalik

Z

,

Boyko

A

et al. ,

2008

Genes mirror geography within Europe.

Nature

456

:

98

–

101

.

Price

A L

,

Tandon

A

,

Patterson

N

,

Barnes

K C

,

Rafaels

N

et al. ,

2009

Sensitive detection of chromosomal segments of distinct ancestry in admixed populations.

PLoS Genet.

5

:

e1000519

.

Risch

N

,

Choudhry

S

,

Via

M

,

Basu

A

,

Sebro

R

et al. ,

2009

Ancestry-related assortative mating in Latino populations.

Genome Biol.

10

:

R132

.

Sebro

R

,

Hoffman

T J

,

Lange

C

,

Rogus

J J

,

Risch

N J

,

2010

Testing for non-random mating: evidence for ancestry-related assortative mating in the Framingham Heart Study.

Genet. Epidemiol.

34

:

674

–

679

.

Tang

H

,

Choudhry

S

,

Mei

R

,

Morgan

M

,

Rodriguez-Cintron

W

et al. ,

2007

Recent genetic selection in the ancestral admixture of Puerto Ricans.

Am. J. Hum. Genet.

81

:

626

–

633

.

Tang

H

,

Peng

J

,

Wang

P

,

Risch

N J

,

2005

Estimation of individual admixture: analytical and study design considerations.

Genet. Epidemiol.

28

:

289

–

301

.

Thornton

T

,

Tang

H

,

Hoffmann

T J

,

Ochs-Balcom

H M

,

Caan

B J

et al. ,

2012

Estimating kinship in admixed populations.

Am. J. Hum. Genet.

91

:

122

–

138

.

Wei

L J

,

Johnson

W E

,

1985

Combining dependent tests with incomplete repeated measurements.

Biometrika

72

:

359

–

364

.

Crossref