Genetic Architecture of Domestication-Related Traits in Maize Free

Summary statistics and heritability estimates (⁠

{\hat{h}}^{2}

⁠) for three domestication-related traits: shank length (SL), cob length (CL), and kernel row number (KRN) in the maize NCRPIS diversity and NAM panels

Statistic	NCRPIS panel			NAM
Statistic	SL^a (mm)	CL (mm)	KRN	SL^b (mm)	CL (mm)	KRN
N^c	5,002	3,381	4,776	6,903	32,031	6,266
Ng^d	2,387	2,287	2,339	2,875	4,359	2,724
Mean	95	141	14	87	129	14
Min	10	10	6	10	79	4
Max	800	270	28	354	180	24
${\hat{h}}^{2}$	0.53	0.40	0.46	0.42	0.70	0.38
SE(⁠ ${\hat{h}}^{2}$ ⁠)^e	0.049	0.045	0.049	0.031	0.021	0.031

Statistic	NCRPIS panel			NAM
Statistic	SL^a (mm)	CL (mm)	KRN	SL^b (mm)	CL (mm)	KRN
N^c	5,002	3,381	4,776	6,903	32,031	6,266
Ng^d	2,387	2,287	2,339	2,875	4,359	2,724
Mean	95	141	14	87	129	14
Min	10	10	6	10	79	4
Max	800	270	28	354	180	24
${\hat{h}}^{2}$	0.53	0.40	0.46	0.42	0.70	0.38
SE(⁠ ${\hat{h}}^{2}$ ⁠)^e	0.049	0.045	0.049	0.031	0.021	0.031

a

${\hat{h}}^{2},$ SE(⁠ ${\hat{h}}^{2}$ ⁠) of SL estimated using log-transformed data.

b

${\hat{h}}^{2},$ SE(⁠ ${\hat{h}}^{2}$ ⁠) of SL estimated using square-root-transformed data.

c

Total number of plots measured.

d

Total number of inbred lines measured.

e

Approximate standard error of heritability estimate.

Table 1

Summary statistics and heritability estimates (⁠

{\hat{h}}^{2}

⁠) for three domestication-related traits: shank length (SL), cob length (CL), and kernel row number (KRN) in the maize NCRPIS diversity and NAM panels

Statistic	NCRPIS panel			NAM
Statistic	SL^a (mm)	CL (mm)	KRN	SL^b (mm)	CL (mm)	KRN
N^c	5,002	3,381	4,776	6,903	32,031	6,266
Ng^d	2,387	2,287	2,339	2,875	4,359	2,724
Mean	95	141	14	87	129	14
Min	10	10	6	10	79	4
Max	800	270	28	354	180	24
${\hat{h}}^{2}$	0.53	0.40	0.46	0.42	0.70	0.38
SE(⁠ ${\hat{h}}^{2}$ ⁠)^e	0.049	0.045	0.049	0.031	0.021	0.031

Statistic	NCRPIS panel			NAM
Statistic	SL^a (mm)	CL (mm)	KRN	SL^b (mm)	CL (mm)	KRN
N^c	5,002	3,381	4,776	6,903	32,031	6,266
Ng^d	2,387	2,287	2,339	2,875	4,359	2,724
Mean	95	141	14	87	129	14
Min	10	10	6	10	79	4
Max	800	270	28	354	180	24
${\hat{h}}^{2}$	0.53	0.40	0.46	0.42	0.70	0.38
SE(⁠ ${\hat{h}}^{2}$ ⁠)^e	0.049	0.045	0.049	0.031	0.021	0.031

a

${\hat{h}}^{2},$ SE(⁠ ${\hat{h}}^{2}$ ⁠) of SL estimated using log-transformed data.

b

${\hat{h}}^{2},$ SE(⁠ ${\hat{h}}^{2}$ ⁠) of SL estimated using square-root-transformed data.

c

Total number of plots measured.

d

Total number of inbred lines measured.

e

Approximate standard error of heritability estimate.

QTL and association mapping

In the NAM population, we identified 10 QTL for shank length (each associated with 1.8–2.8% of the trait variation), 8 QTL for kernel row number (associated with 1.8–7.3% variation), and 20 QTL for cob length (associated with 0.8–2.9% variation; Table S1). No QTL were detected for masculinized ear tips; power of QTL detection for this trait was limited because its segregation was restricted to one biparental family. QTL analysis within this one family did not identify any QTL passing a genome-wide permutation-based threshold of α = 0.05. Comparisons between the positions of domestication-related trait QTL mapped in NAM and previously identified domestication QTL mapped in crosses between maize and teosinte revealed little correspondence between the two sets of QTL (Figure S1, Figure S2, and Figure S3). Genome-wide association scans conducted in the NCRPIS diversity panel identified 0, 5, and 10 SNPs associated with cob length, shank length, and kernel row number, respectively, at FDR < 0.05 (Table S2). In general, there was limited overlap between known domestication QTL and SNPs associated with domestication-related traits in either panel (Figure S1, Figure S2, and Figure S3); however, a few notable correspondences were observed. A SNP 270,000 bp upstream of fea2 was strongly associated with the kernel row number trait; however, the one SNP inside of the fea2 coding region was not significant. Several associations identified for shank length (SL) in NAM are in the vicinity of tb1, but ∼2 million bp downstream of the gene (Table S3). By contrast, the known upstream enhancer of tb1 is ∼59–69 kbp from the coding start site (Clark et al. 2006; Studer et al. 2011).

Testing concordance between loci associated with domestication traits within maize and loci that distinguish maize from teosinte

For each set of trait QTL and SNP associations, we compared the mean r² of associations inside vs. outside genomic regions previously identified as related to domestication. Domestication QTL were mapped in a maize-by-teosinte cross population by Briggs et al. (2007) (Table S4) and domestication selection sweep regions were identified from population genetics analyses (Hufford et al. 2012). In addition, we compared mean r² of associations for SNPs inside or outside regions defined as postdomestication “improvement” selection sweeps from population genetics analyses (Hufford et al. 2012). To remove the potentially confounding effect of variability in gene density among regions tested, we tested within regions defined using both annotation for involvement in domestication or improvement and annotation for coding or noncoding regions.

For the NCRPIS diversity panel, mean marker r² values were ∼0.0009, and the largest difference between groups was only 0.000032 (Table 2). This maximum difference was observed between coding variants inside and outside of domestication QTL for kernel row number (KRN), and the SNPs outside of the domestication QTL were associated with more variation (Table 2). In fact, the only significant differences in mean marker r² for SNPs classified according to domestication QTL were observed when SNPs outside of QTL were associated with greater mean r² values than SNPs within the QTL (Table 2). Further, there was no consistent evidence that SNPs inside domestication or improvement sweeps were associated with more variation than SNPs outside of these regions, although noncoding SNPs within sweep regions had significantly higher mean r² values for shank length than noncoding SNPs outside those regions (Table 2).

Mean SNP association r² and number of markers (N_m) inside and outside hypothesis-defined testing regions in the NCRPIS panel

Table 2

Mean SNP association r² and number of markers (N_m) inside and outside hypothesis-defined testing regions in the NCRPIS panel

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m:	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³
Domestication QTL
Hypothesis	Coding	9.27	21.5	9.18	27.2	9.20	17.8
Background	Coding	9.19	226.4	9.40	220.7	9.52	230.2
Difference	Coding	0.08		−0.22**		−0.32**
Hypothesis	Noncoding	9.09	15.4	9.16	19.6	9.43	13.6
Background	Noncoding	9.29	141.9	9.38	137.7	9.42	143.7
Difference	Noncoding	−0.20**		−0.22		0.01
Domestication sweep
Hypothesis	Coding	9.30	15.0	9.39	15.0	9.45	15.0
Background	Coding	9.20	232.9	9.37	232.9	9.50	232.9
Difference	Coding	0.10		0.02		−0.05
Hypothesis	Noncoding	9.52	10.4	9.31	10.4	9.52	10.4
Background	Noncoding	9.25	146.9	9.35	146.9	9.41	146.9
Difference	Noncoding	0.27**		−0.04**		0.11
Improvement sweep
Hypothesis	Coding	9.22	11.9	9.50	11.9	9.60	11.9
Background	Coding	9.20	236.0	9.37	236.0	9.49	236.0
Difference	Coding	0.02		0.13		0.11
Hypothesis	Noncoding	9.56	9.0	9.53	9.0	9.20	9.0
Background	Noncoding	9.25	148.4	9.34	148.4	9.43	148.4
Difference	Noncoding	0.31**		0.19		−0.23*

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m:	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³
Domestication QTL
Hypothesis	Coding	9.27	21.5	9.18	27.2	9.20	17.8
Background	Coding	9.19	226.4	9.40	220.7	9.52	230.2
Difference	Coding	0.08		−0.22**		−0.32**
Hypothesis	Noncoding	9.09	15.4	9.16	19.6	9.43	13.6
Background	Noncoding	9.29	141.9	9.38	137.7	9.42	143.7
Difference	Noncoding	−0.20**		−0.22		0.01
Domestication sweep
Hypothesis	Coding	9.30	15.0	9.39	15.0	9.45	15.0
Background	Coding	9.20	232.9	9.37	232.9	9.50	232.9
Difference	Coding	0.10		0.02		−0.05
Hypothesis	Noncoding	9.52	10.4	9.31	10.4	9.52	10.4
Background	Noncoding	9.25	146.9	9.35	146.9	9.41	146.9
Difference	Noncoding	0.27**		−0.04**		0.11
Improvement sweep
Hypothesis	Coding	9.22	11.9	9.50	11.9	9.60	11.9
Background	Coding	9.20	236.0	9.37	236.0	9.49	236.0
Difference	Coding	0.02		0.13		0.11
Hypothesis	Noncoding	9.56	9.0	9.53	9.0	9.20	9.0
Background	Noncoding	9.25	148.4	9.34	148.4	9.43	148.4
Difference	Noncoding	0.31**		0.19		−0.23*

Significantly different at * P = 0.05 and ** P = 0.01, respectively.

Table 2

Mean SNP association r² and number of markers (N_m) inside and outside hypothesis-defined testing regions in the NCRPIS panel

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m:	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³
Domestication QTL
Hypothesis	Coding	9.27	21.5	9.18	27.2	9.20	17.8
Background	Coding	9.19	226.4	9.40	220.7	9.52	230.2
Difference	Coding	0.08		−0.22**		−0.32**
Hypothesis	Noncoding	9.09	15.4	9.16	19.6	9.43	13.6
Background	Noncoding	9.29	141.9	9.38	137.7	9.42	143.7
Difference	Noncoding	−0.20**		−0.22		0.01
Domestication sweep
Hypothesis	Coding	9.30	15.0	9.39	15.0	9.45	15.0
Background	Coding	9.20	232.9	9.37	232.9	9.50	232.9
Difference	Coding	0.10		0.02		−0.05
Hypothesis	Noncoding	9.52	10.4	9.31	10.4	9.52	10.4
Background	Noncoding	9.25	146.9	9.35	146.9	9.41	146.9
Difference	Noncoding	0.27**		−0.04**		0.11
Improvement sweep
Hypothesis	Coding	9.22	11.9	9.50	11.9	9.60	11.9
Background	Coding	9.20	236.0	9.37	236.0	9.49	236.0
Difference	Coding	0.02		0.13		0.11
Hypothesis	Noncoding	9.56	9.0	9.53	9.0	9.20	9.0
Background	Noncoding	9.25	148.4	9.34	148.4	9.43	148.4
Difference	Noncoding	0.31**		0.19		−0.23*

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m:	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³	×10⁻⁴	N × 10³
Domestication QTL
Hypothesis	Coding	9.27	21.5	9.18	27.2	9.20	17.8
Background	Coding	9.19	226.4	9.40	220.7	9.52	230.2
Difference	Coding	0.08		−0.22**		−0.32**
Hypothesis	Noncoding	9.09	15.4	9.16	19.6	9.43	13.6
Background	Noncoding	9.29	141.9	9.38	137.7	9.42	143.7
Difference	Noncoding	−0.20**		−0.22		0.01
Domestication sweep
Hypothesis	Coding	9.30	15.0	9.39	15.0	9.45	15.0
Background	Coding	9.20	232.9	9.37	232.9	9.50	232.9
Difference	Coding	0.10		0.02		−0.05
Hypothesis	Noncoding	9.52	10.4	9.31	10.4	9.52	10.4
Background	Noncoding	9.25	146.9	9.35	146.9	9.41	146.9
Difference	Noncoding	0.27**		−0.04**		0.11
Improvement sweep
Hypothesis	Coding	9.22	11.9	9.50	11.9	9.60	11.9
Background	Coding	9.20	236.0	9.37	236.0	9.49	236.0
Difference	Coding	0.02		0.13		0.11
Hypothesis	Noncoding	9.56	9.0	9.53	9.0	9.20	9.0
Background	Noncoding	9.25	148.4	9.34	148.4	9.43	148.4
Difference	Noncoding	0.31**		0.19		−0.23*

Significantly different at * P = 0.05 and ** P = 0.01, respectively.

For the NAM population, the mean SNP r² values were significantly different for each comparison of hypothesis region and grouping based on coding regions (Table 3). The largest differences between categories were observed between SNPs inside and outside of domestication QTL for KRN. Domestication QTL SNPs were associated with more variation only for KRN, whereas domestication QTL SNPs had smaller mean r² values for SL and cob length (CL) (Table 3). SNP variances were larger inside than outside of hypothesis regions most consistently for domestication sweep regions, but even within this group, SNPs in noncoding domestication sweep regions had lower mean r² values associated with CL than SNPs in noncoding regions outside of domestication sweep regions (Table 3).

Mean SNP association r² and number of markers (N_m) inside and outside hypothesis-defined testing regions in the NAM panel

Table 3

Mean SNP association r² and number of markers (N_m) inside and outside hypothesis-defined testing regions in the NAM panel

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵
Domestication QTL
Hypothesis	Coding	9.97	2.1	17.2	2.9	29.7	2.1
Background	Coding	14.3	20.6	20.8	20.3	20.0	20.6
Difference	Coding	−4.3**		−3.6**		9.7**
Hypothesis	Noncoding	10.1	32.9	18.5	42.5	25.6	28.4
Background	Noncoding	14.9	201.6	25.8	196.9	22.5	206.0
Difference	Noncoding	−4.8**		−7.3*		3.1**
Domestication sweep
Hypothesis	Coding	15.3	1.2	20.7	1.2	29.6	1.2
Background	Coding	13.8	21.5	20.4	21.5	20.0	21.5
Difference	Coding	1.5**		0.3**		9.6**

Hypothesis	Noncoding	16.1	15.5	21.1	15.5	31.4	15.5
Background	Noncoding	14.2	219.1	25.0	219.1	22.2	219.1
Difference	Noncoding	1.9**		−3.9**		9.2**
Improvement sweep
Hypothesis	Coding	12.8	1.4	25.2	1.4	17.8	1.4
Background	Coding	14.0	21.6	20.2	21.6	20.8	21.6
Difference	Coding	−1.2**		5**		−3**
Hypothesis	Noncoding	13.9	10.7	31.3	10.7	21.2	10.7
Background	Noncoding	14.3	223.7	24.4	223.7	23.0	223.7
Difference	Noncoding	−0.4**		6.9**		−1.8**

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵
Domestication QTL
Hypothesis	Coding	9.97	2.1	17.2	2.9	29.7	2.1
Background	Coding	14.3	20.6	20.8	20.3	20.0	20.6
Difference	Coding	−4.3**		−3.6**		9.7**
Hypothesis	Noncoding	10.1	32.9	18.5	42.5	25.6	28.4
Background	Noncoding	14.9	201.6	25.8	196.9	22.5	206.0
Difference	Noncoding	−4.8**		−7.3*		3.1**
Domestication sweep
Hypothesis	Coding	15.3	1.2	20.7	1.2	29.6	1.2
Background	Coding	13.8	21.5	20.4	21.5	20.0	21.5
Difference	Coding	1.5**		0.3**		9.6**

Hypothesis	Noncoding	16.1	15.5	21.1	15.5	31.4	15.5
Background	Noncoding	14.2	219.1	25.0	219.1	22.2	219.1
Difference	Noncoding	1.9**		−3.9**		9.2**
Improvement sweep
Hypothesis	Coding	12.8	1.4	25.2	1.4	17.8	1.4
Background	Coding	14.0	21.6	20.2	21.6	20.8	21.6
Difference	Coding	−1.2**		5**		−3**
Hypothesis	Noncoding	13.9	10.7	31.3	10.7	21.2	10.7
Background	Noncoding	14.3	223.7	24.4	223.7	23.0	223.7
Difference	Noncoding	−0.4**		6.9**		−1.8**

Significantly different at * P = 0.05 and ** P = 0.01, respectively.

Table 3

Mean SNP association r² and number of markers (N_m) inside and outside hypothesis-defined testing regions in the NAM panel

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵
Domestication QTL
Hypothesis	Coding	9.97	2.1	17.2	2.9	29.7	2.1
Background	Coding	14.3	20.6	20.8	20.3	20.0	20.6
Difference	Coding	−4.3**		−3.6**		9.7**
Hypothesis	Noncoding	10.1	32.9	18.5	42.5	25.6	28.4
Background	Noncoding	14.9	201.6	25.8	196.9	22.5	206.0
Difference	Noncoding	−4.8**		−7.3*		3.1**
Domestication sweep
Hypothesis	Coding	15.3	1.2	20.7	1.2	29.6	1.2
Background	Coding	13.8	21.5	20.4	21.5	20.0	21.5
Difference	Coding	1.5**		0.3**		9.6**

Hypothesis	Noncoding	16.1	15.5	21.1	15.5	31.4	15.5
Background	Noncoding	14.2	219.1	25.0	219.1	22.2	219.1
Difference	Noncoding	1.9**		−3.9**		9.2**
Improvement sweep
Hypothesis	Coding	12.8	1.4	25.2	1.4	17.8	1.4
Background	Coding	14.0	21.6	20.2	21.6	20.8	21.6
Difference	Coding	−1.2**		5**		−3**
Hypothesis	Noncoding	13.9	10.7	31.3	10.7	21.2	10.7
Background	Noncoding	14.3	223.7	24.4	223.7	23.0	223.7
Difference	Noncoding	−0.4**		6.9**		−1.8**

		Shank length		Cob length		Kernel row number
		r²:	N_m:	r²:	N_m	r²:	N_m:
Hypothesis or background	Coding or noncoding	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵	×10⁻⁴	N × 10⁵
Domestication QTL
Hypothesis	Coding	9.97	2.1	17.2	2.9	29.7	2.1
Background	Coding	14.3	20.6	20.8	20.3	20.0	20.6
Difference	Coding	−4.3**		−3.6**		9.7**
Hypothesis	Noncoding	10.1	32.9	18.5	42.5	25.6	28.4
Background	Noncoding	14.9	201.6	25.8	196.9	22.5	206.0
Difference	Noncoding	−4.8**		−7.3*		3.1**
Domestication sweep
Hypothesis	Coding	15.3	1.2	20.7	1.2	29.6	1.2
Background	Coding	13.8	21.5	20.4	21.5	20.0	21.5
Difference	Coding	1.5**		0.3**		9.6**

Hypothesis	Noncoding	16.1	15.5	21.1	15.5	31.4	15.5
Background	Noncoding	14.2	219.1	25.0	219.1	22.2	219.1
Difference	Noncoding	1.9**		−3.9**		9.2**
Improvement sweep
Hypothesis	Coding	12.8	1.4	25.2	1.4	17.8	1.4
Background	Coding	14.0	21.6	20.2	21.6	20.8	21.6
Difference	Coding	−1.2**		5**		−3**
Hypothesis	Noncoding	13.9	10.7	31.3	10.7	21.2	10.7
Background	Noncoding	14.3	223.7	24.4	223.7	23.0	223.7
Difference	Noncoding	−0.4**		6.9**		−1.8**

Significantly different at * P = 0.05 and ** P = 0.01, respectively.

Association of haplotypes at known domestication genes

A number of domestication QTL have been resolved to individual genes by a combination of high-resolution genetic mapping, mutant analysis, and gene expression studies (Table 4). We identified SNPs within and nearby these genes and defined haplotypes at each domestication gene based on multiple SNP genotypes. Haplotype tests in the NCRPIS panel indicate that haplotypes containing grassy tillers 1 (gt1) were significantly associated with shank length (1.6% of variation, P < 0.05; Table 4). Haplotype additive effects on shank length ranged from +32 mm to −26 mm for gt1 (Table S5), and the inbred lines with haplotype effects that cause the largest increase in shank length represent a set of tropical and exotic germplasm distinct from the major temperate maize breeding pool (CML254, CML270, CML388, CML389, CML419, GE440, NC264, SC276Q2, SC277, SC76, TZEEI17, TZEEI20, and TZEI5). Zea apetala homolog1 (zap1) showed a significant association with cob length (5.9% of variation, P < 0.01; Table 4 and Table S6). No other candidate gene haplotypes had significant effects on trait variation.

Tests of associations between haplotypes of known domestication genes and domestication-related traits in the NCRPIS panel

Table 4

Tests of associations between haplotypes of known domestication genes and domestication-related traits in the NCRPIS panel

Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	Proportion of variance explained (%)^c
Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	SL	CL	KRN
tb1	1	265,745,979	265,747,712	Yes	265,746,572	265,751,840	5	12	15	NS	NS	—
tb1-enhancer	1	265,676,479	265,687,279	No	—	—	9	9	6	NS	NS	—
gt1	1	23,241,091	23,244,476	Yes	23,236,091	23,249,476	3	13	48	1.6*	NS	—
zagl1	1	4,862,047	4,877,625	Yes	4,862,244	4,862,765	5	5	6	—	NS	—
zap1	2	235,845,160	235,853,770	No	—	—	21	21	45	—	5.9**	—
te1	3	165,174,146	165,178,071	No	—	—	8	8	17	—	NS	—
fea2	4	133,662,510	133,664,998	Yes	133,662,368	133,664,252	2	2	6	—	—	NS

Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	Proportion of variance explained (%)^c
Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	SL	CL	KRN
tb1	1	265,745,979	265,747,712	Yes	265,746,572	265,751,840	5	12	15	NS	NS	—
tb1-enhancer	1	265,676,479	265,687,279	No	—	—	9	9	6	NS	NS	—
gt1	1	23,241,091	23,244,476	Yes	23,236,091	23,249,476	3	13	48	1.6*	NS	—
zagl1	1	4,862,047	4,877,625	Yes	4,862,244	4,862,765	5	5	6	—	NS	—
zap1	2	235,845,160	235,853,770	No	—	—	21	21	45	—	5.9**	—
te1	3	165,174,146	165,178,071	No	—	—	8	8	17	—	NS	—
fea2	4	133,662,510	133,664,998	Yes	133,662,368	133,664,252	2	2	6	—	—	NS

Chr, chromosome. NS, not significant. Significant at * P < 0.05 and ** P < 0.01, respectively.

a

Coding sequence start position (AGPv2).

b

If the region is extended, the testing region is 5 kbp extended on the left and right sides of the original position. Sometimes SNPs do not fully spread in the whole testing region, so the extended region is the actual region for testing.

c

Proportion of variance explained is calculated as $σ_{hap}^{2} / (σ_{hap}^{2} + (1 + F) σ_{Α}^{2} + σ_{residual}^{2}) .$

Table 4

Open in new tab Download slide

Tests of associations between haplotypes of known domestication genes and domestication-related traits in the NCRPIS panel

Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	Proportion of variance explained (%)^c
Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	SL	CL	KRN
tb1	1	265,745,979	265,747,712	Yes	265,746,572	265,751,840	5	12	15	NS	NS	—
tb1-enhancer	1	265,676,479	265,687,279	No	—	—	9	9	6	NS	NS	—
gt1	1	23,241,091	23,244,476	Yes	23,236,091	23,249,476	3	13	48	1.6*	NS	—
zagl1	1	4,862,047	4,877,625	Yes	4,862,244	4,862,765	5	5	6	—	NS	—
zap1	2	235,845,160	235,853,770	No	—	—	21	21	45	—	5.9**	—
te1	3	165,174,146	165,178,071	No	—	—	8	8	17	—	NS	—
fea2	4	133,662,510	133,664,998	Yes	133,662,368	133,664,252	2	2	6	—	—	NS

Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	Proportion of variance explained (%)^c
Gene name	Chr	Start^a	End	Extended testing region?^b	Extended start	Extended end	No. of SNPs in gene	No. of SNPs in testing region	No. of haplotypes tested for association	SL	CL	KRN
tb1	1	265,745,979	265,747,712	Yes	265,746,572	265,751,840	5	12	15	NS	NS	—
tb1-enhancer	1	265,676,479	265,687,279	No	—	—	9	9	6	NS	NS	—
gt1	1	23,241,091	23,244,476	Yes	23,236,091	23,249,476	3	13	48	1.6*	NS	—
zagl1	1	4,862,047	4,877,625	Yes	4,862,244	4,862,765	5	5	6	—	NS	—
zap1	2	235,845,160	235,853,770	No	—	—	21	21	45	—	5.9**	—
te1	3	165,174,146	165,178,071	No	—	—	8	8	17	—	NS	—
fea2	4	133,662,510	133,664,998	Yes	133,662,368	133,664,252	2	2	6	—	—	NS

Chr, chromosome. NS, not significant. Significant at * P < 0.05 and ** P < 0.01, respectively.

a

Coding sequence start position (AGPv2).

b

If the region is extended, the testing region is 5 kbp extended on the left and right sides of the original position. Sometimes SNPs do not fully spread in the whole testing region, so the extended region is the actual region for testing.

c

Proportion of variance explained is calculated as $σ_{hap}^{2} / (σ_{hap}^{2} + (1 + F) σ_{Α}^{2} + σ_{residual}^{2}) .$

Variance component testing

To estimate the proportion of trait genotypic variance associated with additive polygenic vs. other genetic effects (such as large-effect loci and nonadditive variance) in the NCRPIS panel, we simultaneously modeled genotypic effects with variance–covariance relationships proportional to the realized additive relationship matrix and genotypic effects with no pairwise relationships to capture genetic effects unique to each line. Among traits, 92–100% of genotypic variance was accounted for by polygenic additive background effects, with the remainder of variance attributable to a combination of nonadditive effects and large-effect loci (Table S7).

To partition total trait variance into components associated with domestication QTL, domestication sweep regions, improvement sweep regions, and the remainder of the genome, we estimated realized additive relationship matrices using SNPs in each of these regions of the genome and estimated the associated variance components in each panel (Figure 2, Figure 3, Table S8, Table S9, and Table S10). When effects associated with all four relationship matrices were fitted simultaneously in a common mixed model, the background polygenic variance component accounted for 67–80% of the total additive genetic variance in NCRPIS (Figure 2A, Table S8, and Table S10) and 71–100% in NAM (Figure 3A, Table S9, and Table S10). The increase in total heritability explained by fitting all four categories together was only 0–1% compared to simply fitting a single relationship matrix based on all SNPs together across all traits and panels (Figure 2A, Figure 3A, Table S8, and Table S9).

Figure 2

(A) The proportion of variance for shank length, cob length, and kernel row number among inbred lines of the NCRPIS panel associated with relationship matrices based on all SNPs in hypothesis-defined regions or on background SNPs. (B) Cumulative proportion of genome tagged by SNPs defining hypothesis relationship matrices and background matrices and the proportion of total additive genetic variation associated with each relationship matrix for shank length, cob length, and kernel row number among inbred lines of the NCRPIS panel. (C) Ratio of proportion of total additive genetic variation to cumulative proportion of the genome tagged by SNPs defining hypothesis and background relationship matrices for shank length, cob length, and kernel row number among inbred lines of the NCRPIS panel.

Figure 3

(A) The proportion of variance for shank length, cob length, and kernel row number among inbred lines of the NAM panel associated with relationship matrices based on all SNPs in hypothesis-defined regions or based on background SNPs. (B) Cumulative proportion of the genome tagged by SNPs defining hypothesis relationship matrices and background matrices and the proportion of total additive genetic variation associated with each relationship matrix for shank length, cob length, and kernel row number among inbred lines of the NAM panel. (C) Ratio of proportion of total additive genetic variation to cumulative proportion of the genome tagged by SNPs defining hypothesis and background relationship matrices for shank length, cob length, and kernel row number among inbred lines of the NAM panel.

Open in new tab Download slide

The relationship matrices were estimated using widely different numbers of markers, which is expected to affect the proportion of variance associated with each matrix under the null hypothesis of equal contributions to the total genetic variance. Therefore, we compared the proportion of additive variance accounted for by each matrix to the proportion of the genome represented by the hypothesis region. The proportion of additive variance associated with QTL-defined and domestication sweep-related hypothesis matrices was smaller than the proportion of genome represented by the SNPs defining those matrices (except for cob length in the NAM population; Figure 2, Figure 3, and Table S10). In contrast, the proportion of total additive variance associated with the improvement sweep-defined relationship matrix was two to five times greater than the proportion of the genome represented by the improvement sweeps (except for kernel row number variance, which was completely associated with the genomic background; Figure 2, Figure 3, and Table S10).

An alternative approach to account for differences in the proportion of the genome represented in each matrix was to fit each hypothesis-based relationship matrix along with a matched background relationship matrix based on an equally sized sample of background SNPs with the same proportion of coding and noncoding variants to estimate variance components. For each combination of hypothesis region, trait, and inbred line panel, we sampled background SNPs and fitted the mixed model 20 times to estimate the variability in variance components estimates across samples. Background polygenic effects were consistently associated with more variance than the domestication QTL, domestication sweep, or improvement sweep regions when fitting relationship matrices with matching numbers and genic composition of SNPs (Figure S4, Figure S5, Table S8, and Table S9). Among the hypothesis-defined regions, the improvement sweep regions consistently explained the largest proportion of variation, ranging from 8% to 48% of the total heritable variance when fitted with a matched background polygenic effect relationship matrix.

Discussion

Heritability and polygenic variation

Heritabilities of the three traits were relatively low to moderate, in part because the large numbers of lines tested precluded evaluating larger numbers of replicates of the experiment. The polygenic relationship matrix was associated with 40–53% of total phenotypic variation in the NCRPIS panel (Table 1). By comparison, the largest amount of variation associated with an individual SNP was estimated to be ∼3% (Table S2) and few SNPs passed stringent thresholds for association tests.

Haplotypes at the candidate gene zap1 were associated with 6% of cob length variation, suggesting that complex variation in a genomic region occasionally may account for more variation than can be associated with a single SNP, but this was the exception to the general trend of no obvious haplotype effects. Variants in zap1 were associated with ear length in teosinte (Weber et al. 2008); our results suggest some functional variation at this locus passed through the domestication bottleneck and remains in maize or new functional variants have arisen within maize. Haplotypes at candidate gene gt1 were also associated with a small amount of shank length variation in maize. Although this locus was not detected as affecting lateral branch (shank) length in maize–teosinte crosses (Briggs et al. 2007), Wills et al. (2013) identified gt1 as conferring the major difference in the number of ears produce by maize and teosinte and observed that haplotypic variation at this locus suggests only a partial sweep due to selection under domestication. Some of the teosinte-type variation at this locus may even have a favorable effect in maize by increasing the number of ears by a small amount, and it is possible that these same variants have small effects on shank length.

The tb1 gene and its linked enhancer played a key role in changing the morphology of maize, including reducing the length of lateral branches, during the domestication process (Studer et al. 2011; Tsiantis 2011). Thus, tb1 is an obvious candidate for explaining the variation among shank (lateral branch) lengths in maize. However, we observed no QTL or SNP association in NAM around tb1. We also did not identify an association for shank length near the gene in the diversity panel GWAS, and SNPs inside of the tb1 coding region and its enhancer were not significant. Direct testing of haplotypes defined by SNPs surrounding tb1 (encompassing a 5268-bp region) and encompassing the tb1 enhancer region suggested that these haplotypes are not significantly associated with shank length for the NCRPIS panel.

Although we identified a few individual SNPs and haplotypes associated with significant amounts of variation for domestication traits in maize, their effects were small. Power of association tests is influenced by sample size, allele frequency, effect size, and marker density; therefore, it is possible that some rare alleles of large effect were not detected in the GWAS scans, resulting in “missing heritability” (Manolio et al. 2009). Many SNPs are rare in the NCRPIS panel (Romay et al. 2013) and their effects are difficult to estimate accurately. Since the NAM population is derived from 25 founders crossed to a common reference parent, the minimum allele frequency expected is ∼2%, or 100 lines, which is sufficiently large to detect large-effect alleles. However, our evaluations of two of the domestication traits were based on smaller subsets of NAM, so power of detection of variants private to individual families is smaller for those traits. Interactions between causal variants at different loci (epistasis) and with environments will also make their detection more difficult (Manolio et al. 2009). Nevertheless, the lack of strong effects associated with any individual SNPs and the relatively large proportion of variation associated with the genomic background indicate that the genetic architecture of variation for domestication traits within maize is distinct from the genetic control of differences between maize and teosinte, which is dominated by a relatively few large-effect loci.

We found no evidence that QTL or SNP associations for these traits were more likely to be near domestication QTL or that markers in domestication QTL explained more trait variation than markers outside of these regions (Figure 2, Figure 3, Table 2, and Table 3). No consistent pattern of increased SNP effects was observed for SNPs inside domestication or improvement sweep regions (Table 2 and Table 3). The comparison of average SNP effects averaged over all SNPs in a group has limitations; many of these effect estimates are expected to be poor, and the mean value estimated is expected to be an upwardly biased estimate of the true mean effect size of individual SNPs. However, by averaging over many thousands of loci within each class, we expect the biases to cancel out when comparing mean effect sizes of different classes.

Partitioning of the genetic variance into components due to specific hypothesis-based regions is likely a more reliable method for comparing the influence of different genomic regions that are highly polygenic. Using this approach, we observed that improvement sweep regions showed a consistently higher proportion of the total heritable variance than other hypothesis-defined regions and often substantially more than the proportion of the genome represented by SNPs defining the improvement sweep relationship matrix (Figure 2, Figure 3, Table S8, Table S9, and Table S10). When we fitted specific hypothesis-based relationship matrices along with background matrices sampled with matching SNP numbers and proportion of coding SNPs, the variance associated with hypothesis-based relationship matrices was always lower than the matching background (Figure S4 and Figure S5). However, although we took care to control for the sample size and gene density of the SNPs used to compute the hypothesis and background relationship matrices, we expect that the markers used for the hypothesis matrix have higher linkage disequilibrium and relatively less explanatory power than equally sized samples of SNPs from the rest of the genome because they were sampled from restricted genomic blocks. The higher levels of linkage disequilibrium expected among the improvement sweep SNPs would downwardly bias the proportion of total additive variance they can explain relative to an equally sized random sample of SNPs from the rest of the genome. Therefore, these results are congruent with enrichment of improvement sweep-related regions of the maize genome for functional variants affecting domestication-related traits, although the effects of individual variants appear to be quite small and the precise magnitude of the enrichment remains difficult to assess.

The generally reduced contribution of domestication QTL regions, and to a lesser extent the domestication sweep regions, to domestication-related traits variation in maize is likely a direct result of selection purging variants that favor the teosinte morphology in these regions. Theory and analysis of response to long-term artificial selection in a number of plant and animal species indicate that initial generations of selection response are due to standing variation in the initial population, but that genetic variation in later generations is usually mostly due to the effects of new mutations (Keightley 2004; Walsh 2004). Thus, mutation is expected to be an important generator of genetic variation over the several thousand generations of selection and evolution of distinct maize types from a common ancestral population following the domestication bottleneck. Our results suggest that if new mutations that occurred after domestication are responsible for some of the observed genetic variation in domestication traits, they occur at genes not involved in domestication.

The increased contribution of improvement sweep regions to variation in these traits may be due to divergent selection for functional alleles in these regions. Although modern inbreds are significantly differentiated from landraces in these regions, the level of differentiation is lower than the mean differentiation between landraces and teosinte in the domestication sweep regions (Hufford et al. 2012). Thus, more sequence variation exists among inbreds in improvement sweep regions than in domestication sweep regions. However, less variation among inbreds exists in both domestication and improvement sweep regions than in the rest of the genome. This suggests that functional variants for domestication traits in improvement sweep regions may be targets of selection, but divergent selection maintains some variation for such variants. For example, some maize varieties have small kernel row numbers (because this is associated with larger seed size); others with small cob lengths are maintained because they have favored kernel types. Historical selection may have favored more kernel rows and longer cobs in general, but diverse inbred lines sampled from different regions may include contributions from populations selected in the opposite direction, resulting in an overall signal of selection near variants that affect these traits at the same time as these variants contribute disproportionately to the observed trait variation.

Acknowledgments

We thank Jeff Glaubitz for help selecting SNPs from the HapMap 3 database for relationship matrix estimation. S.X. was supported by National Institutes of Environmental Health Sciences training grant T32 ES007329 to the North Carolina State University Bioinformatics Research Center and National Science Foundation (NSF) award IOS-1127076; J.B.H. was supported by NSF awards IOS-1127076 and IOS-1238014 and by the U.S. Department of Agriculture, Agricultural Research Service.

Footnotes

Communicating editor: A. H. Paterson

Supplemental material is available online at www.genetics.org/lookup/suppl/doi:10.1534/genetics.116.191106/-/DC1.

Literature Cited

Bass, A. J., A. Dabney, and D. Robinson, 2015 Qvalue: Q-Value Estimation for False Discovery Rate Control. R Package version 2.2.2. Available at: http://github.com/jdstorey/qvalue.

Bian

Y

,

Yang

Q

,

Balint-Kurti

P

,

Wisser

R J

,

Holland

J B

,

2014

Limits on the reproducibility of marker associations with southern leaf blight resistance in the maize nested association mapping population.

BMC Genomics

15

:

1068

.

Bradbury

P J

,

Zhang

Z

,

Kroon

D E

,

Casstevens

T M

,

Ramdoss

Y

et al. ,

2007

TASSEL: software for association mapping of complex traits in diverse samples.

Bioinformatics

23

:

2633

–

2635

.

Briggs

W H

,

McMullen

M D

,

Gaut

B S

,

Doebley

J

,

2007

Linkage mapping of domestication loci in a large Maize–Teosinte backcross resource.

Genetics

177

:

1915

–

1928

.

Broman

K W

,

Wu

H

,

Sen

S

,

Churchill

G A

,

2003

R/qtl: QTL mapping in experimental crosses.

Bioinformatics

19

:

889

–

890

.

Browning

B L

,

Browning

S R

,

2009

A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals.

Am. J. Hum. Genet.

84

:

210

–

223

.

Buckler

E S

,

Holland

J B

,

Bradbury

P J

,

Acharya

C B

,

Brown

P J

et al. ,

2009

The genetic architecture of maize flowering time.

Science

325

:

714

–

718

.

Bukowski

R

,

Guo

X

,

Lu

Y

,

Zou

C

,

He

B

et al. ,

2015

Construction of the third generation Zea mays haplotype map.

bioRxiv

: .

10.1101/026963

Chia

J

,

Song

C

,

Bradbury

P J

,

Costich

D

,

de Leon

N

et al. ,

2012

Maize HapMap2 identifies extant variation from a genome in flux.

Nat. Genet.

44

:

803

–

807

.

Clark

R M

,

Linton

E

,

Messing

J

,

Doebley

J F

,

2003

Pattern of diversity in the genomic region near the maize domestication gene tb1.

Proc. Natl. Acad. Sci. USA

101

:

700

–

707

.

Clark

R M

,

Wagler

T N

,

Quijada

P

,

Doebley

J

,

2006

A distant upstream enhancer at the maize domestication gene tb1 has pleiotropic effects on plant and inflorescent architecture.

Nat. Genet.

38

:

594

–

597

.

Darwin

C R

,

1868

The Variation of Animals and Plants Under Domestication

.

John Murray

,

London

.

Doebley

J

,

2004

The genetics of maize evolution.

Annu. Rev. Genet.

38

:

37

–

59

.

Doebley

J

,

Stec

A

,

Wendel

J

,

Edwards

M

,

1990

Genetic and morphological analysis of a maize-teosinte F2 population: implications for the origin of maize.

Proc. Natl. Acad. Sci. USA

87

:

9888

–

9892

.

Doebley

J

,

Stec

A

,

Kent

B

,

1995

Suppressor of sessile spikelets 1 (Sos1): a dominant mutant affecting inflorescence development in maize.

Am. J. Bot.

82

:

571

–

577

.

Doebley

J

,

Stec

A

,

Hubbard

L

,

1997

The evolution of apical dominance in maize.

Nature

386

:

485

–

488

.

Doebley

J F

,

Gaut

B S

,

Smith

B D

,

2006

The molecular genetics of crop domestication.

Cell

127

:

1309

–

1321

.

Elshire

R J

,

Glaubitz

J C

,

Sun

Q

,

Poland

J A

,

Kawamoto

K

et al. ,

2011

A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species.

PLoS ONE

6

:

e19379

.

Flint-Garcia

S

,

Thuillet

A

,

Yu

J

,

Pressoir

G

,

Romero

S

et al. ,

2005

Maize association population: a high-resolution platform for quantitative trait locus dissection.

Plant J.

44

:

1054

–

1064

.

Galinat

W C

,

1983

The origin of maize as shown by key morphological traits of its ancestor, teosinte.

Maydica

28

:

121

–

138

.

Gilmour

A R

,

Gogel

B J

,

Cullis

B R

,

Thompson

R

,

2009

ASReml User Guide Release 3.0

.

VSN International

,

Hemel Hempstead, UK

.

Glaubitz

J C

,

Casstevens

T M

,

Lu

F

,

Harriman

J

,

Elshire

R J

et al. ,

2014

TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline.

PLoS ONE

9

:

1

–

11

.

Gusev

A

,

Lee

S

,

Trynka

G

,

Finucane

H

,

Vilhjálmsson

B

et al. ,

2014

Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases.

Am. J. Hum. Genet.

95

:

535

–

552

.

Harlan

J R

,

1992

Crops & Man

.

American Society of Agronomy

,

Madison, WI

.

Holland

J B

,

Coles

N D

,

2011

QTL controlling masculinization of ear tips in a maize (Zea mays L.) intraspecific cross.

G3

1

:

337

–

341

.

Hufford

M B

,

Xu

X

,

van Heerwaarden

J

,

Pyhajarvi

T

,

Chia

J

et al. ,

2012

Comparative population genomics of maize domestication and improvement.

Nat. Genet.

44

:

808

–

811

.

Hung

H

,

Browne

C

,

Guill

K

,

Coles

N

,

Eller

M

et al. ,

2011

The relationship between parental genetic or phenotypic divergence and progeny variation in the maize nested association mapping population.

Heredity

108

:

490

–

499

.

Hung

H

,

Shannon

L M

,

Tian

F

,

Bradbury

P J

,

Chen

C

et al. ,

2012

ZmCCT and the genetic basis of day-length adaptation underlying the postdomestication spread of maize.

Proc. Natl. Acad. Sci. USA

109

:

E1913

–

E1921

.

Iltis

H H

,

1983

From teosinte to maize: the catastrophic sexual transmutation.

Science

222

:

886

–

894

.

Keightley

P D

,

2004

Mutational variation and long-term selection response

, pp.

227

–

247

in

Plant Breeding Reviews

,

Vol. 24

, Part 1, edited by

Janick

J

.

John Wiley & Sons

,

New York

.

Kump

K L

,

Bradbury

P J

,

Buckler

E S

,

Belcher

A R

,

Oropeza-Rosas

M

et al. ,

2011

Genome-wide association study of quantitative resistance to southern leaf blight in the maize nested association mapping population.

Nat. Genet.

43

:

163

–

168

.

Manolio

T A

,

Collins

F S

,

Cox

N J

,

Goldstein

D B

,

Hindorff

L A

et al. ,

2009

Finding the missing heritability of complex diseases

.

Nature

461

:

747

–

753

.

Matsuoka

Y

,

Vigouroux

Y

,

Goodman

M M

,

Sanchez G.

J

,

Buckler

E

et al. ,

2002

A single domestication for maize shown by multilocus microsatellite genotyping.

Proc. Natl. Acad. Sci. USA

99

:

6080

–

6084

.

McMullen

M D

,

Kresovich

S

,

Villeda

H S

,

Bradbury

P

,

Li

H

et al. ,

2009

Genetic properties of the maize nested association mapping population.

Science

325

:

737

–

740

.

Meyer

R S

,

Purugganan

M D

,

2013

Evolution of crop species: genetics of domestication and diversification.

Nat. Rev. Genet.

14

:

840

–

852

.

Oakey

H

,

Verbyla

A

,

Pitchford

W

,

Cullis

B

,

Kuchel

H

,

2006

Joint modeling of additive and non-additive genetic line effects in single field trials.

Theor. Appl. Genet.

113

:

809

–

819

.

Oakey

H

,

Verbyla

A P

,

Cullis

B R

,

Wei

X

,

Pitchford

W S

,

2007

Joint modeling of additive and non-additive (genetic line) effects in multi-environment trials.

Theor. Appl. Genet.

114

:

1319

–

1332

.

Ogut

F

,

Bian

Y

,

Bradbury

P J

,

Holland

J B

,

2015

Joint-multiple family linkage analysis predicts within-family variation better than single-family analysis of the maize nested association mapping population.

Heredity

114

:

552

–

563

.

Purcell

S

,

Neale

B

,

Todd-Brown

K

,

Thomas

L

,

Ferreira

M

et al. ,

2007

PLINK: a tool set for whole-genome association and population-based linkage analyses.

Am. J. Hum. Genet.

81

:

559

–

575

.

R Core Team, 2016 R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at: https://www.R-project.org/.

Rodgers-Melnick

E

,

Vera

D L

,

Bass

H W

,

Buckler

E S

,

2016

Open chromatin reveals the functional maize genome.

Proc. Natl. Acad. Sci. USA

113

:

E3177

–

E3184

.

Romay

M C

,

Millard

M J

,

Glaubitz

J C

,

Peiffer

J A

,

Swarts

K L

et al. ,

2013

Comprehensive genotyping of the USA national maize inbred seed bank.

Genome Biol.

14

:

R55

.

SAS Institute

, 2011

SAS/STAT 9.3 User’s Guide

.

SAS Institute

,

Cary, NC

.

Speed

D

,

Balding

D J

,

2014

MultiBLUP: improved SNP-based prediction for complex traits.

Genome Res.

24

:

1550

–

1557

.

Speed

D

,

Hemani

G

,

Johnson

M

,

Balding

D

,

2012

Improved heritability estimation from genome-wide SNPs.

Am. J. Hum. Genet.

91

:

1011

–

1021

.

Studer

A J

,

Doebley

J F

,

2012

Evidence for a natural allelic series at the maize domestication locus teosinte branched1.

Genetics

191

:

951

–

958

.

Studer

A

,

Zhao

Q

,

Ross-Ibarra

J

,

Doebley

J

,

2011

Identification of a functional transposon insertion in the maize domestication gene tb1.

Nat. Genet.

43

:

1160

–

1163

.

Swarts

K

,

Li

H

,

Navarro

J A R

,

An

D

,

Romay

M C

et al. ,

2014

Novel methods to optimize genotypic imputation for low-coverage, next-generation sequence data in crop plants.

Plant Genome

7

:

3

.

Tian

F

,

Bradbury

P J

,

Brown

P J

,

Hung

H

,

Sun

Q

et al. ,

2011

Genome-wide association study of leaf architecture in the maize nested association mapping population.

Nat. Genet.

43

:

159

–

162

.

Tsiantis

M

,

2011

A transposon in tb1 drove maize domestication.

Nat. Genet.

43

:

1048

–

1050

.

van Heerwaarden

J

,

Doebley

J

,

Briggs

W H

,

Glaubitz

J C

,

Goodman

M M

et al. ,

2011

Genetic signals of origin, spread, and introgression in a large sample of maize landraces.

Proc. Natl. Acad. Sci. USA

108

:

1088

–

1092

.

VanRaden

P M

,

2008

Efficient methods to compute genomic predictions.

J. Dairy Sci.

91

:

4414

–

4423

.

Walsh

B

,

2004

Population- and quantitative-genetic models of selection limits

, pp.

177

–

225

in

Plant Breeding Reviews

,

Vol. 24

, Part 1, edited by

Janick

J

.

John Wiley & Sons

,

New York

.

Wang

R

,

Stec

A

,

Hey

J

,

Lukens

L

,

Doebley

J

,

1999

The limits of selection during maize domestication.

Nature

398

:

236

–

239

.

Weber

A L

,

Briggs

W H

,

Rucker

J

,

Baltazar

B M

,

de Jesús Sánchez-Gonzalez

J

et al. ,

2008

The genetic architecture of complex traits in teosinte (Zea mays ssp. parviglumis): new evidence from association mapping.

Genetics

180

:

1221

–

1232

.

Wills

D M

,

Whipple

C J

,

Takuno

S

,

Kursel

L E

,

Shannon

L M

et al. ,

2013

From many, one: genetic control of prolificacy during maize domestication.

PLoS Genet.

9

:

e1003604

.

Wright

S I

,

Bi

I V

,

Schroeder

S G

,

Yamasaki

M

,

Doebley

J F

et al. ,

2005

The effects of artificial selection on the maize genome.

Science

308

:

1310

–

1314

.

Yan

J

,

Crouch

J

,

Warburton

M

,

2011

Association mapping for enhancing maize (Zea mays L.) genetic improvement.

Crop Sci.

51

:

433

–

449

.

Yu

J

,

Holland

J B

,

McMullen

M D

,

Buckler

E S

,

2008

Genetic design and statistical power of nested association mapping in maize.

Genetics

178

:

539

–

551

.

Zhang

Z W

,

Ersoz

E

,

Lai

C Q

,

Todhunter

R J

,

Tiwari

H K

et al. ,

2010

Mixed linear model approach adapted for genome-wide association studies.

Nat. Genet.

42

:

355

–

360

.

PubMed

Zila

C T

,

Ogut

F

,

Romay

M C

,

Gardner

C A

,

Buckler

E S

et al. ,

2014

Genome-wide association study of fusarium ear rot disease in the U.S.A. maize inbred line collection.

BMC Plant Biol.

14

:

1

–

15

.