An assessment of the hypervariable domains of the 16S rRNA genes for their value in determining microbial community diversity: the paradox of traditional ecological indices

Richness, diversity and evenness mean indices based on single hypervariable domains of the 16S rRNA genes for natural sagebrush (NSB) soil at three depths and irrigated mouldboard-ploughed (IMP) soils at 0–30 cm

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1 domain
NSB 0–5 cm	10.78 (± 1.30)^a	1.90 (± 0.11)^c	0.80 (± 0.03)^f
NSB 5–15 cm	12.44 (± 1.51)^b	2.09 (± 0.11)^d	0.83 (± 0.03)^f,g
NSB 15–30 cm	11.78 (± 0.67)^a,b	2.11 (± 0.09)^d	0.86 (± 0.03)^g
IMP 0–30 cm	10.44 (± 0.72)^a	1.65 (±− 0.09)^e	0.70 (± 0.03)^h
ANOVA	F=6.13, P<0.01	F=37.81, P<0.01	F=38.63, P<0.01
V1+V2 domains
NSB 0–5 cm	7.11 (± 1.05)ⁱ	1.76 (± 0.12)^k	0.90 (± 0.04)ⁿ
NSB 5–15 cm	11.33 (± 2.17)^j	2.28 (± 0.19)^l	0.95 (± 0.02)^o
NSB 15–30 cm	9.89 (± 1.69)^j	2.14 (± 0.14)^l	0.94 (± 0.03)^n,o
IMP 0–30 cm	9.56 (± 0.72)^j	2.03 (± 0.10)^m	0.90 (± 0.03)ⁿ
ANOVA	F=11.96, P<0.01	F=21.97, P<0.01	F=6.39, P<0.01
V3 domain
NSB 0–5 cm	4.57 (± 1.13)^p	1.18 (± 0.24)^r	0.81 (± 0.10)^s
NSB 5–15 cm	3.00 (± 0.00)^q	1.01 (± 0.02)^r	0.92 (± 0.02)^t
NSB 15–30 cm	3.67 (± 0.50)^p,q	1.12 (± 0.11)^r	0.87 (± 0.06)^s,t
IMP 0–30 cm	4.44 (± 0.52)^p	1.36 (± 0.14)^r	0.91 (± 0.02)^t
ANOVA	F=7.31, P<0.01	F=6.88, P<0.01	F=4.91, P<0.01

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1 domain
NSB 0–5 cm	10.78 (± 1.30)^a	1.90 (± 0.11)^c	0.80 (± 0.03)^f
NSB 5–15 cm	12.44 (± 1.51)^b	2.09 (± 0.11)^d	0.83 (± 0.03)^f,g
NSB 15–30 cm	11.78 (± 0.67)^a,b	2.11 (± 0.09)^d	0.86 (± 0.03)^g
IMP 0–30 cm	10.44 (± 0.72)^a	1.65 (±− 0.09)^e	0.70 (± 0.03)^h
ANOVA	F=6.13, P<0.01	F=37.81, P<0.01	F=38.63, P<0.01
V1+V2 domains
NSB 0–5 cm	7.11 (± 1.05)ⁱ	1.76 (± 0.12)^k	0.90 (± 0.04)ⁿ
NSB 5–15 cm	11.33 (± 2.17)^j	2.28 (± 0.19)^l	0.95 (± 0.02)^o
NSB 15–30 cm	9.89 (± 1.69)^j	2.14 (± 0.14)^l	0.94 (± 0.03)^n,o
IMP 0–30 cm	9.56 (± 0.72)^j	2.03 (± 0.10)^m	0.90 (± 0.03)ⁿ
ANOVA	F=11.96, P<0.01	F=21.97, P<0.01	F=6.39, P<0.01
V3 domain
NSB 0–5 cm	4.57 (± 1.13)^p	1.18 (± 0.24)^r	0.81 (± 0.10)^s
NSB 5–15 cm	3.00 (± 0.00)^q	1.01 (± 0.02)^r	0.92 (± 0.02)^t
NSB 15–30 cm	3.67 (± 0.50)^p,q	1.12 (± 0.11)^r	0.87 (± 0.06)^s,t
IMP 0–30 cm	4.44 (± 0.52)^p	1.36 (± 0.14)^r	0.91 (± 0.02)^t
ANOVA	F=7.31, P<0.01	F=6.88, P<0.01	F=4.91, P<0.01

Richness (S)=no. of peaks in each sample; diversity (H)=−Σ(p_i)ln(p_i), where p_iis the relative ratio of individual peak heights; evenness (E)=H/H_max, where H_max=ln(S); numbers in parentheses are±SD of the calculated means (n=9). Within each domain set, indices followed by the same letter are not significantly different from each other using Bonferroni post-hoc comparisons, P<0.05.

1

Richness, diversity and evenness mean indices based on single hypervariable domains of the 16S rRNA genes for natural sagebrush (NSB) soil at three depths and irrigated mouldboard-ploughed (IMP) soils at 0–30 cm

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1 domain
NSB 0–5 cm	10.78 (± 1.30)^a	1.90 (± 0.11)^c	0.80 (± 0.03)^f
NSB 5–15 cm	12.44 (± 1.51)^b	2.09 (± 0.11)^d	0.83 (± 0.03)^f,g
NSB 15–30 cm	11.78 (± 0.67)^a,b	2.11 (± 0.09)^d	0.86 (± 0.03)^g
IMP 0–30 cm	10.44 (± 0.72)^a	1.65 (±− 0.09)^e	0.70 (± 0.03)^h
ANOVA	F=6.13, P<0.01	F=37.81, P<0.01	F=38.63, P<0.01
V1+V2 domains
NSB 0–5 cm	7.11 (± 1.05)ⁱ	1.76 (± 0.12)^k	0.90 (± 0.04)ⁿ
NSB 5–15 cm	11.33 (± 2.17)^j	2.28 (± 0.19)^l	0.95 (± 0.02)^o
NSB 15–30 cm	9.89 (± 1.69)^j	2.14 (± 0.14)^l	0.94 (± 0.03)^n,o
IMP 0–30 cm	9.56 (± 0.72)^j	2.03 (± 0.10)^m	0.90 (± 0.03)ⁿ
ANOVA	F=11.96, P<0.01	F=21.97, P<0.01	F=6.39, P<0.01
V3 domain
NSB 0–5 cm	4.57 (± 1.13)^p	1.18 (± 0.24)^r	0.81 (± 0.10)^s
NSB 5–15 cm	3.00 (± 0.00)^q	1.01 (± 0.02)^r	0.92 (± 0.02)^t
NSB 15–30 cm	3.67 (± 0.50)^p,q	1.12 (± 0.11)^r	0.87 (± 0.06)^s,t
IMP 0–30 cm	4.44 (± 0.52)^p	1.36 (± 0.14)^r	0.91 (± 0.02)^t
ANOVA	F=7.31, P<0.01	F=6.88, P<0.01	F=4.91, P<0.01

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1 domain
NSB 0–5 cm	10.78 (± 1.30)^a	1.90 (± 0.11)^c	0.80 (± 0.03)^f
NSB 5–15 cm	12.44 (± 1.51)^b	2.09 (± 0.11)^d	0.83 (± 0.03)^f,g
NSB 15–30 cm	11.78 (± 0.67)^a,b	2.11 (± 0.09)^d	0.86 (± 0.03)^g
IMP 0–30 cm	10.44 (± 0.72)^a	1.65 (±− 0.09)^e	0.70 (± 0.03)^h
ANOVA	F=6.13, P<0.01	F=37.81, P<0.01	F=38.63, P<0.01
V1+V2 domains
NSB 0–5 cm	7.11 (± 1.05)ⁱ	1.76 (± 0.12)^k	0.90 (± 0.04)ⁿ
NSB 5–15 cm	11.33 (± 2.17)^j	2.28 (± 0.19)^l	0.95 (± 0.02)^o
NSB 15–30 cm	9.89 (± 1.69)^j	2.14 (± 0.14)^l	0.94 (± 0.03)^n,o
IMP 0–30 cm	9.56 (± 0.72)^j	2.03 (± 0.10)^m	0.90 (± 0.03)ⁿ
ANOVA	F=11.96, P<0.01	F=21.97, P<0.01	F=6.39, P<0.01
V3 domain
NSB 0–5 cm	4.57 (± 1.13)^p	1.18 (± 0.24)^r	0.81 (± 0.10)^s
NSB 5–15 cm	3.00 (± 0.00)^q	1.01 (± 0.02)^r	0.92 (± 0.02)^t
NSB 15–30 cm	3.67 (± 0.50)^p,q	1.12 (± 0.11)^r	0.87 (± 0.06)^s,t
IMP 0–30 cm	4.44 (± 0.52)^p	1.36 (± 0.14)^r	0.91 (± 0.02)^t
ANOVA	F=7.31, P<0.01	F=6.88, P<0.01	F=4.91, P<0.01

Richness (S)=no. of peaks in each sample; diversity (H)=−Σ(p_i)ln(p_i), where p_iis the relative ratio of individual peak heights; evenness (E)=H/H_max, where H_max=ln(S); numbers in parentheses are±SD of the calculated means (n=9). Within each domain set, indices followed by the same letter are not significantly different from each other using Bonferroni post-hoc comparisons, P<0.05.

The three data sets, V1, V3 and V1+V2, were combined and the indices recalculated (Table 2). IMP was significantly different from NSB 5–15 and NSB 15–30 with respect to diversity. All NSB sample evenness indices were not significantly different from each other, but IMP was significantly different from NSB 15–30. Indices were then recalculated using only two domains. For the combined V1 and V3 domains, there were no significant differences in richness between samples, but IMP diversity and evenness were significantly different from all NSB samples. The same was true for the V1, V1+V2 data set. When V3 and V1+V2 were combined IMP was not significantly different from NSB 5–15 or NSB 15–30; IMP was, however, significantly different from NSB 0–5. Evenness did not differ between any of the samples.

2

The richness, diversity, and evenness mean values for various combinations of the domain data sets produced using amplicon length heterogeneity PCR profiling

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1, V3, and V1+V2 domains
NSB 0–5 cm	20.33 (± 1.15)^a,b	2.40 (± 0.05)^c,d	0.80 (± 0.01)^f
NSB 5–15 cm	25.67 (± 3.78)^a,b	2.63 (± 0.18)^c	0.81 (± 0.04)^f
NSB 15–30 cm	24.67(± 0.57)^a,b	2.68 (± 0.12)^e	0.84 (± 0.03)^f
IMP 0–30 cm	19.33 (± 1.52)^a	2.26 (± 0.11)^c,d	0.76 (± 0.02)^f,g
ANOVA	F=6.42, P<0.02	F= 7.52, P<0.01	F= 4.63, P<0.04
V1, V3 domains
NSB 0–5 cm	12.89 (± 1.76)^h	2.03 (± 0.14)^i,j	0.80 (± 0.04)^l
NSB 5–15 cm	14.11 (± 2.31)^h	2.16 (± 0.12)^i,j	0.82 (± 0.03)^l
NSB 15–30 cm	14.78 (± 0.83)^h	2.25 (± 0.10)^j	0.83 (± 0.03)^l
IMP 0–30 cm	14.00 (± 0.50)^h	1.85 (± 0.07)^k	0.70 (± 0.03)^m
ANOVA	F=2. 34, P<0.09	F=21.52, P<0.01	F=28.58, P<0.01
V1, V1+V2 domains
NSB 0–5 cm	17.67 (± 1.80)ⁿ	2.32 (± 0.08)^p	0.81(± 0.02)^s
NSB 5–15 cm	21.89 (± 2.14)^o	2.52 (± 0.15)^q	0.81(± 0.03)^s
NSB 15–30 cm	18.33 (± 1.11)ⁿ	2.33 (± 0.08)^p	0.80 (± 0.02)^s
IMP 0–30 cm	18.89 (± 0.92)ⁿ	2.13 (± 0.09)^r	0.73 (± 0.03)^t
ANOVA	F= 12.55, P<0.01	F= 19.11, P<0.01	F= 22.25, P<0.01
V1+V2, V3 domains
NSB 0–5 cm	9.33 (± 0.57)^u	2.10 (± 0.03)^w	0.94 (± 0.01)^y
NSB 5–15 cm	15.67 (± 3.05)^v	2.51 (± 0.27)^w,x	0.91 (± 0.05)^y
NSB 15–30 cm	12.67 (± 0.57)^u,v	2.44 (± 0.08)^w,x	0.96 (± 0.02)^y
IMP 0–30 cm	15.00 (± 0.00)^v	2.56 (± 0.01)^x	0.95 (± 0.01)^y
ANOVA	F=9.82, P<0.01	F=6.21, P<0.02	F=1.79, P<0.23

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1, V3, and V1+V2 domains
NSB 0–5 cm	20.33 (± 1.15)^a,b	2.40 (± 0.05)^c,d	0.80 (± 0.01)^f
NSB 5–15 cm	25.67 (± 3.78)^a,b	2.63 (± 0.18)^c	0.81 (± 0.04)^f
NSB 15–30 cm	24.67(± 0.57)^a,b	2.68 (± 0.12)^e	0.84 (± 0.03)^f
IMP 0–30 cm	19.33 (± 1.52)^a	2.26 (± 0.11)^c,d	0.76 (± 0.02)^f,g
ANOVA	F=6.42, P<0.02	F= 7.52, P<0.01	F= 4.63, P<0.04
V1, V3 domains
NSB 0–5 cm	12.89 (± 1.76)^h	2.03 (± 0.14)^i,j	0.80 (± 0.04)^l
NSB 5–15 cm	14.11 (± 2.31)^h	2.16 (± 0.12)^i,j	0.82 (± 0.03)^l
NSB 15–30 cm	14.78 (± 0.83)^h	2.25 (± 0.10)^j	0.83 (± 0.03)^l
IMP 0–30 cm	14.00 (± 0.50)^h	1.85 (± 0.07)^k	0.70 (± 0.03)^m
ANOVA	F=2. 34, P<0.09	F=21.52, P<0.01	F=28.58, P<0.01
V1, V1+V2 domains
NSB 0–5 cm	17.67 (± 1.80)ⁿ	2.32 (± 0.08)^p	0.81(± 0.02)^s
NSB 5–15 cm	21.89 (± 2.14)^o	2.52 (± 0.15)^q	0.81(± 0.03)^s
NSB 15–30 cm	18.33 (± 1.11)ⁿ	2.33 (± 0.08)^p	0.80 (± 0.02)^s
IMP 0–30 cm	18.89 (± 0.92)ⁿ	2.13 (± 0.09)^r	0.73 (± 0.03)^t
ANOVA	F= 12.55, P<0.01	F= 19.11, P<0.01	F= 22.25, P<0.01
V1+V2, V3 domains
NSB 0–5 cm	9.33 (± 0.57)^u	2.10 (± 0.03)^w	0.94 (± 0.01)^y
NSB 5–15 cm	15.67 (± 3.05)^v	2.51 (± 0.27)^w,x	0.91 (± 0.05)^y
NSB 15–30 cm	12.67 (± 0.57)^u,v	2.44 (± 0.08)^w,x	0.96 (± 0.02)^y
IMP 0–30 cm	15.00 (± 0.00)^v	2.56 (± 0.01)^x	0.95 (± 0.01)^y
ANOVA	F=9.82, P<0.01	F=6.21, P<0.02	F=1.79, P<0.23

Richness (S)=no. of peaks in each sample; diversity (H)=−Σ(p_i)ln(p_i), where p_iis the relative ratio of individual peak heights; evenness (E)=H/H_max, where H_max=ln(S); numbers in parentheses are±SD of the calculated means (n=9). Within each domain set, indices followed by the same letter are not significantly different from each other using Bonferroni post-hoc comparisons, P<0.05.

2

Open in new tab Download slide

The richness, diversity, and evenness mean values for various combinations of the domain data sets produced using amplicon length heterogeneity PCR profiling

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1, V3, and V1+V2 domains
NSB 0–5 cm	20.33 (± 1.15)^a,b	2.40 (± 0.05)^c,d	0.80 (± 0.01)^f
NSB 5–15 cm	25.67 (± 3.78)^a,b	2.63 (± 0.18)^c	0.81 (± 0.04)^f
NSB 15–30 cm	24.67(± 0.57)^a,b	2.68 (± 0.12)^e	0.84 (± 0.03)^f
IMP 0–30 cm	19.33 (± 1.52)^a	2.26 (± 0.11)^c,d	0.76 (± 0.02)^f,g
ANOVA	F=6.42, P<0.02	F= 7.52, P<0.01	F= 4.63, P<0.04
V1, V3 domains
NSB 0–5 cm	12.89 (± 1.76)^h	2.03 (± 0.14)^i,j	0.80 (± 0.04)^l
NSB 5–15 cm	14.11 (± 2.31)^h	2.16 (± 0.12)^i,j	0.82 (± 0.03)^l
NSB 15–30 cm	14.78 (± 0.83)^h	2.25 (± 0.10)^j	0.83 (± 0.03)^l
IMP 0–30 cm	14.00 (± 0.50)^h	1.85 (± 0.07)^k	0.70 (± 0.03)^m
ANOVA	F=2. 34, P<0.09	F=21.52, P<0.01	F=28.58, P<0.01
V1, V1+V2 domains
NSB 0–5 cm	17.67 (± 1.80)ⁿ	2.32 (± 0.08)^p	0.81(± 0.02)^s
NSB 5–15 cm	21.89 (± 2.14)^o	2.52 (± 0.15)^q	0.81(± 0.03)^s
NSB 15–30 cm	18.33 (± 1.11)ⁿ	2.33 (± 0.08)^p	0.80 (± 0.02)^s
IMP 0–30 cm	18.89 (± 0.92)ⁿ	2.13 (± 0.09)^r	0.73 (± 0.03)^t
ANOVA	F= 12.55, P<0.01	F= 19.11, P<0.01	F= 22.25, P<0.01
V1+V2, V3 domains
NSB 0–5 cm	9.33 (± 0.57)^u	2.10 (± 0.03)^w	0.94 (± 0.01)^y
NSB 5–15 cm	15.67 (± 3.05)^v	2.51 (± 0.27)^w,x	0.91 (± 0.05)^y
NSB 15–30 cm	12.67 (± 0.57)^u,v	2.44 (± 0.08)^w,x	0.96 (± 0.02)^y
IMP 0–30 cm	15.00 (± 0.00)^v	2.56 (± 0.01)^x	0.95 (± 0.01)^y
ANOVA	F=9.82, P<0.01	F=6.21, P<0.02	F=1.79, P<0.23

Sample	Richness (S)	Diversity (H)	Evenness (E)
V1, V3, and V1+V2 domains
NSB 0–5 cm	20.33 (± 1.15)^a,b	2.40 (± 0.05)^c,d	0.80 (± 0.01)^f
NSB 5–15 cm	25.67 (± 3.78)^a,b	2.63 (± 0.18)^c	0.81 (± 0.04)^f
NSB 15–30 cm	24.67(± 0.57)^a,b	2.68 (± 0.12)^e	0.84 (± 0.03)^f
IMP 0–30 cm	19.33 (± 1.52)^a	2.26 (± 0.11)^c,d	0.76 (± 0.02)^f,g
ANOVA	F=6.42, P<0.02	F= 7.52, P<0.01	F= 4.63, P<0.04
V1, V3 domains
NSB 0–5 cm	12.89 (± 1.76)^h	2.03 (± 0.14)^i,j	0.80 (± 0.04)^l
NSB 5–15 cm	14.11 (± 2.31)^h	2.16 (± 0.12)^i,j	0.82 (± 0.03)^l
NSB 15–30 cm	14.78 (± 0.83)^h	2.25 (± 0.10)^j	0.83 (± 0.03)^l
IMP 0–30 cm	14.00 (± 0.50)^h	1.85 (± 0.07)^k	0.70 (± 0.03)^m
ANOVA	F=2. 34, P<0.09	F=21.52, P<0.01	F=28.58, P<0.01
V1, V1+V2 domains
NSB 0–5 cm	17.67 (± 1.80)ⁿ	2.32 (± 0.08)^p	0.81(± 0.02)^s
NSB 5–15 cm	21.89 (± 2.14)^o	2.52 (± 0.15)^q	0.81(± 0.03)^s
NSB 15–30 cm	18.33 (± 1.11)ⁿ	2.33 (± 0.08)^p	0.80 (± 0.02)^s
IMP 0–30 cm	18.89 (± 0.92)ⁿ	2.13 (± 0.09)^r	0.73 (± 0.03)^t
ANOVA	F= 12.55, P<0.01	F= 19.11, P<0.01	F= 22.25, P<0.01
V1+V2, V3 domains
NSB 0–5 cm	9.33 (± 0.57)^u	2.10 (± 0.03)^w	0.94 (± 0.01)^y
NSB 5–15 cm	15.67 (± 3.05)^v	2.51 (± 0.27)^w,x	0.91 (± 0.05)^y
NSB 15–30 cm	12.67 (± 0.57)^u,v	2.44 (± 0.08)^w,x	0.96 (± 0.02)^y
IMP 0–30 cm	15.00 (± 0.00)^v	2.56 (± 0.01)^x	0.95 (± 0.01)^y
ANOVA	F=9.82, P<0.01	F=6.21, P<0.02	F=1.79, P<0.23

Richness (S)=no. of peaks in each sample; diversity (H)=−Σ(p_i)ln(p_i), where p_iis the relative ratio of individual peak heights; evenness (E)=H/H_max, where H_max=ln(S); numbers in parentheses are±SD of the calculated means (n=9). Within each domain set, indices followed by the same letter are not significantly different from each other using Bonferroni post-hoc comparisons, P<0.05.

Multidimensional scaling

Multidimensional scaling tightly clustered the NSB samples apart from IMP with V1 and the three-domain data (Figs 1a and 1e). Similarly, distinct NSB and IMP groups were present with V1 and V3 combined data (Fig. 1d). The V1 and V3 data also indicated the subtle differences related to depth. The V1+V2 or V3 domain data were able to distinguish NSB from IMP samples but the clustering was not as pronounced. All other domain combinations followed the same MDS clustering trend as the V1 and V3 data, with only slight variations in the spacing of the clusters within the three different depth NSB groups (data not shown).

1

Multidimensional scaling of (a) V1 domain, (b) V1+V2 domain, (c) V3 domain, (d) V1 and V3 domains combined, and (e) the concatenation of three hypervariable domains, V1, V1+V2 and V3. Δ represents NSB 0–5 cm; ▿ represents NSB 5–15 cm; ◻ represents NSB 15–30 cm; and ▪ represents IMP 0–30 cm.

ANOSIM is a measure of the dissimilarity of a priori defined groups. Global R-values near zero indicate no difference among groups, and R=1 or near one indicates that samples within groups are more similar to each other than samples from different groups. In this study, the global R-values for NSB groups compared with IMP for any combination of domains was R≥ 0.80, P<0.001, indicating that NSB always grouped distinctly from IMP. SIMPER analysis of the combined V1 and V3 domains (Table 3) shows the major amplicons that contributed to the dissimilarity among the soil groups. How consistently an amplicon was able to differentiate among the groups were indicated by the ratio of the average dissimilarity to the standard deviation (Dis:SD). A large ratio indicates that the amplicon consistently and substantially contributed to the differences between LH-PCR profiles.

3

The major amplicons (>50% contribution) responsible for the dissimilarity among soil groups for the V1 and V3 combined domain data. Column headings are Dis: averaged dissimilarity between paired soil groups; BP: amplicon length in base pairs; Ab₁: average abundance of the amplicon for group 1; Ab₂: average abundance of the amplicon for group 2; BP_Dis: amplicon-specific contribution to the average dissimilarity; Dis:SD: ratio of the amplicon's contribution to dissimilarity divided by the standard deviation of the contribution to dissimilarity among groups; % Con: % of the average dissimilarity due to the amplicon; % Cum: the cumulative contribution of the amplicons to the dissimilarity among groups

Soil Groups	Dis	BP	Ab1	Ab2	BP_Dis	Dis:SD	%Con	%Cum
NSB0–5:NSB5–15	27.13	83	0.00	0.02	2.14	5.11	7.88	7.88
	73	0.02	0.00	2.04	3.69	7.51	15.39
	74	0.32	0.20	1.74	5.98	6.40	21.79
	86	0.13	0.22	1.63	1.75	6.00	27.79
	88	0.02	0.06	1.59	1.64	5.84	33.63
	85	0.04	0.00	1.58	0.67	5.81	39.44
	93	0.01	0.03	1.36	1.42	5.00	44.44
91	0.01	0.02	1.29	1.75	4.75	49.19
NSB0–5:NSB15–30	19.64	90	0.00	0.04	2.34	3.60	8.65	8.65
	88	0.02	0.08	2.25	4.23	8.31	16.96
	73	0.02	0.00	2.00	3.70	7.40	24.36
	80	0.02	0.00	1.87	13.79	6.89	31.25
	74	0.32	0.19	1.78	3.49	6.56	37.81
	85	0.04	0.00	1.55	0.67	5.73	43.54
93	0.01	0.03	1.43	1.50	5.26	48.80
NSB5–15:NSB15–30	19.64	90	0.00	0.04	2.85	17.65	14.53	14.53
	83	0.02	0.00	2.17	5.12	11.03	25.56
	94	0.04	0.02	1.49	1.24	7.60	33.16
	80	0.01	0.00	1.33	4.72	6.76	39.92
174	0.00	0.01	1.21	4.00	6.17	46.09
NSB0–5:IMP0–30	42.16	85	0.04	0.27	5.78	2.28	13.72	13.72
	86	0.13	0.00	5.12	5.80	12.14	25.86
	84	0.12	0.00	4.81	3.45	11.41	37.26
	76	0.05	0.00	3.23	33.82	7.67	44.93
78	0.03	0.00	2.28	7.39	5.41	50.34
NSB5–15:IMP0–30	52.49	85	0.00	0.27	7.65	19.03	14.57	14.57
	86	0.22	0.00	7.02	19.87	13.37	27.94
	84	0.17	0.00	6.20	17.07	11.81	39.75
73	0.00	0.06	3.45	4.06	6.58	46.33
NSB15–30:IMP0–30	45.73	85	0.00	0.27	7.52	19.57	16.45	16.45
	86	0.17	0.00	5.97	11.95	13.07	29.51
	84	0.14	0.00	5.44	6.97	11.90	41.42
73	0.00	0.06	3.39	4.07	7.42	48.84

Soil Groups	Dis	BP	Ab1	Ab2	BP_Dis	Dis:SD	%Con	%Cum
NSB0–5:NSB5–15	27.13	83	0.00	0.02	2.14	5.11	7.88	7.88
	73	0.02	0.00	2.04	3.69	7.51	15.39
	74	0.32	0.20	1.74	5.98	6.40	21.79
	86	0.13	0.22	1.63	1.75	6.00	27.79
	88	0.02	0.06	1.59	1.64	5.84	33.63
	85	0.04	0.00	1.58	0.67	5.81	39.44
	93	0.01	0.03	1.36	1.42	5.00	44.44
91	0.01	0.02	1.29	1.75	4.75	49.19
NSB0–5:NSB15–30	19.64	90	0.00	0.04	2.34	3.60	8.65	8.65
	88	0.02	0.08	2.25	4.23	8.31	16.96
	73	0.02	0.00	2.00	3.70	7.40	24.36
	80	0.02	0.00	1.87	13.79	6.89	31.25
	74	0.32	0.19	1.78	3.49	6.56	37.81
	85	0.04	0.00	1.55	0.67	5.73	43.54
93	0.01	0.03	1.43	1.50	5.26	48.80
NSB5–15:NSB15–30	19.64	90	0.00	0.04	2.85	17.65	14.53	14.53
	83	0.02	0.00	2.17	5.12	11.03	25.56
	94	0.04	0.02	1.49	1.24	7.60	33.16
	80	0.01	0.00	1.33	4.72	6.76	39.92
174	0.00	0.01	1.21	4.00	6.17	46.09
NSB0–5:IMP0–30	42.16	85	0.04	0.27	5.78	2.28	13.72	13.72
	86	0.13	0.00	5.12	5.80	12.14	25.86
	84	0.12	0.00	4.81	3.45	11.41	37.26
	76	0.05	0.00	3.23	33.82	7.67	44.93
78	0.03	0.00	2.28	7.39	5.41	50.34
NSB5–15:IMP0–30	52.49	85	0.00	0.27	7.65	19.03	14.57	14.57
	86	0.22	0.00	7.02	19.87	13.37	27.94
	84	0.17	0.00	6.20	17.07	11.81	39.75
73	0.00	0.06	3.45	4.06	6.58	46.33
NSB15–30:IMP0–30	45.73	85	0.00	0.27	7.52	19.57	16.45	16.45
	86	0.17	0.00	5.97	11.95	13.07	29.51
	84	0.14	0.00	5.44	6.97	11.90	41.42
73	0.00	0.06	3.39	4.07	7.42	48.84

3

The major amplicons (>50% contribution) responsible for the dissimilarity among soil groups for the V1 and V3 combined domain data. Column headings are Dis: averaged dissimilarity between paired soil groups; BP: amplicon length in base pairs; Ab₁: average abundance of the amplicon for group 1; Ab₂: average abundance of the amplicon for group 2; BP_Dis: amplicon-specific contribution to the average dissimilarity; Dis:SD: ratio of the amplicon's contribution to dissimilarity divided by the standard deviation of the contribution to dissimilarity among groups; % Con: % of the average dissimilarity due to the amplicon; % Cum: the cumulative contribution of the amplicons to the dissimilarity among groups

Soil Groups	Dis	BP	Ab1	Ab2	BP_Dis	Dis:SD	%Con	%Cum
NSB0–5:NSB5–15	27.13	83	0.00	0.02	2.14	5.11	7.88	7.88
	73	0.02	0.00	2.04	3.69	7.51	15.39
	74	0.32	0.20	1.74	5.98	6.40	21.79
	86	0.13	0.22	1.63	1.75	6.00	27.79
	88	0.02	0.06	1.59	1.64	5.84	33.63
	85	0.04	0.00	1.58	0.67	5.81	39.44
	93	0.01	0.03	1.36	1.42	5.00	44.44
91	0.01	0.02	1.29	1.75	4.75	49.19
NSB0–5:NSB15–30	19.64	90	0.00	0.04	2.34	3.60	8.65	8.65
	88	0.02	0.08	2.25	4.23	8.31	16.96
	73	0.02	0.00	2.00	3.70	7.40	24.36
	80	0.02	0.00	1.87	13.79	6.89	31.25
	74	0.32	0.19	1.78	3.49	6.56	37.81
	85	0.04	0.00	1.55	0.67	5.73	43.54
93	0.01	0.03	1.43	1.50	5.26	48.80
NSB5–15:NSB15–30	19.64	90	0.00	0.04	2.85	17.65	14.53	14.53
	83	0.02	0.00	2.17	5.12	11.03	25.56
	94	0.04	0.02	1.49	1.24	7.60	33.16
	80	0.01	0.00	1.33	4.72	6.76	39.92
174	0.00	0.01	1.21	4.00	6.17	46.09
NSB0–5:IMP0–30	42.16	85	0.04	0.27	5.78	2.28	13.72	13.72
	86	0.13	0.00	5.12	5.80	12.14	25.86
	84	0.12	0.00	4.81	3.45	11.41	37.26
	76	0.05	0.00	3.23	33.82	7.67	44.93
78	0.03	0.00	2.28	7.39	5.41	50.34
NSB5–15:IMP0–30	52.49	85	0.00	0.27	7.65	19.03	14.57	14.57
	86	0.22	0.00	7.02	19.87	13.37	27.94
	84	0.17	0.00	6.20	17.07	11.81	39.75
73	0.00	0.06	3.45	4.06	6.58	46.33
NSB15–30:IMP0–30	45.73	85	0.00	0.27	7.52	19.57	16.45	16.45
	86	0.17	0.00	5.97	11.95	13.07	29.51
	84	0.14	0.00	5.44	6.97	11.90	41.42
73	0.00	0.06	3.39	4.07	7.42	48.84

Soil Groups	Dis	BP	Ab1	Ab2	BP_Dis	Dis:SD	%Con	%Cum
NSB0–5:NSB5–15	27.13	83	0.00	0.02	2.14	5.11	7.88	7.88
	73	0.02	0.00	2.04	3.69	7.51	15.39
	74	0.32	0.20	1.74	5.98	6.40	21.79
	86	0.13	0.22	1.63	1.75	6.00	27.79
	88	0.02	0.06	1.59	1.64	5.84	33.63
	85	0.04	0.00	1.58	0.67	5.81	39.44
	93	0.01	0.03	1.36	1.42	5.00	44.44
91	0.01	0.02	1.29	1.75	4.75	49.19
NSB0–5:NSB15–30	19.64	90	0.00	0.04	2.34	3.60	8.65	8.65
	88	0.02	0.08	2.25	4.23	8.31	16.96
	73	0.02	0.00	2.00	3.70	7.40	24.36
	80	0.02	0.00	1.87	13.79	6.89	31.25
	74	0.32	0.19	1.78	3.49	6.56	37.81
	85	0.04	0.00	1.55	0.67	5.73	43.54
93	0.01	0.03	1.43	1.50	5.26	48.80
NSB5–15:NSB15–30	19.64	90	0.00	0.04	2.85	17.65	14.53	14.53
	83	0.02	0.00	2.17	5.12	11.03	25.56
	94	0.04	0.02	1.49	1.24	7.60	33.16
	80	0.01	0.00	1.33	4.72	6.76	39.92
174	0.00	0.01	1.21	4.00	6.17	46.09
NSB0–5:IMP0–30	42.16	85	0.04	0.27	5.78	2.28	13.72	13.72
	86	0.13	0.00	5.12	5.80	12.14	25.86
	84	0.12	0.00	4.81	3.45	11.41	37.26
	76	0.05	0.00	3.23	33.82	7.67	44.93
78	0.03	0.00	2.28	7.39	5.41	50.34
NSB5–15:IMP0–30	52.49	85	0.00	0.27	7.65	19.03	14.57	14.57
	86	0.22	0.00	7.02	19.87	13.37	27.94
	84	0.17	0.00	6.20	17.07	11.81	39.75
73	0.00	0.06	3.45	4.06	6.58	46.33
NSB15–30:IMP0–30	45.73	85	0.00	0.27	7.52	19.57	16.45	16.45
	86	0.17	0.00	5.97	11.95	13.07	29.51
	84	0.14	0.00	5.44	6.97	11.90	41.42
73	0.00	0.06	3.39	4.07	7.42	48.84

Choice of diversity measures and hypervariable domains

A study by Hill (2003) addressed the issue of the suitability of ecological indices for microbial ecology studies. This report included a comprehensive review of the index selection issue using microbial community clone libraries and concluded that the Shannon index appeared to be more sensitive to overall community change and the loss of rare populations than did the Simpson dominance index (Hill, 2003). Since LH-PCR profiles reflect a combination of dominant and rare data points, the Shannon index was selected to measure diversity.

Unlike large clone libraries, however, microbial community profiles generated by any profiling method, including LH-PCR, only represent the minimum detectable diversity. When any of the (two or three) domain data were combined, NSB samples had higher diversity than IMP, but within NSB, based on depth, in general the indices did not significantly differ. The individual domain data did not, however, support this trend, and the interpretation differed depending on the domain queried. Careful consideration when selecting both which hypervariable domain(s) to query and which indices to use, with respect to their suitability and limitations, is required if diversity measures are to be applied appropriately to LH-PCR community profiles.

Other microbial ecology studies have addressed the issue of diversity using other molecular techniques and these same ecological indices. Dunbar. (2000) concluded in their soil study using terminal restriction fragment length polymorphism (TRFLP) analyses that the use of multiple restriction enzymes to generate community patterns did not necessarily increase the overall resolution of diversity. Instead, multiple digests were used to increase the confidence that similarities between samples were not a result of technical biases that may come from using only one enzyme (Dunbar, 1999). In addition, diversity indices were calculated individually for each enzymatic pattern. They observed that diversity indices varied based on the restriction enzyme used. Likewise, in our study, the choice of domain queried had implications for diversity measures. Unlike Dunbar, we observed an increase in discrimination of diversity using the combined domain data.

Similarity and MDS

Patterning the similarity/dissimilarity of the community proved to be more informative than the traditional ecological indices in discriminating treatment groups. Bray–Curtis similarity is used principally with continuous numerical data and species abundance data similar to those generated by LH-PCR profiling (Gotelli & Ellsion, 2004). Clearly, in this study, the Bray–Curtis similarity and subsequent MDS ordinations strongly reflected between site dissimilarities. The advantage of using Bray–Curtis similarity over other similarity indices based on presence/absence calculations and/or Euclidean distances is that there is less bias introduced by shared absences of amplicons (Gotelli & Ellsion, 2004). The intrinsic importance of both dominant and rare community members within the community can still be reflected using this similarity index and ordination method.

In a recent study by Bernhard. (2005), LH-PCR data patterned the microbial communities across a salinity gradient. Using MDS, these authors were able to discriminate clearly between the bacterial communities associated with fresh water and those in estuarine and marine waters. These different aquatic systems drive both physiological and habitat adaptation, and were reflected in structural changes (i.e. amplicons) in the microbial communities. The present study differs from Bernhard's in that the same soil ecosystem (no physiological gradient per se) was being studied, and disturbance and depth were assumed to be the ecological drivers. It has been shown previously that microbial diversity changes, and is often lower with cropping, land management and use (Ibekwe, 2002) and with depth (Blume, 2002). Even though similar micro organisms were no doubt associated within a given Idaho soil type, LH-PCR and MDS analysis were better able to pattern the subtle community differences associated with disturbance and somewhat with depth. MDS strongly supported the clustering of the samples based on tillage. However, the discriminatory pattern produced was also a function of which domains were queried. Even though the diversity indices seemed to need only two domains (i.e. V1 combined with V3) to discriminate adequately between samples, the more robust scaling output was reflected with either the V1 or the three-domain data. However, V1 and V3 data were able to differentiate NSB from IMP samples and also to identify differences associated with depth in the NSB soil groups. Herein lies the paradox of choosing which domains and measures better represent community structural differences.

Conclusions

Univariate measures such as Shannon's information index and its associated evenness index appear to be more appropriate measures for community clone libraries since individual bacterial identification can be ascertained through sequencing (Wintzingerode, 1997; Hill, 2002). Multivariate and ordination measures such as Bray–Curtis similarity and MDS that use iterative procedures to map the similarity (or dissimilarity) produced by profiling were better measures to apply to this study. ANOSIM can test for differences between multivariate groups, and SIMPER analyses can rank the amplicons contributing the most to the dissimilarities. Because of the inherently lower resolution provided by any molecular profiling method, dynamic patterning of microbial communities can best be analysed using these nonparametric, multivariate methods.

The evolution of profiling techniques over the last few years has moved from publishing only the raw data (electropherograms or gels) to applying traditional ecological analyses to profiling data and now to using multivariate analyses. Perhaps the greatest challenge facing microbial ecologists will be to break away from the traditional ecological paradigms of diversity measures and resolve the paradox by developing new algorithms or unique metrics that will better analyse the complexities still hidden within microbial communities.

Acknowledgements

This study was funded in part by a NSF ADVANCE Fellowship (Award no. 0340695) to D.K.M. and in part by the USDA Agricultural Research Service, Northwest Irrigation and Soils Research Laboratory, Kimberly, ID to J.A.E. Lilliana Moreno, Laurie Richardson, Todd Crandall and the rest of the Mathee and Richardson laboratory members are thanked for their critical reviews and helpful comments.

References

Bernhard

AE

Field

KG

(

2000

)

Identification of nonpoint sources of fecal pollution in coastal waters by using host-specific 16S ribosomal DNA genetic markers from fecal anaerobes

.

Appl Environ Microbiol

66

:

1587

–

1594

.

Bernhard

AE

Colbert

D

McManus

J

Field

KG

(

2005

)

Microbial community dynamics based on 16S rRNA gene profiles in a Pacific Northwest estuary and its tributaries

.

FEMS Microbiol Ecol

52

:

115

–

128

.

Blume

E

Bischoff

M

Reichert

JM

Moorman

T

Konopka

A

Turco

RF

(

2002

)

Surface and subsurface microbial biomass, community structure and metabolic activity as a function of soil depth and season

.

Appl Soil Ecol

20

:

171

–

181

.

Cocolin

L

Manzano

M

Cantoni

C

Comi

G

(

2001

)

Denaturing gradient gel electrophoresis analysis of the 16S rRNA gene V1 region to monitor dynamic changes in the bacterial population during fermentation of Italian sausages

.

Appl Environ Microbiol

67

:

5113

–

5121

.

Crosby

LD

Criddle

CS

(

2003

)

Understanding bias in microbial community analysis techniques due to rrn operon copy number heterogeneity

.

BioTechniques

34

:

790

–

802

.

Diez

B

Pedros-Alio

C

Marsh

TL

Massana

R

(

2001

)

Application of denaturing gradient gel electrophoresis (DGGE) to study the diversity of marine picoeukaryotic assemblages and comparison of DGGE with other molecular techniques

.

Appl Environ Microbiol

67

:

2942

–

2951

.

Dunbar

J

Takala

S

Barns

SM

Davis

JA

Kuske

CR

(

1999

)

Levels of bacterial community diversity in four arid soils compared by cultivation and 16S rRNA gene cloning

.

Appl Environ Microbiol

65

:

1662

–

1669

.

Dunbar

J

Ticknor

LO

Kuske

CR

(

2000

)

Assessment of microbial diversity in four Southwestern United States soils by 16S rRNA gene terminal restriction fragment analysis

.

Appl Environ Microbiol

66

:

2943

–

2950

.

Fennell

DE

Rhee

S-K

Ahn

Y-B

Haggblom

MM

Kerkhof

LJ

(

2004

)

Detection and characterization of a dehalogenating microorganism by terminal restriction fragment length polymorphism fingerprinting of 16S rRNA in a sulfidogenic, 2-bromophenol-utilizing enrichment

.

Appl Environ Microbiol

70

:

1169

–

1175

.

Girvan

MS

Bullimore

J

Pretty

JN

Osborn

AM

Ball

AS

(

2003

)

Soil type is the primary determinant of the composition of the total and active bacterial communities in arable soils

.

Appl Environ Microbiol

69

:

1800

–

1809

.

Gotelli

NJ

Ellsion

AM

(

2004

)

The Analysis of Multivariate Data

, pp.

383

–

446

.

Sinauer Associates, Inc

,

Sunderland, MA

.

Google Preview

Hill

JE

Seipp

RP

Betts

M

Hawkins

L

Van Kessel

AG

Crosby

WL

Hemmingsen

SM

(

2002

)

Extensive profiling of a complex microbial community by high-throughput sequencing

.

Appl Environ Microbiol

68

:

3055

–

3066

.

Hill

TCJ

Walsh

KA

Harris

JA

Bruce

FM

(

2003

)

Using ecological diversity measure with bacterial communities

.

FEMS Microbiol Ecol

43

:

1

–

11

.

Hughes

JB

Hellmann

JJ

Ricketts

TH

Bohannon

BJM

(

2001

)

Counting the uncountable: statistical approaches to estimating microbial diversity

.

App Environ Microbiol

67

:

4399

–

4406

.

Ibekwe

AM

Kennedy

AC

Frohne

PS

Papiernik

SK

Yang

C-H

Crowley

DE

(

2002

)

Microbial diversity along a transect of agronomic zones

.

FEMS Microbiol Ecol

39

:

183

–

191

.

Istock

CA

Bell

JA

Ferguson

N

Istock

NL

(

1996

)

Bacterial species and evolution: theoretical and practical perspectives

.

J Ind Microbiol

17

:

137

–

150

.

Litchfield

CD

Gillevet

PM

(

2002

)

Microbial diversity and complexity in hypersaline environnments: a preliminary assessment

.

J Ind Microbiol

28

:

48

–

55

.

Mills

DK

(

2000

)

Molecular monitoring of microbial populations during bioremediation of contaminated soils

.

PhD thesis

,

Environmental Sciences and Public Policy, Department of Biology

,

Fairfax,VA

.

Google Preview

Mills

D

Fitzgerald

K

Litchfield

C

Gillevet

P

(

2003

)

A comparison of DNA profiling techniques for monitoring nutrient impact on microbial community composition during bioremediation of petroleum-contaminated soils

.

J Microbiol Meth

54

:

57

–

74

.

Osborn

AM

Moore

ERB

Timmis

KN

(

2000

)

An evaluation of terminal-restriction fragment length polymorphism (T-RFLP) analysis for the study of microbial community structure and dynamics

.

Environ Microbiol

2

:

39

–

50

.

Pohle

GW

Thomas

MLH

(

2001

)

Monitoring protocol for marine benthos: Intertidal and subtidal macrofauna

.

Report for Atlantic Maritime Ecological Science Cooperative

,

St Andrews

,

New Brunswick

.

Google Preview

Ritchie

NJ

Schutter

ME

Dick

RP

Myrold

DD

(

2000

)

Use of length heterogeneity PCR and fatty acid methyl ester profiles to characterize microbial communities in soil

.

Appl Environ Microbiol

66

:

1668

–

1675

.

Suzuki

MT

Giovannoni

SJ

(

1996

)

Bias caused by template annealing in the amplification of mixtures of 16S rRNA genes by PCR

.

Appl Environ Microbiol

62

:

625

–

630

.

Suzuki

M

Rappe

MS

Giovannoni

SJ

(

1998

)

Kinetic bias in estimates of coastal picoplankton community structure obtained by measurements of small-subunit rRNA gene PCR amplicon length heterogeneity

.

Appl Environ Microbiol

64

:

4522

–

4529

.

Tiirola

MA

Mannisto

MK

Puhakka

JA

Kulomaa

MS

(

2002

)

Isolation and characterization of Novosphingobium sp. strain mt1, a dominant polychlorophenol-degrading strain in a groundwater bioremediation system

.

Appl Environ Microbiol

68

:

173

–

180

.

Tiirola

MA

Suvilampi

JE

Kulomaa

MS

Rintala

JA

(

2003

)

Microbial diversity in a thermophilic aerobic biofilm process: analysis by length heterogeneity PCR (LH-PCR)

.

Water Res

37

:

2259

–

2269

.

Torsvik

V

Sorheim

R

Goksoyr

J

(

1996

)

Total bacterial diversity in soil and sediment communities−a review

.

J Ind Microbiol

17

:

170

–

178

.

Watve

MG

Gangal

RM

(

1996

)

Problems in measuring bacterial diversity and a possible solution

.

Appl Environ Microbiol

62

:

4299

–

4301

.