Bayesian Estimation of Sequence Damage in Ancient DNA Free

Estimates of Damage Levels in a Range of aDNA Data Sets

	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Damage Per Site		Total Damage
Data Set	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Mean	95% HPD	Per Sequence	Per Alignment
Adélie penguin	96	0–12	313	2.42 × 10⁻³	8.41 × 10⁻⁴–4.08 × 10⁻³	0.76	72.8
Aurochs	40	2–12	379	2.31 × 10⁻⁴	3.37 × 10⁻⁸–5.89 × 10⁻⁴	0.09	3.49
Bison
With modern	182	0–60	601	1.51 × 10⁻³	6.14 × 10⁻⁴–1.71 × 10⁻³	0.91	165.2
Without modern	154	1–60	601	2.03 × 10⁻³	1.34 × 10⁻³–2.81 × 10⁻³	1.22	187.5
Brown bear	30	10–59	130	2.75 × 10⁻³	3.63 × 10⁻⁷–5.96 × 10⁻³	0.36	10.7
Cave bear	26	27–80	288	9.69 × 10⁻⁴	4.17 × 10⁻⁷–2.27 × 10⁻³	0.28	7.3
Cave hyena	10	38–51	366	4.05 × 10^-4	7.85× 10⁻⁸–1.16 × 10⁻³	0.15	1.48
Cave lion	34	12–62	213	6.05 × 10⁻⁴	1.07× 10⁻⁷–1.43 × 10⁻³	0.13	4.4
Horse	12	1–28	348	3.68 × 10⁻³	4.01 × 10⁻⁴–7.24 × 10⁻³	1.28	15.4
Moa	14	1–6	241	1.74 × 10⁻³	1.75 × 10⁻⁵–3.58 × 10⁻³	0.42	5.9
Musk ox	10	0–44	177	6.52 × 10⁻⁴	9.81 × 10⁻⁸–1.91 × 10⁻³	0.12	1.2
Musk ox (cytb)	10	0–44	114	9.79 × 10⁻⁴	1.01 × 10⁻⁷–2.96 × 10⁻³	0.11	1.1
Ox	36	4–8	379	3.87 × 10⁻⁴	5.11 × 10⁻⁷–8.57 × 10⁻⁴	0.15	5.3
Tuco-tuco (cytb)	45	0–10	253	3.26 × 10⁻⁴	3.37 × 10⁻⁷–7.94 × 10⁻⁴	0.08	3.7

	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Damage Per Site		Total Damage
Data Set	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Mean	95% HPD	Per Sequence	Per Alignment
Adélie penguin	96	0–12	313	2.42 × 10⁻³	8.41 × 10⁻⁴–4.08 × 10⁻³	0.76	72.8
Aurochs	40	2–12	379	2.31 × 10⁻⁴	3.37 × 10⁻⁸–5.89 × 10⁻⁴	0.09	3.49
Bison
With modern	182	0–60	601	1.51 × 10⁻³	6.14 × 10⁻⁴–1.71 × 10⁻³	0.91	165.2
Without modern	154	1–60	601	2.03 × 10⁻³	1.34 × 10⁻³–2.81 × 10⁻³	1.22	187.5
Brown bear	30	10–59	130	2.75 × 10⁻³	3.63 × 10⁻⁷–5.96 × 10⁻³	0.36	10.7
Cave bear	26	27–80	288	9.69 × 10⁻⁴	4.17 × 10⁻⁷–2.27 × 10⁻³	0.28	7.3
Cave hyena	10	38–51	366	4.05 × 10^-4	7.85× 10⁻⁸–1.16 × 10⁻³	0.15	1.48
Cave lion	34	12–62	213	6.05 × 10⁻⁴	1.07× 10⁻⁷–1.43 × 10⁻³	0.13	4.4
Horse	12	1–28	348	3.68 × 10⁻³	4.01 × 10⁻⁴–7.24 × 10⁻³	1.28	15.4
Moa	14	1–6	241	1.74 × 10⁻³	1.75 × 10⁻⁵–3.58 × 10⁻³	0.42	5.9
Musk ox	10	0–44	177	6.52 × 10⁻⁴	9.81 × 10⁻⁸–1.91 × 10⁻³	0.12	1.2
Musk ox (cytb)	10	0–44	114	9.79 × 10⁻⁴	1.01 × 10⁻⁷–2.96 × 10⁻³	0.11	1.1
Ox	36	4–8	379	3.87 × 10⁻⁴	5.11 × 10⁻⁷–8.57 × 10⁻⁴	0.15	5.3
Tuco-tuco (cytb)	45	0–10	253	3.26 × 10⁻⁴	3.37 × 10⁻⁷–7.94 × 10⁻⁴	0.08	3.7

NOTE.—Data sets were obtained from the following studies: Aurochs (Edwards et al. forthcoming), bison (Shapiro et al. 2004), brown bear (Barnes et al. 2002), cave bear (Loreille et al. 2001; Hofreiter et al. 2002; Orlando et al. 2002), cave lion (Barnett R, unpublished data), horse (Vila et al. 2001), moa (Huynen et al. 2003), musk ox (MacPhee et al. 2005), ox (Bollongino et al. 2006), and social tuco-tuco (Chan et al. 2006).

Table 1

Open in new tab Download slide

Estimates of Damage Levels in a Range of aDNA Data Sets

	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Damage Per Site		Total Damage
Data Set	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Mean	95% HPD	Per Sequence	Per Alignment
Adélie penguin	96	0–12	313	2.42 × 10⁻³	8.41 × 10⁻⁴–4.08 × 10⁻³	0.76	72.8
Aurochs	40	2–12	379	2.31 × 10⁻⁴	3.37 × 10⁻⁸–5.89 × 10⁻⁴	0.09	3.49
Bison
With modern	182	0–60	601	1.51 × 10⁻³	6.14 × 10⁻⁴–1.71 × 10⁻³	0.91	165.2
Without modern	154	1–60	601	2.03 × 10⁻³	1.34 × 10⁻³–2.81 × 10⁻³	1.22	187.5
Brown bear	30	10–59	130	2.75 × 10⁻³	3.63 × 10⁻⁷–5.96 × 10⁻³	0.36	10.7
Cave bear	26	27–80	288	9.69 × 10⁻⁴	4.17 × 10⁻⁷–2.27 × 10⁻³	0.28	7.3
Cave hyena	10	38–51	366	4.05 × 10^-4	7.85× 10⁻⁸–1.16 × 10⁻³	0.15	1.48
Cave lion	34	12–62	213	6.05 × 10⁻⁴	1.07× 10⁻⁷–1.43 × 10⁻³	0.13	4.4
Horse	12	1–28	348	3.68 × 10⁻³	4.01 × 10⁻⁴–7.24 × 10⁻³	1.28	15.4
Moa	14	1–6	241	1.74 × 10⁻³	1.75 × 10⁻⁵–3.58 × 10⁻³	0.42	5.9
Musk ox	10	0–44	177	6.52 × 10⁻⁴	9.81 × 10⁻⁸–1.91 × 10⁻³	0.12	1.2
Musk ox (cytb)	10	0–44	114	9.79 × 10⁻⁴	1.01 × 10⁻⁷–2.96 × 10⁻³	0.11	1.1
Ox	36	4–8	379	3.87 × 10⁻⁴	5.11 × 10⁻⁷–8.57 × 10⁻⁴	0.15	5.3
Tuco-tuco (cytb)	45	0–10	253	3.26 × 10⁻⁴	3.37 × 10⁻⁷–7.94 × 10⁻⁴	0.08	3.7

	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Damage Per Site		Total Damage
Data Set	Sample Size	Age Range thousand years ago	Aligned Length (bp)	Mean	95% HPD	Per Sequence	Per Alignment
Adélie penguin	96	0–12	313	2.42 × 10⁻³	8.41 × 10⁻⁴–4.08 × 10⁻³	0.76	72.8
Aurochs	40	2–12	379	2.31 × 10⁻⁴	3.37 × 10⁻⁸–5.89 × 10⁻⁴	0.09	3.49
Bison
With modern	182	0–60	601	1.51 × 10⁻³	6.14 × 10⁻⁴–1.71 × 10⁻³	0.91	165.2
Without modern	154	1–60	601	2.03 × 10⁻³	1.34 × 10⁻³–2.81 × 10⁻³	1.22	187.5
Brown bear	30	10–59	130	2.75 × 10⁻³	3.63 × 10⁻⁷–5.96 × 10⁻³	0.36	10.7
Cave bear	26	27–80	288	9.69 × 10⁻⁴	4.17 × 10⁻⁷–2.27 × 10⁻³	0.28	7.3
Cave hyena	10	38–51	366	4.05 × 10^-4	7.85× 10⁻⁸–1.16 × 10⁻³	0.15	1.48
Cave lion	34	12–62	213	6.05 × 10⁻⁴	1.07× 10⁻⁷–1.43 × 10⁻³	0.13	4.4
Horse	12	1–28	348	3.68 × 10⁻³	4.01 × 10⁻⁴–7.24 × 10⁻³	1.28	15.4
Moa	14	1–6	241	1.74 × 10⁻³	1.75 × 10⁻⁵–3.58 × 10⁻³	0.42	5.9
Musk ox	10	0–44	177	6.52 × 10⁻⁴	9.81 × 10⁻⁸–1.91 × 10⁻³	0.12	1.2
Musk ox (cytb)	10	0–44	114	9.79 × 10⁻⁴	1.01 × 10⁻⁷–2.96 × 10⁻³	0.11	1.1
Ox	36	4–8	379	3.87 × 10⁻⁴	5.11 × 10⁻⁷–8.57 × 10⁻⁴	0.15	5.3
Tuco-tuco (cytb)	45	0–10	253	3.26 × 10⁻⁴	3.37 × 10⁻⁷–7.94 × 10⁻⁴	0.08	3.7

For each species, aDNA sequences with known radiocarbon ages were collected from GenBank. The substitution model for each data set was chosen by assessment of AIC scores using Modeltest. As above, BEAST analyses were performed assuming a strict molecular clock and incorporating radiocarbon ages as prior information. A constant-size coalescent prior was placed on the tree. For each data set, the MCMC was run for 10,000,000 steps following 1,000,000 discarded burn-in steps, with samples drawn every 1,000 steps. Samples from the posterior were checked for convergence and acceptable mixing using Tracer.

Results

Simulations

In the analyses of simulated data sets with known damage rates, estimates of damage using the delta model were generally accurate for the pseudo–cave lion data sets, whereas there was a small but consistent overestimation of damage in the pseudo-bison data sets (fig. 1a). For both data sets, significantly nonzero estimates of damage levels were obtained even when there was no actual damage in the sequences.

FIG. 1.—

Results from Bayesian phylogenetic analyses of damaged sequence data generated by simulation. All error bars represent ±1 standard error. (a) Estimates of damage per site made using the delta model. The unlabeled dashed line represents y = x. (b) Estimates of the mutation rate made with and without the delta model. The 2 horizontal dashed lines indicate the true mutation rates (i.e., those used for simulation), with the top and bottom lines denoting the rates in cave lions and bison, respectively.

Analyses performed without the delta model tended to overestimate the mutation rate when damage was present (fig. 1b). The pattern of overestimation is particularly noticeable for the pseudo–cave lion sequences. Addition of the delta model removed this bias from the estimates, thereby increasing the accuracy of BEAST in recovering the true mutation rate. For the pseudo–cave lion and pseudo-bison data, the true rate was contained within the 95% highest posterior density (95% HPD) 100% and 90% of the time, respectively.

Real Data

The amount of miscoding lesions estimated from the real aDNA data sets ranged from 2.3 × 10⁻⁴ per site (1 miscoding lesion for every 4,329 nt) in aurochsen to 3.7 × 10⁻³ per site (1 miscoding lesion for every 272 nt) in horses (fig. 2 and table 1). The estimated totals range from 1.1 damaged nucleotides in the musk ox data set (from a total of 1,140 nt) to 165 of 109,382 nt in the full bison data set. Based on regression analyses, no significant relationships were found between the estimated level of damage and the number of sequences, age of the oldest sequence, or the estimated mutation rate. Three studies did not have their sequences checked by independent replication; the damage rates in these data sets were higher, but not significantly so (P = 0.19, 1-tailed t-test).

FIG. 2.—

Bayesian estimates of sequence damage for 13 aDNA data sets. The error bars denote 95% HPDs. Details of the data sets are given in table 1.

Open in new tab Download slide

For each data set, estimated mutation rates were relatively high compared with those estimated from phylogenetic studies (for a discussion of this issue, see Ho et al. 2005). These elevated rates, ranging from 11.1% per MY in horses to 112% per MY in Adélie penguin (table 2), are obtained in spite of the correction of damage-related bias through the delta parameter. The rate estimate from Adélie penguin is consistent with an earlier Bayesian analysis of the same penguin sequences (Lambert et al. 2002). In most cases, however, the mean rate estimates are not particularly meaningful because of very large associated 95% HPDs. Mutation rates could not be estimated from the 2 musk ox alignments due to poor MCMC convergence, which may be due to the limited variation among the sequences. Subsequent analyses, which are not shown here, demonstrated that estimates of the delta parameter did not change when the mutation rate was fixed to an arbitrary value.

Table 2

Estimates of Mutation Rates in a Range of aDNA Data Sets, Made Using the Delta Model

	Mutation Rate (% per MY)
Data Set	Mean	95% HPD
Adélie penguin	112	31.8–198
Aurochs	63.5	15.1–118
Bison (all)	22.5	14.7–30.6
Brown bear	78.3	9.43–138
Cave bear	13.2	4.02–25.1
Cave lion	20.2	3.15–40.4
Horse	11.1	1.77–30.8
Moa	67.2	1.30–209
Musk ox	N/Aa	N/Aa
Musk ox (cytb)	N/Aa	N/Aa
Ox	13.3	0.21–40.8
Social tuco-tuco (cytb)	41.7	9.04–81.0

	Mutation Rate (% per MY)
Data Set	Mean	95% HPD
Adélie penguin	112	31.8–198
Aurochs	63.5	15.1–118
Bison (all)	22.5	14.7–30.6
Brown bear	78.3	9.43–138
Cave bear	13.2	4.02–25.1
Cave lion	20.2	3.15–40.4
Horse	11.1	1.77–30.8
Moa	67.2	1.30–209
Musk ox	N/Aa	N/Aa
Musk ox (cytb)	N/Aa	N/Aa
Ox	13.3	0.21–40.8
Social tuco-tuco (cytb)	41.7	9.04–81.0

NOTE.—Unless otherwise indicated, all sequences are from the mitochondrial control region.

Mutation rates could not be reliably estimated for the two musk ox data sets, due to convergence problems in the MCMC analysis. This did not affect estimates of the delta parameter.

Table 2

Estimates of Mutation Rates in a Range of aDNA Data Sets, Made Using the Delta Model

	Mutation Rate (% per MY)
Data Set	Mean	95% HPD
Adélie penguin	112	31.8–198
Aurochs	63.5	15.1–118
Bison (all)	22.5	14.7–30.6
Brown bear	78.3	9.43–138
Cave bear	13.2	4.02–25.1
Cave lion	20.2	3.15–40.4
Horse	11.1	1.77–30.8
Moa	67.2	1.30–209
Musk ox	N/Aa	N/Aa
Musk ox (cytb)	N/Aa	N/Aa
Ox	13.3	0.21–40.8
Social tuco-tuco (cytb)	41.7	9.04–81.0

	Mutation Rate (% per MY)
Data Set	Mean	95% HPD
Adélie penguin	112	31.8–198
Aurochs	63.5	15.1–118
Bison (all)	22.5	14.7–30.6
Brown bear	78.3	9.43–138
Cave bear	13.2	4.02–25.1
Cave lion	20.2	3.15–40.4
Horse	11.1	1.77–30.8
Moa	67.2	1.30–209
Musk ox	N/Aa	N/Aa
Musk ox (cytb)	N/Aa	N/Aa
Ox	13.3	0.21–40.8
Social tuco-tuco (cytb)	41.7	9.04–81.0

NOTE.—Unless otherwise indicated, all sequences are from the mitochondrial control region.

Mutation rates could not be reliably estimated for the two musk ox data sets, due to convergence problems in the MCMC analysis. This did not affect estimates of the delta parameter.

Discussion

aDNA Damage

The results of the simulations indicate that the delta model is capable of measuring the proportion of damage in DNA sequences with reasonable accuracy. In some cases, there is a slight tendency to overestimate the actual amount of damage, possibly due to the treatment of genuine polymorphisms as damage. For this reason, the delta model appears to be very effective in placing upper limits on the amount of miscoding lesions that may be present in a data set, but it is probably inappropriate for analyzing sequences from multiple species.

Interestingly, there appear to be few miscoding lesions in the majority of the real data sets analyzed in this study, with estimated damage rates of less than 2.0 × 10⁻³ per nucleotide, or 1 damaged site per 500 nt. The most damaged alignments were those from horse, brown bear, and Adélie penguin, which exhibited estimated damage levels of 3.68 × 10⁻³, 2.75 × 10⁻³, and 2.42 × 10⁻³ per base, respectively. As expected, excluding modern (presumably undamaged) sequences from the bison data set results in an increase in the average amount of damage estimated over the data set, although the 95% HPDs between the 2 estimates overlap to some extent. The simulations suggest that when there is no actual damage, the delta model produces mean estimates of around 0.5–2 damaged sites per 1,000 bp. The majority of aDNA data sets have estimated damage falling in this region, indicating that recent aDNA studies have been successful in addressing the problem of damage. In turn, this suggests that current practices in aDNA research, including cloning and UNG treatment, are having a positive effect on reducing the number of spurious mutations introduced by damage. It is also noted that aDNA data are often obtained by amplifying small fragments that are ultimately concatenated to provide the full-length target sequence. In many cases, these small fragments overlap to some extent, resulting in regions within the sequence that are independently replicated. Indeed, any process that generates multiple PCR amplifications of the same sequence fragment, including cloning, overlapping amplification, and replication, will serve to increase the chance of identifying inconsistent bases, thereby reducing the effect of spurious mutations on the data set.

Unfortunately, it was not possible either to quantify or qualify the effects of UNG treatment, replication of full or partial sequences, or cloning on damage rates. Most aDNA studies did not state whether sequences were treated with UNG, and all but one study performed cloning. There was no significant evidence of a higher damage rate in data sets that had not been checked by independent replication, but this may have been due to the limited sample size or the effect of the other forms of replication described above.

High estimated levels of damage in some data sets could be due to sparse sampling, which increases the probability that different sequences will not share polymorphisms. This has the effect of increasing the number of base changes assigned to terminal branches. For this reason, it is expected that the accuracy of the delta model will be greatest for large, thoroughly sampled data sets.

Overall, the delta model appears to work best when a large amount of damage is present in the sequence data. At low levels of damage, the model lacks sufficient power to distinguish between damage and genetic variation, especially if the latter makes a substantial contribution to the total sequence variation. It is also notable that the delta model produced nonzero damage estimates for the undamaged simulated data. This is partly because delta is a scale parameter (bounded at zero but with no upper bound), but it is also likely that the delta model is treating some of the genuine polymorphism as sequence errors. This problem is exacerbated in analyses of data sets comprising fewer than 10 sequences, when there is a significant correlation between the number of tips and the estimated value of delta (results not shown). This correlation disappears for larger data sets; for this reason, our analyses were restricted to data sets comprising at least 10 sequences.

Mutation Rates

The high estimates of mutation rates, combined with the relatively low levels of sequence damage, provide a strong indication that sequence errors alone are insufficient to explain the “time dependency of molecular rate estimates” hypothesis, which postulates that molecular evolutionary rates appear to decline with calibration depth (Ho et al. 2005). Phylogenetic methods are liable to overestimate the mutation rate if spurious polymorphisms are present. This is clearly evident in the analysis of the pseudo–cave lion data sets, for which the overestimation is particularly marked because the original sequences exhibit low variation. As a result, any induced damage will form a substantial proportion of the total sequence variation, hence making a large contribution to the overestimation of the mutation rate.

We were not able to test some of the sequences that were published in the earlier years of aDNA research, when cloning was not routine (Higuchi et al. 1984; Handt et al. 1996), because these data sets are small and consist of short, fragmentary sequences. The delta model would have been particularly useful for investigating these sequences, which might have had high levels of damage because rigorous authentication criteria had not yet been adopted (for the most recent discussion of criteria, see Gilbert et al. 2005).

Future aDNA studies can profit from using the delta model to place upper credibility limits on the amount of sequence damage present in an alignment. It could also be used to assess the efficacy of damage-limiting precautions, such as cloning, UNG-treatment, and high-fidelity Taq polymerases. In theory, the delta model can also be used to detect sequencing errors, provided that multiple sequences from the same loci are available and the amount of error is not negligible. With respect to the latter, for example, genome projects appear to have sequencing errors of about 1 per 10,000 bp (Hill et al. 2000; Schmutz et al. 2004), which is effectively negligible from the perspective of the delta model. In contrast, single-pass sequencing of noncoding regions can yield error rates as high as 3.1 per 1,000 bp (Hill et al. 2000), which is well within the detection range of the delta model. Damage levels are likely to be lower for coding regions, however, for several reasons. First, damage is easier to detect because of the lower amount of natural sequence variation, and second, damage tends to occur at sites that are highly polymorphic, such as the mutation hot spots in the mitochondrial control region (Gilbert, Willerslev, et al. 2003).

The current delta model is simplistic and could be extended in a number of ways. For example, the damage process could be explicitly modeled in finer detail, using a time-independent damage substitution matrix. Rather than being uniform among tips, multiple delta parameters could be assigned among the sequences or could be modeled in an age-dependent manner. The present amount of available aDNA and damage data is perhaps too limited for these models to be tested reliably.

We have demonstrated that the delta model is able to estimate levels of damage accurately from simulated data, but it would be ideal to measure damage in situ. Unfortunately, techniques have not yet been developed to measure molecular damage directly from the DNA molecule. Future advances in molecular biological techniques will undoubtedly improve our understanding of the processes causing sequence damage and increase our power to detect this damage.

We thank Ross Barnett, Ceiridwen Edwards, and Yvonne Chan for providing data. S.Y.W.H. was funded by the Leverhulme Trust, the Commonwealth Scholarship Commission, and Linacre College, Oxford. T.H.H. was funded by the Marie Curie GeneTime program. A.R. and B.S. were funded by the Royal Society.

References

Barnes

Matheus

Shapiro

Jensen

Cooper

Dynamics of Pleistocene population extinctions in Beringian brown bears

Science

2002

, vol.

295

(pg.

2267

2270

)

Bollongino

Edwards

Alt

Burger

Bradley

Early history of European domestic cattle as revealed by ancient DNA

Biol Lett

2006

, vol.

(pg.

155

159

)

Bower

Spencer

Matsumura

Nisbet

RER

Howe

How many clones need to be sequenced from a single forensic or ancient DNA sample in order to determine a reliable consensus sequence?

Nucleic Acids Res

2005

, vol.

(pg.

2549

2556

)

Chan

Anderson

Hadly

Bayesian estimation of the timing and severity of a population bottleneck from ancient DNA

PLoS Genet

2006

, vol.

pg.

e59

Clark

Whittam

Sequencing errors and molecular evolutionary analysis

Mol Biol Evol

1992

, vol.

(pg.

744

752

)

Drummond

Rambaut

BEAST

2003

Oxford

University of Oxford

Google Preview

Drummond

SYW

Phillips

Rambaut

Relaxed phylogenetics and dating with confidence

PLoS Biol

2006

, vol.

pg.

e88

Drummond

Nicholls

Rodrigo

Solomon

Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data

Genetics

2002

, vol.

161

(pg.

1307

1320

)

Edwards

Bollongino

Scheu

et al. ,

(38 co-authors)

. ,

Mitochondrial history of the aurochs (Bos primigenius primigenius) in Europe

Proc Roy Soc B

2007

, vol.

274

(pg.

1377

1385

)

Crossref

Felsenstein

Evolutionary trees from DNA sequences: a maximum likelihood approach

J Mol Evol

1981

, vol.

(pg.

368

376

)

Gilbert

MTP

Bandelt

Hofreiter

Barnes

Assessing ancient DNA studies

Trends Ecol Evol

2005

, vol.

(pg.

541

544

)

Gilbert

MTP

Binladen

Miller

Wiuf

Willerslev

Poinar

Carlson

Leebens-Mack

Schuster

Recharacterization of ancient DNA miscoding lesions: insights in the era of sequencing-by-synthesis

Nucleic Acids Res

2007

, vol.

(pg.

)

Gilbert

MTP

Hansen

Willerslev

Rudbeck

Barnes

Lynnerup

Cooper

Characterization of genetic miscoding lesions caused by postmortem damage

Am J Hum Genet

2003

, vol.

(pg.

)

Gilbert

MTP

Willerslev

Hansen

Barnes

Rudbeck

Lynnerup

Cooper

Distribution patterns of postmortem damage in human mitochondrial DNA

Am J Hum Genet

2003

, vol.

(pg.

)

Handt

Krings

Ward

Paabo

The retrieval of ancient human DNA sequences

Am J Hum Genet

1996

, vol.

(pg.

368

376

)

Hansen

Willerslev

Wiuf

Mourier

Arctander

Statistical evidence for miscoding lesions in ancient DNA templates

Mol Biol Evol

2001

, vol.

(pg.

262

265

)

Higuchi

Bowman

Freiberger

Ryder

Wilson

DNA sequences from the quagga, an extinct member of the horse family

Nature

1984

, vol.

312

(pg.

282

284

)

Hill

Gemünd

Benes

Ansorge

Gibson

An estimate of large-scale sequencing accuracy

EMBO Rep

2000

, vol.

(pg.

)

SYW

Phillips

Cooper

Drummond

Time dependency of molecular rate estimates and systematic overestimation of recent divergence times

Mol Biol Evol

2005

, vol.

(pg.

1561

1568

)

Hofreiter

Capelli

Krings

et al. ,

(13 co-authors)

. ,

Ancient DNA analyses reveal high mitochondrial DNA sequence diversity and parallel morphological evolution of late Pleistocene cave bears

Mol Biol Evol

2002

, vol.

(pg.

1244

1250

)

Hofreiter

Jaenicke

Serre

Haeseler Av

Paabo

DNA sequences from multiple amplifications reveal artifacts induced by cytosine deamination in ancient DNA

Nucleic Acids Res

2001

, vol.

(pg.

4793

4799

)

Höss

Jaruga

Zastawny

Dizdaroglu

Paabo

DNA damage and DNA sequence retrieval from ancient tissues

Nucleic Acids Res

1996

, vol.

(pg.

1304

1307

)

Huynen

Millar

Scofield

Lambert

Nuclear DNA sequences detect species limits in ancient moa

Nature

2003

, vol.

425

(pg.

175

178

)

Lambert

Ritchie

Millar

Holland

Drummond

Baroni

Rates of evolution in ancient DNA from Adélie penguins

Science

2002

, vol.

295

(pg.

2270

2273

)

Lindahl

Instability and decay of the primary structure of DNA

Nature

1993

, vol.

362

(pg.

709

715

)

Loreille

Orlando

Patou-Mathis

Philippe

Taberlet

Hanni

Ancient DNA analysis reveals divergence of the cave bear, Ursus spelaeus, and brown bear, Ursus arctos, lineages

Curr Biol

2001

, vol.

(pg.

200

203

)

MacPhee

Tikhonov

Mol

Greenwood

Late Quaternary loss of genetic diversity in muskox (Ovibos)

BMC Evol Biol

2005

, vol.

pg.

Noonan

Coop

Kudaravalli

et al. ,

(11 co-authors)

. ,

Sequencing and analysis of Neanderthal genomic DNA

Nature

2006

, vol.

314

(pg.

1113

1118

)

Noonan

Hofreiter

Smith

Priest

Rohland

Rabeder

Krause

Detter

Paabo

Rubin

Genomic sequencing of Pleistocene cave bears

Science

2005

, vol.

309

(pg.

597

600

)

Orlando

Bonjean

Bocherens

Thenot

Argant

Otte

Hanni

Ancient DNA and the population genetics of cave bears (Ursus spelaeus) through space and time

Mol Biol Evol

2002

, vol.

(pg.

1920

1933

)

Pääbo

Ancient DNA: extraction, characterization, molecular cloning, and enzymatic amplification

Proc Natl Acad Sci USA

1989

, vol.

(pg.

1939

1943

)

Poinar

Schwarz

et al. ,

(13 co-authors)

. ,

Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA

Science

2006

, vol.

311

(pg.

392

394

)

Posada

Crandall

Modeltest: testing the model of DNA substitution

Bioinformatics

1998

, vol.

(pg.

817

818

)

Rambaut

Grassly

Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees

Comput Appl Biosci

1997

, vol.

(pg.

235

238

)

Rambaut

Drummond

Tracer

2004

Oxford

University of Oxford

Google Preview