Abstract

Despite playing a critical role in evolutionary processes and outcomes, relatively little is known about rates of recombination in the vast majority of species, including squamate reptiles—the second largest order of extant vertebrates, many species of which serve as important model organisms in evolutionary and ecological studies. This paucity of data has resulted in limited resolution on questions related to the causes and consequences of rate variation between species and populations, the determinants of within-genome rate variation, as well as the general tempo of recombination rate evolution on this branch of the tree of life. In order to address these questions, it is thus necessary to begin broadening our phylogenetic sampling. We here provide the first fine-scale recombination maps for two species of spiny lizards, Sceloporus jarrovii and Sceloporus megalepidurus, which diverged at least 12 Mya. As might be expected from similarities in karyotype, population-scaled recombination landscapes are largely conserved on the broad-scale. At the same time, considerable variation exists at the fine-scale, highlighting the importance of incorporating species-specific recombination maps in future population genomic studies.

Introduction

In most sexually reproducing organisms, recombination is critically important. On the one hand, recombination ensures the proper pairing and segregation of homologous chromosomes during meiotic cell division; on the other, it creates novel combinations of alleles through the exchange of genetic material between the parental chromosomes upon which selection may act (Hill and Robertson 1966; Felsenstein 1974; Felsenstein and Yokoyama 1976; Otto and Barton 2001). Recombination also plays a pivotal role in shaping the spatial distribution of variation within a genome and modulating genetic diversity among individuals (Maynard Smith and Haigh 1974; Begun and Aquadro 1992; Charlesworth et al. 1993), facilitating adaptation to novel or changing environments (Charlesworth 1976), and contributing to the formation of new species (Nachman and Payseur 2012; Payseur 2016). Moreover, as recombination rate variation influences the performance of genome scans to identify signatures of positive selection (Booker et al. 2020), a detailed knowledge of recombination landscapes is essential for many ecological and evolutionary studies.

Recombination is a quantitative, heritable trait subject to selection (Charlesworth and Charlesworth 1985) that may exhibit plasticity due to environmental or physiological factors (Stevison et al. 2017). Although constraints appear to exist with regards to an organismal minimum and maximum recombination rate—imposed by an obligate crossover per chromosome (or chromosome arm) and by crossover interference, respectively (see review by Ritz et al. 2017 and references therein), tremendous variation in rates and patterns of recombination exists across the tree of life—between species, populations, and individuals—as well as across the genome (see review by Stapley et al. 2017 and references therein). Yet, relatively little remains known about rates of recombination in the vast majority of species, including squamate reptiles—the second largest order of extant vertebrates.

Sceloporus is a diverse genus of lizards native to North America with roughly 100 species that have been well-studied in terms of behavior (Hews and Martins 2013), habitat (Lawing et al. 2016; Rivera et al. 2020, 2021), and phylogenetic relationships (Wiens et al. 2010; Lambert and Wiens 2013; Leaché et al. 2016). Sceloporus species display an unusual variability in chromosome number—ranging from 22 to 46 chromosomes (Sites et al. 1992), resulting in a rapid differentiation among species with markedly different chromosome counts (Leaché et al. 2016). Differences in chromosome number may not only promote speciation (Hall 2009), they also impose significant constraints on genome evolution (Bedoya and Leaché 2021) and may lead to changes in broad-scale recombination landscapes (Dumas and Britton-Davidian 2002). Here, we use population genetic data to characterize and compare genome-wide recombination profiles of Sceloporus jarrovii and Sceloporusmegalepidurus—two vivaparous species that diverged at least 12 Mya and for which similar constraints on recombination might be expected due to their belonging to the same 32-chromosome clade (Leaché et al. 2016).

Materials and methods

Population sampling

We captured eight S.jarrovii and eight S. megalepidurus individuals (four males and four females per species) in the field during peak breeding season in early October 2013 in south-eastern Arizona, United States, and in late August 2013 near Veracruz, Mexico, respectively (Supplementary Figure S1). Lizards were initially placed in uniquely numbered cloth bags and later sacrificed by first cooling individuals over ice and then rapidly decapitating (IACUC protocols 492636-1 and 962836-1 to DKH). Liver samples were collected from each individual and placed in RNAeasy solution. Samples were held in a −5°C freezer while in the field (for 2–3 weeks) until permanently stored in a −20°C freezer at Indiana State University.

DNA extraction, library preparation, and sequencing

DNA was extracted from the liver samples at the Yale Center for Genomic Analysis following the chemagic DNA Tissue100 H24 prefilling VD1208504.che protocol (PerkinElmer Ref# CMG-1207). Specifically, tissue samples were lysed overnight in 1 ml chemagic lysis buffer and 50 µl Proteinase K at 56°C. The next day, samples were treated with 80 µl RNase A at 4 mg/µl (AmericanBio Ref# AB12023-00100) for 10 min at 56°C before transferring the lysates into deep well plates. DNA was extracted using the chemagic 360 Nucleic Acid Extractor (PerkinElmer). Next, samples were transferred to intermediate tubes and centrifuged at 13,000 rpm for one minute, placed on a magnet, and transferred to final tubes.

To ensure that the quantity and quality of the extracted DNA were sufficient for sequencing, DNA concentration was measured using a Qubit fluorometer (Thermo Fisher) and purity assessed by measuring 260/280 nm and 260/230 absorbance ratios on a NanoDrop. DNA was fragmented using a Covaris E220 Focused-ultrasonicator, and size-selected to an average length of 350 bp. Fragmented DNA with 3′ and 5′ overhangs was purified and dual-size selected using Agencourt AMPure XP magnetic beads. T4 DNA Polymerase and Polynucleotide Kinase were used to repair the ends of the DNA fragments to which Illumina TruSeq UD Index adapters were subsequently ligated to allow for hybridization to the flow cells.

Libraries were sequenced on an Illumina NovaSeq 6000 platform following manufacturer's protocols. Signal intensities were converted to base calls using the platform's proprietary real time analysis (RTA) software. To monitor quality during sequencing, Illumina's Phi X library was spiked into each lane at a concentration of 1% as a positive control. Last, samples were de-multiplexed using CASAVA v.1.8.2.

Reference assembly

Generating de novo assemblies remains a time-consuming and expensive endeavor. At the same time, mapping reads from individuals of one species to the genome of another, distantly related species can pose several challenges (see discussion in Pfeifer 2017). To avoid biasing our analyses toward one of the two focal species and allow for fair genomic comparisons, a genome assembly was generated from a third species, Sceloporus cowlesi, which is equally closely related to both S. jarrovii and S. megalepidurus (Leaché et al. 2016). For this purpose, tissue from a single S. cowlesi individual collected at White Sands National Park (Otero County, NM) was used for high molecular weight (HMW) DNA extraction, 10X Genomics Chromium Genome library preparation, and sequencing on an Illumina HiSeq 4000. To create a draft genome assembly, raw sequence reads were processed for quality assurance using a custom in-house pipeline, proc10xG (https://github.com/ucdavis-bioinformatics/proc10xG; last accessed November/3rd 2021), together with the Kmer Analysis Toolkit (Mapleson et al. 2017). Specifically, 10X gem barcodes were checked for expected distribution and a genome k-mer analysis was performed to estimate genome size, repeat content, and other genomic features. Next, Supernova v.2.1.1 (Weisenfeld et al. 2017) was used to generate a de novo assembly, using ∼826 million single raw reads as input (default settings). As the assembly algorithm is designed to work specifically with data generated using the 10X Genomics Chromium system, no additional processing of sequencing reads was necessary. The resulting reference assembly contained 34,570 scaffolds with an overall length of 1.91 Gb (N50 = 62,759,035 bp as determined by QUAST v.5.0.2; Mikheenko et al. 2018).

Sequence alignment

To check initial data quality, raw sequence reads were visualized using FastQC v.0.11.7 (http://www.bioinformatics.babraham.ac.uk/projects/fastqc; last accessed 27 March 2021) and adapters and low-quality regions subsequently trimmed using Trim Galore! v.0.6.1 (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/; last accessed March/27th 2021). The pre-processed reads were aligned to the S. cowlesi reference assembly using BWA mem v.0.7.17 (Li and Durbin 2009) with default parameters. Despite the use of a reference assembly from a different Sceloporus species, 96.5% and 96.9% of reads could be mapped for S. jarrovii and S. megalepidurus, respectively. Aligned reads were validated, sorted, and indexed using SAMtools v.1.9 (Li et al. 2009) and duplicates marked using Picard v.2.18.3 (http://broadinstitute.github.io/picard/; last accessed March/27th 2021). Following mapping, the mean per-individual coverage for S. jarrovii (n = 8) and S. megalepidurus (n = 8) was 10.4X and 11.2X, respectively (Supplementary Table S1).

Variant calling, genotyping, and filtering

Variants were called using the Genome Analysis Toolkit (GATK) HaplotypeCaller v.3.7.0 (Poplin et al. 2017) and jointly genotyped using GenotypeGVCFs v.4.1.2.0. In the absence of a curated dataset required to train GATK's variant filtration machine learning algorithm, variants were hard-filtered following GATK's Best Practices (Van der Auwera et al. 2013; Van der Auwera and O'Connor 2020). Specifically, GATK’s SelectVariants and VariantFiltration v.4.1.2.0 were used to filter out variants with (1) poor alignment characteristics as indicated by low alignment qualities (i.e., QD < 2.0; MQ < 20.0; MQRankSum < –12.5), (2) evidence of strand bias as estimated by Fisher’s exact test (FS > 60.0) or the symmetric odds ratio test (SOR > 3.0), or (3) evidence of a positional bias in read position (ReadPosRankSum < –8.0). In addition, as false-positive variants frequently exhibit excessive read coverage and tend to occur more often in regions where a large number of other variants were called, variants with (5) extremely low (<4X) or high (>15X) read coverage (i.e., DP ≥ 32 && DP ≤ 120) or (5) an extensive clustering (three or more variants within a 10 bp window, i.e., cluster size = 3; cluster window size = 10) were removed. Last, the dataset was limited to biallelic single nucleotide polymorphisms (SNPs) genotyped in all samples of a species (i.e., AN = 16).

Due to the inherent difficulties of reliably calling variants in nonmodel organisms, several sanity checks were performed. First, assuming a constant genome-wide mutation rate, the number of variants on each scaffold should roughly correspond to its length. Although SNP densities agreed well across long scaffolds, a preliminary analysis highlighted a large variation in SNP density (ranging from 0 to 0.035) on the smallest scaffolds, likely due to misaligned reads and artifactual variant calls (Supplementary Figure S2). In order to limit the number of false positives in this study, analyses were thus restricted to scaffolds longer than 2 Mb (i.e., a total of 88 scaffolds). Importantly, as these 88 (out of the total 34,570) scaffolds comprise 1.63 out of the 1.91 Gb assembled genome, 94.9% and 95.2% of variants were retained for S. jarrovii and S. megalepidurus, respectively.

Low-complexity and repetitive regions often result in ambiguous read alignments that can lead to erroneous variant calls (see review by Pfeifer 2017). Consequently, five different classes of repeats—LINEs, LTRs, DNA transposons, simple repeats, and low complexity regions—were annotated using RepeatMasker v.4.1.0 (http://www.repeatmasker.org; last accessed March/27th 2021) and SNPs within these regions were removed from the dataset.

As collapsed copy number variants and other misassembled regions can lead to artifactual excessive heterozygosity in the genome, VCFtools v.0.1.16 (Danecek et al. 2011) was used to filtered out SNPs with a P-value < 0.01 for Hardy-Weinberg equilibrium.

As a final sanity check, both per-sample coverage and number of variants were compared across scaffolds. As shown in Supplementary Figure S3, with the exception of three scaffolds (90, 90602, and 90921), the per-sample coverage was highly consistent. In addition, the number of variants on each scaffold was highly consistent across samples (Supplementary Figure S4).

The final dataset for S. jarrovii contained 5,927,176 segregating sites (with a transition-transversion ratio, TsTv, of 2.05) and 217,678 fixed differences to the S. cowlesi reference assembly in the accessible part of the genome (959,437,632 bp). The final dataset for S. megalepidurus contained 8,742,115 segregating sites (Ts/Tv = 1.96) and 211,825 fixed differences to the S. cowlesi reference assembly in the accessible part of the genome (980,116,223 bp).

Kinship and population structure

Genetic relatedness among individuals was inferred using the software KING (Manichaikul et al. 2010) as implemented in plink2 (Chang et al. 2015) (Supplementary Figure S5). Genetic differentiation among individuals was explored using a principal component analysis (PCA; Supplementary Figure S6) and individual ancestries were assessed using the software ADMIXTURE (Alexander et al. 2009). As SNPs in linkage disequilibrium (LD) can distort signals of population structure (Liu et al. 2020), SNPs were pruned for linkage using plink2 (Chang et al. 2015). Specifically, the plink2 command “–indep-pairwise 50 5 0.2” was run, for each 50-SNP window, to exclude one of a pair of SNPs if their pairwise association r2 > 0.2 (sliding window size: 5 SNPs). After filtering, 277,341 and 337,214 SNPs remained in the LD-pruned S. jarrovii and S. megalepidurus datasets, respectively. Next, the R package SNPRelate v.1.20.1 (Zheng et al. 2012) was used to perform a PCA (Supplementary Figure S6). In addition, ADMIXTURE v.1.3.0 (Alexander et al. 2009) was run to infer admixture proportions for 1–4 ancestral source populations (K). The best model was chosen to minimize the cross-validation error rates (Supplementary Figure S7). Finally, population genetic summary statistics (nucleotide diversity π and Tajima's D) were calculated for each species using VCFtools v.0.1.16 (Danecek et al. 2011) and pixy v.1.2.5.beta1 (Korunes and Samuk 2021) on the full dataset.

Phasing

Following Auton et al. (2012), genotypes were phased using PHASE v.2.1.1 (Stephens et al. 2001b; Stephens and Donnelly 2003) to reconstruct haplotypes. Specifically, each scaffold was partitioned into 400 SNP regions with a 100-SNP overlap between regions, and regional files in the variant calling format (.vcf) were converted into PHASE input using a modified version of vcf2PHASE.pl (vcf-conversion-tools 1.0; Zenodo: http://doi.org/10.5281/zenodo.10288; last accessed March/27th 2021). Haplotypes were reconstructed using the “recombination model” in PHASE with the following options: “200 1 300 ‐MR ‐F.05 ‐l10 ‐x5 –X5”. Overlapping phased regions were joined back together using a modified version of join_phase_blocks.pl (great-ape-recombination 1.0; Zenodo: http://dx.doi.org/10.5281/zenodo.13975; last accessed March/27th 2021) and converted into linkage format using plink v.1.90b3.45 (Purcell et al. 2007). Excluding sites that fixed during phasing, the final phased datasets contained 5,924,882 and 8,685,799 variants with a transition-transversion ratio of 2.05 and 1.96 for S. jarrovii and S. megalepidurus, respectively.

Recombination rate estimation

Population recombination rates (ρ = 4 Ne r, where Ne is the effective population size and r is the recombination rate per site per generation) were inferred using LDhat v.2.2 (McVean et al. 2002, 2004; Auton and McVean 2007)—a method that has been widely applied in the field, including in the only other study of recombination landscapes in lizards (Bourgeois et al. 2019), and that is suitable for small sample sizes (Auton et al. 2012). As the computation of two-locus coalescent likelihoods is computationally expensive, a likelihood lookup table was calculated to speed up analyses. To this end, LDhat “convert” was used to infer Watterson's infinite-sites estimator of the population-scaled mutation rate (Θ). An approximation of Watterson's Θ of 10−4 was then used to generate a likelihood lookup table using LDhat “complete” with a 101-point grid resolution. This lookup table was used to estimate recombination rates in the species following Auton et al. (2012). Specifically, each phased scaffold was partitioned into 4000 SNP regions with a 200-SNP overlap between regions. Next, LDhat “interval” was run for 60 million iterations with a block penalty of 5 and samples were taken every 40,000 iterations. After using LDhat “stat” to discard the first 20 million iterations as burn-in, recombination rate estimates were joined at the mid-points of the 200-SNP overlapping regions. Using these estimates, correlations with nucleotide diversity (π) and GC-content were calculated on the 1 Mb-, 500 kb-, and 100 kb-scale.

Results and discussion

The genomes of 16 wild-caught spiny lizards—eight S.jarrovii and eight S. megalepidurus (four males and four females per species; Supplementary Figure S1)—were sequenced to an average coverage of 10X per individual (Supplementary Table S1). Quality-controlled reads were mapped to a draft S. cowlesi reference genome (34,570 scaffolds, N50 = 62,759,035 bp) and SNPs called following the GATK Best Practices (Van der Auwera et al. 2013; Van der Auwera and O'Connor 2020). Analyzing the patterns of variation across samples and scaffolds suggested that SNPs residing on scaffolds smaller than 2 Mb were likely false positives due to misaligned reads and spurious variant calls (Supplementary Figure S2), thus they were discarded in subsequent analyses. Stringent filter criteria were employed to produce high-quality datasets containing 5.9 and 8.7 million biallelic SNPs on the remaining genomic scaffolds (i.e., 88 scaffolds comprising 1.63 out of the 1.91 Gb assembled genome) for S. jarrovii and S. megalepidurus, corresponding to a SNP density of 3.6 and 5.3/kb, respectively (see Materials and Methods for details). Per-sample coverages were relatively evenly distributed across these 88 scaffolds, suggesting that there were no significant issues caused by either the sequencing strategy (e.g., biases introduced by PCR enrichment) or genome assembly (e.g., biases due to extreme base composition) (Supplementary Figure S3). The number of identified variants was consistent across regions—a further indicator that there were no systematic sequencing or mapping errors (Supplementary Figure S4). Moreover, transition-transversion ratios of 2.05 and 1.96 for S. jarrovii and S. megalepidurus agree well with the genome-wide average of ∼2.0 seen in many organisms (e.g., Stephens et al. 2001a). Both species exhibit similar nucleotide diversity levels (S. jarrovii: 0.17%; S. megalepidurus: 0.32%)—close to the levels of diversity previously reported for different populations of S. cowlesi (0.25%–0.27%)—and Tajima's D-values ranging from −0.51 in S. jarrovii to 0.46 in S. megalepidurus, indicating a relatively unskewed site frequency spectrum (S. cowlesi: −0.17 to 0.25; Laurent et al. 2016).

Genome-wide population-scaled recombination rates for S. jarrovii and S. megalepidurus were inferred from patterns of LD using the LDhat methodology (McVean et al. 2002, 2004; Auton and McVean 2007), which relies on polymorphism data from unrelated individuals. Analyzing patterns of genetic relatedness and differentiation among individuals confirmed that all individuals included in this study were genetically unrelated (Supplementary Figure S5). Although no family relationships were detected among the sampled individuals, negative estimates of pairwise kinship coefficients indicated a putative structuring of the populations. To better understand any population structure potentially present in the samples, genetic differentiation among individuals was explored using a principal component analysis (Supplementary Figure S6) and individual ancestries were assessed using the software ADMIXTURE (Alexander et al. 2009). As expected from the sampling design, a single ancestral source population (K = 1) provided the best fit to the data whereas models with more than one source population (K > 1) led to overfitting (Supplementary Figure S7).

Although population-scaled recombination rates are generally higher in S. jarrovii than in S. megalepidurus (Figure 1A), recombination landscapes are largely conserved on the broad-scale, with a positive correlation of 0.74, 0.77, and 0.81 on the 1-, 2-, and 5 Mb-scale, respectively (Figure 2). On the fine-scale, considerable variation exists between the two species as well as along their genomes (Figure 1B, Supplementary Figures S8 and S9), with the strength of correlation decreasing with successively smaller scales from 0.57 at 100 kb to 0.19 at 1 kb (Figure 2). However, it is important to note that the variance will be larger at the fine-scale which may (at least in part) drive this observation. Recombination rates in S. jarrovii and S. megalepidurus are positively correlated with genome-wide nucleotide diversity (Figure 3A) and GC-content (Figure 3B), with observed differences in the shape of the relationship between the two species being likely driven by differences in the underlying effective population sizes. These patterns are in concordance with previous work in many vertebrates—including other squamates (Bourgeois et al. 2019; Schield et al. 2020)—and are likely caused by the pervasive effects of selection at linked sites (see reviews by Cutter and Payseur 2013; Charlesworth and Jensen 2021) and biased gene conversion (Pessia et al. 2012), respectively.

Recombination landscapes in S. jarrovii and S. megalepidurus. (A) Broad- and fine-scale recombination rates along the longest scaffold (scaffold 19) in S. jarrovii (shown in purple) and S. megalepidurus (shown in orange). Broad-scale rates were averaged over 1 Mb-regions. (B) Variation in the fine-scale recombination landscape within and between scaffolds in S. jarrovii (purple) and S. megalepidurus (orange). Only the 10 longest scaffolds are shown here; the 88 scaffolds used in this study are displayed in Supplementary Figures S8 and S9. Picture credits: squamatologist (S. jarrovii; distributed under a CC BY-NC-ND 2.0 license) and camamed (S. megalepidurus; distributed under a CC BY-NC 4.0 license).
Figure 1

Recombination landscapes in S. jarrovii and S. megalepidurus. (A) Broad- and fine-scale recombination rates along the longest scaffold (scaffold 19) in S. jarrovii (shown in purple) and S. megalepidurus (shown in orange). Broad-scale rates were averaged over 1 Mb-regions. (B) Variation in the fine-scale recombination landscape within and between scaffolds in S. jarrovii (purple) and S. megalepidurus (orange). Only the 10 longest scaffolds are shown here; the 88 scaffolds used in this study are displayed in Supplementary Figures S8 and S9. Picture credits: squamatologist (S. jarrovii; distributed under a CC BY-NC-ND 2.0 license) and camamed (S. megalepidurus; distributed under a CC BY-NC 4.0 license).

Correlation between the recombination maps of S. jarrovii and S. megalepidurus. Pearson correlation between the recombination maps of S. jarrovii and S. megalepidurus at different scales; the inlay shows the correlation at the broad (1 Mb) scale.
Figure 2

Correlation between the recombination maps of S. jarrovii and S. megalepidurus. Pearson correlation between the recombination maps of S. jarrovii and S. megalepidurus at different scales; the inlay shows the correlation at the broad (1 Mb) scale.

Relationship between genome-wide recombination rate with nucleotide diversity and GC-content. Relationship between genome-wide recombination rate with nucleotide diversity (left) and GC-content (right) in S. jarrovii (purple) and S. megalepidurus (orange) at the 1 Mb- (top), 500 kb- (middle), and 100 kb- (bottom) scale.
Figure 3

Relationship between genome-wide recombination rate with nucleotide diversity and GC-content. Relationship between genome-wide recombination rate with nucleotide diversity (left) and GC-content (right) in S. jarrovii (purple) and S. megalepidurus (orange) at the 1 Mb- (top), 500 kb- (middle), and 100 kb- (bottom) scale.

Last, it is important to note the limitations of estimating recombination rates from population-level sequencing data. First, population genetic approaches estimate historical recombination rates averaged over many generations (and hence, individuals and sexes). Second, many methods (including LDhat) assume that the population is at neutral equilibrium—an assumption that is frequently violated in nature which can lead to mis-inference (Dapper and Payseur 2018). Although methods exist that can take population demographic history into account when estimating recombination rates (e.g., pyrho; Spence and Song 2019), our analyses are limited by the scarce data available for lizards in the genus Sceloporus. Namely, it is challenging to infer the demographic history of the two species in the first place without any prior knowledge of, for example, mutation rates, effective population sizes, or even which genomic regions to use for such inference as there are no genome annotations available that could be leveraged to select regions unaffected (or at least less affected) by selection [see Johri et al. (BioRxiv) for a discussion regarding statistical inference in population genomics]. This highlights the importance of developing further genomic resources for these important model organisms to improve our understanding of recombination rate evolution in squamates.

Conclusion

As the field begins to gain a broader phylogenetic view of recombination rate variation and evolution, multiple hypotheses related to both the determinants and consequences of rate variation are anticipated to be better resolved. We here add two closely-related species of spiny lizards to this view. Despite similarities in karyotype, differences in recombination rate were observed at both the fine- and (to a lesser extent) broad-scale, highlighting the importance of including species-specific recombination maps in future population genomic analyses and genome-wide scans for targets of selection in the species. Moreover, our results suggest that major variation in the recombination landscapes of Sceloporus species with different chromosome counts remains to be discovered.

Data availability

Supplementary Table S1 and Supplementary Figure S1 provide information on the population sampling. Supplementary Figures S2–S4 display the population SNP density, per-sample coverage, and number of variants per sample across scaffolds, respectively. Supplementary Figure S5 shows the relationships among individuals. Supplementary Figure S6 displays the genetic differentiation among individuals. Supplementary Figure S7 shows the cross-validation error in the ADMIXTURE models. Supplementary Figures S8 and S9 illustrate the variation in fine-scale recombination landscape within and between scaffolds in S.jarrovii and S. megalepidurus, respectively. The sequencing data from this study is available on NCBI's Sequence Read Archive under the BioProject designation PRJNA726723. The fine-scale recombination map is available at http://spfeiferlab.org/data.

Supplementary material is available at G3 online.

Acknowledgments

We are grateful to Adam Auton for sharing his code from previous publications as well as Jeffrey Jensen and Matthew Settles for helpful comments and discussion. DNA extraction, library preparation, and sequencing was performed at the UC Davis Genome Center and the Yale Center for Genomic Analysis. Computations were performed on Arizona State University's High Performance Cluster and the Open Science Grid, which is supported by the National Science Foundation award #2030508.

Funding

This work was supported by a National Science Foundation CAREER grant to SPP (DEB-2045343), and a National Science Foundation grant to EBR (DEB-1754125).

Conflicts of interest

The authors declare that there is no conflict of interest.

Literature cited

Alexander
DH
,
Novembre
J
,
Lange
K.
2009
.
Fast model-based estimation of ancestry in unrelated individuals
.
Genome Res
.
19
:
1655
1664
.

Auton
A
,
Fledel-Alon
A
,
Pfeifer
S
,
Venn
O
,
Ségurel
L
, et al.
2012
.
A fine-scale chimpanzee genetic map from population sequencing
.
Science
.
336
:
193
198
.

Auton
A
,
McVean
G.
2007
.
Recombination rate estimation in the presence of hotspots
.
Genome Res
.
17
:
1219
1227
.

Bedoya
AM
,
Leaché
AD.
2021
.
Characterization of a pericentric inversion in plateau fence lizards (Sceloporus tristichus): evidence from chromosome-scale genomes
.
G3 (Bethesda)
.
11
:
jkab036
.

Begun
DJ
,
Aquadro
CF.
1992
.
Levels of naturally occurring DNA polymorphism correlate with recombination rates in D. melanogaster
.
Nature
.
356
:
519
520
.

Booker
T
,
Yeaman
S
,
Whitlock
MC.
2020
.
Variation in recombination rate affects detection of outliers in genome scans under neutrality
.
Mol Ecol
.
29
:
4274
4279
.

Bourgeois
Y
,
Ruggiero
RP
,
Manthey
JD
,
Boissinot
S.
2019
.
Recent secondary contacts, linked selection, and variable recombination rates shape genomic diversity in the model species Anolis carolinensis
.
Genome Biol Evol
.
11
:
2009
2022
.

Chang
CC
,
Chow
CC
,
Tellier
LCAM
,
Vattikuti
S
,
Purcell
SM
, et al.
2015
.
Second-generation PLINK: rising to the challenge of larger and richer datasets
.
GigaScience
.
4
:
1
16
.

Charlesworth
B.
1976
.
Recombination modification in a fluctuating environment
.
Genetics
.
83
:
181
195
.

Charlesworth
B
,
Charlesworth
D.
1985
.
Genetic variation in recombination in Drosophila. I. Responses to selection and preliminary genetic analysis
.
Heredity
.
54
:
71
83
.

Charlesworth
B
,
Jensen
JD.
2021
.
The Effects of Selection at Linked Sites on Patterns of Genetic Variability
.
Annual Review of Ecology, Evolution, and Systematics 52
:
177
197
.

Charlesworth
B
,
Morgan
MT
,
Charlesworth
D.
1993
.
The effect of deleterious mutations on neutral molecular variation
.
Genetics
.
134
:
1289
1303
.

Cutter
AD
,
Payseur
BA.
2013
.
Genomic signatures of selection at linked sites: unifying the disparity among species
.
Nat Rev Genet
.
14
:
262
274
.

Danecek
P
,
Auton
A
,
Abecasis
G
,
Albers
CA
,
Banks
E
, et al. ;
1000 Genomes Project Analysis Group
.
2011
.
The variant call format and VCFtools
.
Bioinformatics
.
27
:
2156
2158
.

Dapper
AL
,
Payseur
BA.
2018
.
Effects of demographic history on the detection of recombination hotspots from linkage disequilibrium
.
Mol Biol Evol
.
35
:
335
353
.

Dumas
D
,
Britton-Davidian
J.
2002
.
Chromosomal rearrangements and evolution of recombination: comparison of Chiasma distribution patterns in standard and Robertsonian populations of the house mouse
.
Genetics
.
162
:
1355
1366
.

Felsenstein
J.
1974
.
The evolutionary advantage of recombination
.
Genetics
.
78
:
737
756
.

Felsenstein
J
,
Yokoyama
S.
1976
.
The evolutionary advantage of recombination. II. Individual selection for recombination
.
Genetics
.
83
:
845
859
.

Hall
WP.
2009
.
Chromosome variation, genomics, speciation and evolution in Sceloporus lizards
.
Cytogenet Genome Res
.
127
:
143
165
.

Hews
DK
,
Martins
EP.
2013
. Visual and chemical signals of social communication: providing the link to habitat and environment. In:
Lutterschmidt
W
, editor.
Reptiles in Research: Investigations of Ecology, Physiology and Behavior from Desert to Sea
.
Hauppauge NY
:
Nova Publishers. pp. 111–141
.

Hill
WG
,
Robertson
A.
1966
.
The effect of linkage on limits to artificial selection
.
Genet Res
.
8
:
269
294
.

Johri
P
,
Aquadro
CF
,
Beaumont
M
,
Charlesworth
B
,
Excoffier
L
, et al. Statistical inference in population genomics. BioRxiv. doi:. https://www.biorxiv.org/content/10.1101/2021.10.27.466171v1.

Korunes
KL
,
Samuk
K.
2021
.
pixy: unbiased estimation of nucleotide diversity and divergence in the presence of missing data
.
Mol Ecol Resour
.
21
:
1359
1368
.

Lambert
SM
,
Wiens
JJ.
2013
.
Evolution of viviparity: a phylogenetic test of the cold-climate hypothesis in Phrynosomatid lizards
.
Evolution
.
67
:
2614
2630
.

Laurent
S
,
Pfeifer
SP
,
Settles
ML
,
Hunter
SS
,
Hardwick
KM
, et al.
2016
.
The population genomics of rapid adaptation: disentangling signatures of selection and demography in white sands lizards
.
Mol Ecol
.
25
:
306
323
.

Lawing
AM
,
Polly
PD
,
Hews
DK
,
Martins
EP.
2016
.
Including fossils in phylogenetic climate reconstructions: a deep time perspective on the climatic niche evolution and diversification of spiny lizards (Sceloporus)
.
Am Nat
.
188
:
133
148
.

Leaché
AD
,
Banbury
BL
,
Linkem
CW
,
Nieto Montes de Oca
A.
2016
.
Phylogenomics of a rapid radiation: is chromosomal evolution linked to increased diversification in North American spiny lizards (genus Sceloporus)?
BMC Evol Biol
.
16
:
63
.

Li
H
,
Durbin
R.
2009
.
Fast and accurate short read alignment with Burrows-Wheeler transform
.
Bioinformatics
.
25
:
1754
1760
.

Li
H
,
Handsaker
B
,
Wysoker
A
,
Fennell
T
,
Ruan
J
, et al. ;
1000 Genome Project Data Processing Subgroup
.
2009
.
The sequence alignment/map format and SAMtools
.
Bioinformatics
.
25
:
2078
2079
.

Liu
CC
,
Shringarpure
S
,
Lange
K
,
Novembre
J.
2020
.
Exploring population structure with admixture models and principal component analysis
.
Methods Mol Biol
.
2090
:
67
86
.

Manichaikul
A
,
Mychaleckyj
JC
,
Rich
SS
,
Daly
K
,
Sale
M
, et al.
2010
.
Robust relationship inference in genome-wide association studies
.
Bioinformatics
.
26
:
2867
2873
.

Mapleson
D
,
Accinelli
GG
,
Kettleborough
G
,
Wright
J
,
Clavijo
BJ.
2017
.
KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies
.
Bioinformatics
.
33
:
574
576
.

Maynard Smith
J
,
Haigh
J.
1974
.
The hitch-hiking effect of a favourable gene
.
Genet Res
.
23
:
23
35
.

McVean
G
,
Awadalla
P
,
Fearnhead
P.
2002
.
A coalescent-based method for detecting and estimating recombination from gene sequences
.
Genetics
.
160
:
1231
1241
.

McVean
GA
,
Myers
SR
,
Hunt
S
,
Deloukas
P
,
Bentley
DR
, et al.
2004
.
The fine-scale structure of recombination rate variation in the human genome
.
Science
.
304
:
581
584
.

Mikheenko
A
,
Prjibelski
A
,
Saveliev
V
,
Antipov
D
,
Gurevich
A.
2018
.
Versatile genome assembly evaluation with QUAST-LG
.
Bioinformatics
.
34
:
i142
i150
.

Nachman
MW
,
Payseur
BA.
2012
.
Recombination rate variation and speciation: theoretical predictions and empirical results from rabbits and mice
.
Philos Trans R Soc Lond B Biol Sci
.
367
:
409
421
.

Otto
SP
,
Barton
NH.
2001
.
Selection for recombination in small populations
.
Evolution
.
55
:
1921
1931
.

Payseur
BA.
2016
.
Genetic links between recombination and speciation
.
PLoS Genet
.
12
:
e1006066
.

Pessia
E
,
Popa
A
,
Mousset
S
,
Rezvoy
C
,
Duret
L
, et al.
2012
.
Evidence for widespread GC-biased gene conversion in eukaryotes
.
Genome Biol Evol
.
4
:
675
682
.

Pfeifer
SP.
2017
.
From next-generation resequencing reads to a high-quality variant data set
.
Heredity (Edinb)
.
118
:
111
124
.

Poplin
R
,
Ruano-Rubio
V
,
DePristo
MA
,
Fennell
TJ
,
Carneiro
MO
, et al.
2017
. Scaling accurate genetic variant discovery to tens of thousands of samples. BioRxiv, doi:.

Purcell
S
,
Neale
B
,
Todd-Brown
K
,
Thomas
L
,
Ferreira
MA
, et al.
2007
.
PLINK: a tool set for whole-genome association and population-based linkage analyses
.
Am J Hum Genet
.
81
:
559
575
.

Ritz
KR
,
Noor
MAF
,
Singh
ND.
2017
.
Variation in recombination rate: adaptive or not?
Trends Genet
.
33
:
364
374
.

Rivera
JA
,
Lawing
AM
,
Martins
EP.
2020
.
Reconstructing historical shifts in suitable habitat of Sceloporus lineages using phylogenetic niche modeling
.
J Biogeogr
.
47
:
2117
2128
.

Rivera
JA
,
Rich
HN
,
Michelle Lawing
A
,
Rosenberg
MS
,
Martins
EP.
2021
.
Occurrence data uncover patterns of allopatric divergence and interspecies interactions in the evolutionary history of Sceloporus lizards
.
Ecol Evol
.
11
:
2796
2813
.

Schield
DR
,
Pasquesi
GIM
,
Perry
BW
,
Adams
RH
,
Nikolakis
ZL
, et al.
2020
.
Snake recombination landscapes are concentrated in functional regions despite PRDM9
.
Mol Biol Evol
.
37
:
1272
1294
.

Sites
JW
,
Archie
JW
,
Cole
CJ
,
Flores-Villela
O.
1992
.
A review of phylogenetic hypotheses for lizards of the genus Sceloporus (Phrynosomatidae): implications for ecological and evolutionary studies
.
Bull Am Mus Nat Hist
.
213
:
95
110
.

Spence
JP
,
Song
YS.
2019
.
Inference and analysis of population-specific fine-scale recombination maps across 26 diverse human populations
.
Sci Adv
.
5
:
eaaw9206
.

Stapley
J
,
Feulner
PGD
,
Johnston
SE
,
Santure
AW
,
Smadja
CM.
2017
.
Variation in recombination frequency and distribution across eukaryotes: patterns and processes
.
Phil Trans R Soc B
.
372
:
20160455
.

Stephens
JC
,
Schneider
JA
,
Tanguay
DA
,
Choi
J
,
Acharya
T
, et al.
2001a
.
Haplotype variation and linkage disequilibrium in 313 human genes
.
Science
.
293
:
489
493
.

Stephens
M
,
Donnelly
P.
2003
.
A comparison of Bayesian methods for haplotype reconstruction from population genotype data
.
Am J Hum Genet
.
73
:
1162
1169
.

Stephens
M
,
Smith
NJ
,
Donnelly
P.
2001b
.
A new statistical method for haplotype reconstruction from population data
.
Am J Hum Genet
.
68
:
978
989
.

Stevison
LS
,
Sefick
S
,
Rushton
C
,
Graze
RM.
2017
.
Recombination rate plasticity: revealing mechanisms by design
.
Phil Trans R Soc B
.
372
:
20160459
.

Van der Auwera
GA
,
Carneiro
MO
,
Hartl
C
,
Poplin
R
,
Del Angel
G
, et al.
2013
.
From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline
.
Curr Protoc Bioinformatics
.
43
:
11.10.1
11.10.33
.

Van der Auwera
GA
,
O'Connor
BD.
2020
.
Genomics in the Cloud: Using Docker, GATK, and WDL in Terra
.
O'Reilly Media Incorporated
.

Weisenfeld
NI
,
Kumar
V
,
Shah
P
,
Church
DM
,
Jaffe
DB.
2017
.
Direct determination of diploid genome sequences
.
Genome Res
.
27
:
757
767
.

Wiens
JJ
,
Kuczynski
CA
,
Arif
S
,
Reeder
TW.
2010
.
Phylogenetic relationships of phrynosomatid lizards based on nuclear and mitochondrial data, and a revised phylogeny for Sceloporus
.
Mol Phylogenet Evol
.
54
:
150
161
.

Zheng
X
,
Levine
D
,
Shen
J
,
Gogarten
SM
,
Laurie
C
, et al.
2012
.
A high-performance computing toolset for relatedness and principal component analysis of SNP data
.
Bioinformatics
.
28
:
3326
3328
.

Author notes

Cyril J. Versoza, Julio A. Rivera contributed equally to this work.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Editor: J Ross-Ibarra
J Ross-Ibarra
Editor
Search for other works by this author on:

Supplementary data