Abstract

Transposable elements (TEs) can make up more than 50% of any given vertebrate's genome, with substantial variability in TE composition among lineages. TE variation is often linked to changes in gene regulation, genome size, and speciation. However, the role that genome duplication events have played in generating abrupt shifts in the composition of the mobilome over macroevolutionary timescales remains unclear. We investigated the degree to which the teleost genome duplication (TGD) shaped the diversification trajectory of the teleost mobilome. We integrate a new high coverage genome of Polypterus bichir with data from over 100 publicly available actinopterygian genomes to assess the macroevolutionary implications of genome duplication events on TE evolution in teleosts. Our results provide no evidence for a substantial shift in mobilome composition following the TGD event. Instead, the diversity of the teleost mobilome appears to have been shaped by a history of lineage-specific shifts in composition that are not correlated with commonly evoked drivers of diversification such as body size, water column usage, or latitude. Collectively, these results provide additional evidence for an emerging perspective that TGD did not catalyze bursts of diversification and innovation in the actinopterygian mobilome.

Significance

We investigate the role of the teleost genome duplication on transposable element (TE) diversification in ray-finned fishes by integrating an analysis of the mobilome from a newly sequenced genome from Polypterus bichir with analyses from over 100 ray-finned fish genomes. We reveal that teleost fish TE diversity depicts a signature of lineage-specific shifts rather than a major burst of novelty near the origin of teleosts, suggesting a complex, nuanced history of TE evolution. Our results are in line with emerging hypotheses concerning the impact of ploidy events on long-term TE evolution, providing critical context for future research into the genomic and evolutionary mechanisms influencing TE diversity across half of all living vertebrates.

Introduction

Transposable elements (TEs) can account for over 50% of a vertebrate's total genome content (Sotero-Caio et al. 2017). Aptly dubbed “jumping genes,” TEs possess the astonishing ability to rearrange and reposition themselves within a given genome (McClintock 1984), through the use of two primary strategies for transposing. Class I retrotransposons replicate through a “copy and paste mechanism” involving an RNA intermediate. This RNA molecule is reverse transcribed back into a DNA copy, which is then seamlessly integrated into the genome, leaving the original template element untouched (Hickman and Dyda 2015). In contrast, class II DNA transposons use a “cut-and-paste” technique to excise themselves from their current location and relocate to an alternate genomic location (Muñoz-López and García-Pérez 2010). Comparative studies of vertebrate TEs have revealed substantial heterogeneity in the composition of class I and class II TEs between major clades of vertebrates (Chalopin et al. 2015). These studies have implicated changes in TE composition with altered gene regulation (Torres et al. 2021; Rech et al. 2022), evolutionary changes in genome size (Böhne et al. 2008; Naville et al. 2019; Wong et al. 2019), evolutionary novelties (Senft and Macfarlan 2021), changes in life history (Niu et al. 2019), and speciation (Ricci et al. 2018; Serrato-Capuchina and Matute 2018) to name but a few. While the past decade has yielded tremendous strides towards developing an understanding of the general hallmarks of TE evolution, the evolutionary fate of TEs following whole-genome duplication events is just beginning to emerge (Parisod et al. 2010; Rodriguez and Arkhipova 2018; Papon et al. 2023).

Genome duplication events have the potential to amplify genome complexity, with emerging evidence highlighting a possible role for duplication events enabling phenotypic evolution through duplicated genes (Moriyama and Koshiba-Takeuchi 2018). Studies within individual or closely related species that vary in their level of ploidy have revealed substantial modifications that can facilitate profound epigenetic repatterning within the domains of TEs. However, the impact of such changes on the mode of TE diversification remains unclear. For example, surplus gene copies can compensate for potential losses or modifications in expression caused by TE insertions, thereby facilitating extensive genomic modification through the actions of TEs. This redundancy hypothesis (Matzke and Matzke 1998) suggests a substantial shift in TE content and composition following a genome duplication event that continues as lineages diversify (Parisod et al. 2010). An alternate hypothesis argues that genome duplication events correspond to a transitory phase for a species that is characterized by diminished population size (Lynch 2007). This bottleneck hypothesis argues that as the efficacy of selection decreases, the likelihood of moderately deleterious TE insertions within nascent polyploid genomes becoming fixed is increased. Consequently, this hypothesis would expect a pulse of TE diversification coincident with a genome duplication event (Parisod et al. 2010). Recently, a hypothesis framed around the concept of “Hopeful Monsters” suggested that the balance between the deleterious and beneficial aspects of TE proliferation can lead to occasional beneficial mutations that can promote speciation and contribute to the emergence of new traits (Turner et al. 2021). In contrast to the previous two hypotheses, this hypothesis predicts that genome duplications would result in only a limited number of TE changes, not a substantial pulse of diversification. Given that these hypotheses represent corner cases on a continuum of possibilities and alternate hypotheses, empirical comparative genomic studies at different evolutionary timescales are key to understanding the implications of genome duplication events on the evolution of TEs.

Teleost fish offer an exemplar group for the study of TEs and an ancient genome duplication event. Over 99% of the over 35,000 living species of actinopterygians are teleosts, a clade that experienced a genome duplication event in its early evolutionary history (Brunet et al. 2006; Glasauer and Neuhauss 2014). Comprising half of all living vertebrate species, teleosts have successfully radiated across virtually all aquatic habitats, including swamps (Albert et al. 2020), abyssal ocean depths (Davis et al. 2014; Miller et al. 2022), polar and high altitude regions (Near et al. 2012; Dornburg et al. 2017; De-Kayne et al. 2022; Parker et al. 2022), and caves (Wen et al. 2022). In contrast, the three remaining extant ray-finned fish lineages—Holostei (gars and bowfin, 8 species), Acipenseriformes (sturgeon and paddlefish, 29 species), and Polypteridae (bichirs and ropefishes, 14 species)—did not undergo the teleost specific genome duplication event. Recent studies of holostean genomes have suggested alternative views on the evolutionary diversification of the teleost versus nonteleost actinopterygian mobilome. On the one hand, differences in the abundance of TE types between holosteans and model teleosts such as Zebrafish (Danio rerio) and Medaka (Oryzias latipes) (Braasch et al. 2016; Thompson et al. 2021; Mallik et al. 2023) raise the possibility of an evolutionary shift following a genome duplication event in teleosts. On the other hand, the diversity of TE superfamilies is similar between holosteans and teleosts, suggesting the TGD had a limited impact on the generation of novelty in the teleost mobilome (Chalopin and Volff 2017). Further comparative studies of the ray-finned fish mobilome that include a broad representative sampling of teleosts, holosteans, acipenseriforms, and polypterids are necessary to discern the impact of the TGD on the teleost mobilome. With the growing number of genomes of ray-finned fishes deposited in public data repositories, such studies are now possible.

Over 20 years ago, the Takifugu rubripes genome represented the first ray-finned fish genome sequenced (Aparicio et al. 2002). Decreased costs of sequencing have since led to a surge of efforts to sequence additional actinopterygian genomes (Fan et al. 2020; Randhawa and Pawar 2021) that now present a wealth of resources for phylogenetically comprehensive comparative studies. In addition to the availability of hundreds of teleost genomes, there are genomes of a few for nonteleost ray-finned fishes. For example, the Sterlet (Acipenser ruthenus) genome revealed an independent historic genome duplication in this chondrostein lineage that provides an additional ancient duplication for understanding evolutionary patterns of TE diversification in actinopterygians (Cheng et al. 2019). Similarly, the sequencing of the Senegal Bichir (Polypterus senegalus) leveraged the anatomy of polypterids to provide critical insights into how vertebrates achieved the transition from water to land (Bi et al. 2021). In addition, the recently sequenced genome of a second polypterid, Erpetoichthys calabaricus (Reedfish; Assembly [ErpCal1.1; NCBI Annotation Release 100]), has been deployed alongside the P. senegalus genome to provide further insights into other aspects of early vertebrate diversification that include the evolution of keratins (Kimura and Nikaido 2021), olfactory receptors (Zhang et al. 2021), and numerous other traits (Helfenrath et al. 2021; Mikami et al. 2022). However, based on BUSCO scores, the currently sequenced genome of P. senegalus, remains incomplete. As the only Polypterus genome sequenced, the lack of an additional Polypterus genome challenges interpretation of results concerning the distribution of TEs in this lineage. Given the phylogenetic position of polypterids, additional genome sequencing efforts are of extreme value for contextualizing the diversification of ray-finned fish TEs.

Here we present a high-quality chromosome-level assembly and annotation for an additional Polypterus species, Polypterus bichir. We integrate analyses of TEs within this genome with data on TE content for all other major lineages of actinopterygians to investigate the impact of genome duplication on the early evolution of the teleost mobilome. Using a comparative phylogenetic framework, we test for associations between genome size and the composition of the mobilome, variations in mobilome composition between teleosts and nonteleost ray-finned fish lineages, as well as possible correlations between TE content and aspects of actinopterygian biodiversity such as habitat, body size, or water column occupation. We additionally reconstruct the ancestral mobilome of actinopterygians through the TGD event and assess the phylogenetic signal of class I and class II TEs. These results support an emerging perspective on the early diversification of the actinopterygian mobilome and the impact of the TGD on its evolution.

Results and Discussion

Bichir Genomes: an Example of Evolutionary Conservation or Recent Divergence?

We present a high-quality assembly of the P. bichir genome (NCBI Bioproject PRJNA811142). The results of a BLAST search of mitochondrial COI from our specimen against polypterid barcode sequences on NCBI verified the identification as Polypterus bichir lapradei, a currently not recognized subspecies based on morphology that has been suggested to represent a genetically distinct lineage (Near et al. 2014). The 10X supernova assembly from Dovetail Genomics (Scotts Valley, CA) resulted in 130,773 scaffolds forming a total final genome size of 3,962,089,718 bp. Here 13.9% of the genome (550,533,170 bp) is composed of the ambiguous base “N” and a GC content of 39.21%. During Dovetail Hi-Rise assembly, the input assembly was further incorporated into 70,587 longer scaffolds. The total length of the resulting Dovetail Hi-Rise assembly was 3,905.43 Mbp, with a contig N50 of 37.39 kbp. The N50 of the assembly was 202.693 Mbp scaffolds with a L50 of seven scaffolds. This is similar to the E. calabaricus genome which is 3.6 Gb long, with a contig N50 size of 6.8 Mb, and a scaffold N50 size of 217.7 Mbp, and the P. senegalus reference genome, which is 3.7 Gbp, with a scaffold N50 of 189.69 Mbp, and contig N50 of 4,528.14 kbp (supplementary table S1, Supplementary Material online). However, a comparison of the BUSCO analysis on the P. senegalus reference genome to the newly sequenced P. bichir genome reveal a striking difference between the two assemblies (supplementary table S2 and fig. S1, Supplementary Material online). We find nearly triple the number of actinopterygian orthologs in the P. bichir genome relative to the P. senegalus genome (supplementary fig. S1, Supplementary Material online). This suggests that the P. bichir sequence may fill additional gaps in our understanding of the genomic evolution of polypteriforms outside of the mobilome elements discussed in this study.

Contrasting patterns of synteny between our new sequenced genome and those of P. senegalus and E. calabaricus using D-genies supports a high level of synteny between these species (supplementary fig. S2, Supplementary Material online). Further, the number of P. bichir superscaffolds (18) reflects the described karyotypes of most polypterids: E. calabaricus (n = 18), Polypterus palmas (n = 18), Polypterus delhezi (n = 18), P. senegalus (n = 18), and Polypterus ornatipinnis (n = 18) (Bachmann et al. 1972; Vervoort 1980; Morescalchi et al. 2007; Krysanov and Golubtsov 2014). The only exception to this chromosome count known in polypterids is in P. weeksi (n = 19) (Vervoort 1980), the sister lineage to Polypterus ornatipinnis. While closely related vertebrate taxa can often diverge substantially in their TE content over time, we find that this is not the case in bichirs. Instead, the conservation of karyotype and synteny across these species is also reflected in the relative abundances of TEs across these species. For P. bichir, RepeatModeler quantified 53.40% of the genome to be composed of TEs. This is similar to P. senegalus with (54.66%) and E. calabaricus (60.35%) and overall posits a surprising level of genomic conservation between species of polypterids. This conservation could be a consequence of multiple factors including slow rates of molecular evolution such as those observed in gars and bowfin (Wright et al. 2022), the possible geologically recent Early Miocene crown age of Polypteridae estimated using molecular clocks (Near et al. 2014), or a combination of the two to name but one set of possibilities. Indeed, Kimura distances of P. bichir TEs (supplementary fig. S28, Supplementary Material online), reveal that the bichir genome is dominated by recent copies of mostly DNA and LINE transposons (K ∼ 5). Regardless of the mechanism, our sequencing of the P. bichir genome reveals polypterids as a candidate lineage for future studies of the mechanisms that promote genomic stability within a clade.

The Evolution of the ray-finned Fish Mobilome

Placing our characterization of the polypterid mobilome into the context of other major ray-finned fish lineages reveals a complex history of mobilome evolution over the past 400 million years (Fig. 1a). The reconstructed phylogenetic history of TE evolution in actinopterygians is not consistent with a sudden TE proliferation coincident with the TGD event. Instead, our analyses reveal that compositional TE patterns at this scale largely reflect shifts within actinopterygian subclades. For example, cichlids have similar compositional patterns relative to pufferfish, the latter of which exhibits an increase in the number of LINE elements (Fig. 1a). Independently, Lampriformes (Regalecus & Lampris) also exhibit an expansion of their LINE elements to a relative level similar to that observed in pufferfishes (Fig. 1a). The ubiquity of such patterns of clade-specific heterogeneity are strongly supported by tests of phylogenetic signal using both Pagel's lambda (λ) and Blomberg's K (K) (supplementary table S3, Supplementary Material online). In all cases, K values are significant (DNA | K = 0.478, P  =  6e-04; LTR | K  = 0.405, P  =  1e-04; LINE | K  = 0.945, P  =  1e-04; and SINE | K  = 0.529, P  =  3e-04). Likewise, quantifications of lambda values for DNA (λ  =  0.98, P  =  1.89e-13), LTR (λ  = 0.819, P  =  6.64e-11), LINE (λ  = 0.975, P  =  1.51e-19), and SINE (λ ∼ 1.0, P  =  4.37e-15) are very close to 1, indicating strong phylogenetic signal, supporting a pattern of similar relative TE abundances between closely related taxa and increasing disparity between subclades.

Major patterns of TE evolution across the evolutionary history of ray-finned fishes. a) The relative abundance of DNA, LTR, LINE, SINE transposons, and unknown transposons relative to actinopteryigiian phylogeny are shown. b) Total TE % and absolute genome size in the context of evolutionary history are compared. c) Results of a phylogenetic generalized least-squares regression to assess the relationship between genome size and TE abundance are provided. d) TE % and genome size between teleosts and nonteleost actinopterygians are compared. Shadings in (a) correspond to elements labeled in the panel. Shadings in (b) correspond to high (light or dark) and low values (dark or light) in the upper and lower panels, respectively. Shadings in (c) correspond to the labels in (a) for individual elements. Panels in (d) correspond to the quantiles of each category with dark horizontal lines indicating the mean value.
Fig. 1.

Major patterns of TE evolution across the evolutionary history of ray-finned fishes. a) The relative abundance of DNA, LTR, LINE, SINE transposons, and unknown transposons relative to actinopteryigiian phylogeny are shown. b) Total TE % and absolute genome size in the context of evolutionary history are compared. c) Results of a phylogenetic generalized least-squares regression to assess the relationship between genome size and TE abundance are provided. d) TE % and genome size between teleosts and nonteleost actinopterygians are compared. Shadings in (a) correspond to elements labeled in the panel. Shadings in (b) correspond to high (light or dark) and low values (dark or light) in the upper and lower panels, respectively. Shadings in (c) correspond to the labels in (a) for individual elements. Panels in (d) correspond to the quantiles of each category with dark horizontal lines indicating the mean value.

Contrasting relative TE content (Fig. 1b) against total genome size reveals numerous expansions and contractions of both genome size and relative TE abundances across the phylogenetic diversity of ray-finned fishes. Importantly, there is no signal of a shift in the rate of TE evolution with the origin of teleosts. Instead, the reconstructed history indicates that expansions and contractions of both relative TE abundance and genome size appear to heterogeneously occur across various teleost and nonteleost lineages. Phylogenetic regression is essential when comparing data between species because it explicitly models the covariance structure arising from shared ancestry, accounting for the statistical dependence between species that results from their evolutionary relationships. Our phylogenetic regression analyses indicate that shifts in genome size are weakly correlated with changes in the relative abundances of TEs (Fig. 1c; supplementary table S3, Supplementary Material online). This result is consistent with previous studies of ray-finned fish mobile elements that suggest a modest correlation between the relative abundance of TEs and genome size (Yuan et al. 2018; Reinar et al. 2023), and supports the suggestion that lineage-specific shifts in mobilome expansion and deletion events have clade specific effects in mobilome composition (Chalopin et al. 2015). Correspondingly, we find strong evidence that genome size in ray-finned fishes also exhibits a pattern of strong phylogenetic signal (K  = 0.659, P  =  6e-04; λ  = 0.717, P  =  4.45e-15), paralleling a major trend in genome size evolution. Lineages as disparate as angiosperms (Beaulieu et al. 2008; Alonso et al. 2015), Drosophila (Sessegolo et al. 2016), marine dinoflagellates (Lo et al. 2022), bacteria, and Archaea (Martinez-Gutierrez and Aylward 2022) exhibit strong phylogenetic signal in genome size evolution. Given that the relative abundance of TE content in a genome is weakly correlated with genome size in ray-finned fishes (Lehmann et al. 2021) Fig. 1d; and numerous other lineages (Chalopin et al. 2015), it is possible that the evolution of TEs exhibits a signature of strong phylogenetic signal across the Tree of Life (Dodsworth et al. 2015).

Multiple factors promote genome size evolution (Olmo et al. 1989), including hypothesized correlations between the molecular evolution of the mobilome and overall increases in genome size. For example, transposition rates generally exceed excision rates (Kelleher et al. 2020), enabling TEs to contribute to the enlargement of genomes and exert an influence on genome size evolution. However, whether the TGD event resulted in a shift in the mode of evolution between the teleosts and nonteleosts mobilome remains unclear. We tested for differences between teleosts and nonteleosts using a phylogenetic analysis of variance (ANOVA) (Garland et al. 1993). This test calculates a traditional ANOVA and then compares the observed F-value to a null distribution generated by simulating trait evolution on the phylogenetic tree under a scenario where the independent variable (x) has no effect on the dependent variable (y). This approach accounts for the phylogenetic relationships by distinguishing whether similarities in the dependent variable among species arise from the influence of the independent variable or from shared evolutionary history, thereby removing possible confounding effects arising from shared common ancestry by considering the dispersion of x across the phylogeny. Our phylogenetic ANOVA reveals no support for a significant difference in TE content between teleosts and nonteleosts (P = 0.06; Fig. 1d).

Additionally, phylogenetic regression analyses strongly support a correlation between the content of all mobilome elements between species. When not accounting for differences in genome size, such a correlation may be expected if TEs only scale with genome size, and is indeed supported by our analyses (P < 0.0001 in all cases). However, when accounting for differences in genome size, our results continue to support that an increase or decrease in one element will also be correlated in an increase in the other for all TE types (Fig. 2). These results suggest that TE content is not just a function of genome size, as is also reflected in our finding of only a weak correlation between these variables (Fig. 1c). On the contrary, there is a growing body of evidence that TEs and the TE silencing machinery of a host are locked in an evolutionary arms race (Simkin et al. 2013). Of particular note are studies that have investigated how changes in DNA methylation sites can provide opportunities for TEs to become more promiscuous (Yoder et al. 1997; Chernyavskaya et al. 2017), including studies that have linked changes in DNA methylation to TE derepression that acts a post zygotic barrier during the speciation process (Laporte et al. 2019). Considering opportunities for subtle shifts in methylation and other mechanisms that inhibit or enable TE propagation across the teleost Tree of Life alongside our results, this suggest that the overall relative composition of the ray-finned fish mobilome has likely been shaped more by recent lineage-specific shifts in genome evolution than historical legacy stemming from the TGD event.

Correlation between relative class I and class II mobilome content. Panels a), b), and c) display the correlations between DNA transposons and LINE, LTR, and SINE transposons resulting from phylogenetic generalized least-squares regressions, respectively. Panels d) and e) illustrate the correlations based on a phylogenetic generalized least squares regression between LTR transposons and LINE and SINE transposons, respectively. Panel f) presents the correlation based on a phylogenetic generalized least-squares regression between LINE and SINE transposons.
Fig. 2.

Correlation between relative class I and class II mobilome content. Panels a), b), and c) display the correlations between DNA transposons and LINE, LTR, and SINE transposons resulting from phylogenetic generalized least-squares regressions, respectively. Panels d) and e) illustrate the correlations based on a phylogenetic generalized least squares regression between LTR transposons and LINE and SINE transposons, respectively. Panel f) presents the correlation based on a phylogenetic generalized least-squares regression between LINE and SINE transposons.

Concerning the TGD and the Appearance of Novel Mobilome Elements

It is possible that the TGD catalyzed the evolution of major groups of novel TE's. However, when placing the 25 main superfamilies of the teleost mobilome (Chalopin et al. 2015) into the context of the phylogenetic diversity of ray-finned fishes, we reveal that this effect was muted. We estimated the ancestral mobilome of actinopterygians utilizing a model-averaged stochastic character mapping approach (Revell 2024). Out of 25 superfamilies, the presence of 24 superfamilies observed in teleosts is mirrored within nonteleosts (Fig. 3). The only exception to this is the DNA transposon superfamily Chaepev (Kapitonov and Jurka 2007). It is unlikely that this superfamily arose as a consequence of the TGD, as it is present in several arthropods as well as a range of vertebrate species including anoles (Novick et al. 2011), and lampreys (Zhang et al. 2014). Chapaev sequences have also been documented from White Sturgeon (Acipenser transmontanus) (Zhang et al. 2014), suggesting that additional species of nonteleost ray-finned fishes possess this superfamily.

Reconstructing the ancestral actinopterygian mobilome. The presence and absence of TE superfamilies (listed above) are represented by blue and gray squares, respectively, for various species on the left (top panel). The shaded area behind the fish paintings represents teleosts, while the unshaded region below represents nonteleosts with evolutionary relationships between taxa depicted by the phylogenetic tree on the far-left. These input data were used to estimate the ancestral mobilome (the probability of each TE superfamily being present or absent) for the most recent common ancestor of all Actinopterygii (A), Neopterygii (N), or Teleostei (T) using SIMMAP (bottom panel). With a resulting heatmap plotted using ggplot (Wickham 2016).
Fig. 3.

Reconstructing the ancestral actinopterygian mobilome. The presence and absence of TE superfamilies (listed above) are represented by blue and gray squares, respectively, for various species on the left (top panel). The shaded area behind the fish paintings represents teleosts, while the unshaded region below represents nonteleosts with evolutionary relationships between taxa depicted by the phylogenetic tree on the far-left. These input data were used to estimate the ancestral mobilome (the probability of each TE superfamily being present or absent) for the most recent common ancestor of all Actinopterygii (A), Neopterygii (N), or Teleostei (T) using SIMMAP (bottom panel). With a resulting heatmap plotted using ggplot (Wickham 2016).

Genome duplication events are hypothesized to form a substrate for genomic innovation (Crow et al. 2006). In contrast to this expectation, our results reveal numerous instances of numerous likely TE losses as well as independent gains. For example, we find repeated losses of TE super families such as LINE Dong, DNA TcMariner, and DNA Crypton in teleosts (Fig. 3). LTR-Pao is absent in longnose gar and bowfin, suggesting a loss in holosteans. LTR-DIRS are absent in all the three polypterids (P. bichir, P. senegalus, and E. calabaricus). In addition, we identify the TE superfamily Jockey, a LINE element, exclusively in Lepisosteus osseus and P. senegalus. Jockey has been previously confirmed in coelacanth (Chalopin et al. 2014) and lampreys, and this marks the first reported instance of its presence in actinopterygians. It is possible that this element has been lost multiple times independently, however, Jockey is known to have high rates of horizontal transfer in other lineages (Reiss et al. 2019; Tambones et al. 2019), raising the possibility that these were independent gains.

Studies of vertebrate genome evolution often assume vertical transmission as the dominant mode of evolution. However, recent work has highlighted the impact of horizontal transfer events in the evolution of the vertebrate mobilome (Zhang et al. 2020; Galbraith et al. 2022; Paulat et al. 2023). Our ancestral mobilome reconstructions of the DNA Kolobok superfamily are in line with phylogenetic patterns expected by a model of horizontal transfer (Fig. 3). In our sampling, the presence of Kolobok is limited to five distantly related teleost lineages, as well as the Longnose.

Gar (L. osseus) among nonteleost ray-finned fishes. It is possible that the Kolobok element has been repeatedly transferred throughout the evolutionary history of Actinopterygii. Such transfer events are considered more likely by recent observations of horizontal transfer at more recent time scales. For example, there is evidence of horizontal gene transfer of DNA transposons like Merlin, TcMariner, and PiggyBAC in salmonids (de Boer et al. 2007). Likewise, there has been evidence of horizontal transfer of Chapaev transposons in White Sturgeon, Pacific Bluefin Tuna (Thunnus orientalis) and Blue Catfish (Ictalurus furcatus) (Zhang et al. 2014). As the number of high-quality genomes for species of ray finned fishes continue to accumulate, future studies of the extent of horizontal transfer events across the ray-finned fish mobilome offer an exciting research prospect.

Lineage-specific Expansions of the Teleost Mobilome

TEs can play a beneficial role by enhancing an organism's ability to respond to dynamic environmental conditions (Yuan et al. 2018). Numerous studies have linked TE activity to an organism's responsiveness to environmental factors (Capy et al. 2000; Fujino et al. 2011; Hua-Van et al. 2011; Baduel et al. 2021). This association between TEs and environment suggests that there may be a correlation between relative TE abundance and specific aspects of an organism's ecological niche or life history. We tested this expectation using three commonly tested abiotic/biotic factors associated with evolutionary diversification: latitude, body size, and depth. However, we find no such association for several factors often invoked to explain diversification patterns at this evolutionary scale. Under the Brownian motion model, our phylogenetics least-squares regression (PGLS) analysis between TE content and maximum body size revealed no evidence for a strong correlation between these traits (Fig. 4a and supplementary tables S4 to S7, Supplementary Material online). Likewise, we find no statistically supported correlation between TE content and depth occupancy in marine fishes (Fig. 4b), in line with recent work suggesting that genome size and depth relationships have been driven by an association within just argentiniiforms, and that this is not a pattern across all teleosts (Medeiros et al. 2022). There is also no general pattern of a correlation between TE content and the average latitude of a species geographic distribution (Fig. 4c). The only exception to this lack of correlation between latitude and TE content occurs in SINEs (supplementary table S6, Supplementary Material online) under the Brownian motion model (P = 0.0197). This correlation is likely driven by a reduction in SINE elements in Antarctic notothenioids, which experienced an unusual bout of genomic evolution prior to their diversification in the Southern Ocean (Daane et al. 2019). Regardless, depth, body size, or latitude appear not to be predictors of mobilome evolution, even when differentiating between marine, freshwater, or estuarine fishes at this evolutionary scale. In all cases, phylogenetic regression results were largely similar between a brownian and Orstein–Uhlenbeck model of trait evolution (supplementary table S6 and S7, Supplementary Material online).

Correlation of TEs with biotic and abiotic factors. a) This panel displays the results of phylogenetic generalized least-squares (PGLS) regressions between TE percentages and median latitude b) This panel portrays the PGLS regression results between TE percentage and body size (log-transformed). Lastly, c) depicts PGLS results between TE abundance and the maximum habitat depth of the species. The colors and shapes of each data point on the plot are defined in the key.
Fig. 4.

Correlation of TEs with biotic and abiotic factors. a) This panel displays the results of phylogenetic generalized least-squares (PGLS) regressions between TE percentages and median latitude b) This panel portrays the PGLS regression results between TE percentage and body size (log-transformed). Lastly, c) depicts PGLS results between TE abundance and the maximum habitat depth of the species. The colors and shapes of each data point on the plot are defined in the key.

The TGD certainly could have presented opportunities for a burst of genomic novelty. However, the next 300 million years of ray-finned fish evolution would certainly be expected to shape the genomes of different lineages responding to different biotic and abiotic conditions. As such, it is possible that the signal of the genome duplication has eroded. For example, TEs could replace themselves in a way analogous to multigene families, in which recently diverging paralogs often replace older genes. In such a scenario, rapid rates of mobilome evolution manifest as high between-clade heterogeneity at the scale of all ray-finned fishes. Clade-specific changes in the mobilome are readily apparent when considering a phylogenetic principal components analysis (PCA) of mobilome abundances (Fig. 5). For example, PC1 corresponds to sharp divergence between Esox, salmonids, and three bichirs, from acanthomorphs (Fig. 5 and supplementary figs. S4 to S6, Supplementary Material online). Some of this divergence is possibly explained by genome duplications specific to Esox and salmonids, suggesting future studies of the mobilome in these taxa to be of high research interest. Likewise, the hundreds of millions of years of independent evolution in bichir and holosteans lineages could also have eroded some signature of TE diversification. Our sampling includes all the extant lineages of holosteans and bichirs that comprise the deepest branches in their respective phylogenies, as such there are no known additional lineages that could be sampled to polarize ancestral state reconstructions of the mobilome at deeper divergences. Regardless, the alignment of taxa with PC1 values coincides with the prevalence of high percentage of DNA transposons that comprise the TEs within fish genomes that influence genome size variation among teleosts such as zebrafish (supplementary fig. S3, Supplementary Material online), medaka, stickleback, and Tetraodon (Jiang et al. 2016). It is certainly not possible to discount a possible role for life-history shifts shaping the mobilome among closely related species or that differences between class 1 and class II replications serve as an opportunity for substantial expansion of the mobilome.

Considering TE diversity in the context of evolutionary history. a) Results of a Phylogenetic PCA on the abundances of LTRs, DNA transposons, LINEs, and SINEs that project the phylogeny and points onto a 2D plot of PC1 and PC2. b) Depicts the resulting space based on the residual variation following linear regression between major TE classes and genome size. c) Depicts the results of a PCA on the residual variation of the abundances of all 25 superfamilies accounting for differences in genome size, consisting of 13 DNA transposons, 7 LINEs, and 5 LTR subfamilies. Dark shading indicates nonteleost actinopterygians; light shading indicates teleosts. All the images of the fishes are from Wikimedia commons.
Fig. 5.

Considering TE diversity in the context of evolutionary history. a) Results of a Phylogenetic PCA on the abundances of LTRs, DNA transposons, LINEs, and SINEs that project the phylogeny and points onto a 2D plot of PC1 and PC2. b) Depicts the resulting space based on the residual variation following linear regression between major TE classes and genome size. c) Depicts the results of a PCA on the residual variation of the abundances of all 25 superfamilies accounting for differences in genome size, consisting of 13 DNA transposons, 7 LINEs, and 5 LTR subfamilies. Dark shading indicates nonteleost actinopterygians; light shading indicates teleosts. All the images of the fishes are from Wikimedia commons.

Conclusion

We find no evidence for a discernible temporal signature of TE diversification coincident with the TGD, supporting the hypothesis that this genome duplication event did not exert a long-lasting influence on TE diversification (Chalopin and Volff 2017). Our results suggest that the ray-finned fish mobilome appears to maintain a consistent profile, with substantial similarity between the mobilomes of teleosts and nonteleost ray-finned fishes. However, mobilome content between taxa varied substantially, from around 55% in Zebrafish to about 6% in pufferfish genomes. These shifts cannot be entirely explained by differences in genome size as genome compaction in smooth pufferfish like T. rubripes and Tetraodon nigroviridis is not associated with an overall reduction of retrotransposon diversity (Volff et al. 2003). Similarly, Medaka and Zebrafish have been found to have very similar L1 retrotransposon diversity, despite a large disparity in genome size (Kordis et al. 2006). Instead, these results strongly imply that the history of the teleost mobilome is one of lineage-specific evolution. In this way, the diversification of TEs may mirror the general dynamics of gene-birth found in multigene families in which there is consistent turnover of paralogs through time (Pasquier et al. 2016). If true, then the impact of ploidy events may be short-lived at best, and shifts in TEs may be more aligned with the concepts of hopeful monsters over deeper evolutionary scales.

Our analyses support the observation that early diverging aquatic vertebrates (e.g. coelacanth, ray-finned fishes, cartilaginous fish, and lamprey) contain a significantly broader spectrum of mobilome diversity in the genome relative to birds or mammals, ranging from 22 to 27 TE superfamilies. Among these, certain autonomous elements like ERVs, LINE1 retrotransposons, TcMariner, or hAT DNA transposons, as well as nonautonomous ones like V-SINE (Piskurek and Jackson 2011), are widely distributed across all the vertebrates. This suggests their likely presence in ancestral jawed vertebrate genomes. In contrast, we see a lower distribution of endoretroviruses (ERVs) in actinopterygians than mammals and birds, with the amount ranging from 0.033% in Fugu to 0.76% in zebrafish (Carducci et al. 2020). Our findings of clade-specific heterogeneity within ray-finned fishes, suggests that future studies between more closely related clades to be particularly fruitful as genomic resources become available. In particular, such studies could reveal superfamilies that have been lost or are on the path to extinction in specific lineages, similar to studies revealing such findings for L2 and Helitrons in birds (Chalopin et al. 2015) and gypsy retrotransposons in birds and mammals (Volff et al. 2003). It is clear that we are only beginning to unmask the complexity of transposon dynamics and decipher the intricate processes that contribute to TE diversification over deep evolutionary timescales.

Methods and Materials

Sample Acquisition and Sequencing

All research involving live animals was performed in accordance with relevant institutional and national guidelines and regulations, and was approved by the North Carolina State University Institutional Animal Care and Use Committee (IACUC). We acquired an individual P. bichir specimen (Yoder lab ID 0051) through the pet trade that was anesthetized using MS-222 for the extraction of blood (10 ml). The fish was then euthanized for dissection of tissue samples and the voucher specimen was deposited in the North Carolina Museum of Natural Sciences Ichthyology Collection (NCSM 111902). Blood was shipped to Dovetail Genomics, LCC (Scotts Valley, CA) for genomic DNA extraction, library preparation, sequencing, and assembly. Samples were extracted by Dovetail staff using Qiagen and Cell Culture Midi Kit (Qiagen, Gmbh), yielding DNA with an average fragment length of 95 kbp that was used in the construction of HiC sequencing libraries.

Chicago Library Preparation and Sequencing

A Chicago library was prepared using the methods as described in Putnam et al. (2016). About 500 ng of High Molecular Weight genomic DNA, with mean fragment length of 95 kbp. Chromatin was reconstituted in vitro by incorporating the DNA with purified histones and chromatin assembly factors and then fixed by formaldehyde. DpnII was used to digest the chromatin, followed by filling in the 5′ overhangs with biotinylated nucleotides, and then ligating the free blunt ends. After the ligations step, DNA was purified from protein by reversing the crosslinks. Biotinylated free ends were removed from the purified DNA by treating it. The DNA was sheared to ∼350 bp fragment size and Illumina compatible adapters along with NERNext Ultra enzymes were used to generate sequencing libraries. Streptavidin beads were used ahead of PCR enrichment of each library to isolate the biotin-containing fragments. The libraries were sequenced on an Illumina HiSeqX providing 30.58x physical coverage of the genome (1 to 100 kbp).

Dovetail Hi-C Library Preparation and Sequencing

The preparation of a dovetail Hi-C library was executed as described (Lieberman-Aiden et al. 2009). For each library, the chromatin was fixed in place in the nucleus by crosslinking with formaldehyde, and the extracted fixed chromatin was then digested with the restriction enzyme DpnII that produces 5′ overhangs. These 5′ overhangs were filled in with biotinylated nucleotides, and then the resulting blunt ends were ligated. Crosslinks were then reversed to obtain the purified DNA, and which was treated to remove excess biotin. A Hi-C library was then created by shearing the DNA to ∼350 bp mean fragment size. Sequencing libraries were generated using NEBNext Ultra enzymes and Illumina compatible adapters. Streptavidin beads were used prior to PCR enrichment of each library to isolate the biotin containing fragments. Libraries were sequenced using an Illumina HiSeqX which yielded 191 million paired end reads (2 × 150 bp) and provided 8,427.66 x physical coverage of the genome (10 to 10,000 kbp).

Scaffolding the Assembly With Hi-Rise

Sequencing reads were assembled with Hi-Rise, a software pipeline designed to scaffold genome assemblies using proximity ligation data (Putnam et al. 2016). It uses the de novo assembly, shotgun reads, Chicago library reads and Dovetail HiC library reads as input and conducts an iterative analysis. The shotgun and Chicago library sequences are first aligned to the draft input assembly using a modified SNAP read mapper (http://snap.cs.berkeley.edu), with modifications such as masking out the base pairs that followed a restriction enzyme junction. Hi-Rise then modeled the separations of Chicago read pairs mapped within draft scaffolds, using a likelihood model for genomic distance between read pairs. The likelihood model produced by HiRise was used to identify and break putative misjoins and also score and make prospective joins. Dovetail HiC library sequences were analyzed using the same methods after aligning and scaffolding the Chicago data. Once all the sequences were aligned and scaffolded, shotgun sequences were used to close the gaps between contigs.

Contamination Removal and Species Verification

Dovetail staff have noted that when pooled with other samples on Illumina sequencing platforms, 10X Chromium Genome solution libraries are susceptible to a small degree of index hopping that can result in minor incorrect assignment of sequenced reads during demultiplexing. This low level of index misassignment, typically results in sequence contaminants impacting, but limited to, small scaffolds (typically <10 kb) in the final assembly. To mitigate this, dovetail staff leveraged any uncharacteristic number of reads per barcode associated with impacted scaffolds to identify and reliably isolate them from the final assembly. This was accomplished by aligning the 10X reads to the supernova assembly, recording the barcode count for the aligned reads, and recording the number of reads that aligned to each scaffold and were tagged with the same barcode. The median number of reads per barcode for each scaffold were then calculated and scaffolds with a distinct anomalously low ratio were removed.

Currently, P. bichir is described as a single species with the subspecies Polypterus bichir bichir and Polypterus bichir lapradei no longer considered valid (Moritz and Britz 2019). This sinking of subspecies is a consequence of the high degree of morphological similarity (Britz and David Johnson 2003; Britz and Johnson 2010). However, molecular investigations have suggested the possibility that P. bichir bichir and P. bichir lapradei may be genetically distinct (Suzuki et al. 2010) and these have been treated as independent lineages (Near et al. 2014). As no genetic species delimitation analyses have been conducted between these putative genetic lineages, we extracted the mitochondrial barcode COI from our assembly and used a BLAST search against polypterid sequences on NCBI to verify identification that included barcodes from P. bichir bichir and P. bichir lapradei. This ensured that we accounted for possible future taxonomic revisions while remaining consistent with current taxonomy.

RNA Sequencing and Assembly

RNA was extracted (Qiagen RNeasy kit) from the spleen, kidney, gill, heart, eyes, and intestine of the same individual P. bichir as the genome sequence. Quantity and integrity of the isolated RNA were assessed using a NanoDrop 1000 (Thermo Fisher) and Agilent Bioanalyzer, respectively. The process of mRNA enrichment was done using oligo(dT) beads, and rRNA was removed using a Ribo-Zero kit (Epicentre, Madison, WI). Each RNA extraction was equalized for a final concentration of 180 ng/µL. Library preparation and sequencing was performed by Novogen Corporation (Sacramento, CA). Next-gen sequencing (2 × 150 bp paired end reads) was performed on a NovaSeq 6000 instrument (Illumina). Adapter sequences and poor quality reads were filtered with Trimmomatic v34 (Bolger et al. 2014). The transcriptome was de novo assembled with Trinity v2.11.0 (Grabherr et al. 2011) followed by BUSCO analysis to assess completeness of the transcriptomes (Manni et al. 2021). Raw reads and computationally assembled transcriptome sequences were deposited onto NCBI under the accession numbers SRR19537224–SRR19537230 and GKOV01000000, respectively.

Gene Ontology Assessment

We conducted a series of analyses for gene ontology assignment. First, the RNA-seq data assembled using Trinity was translated with Transdecoder, enabling the identification of potential coding regions in the transcript sequences. The longest open reading frames (ORFs) were extracted and subjected to BLASTx and BLASTp analyses against the Uniprot database (November 2021 release), yielding the top target sequences for each transcript. Subsequently, Hmmscan v.3.3.2 was employed to search for protein sequences in the Pfam-A database (November 2021 release). Signalp v.5.0b and TMHMM v.2.0c were used to detect signal peptides and transmembrane proteins, respectively. Trinotate v.3.2.2, in combination with the Trinity assembled transcriptome and the longest ORFs from Transdecoder, generated a gene transmap. This process facilitated the annotation of transcripts using Trinotate, followed by further analysis to obtain the Gene Ontology (GO) annotations. For visualization of the GO terms, the “enrichplot’ and “ggupset’ packages in R were employed. These steps collectively provided a comprehensive understanding of the functional attributes of the transcriptome data, to understand the underlying biological processes and pathways associated with the studied organisms (supplementary figs. S7 to S27, Supplementary Material online).

Annotation and Genome Quality Assessments

The P. bichir genome annotation was done using the commonly used BRAKER2 (Brůna et al. 2021). To accomplish this, we first used RepeatModeler (Version 5.8.8) (Flynn et al. 2020) to model the repeats in the genome sequence. We then used RepeatMasker (Smit et al. 2005) to mask the repeats found with RepeatModeler and remove them from the genome. The masked genome and the transcriptome aligned using HISAT2 (version 2.1.0) (Kim et al. 2019) were then used to annotate the genome using the Genemark-ET option in BRAKER. The genemark.gtf file was subsequently used by Augustus (Gabriel et al. 2021) to model the proteins.

The completeness of the protein sequences was assessed using BUSCO v. (5.5.0) (Simão et al. 2015; Seppey et al. 2019) with both the Actinopterygii (actinopterygii_odb10, busco.ezlab.org) and vertebrate (vertebrata_odb10, busco.ezlab.org) databases. As polypterids are several hundred million years divergent from all other actinopterygians (Dornburg et al. 2014; Near et al. 2014), the use of the second vertebrate-wide database allowed us to verify similar levels of assembly completeness and mitigate against a potentially teleost-biased ray-finned fish database that may not capture loci in a deeply divergent taxon. We additionally conducted an analysis of synteny between our assembly of the P. bichir genome and the previously sequenced P. senegalus and E. calabaricus genomes using D-Genies (Cabanettes and Klopp 2018). To estimate the relative ages of TEs within the genome, we generated a Kimura distance plot using Repeatmasker with the -a parameter to get the alignment file (.align). This was analyzed using “calcDivergenceFromAlign.pl’ to get the divergence summary divsum file (.divsum), which was further analyzed using “createRepeatLandscape.pl’ to create the comprehensive repeat landscape.

Comparative Analyses of the Actinopterygian Mobilome

To assemble a dataset of TE content across major ray-finned fish lineages, we integrated the results of the repeat analysis of the P. bichir genome with other publicly available analyses of TEs in ray-finned fishes. For nonteleosts, this captured TE content from two additional polypteriform genomes for E. calabaricus (Helfenrath et al. 2021) and P. senegalus (Fujito and Nonaka 2012), A. ruthenus (Du et al. 2020) as a representative of Chondrostei, as well as Lepisosteus oculatus (Braasch et al. 2016), L. osseus (Mallik et al. 2023), and Amia calva (Thompson et al. 2021) as representative holosteans. These data were integrated with data from 98 teleost genomes previously analyzed (Reinar et al. 2023), that capture the majority of major teleost lineages including Elopomorpha, Osteoglossomorpha, Otocephala, and a large number of acanthomorph and nonacanthomorph euteleosts. This study used the same RepeatModeller-based approach for TE identification as our methods above, making it appropriate to combine with our dataset. We generated a consensus sequence file using RepeatMasker to obtain detailed information TEs that were extracted from the resulting .out and .tbl files. This yielded a dataset of TE content for 105 ray-finned fish genomes. To place this data into a comparative phylogenetic framework, we first obtained a time calibrated phylogeny of all taxa from TimeTree v5 (Kumar et al. 2022). As this tree lacked resolution for acanthomorph lineages, we modified branch lengths to reflect the age estimates from a recent analysis of acanthomorph divergence times based on ultraconserved elements (Ghezelayagh et al. 2022) that is consistent with other published divergence time estimates of this superradiation (Near et al. 2013; Hughes et al. 2018). We additionally modified the topology to reflect the proposed sister relationship between Osteoglossomorpha and Elopomorpha based on genomic and transcriptomic sequence analyses (Hughes et al. 2018; Vialle et al. 2018; Hao et al. 2020; Wcisel et al. 2020).

We used the ggtree and ggtreeExtra packages (Yu 2022) in R 4.3.1 to visualize the distribution of TE abundances (LTR, LINE, SINE, DNA) across the evolutionary history of actinopterygians. We additionally used ggtree in conjunction with phytools (Revell 2012) to visualize genome size and TE content variation between species alongside a likelihood based ancestral state estimation of changes in genome size across the phylogeny conducted in phytools. To assess if changes in genome size were correlated with the changes in the overall abundance of TEs, or changes in the abundances of LTRs, LINEs, SINEs, or DNA elements, we conducted a series of phylogenetic linear regressions using the phylolm package in R. Model fits were assessed by quantification of Akaike Information Criterion (AIC) scores (supplementary table S4, Supplementary Material online) Regressions were conducted under a Brownian model of trait evolution (supplementary table S6, Supplementary Material online). To assess whether results were robust to the underlying model of character evolution, analyses were repeated using an Ornstein–Uhlenbeck (OU) model of character evolution (supplementary table S7, Supplementary Material online). A similar set of phylogenetic regressions was next conducted to assess if increases in the abundance of elements (e.g. LINEs, SINEs, etc.) were positively correlated with each other, or if negative correlations exist, allowing us to assess whether elements have antagonistic evolutionary dynamics (supplementary table S8, Supplementary Material online). Next, we performed a set of regressions assessing whether changes in the abundances of elements were correlated with changes in maximum body size, latitude, or maximum depth of occurrence. Body size and depth data for each species were taken from fishbase using the rfishbase package in R (Boettiger et al. 2012). Latitudinal data were calculated using occurrence data from the Global Biodiversity Information Facility (gbif) using rgbif v3.7.8. Although imperfect, both BM and OU models are commonly used models for comparative analyses of the above traits (Iglesias et al. 2015; Medeiros et al. 2022; Parker et al. 2022), with OU models often providing a better fit for depth data depending on the temporal scale. This fit may be a consequence of clade specific expansions into different depth zones occurring over deep time scales. We further tested for differences in TE content between teleosts and nonteleosts using a phylogenetic ANOVA in phytools phylANOVA with multiple comparison correction using the Benjamini–Hochberg procedure (Benjamini and Hochberg 1995).

To reconstruct the ancestral mobilome of early ray-finned fishes, we focused on the 25 elements most commonly studied in analyses of ray-finned fish TEs. The presence/absence data were scored for each species and each element and used as input data for a model averaged stochastic character mapping approach (Bollback 2006) implemented in phytools. This approach expands the standard phytools implementation to ancestral state estimation by allowing possible models of character change (e.g. “Equal Rates”, “All Rates Different”) in phytools to contribute to the reconstruction in relation to their Akaike weight. We assessed the degree to which variation in TE content could be explained by evolutionary history through quantification of the phylogenetic signal of overall TE abundance as well as the abundances of each TE type. This was accomplished using the phylosig function in phytools to calculate both Pagel's λ (Pagel 1999) and Blomberg et al.'s K (Blomberg et al. 2003) Values of λ are distributed between zero and 1, with a zero value representing the absence of phylogenetic signal and a value of 1 corresponding to the expectations of Brownian motion on a phylogeny. To assess statistical significance, we compared our empirical λ values to the null expectation that λ = 0 for each trait via a likelihood ratio test. Blomberg's K compliments estimates of λ, with values <1 indicating less phylogenetic signal than would be expected given a model of Brownian motion, and value >1 indicating a higher than expected coupling between the distribution of traits and the underlying phylogeny. To assess statistical significance of K, we compared our empirical K values to null distributions of expected K values based on 10,000 permutations of each trait on the phylogeny.

To visualize the major axes of variation of the ray-finned fish mobilome, we conducted a phylogenetic principal component analysis (pPCA) using the phyl.pca function in phytools. The resulting PC axes were then used with the phylomorphospace function in phytools to project the phylogeny into the resulting PCA space using ggplot2. Additionally, we conducted a pPCA on the abundances of the subtypes of all TEs based on the amounts derived from the .out files. As preliminary analyses indicated a correlation between elements and overall genome size, pPCAs were repeated on the residuals resulting from a regression of genome size versus TE content, mirroring similar approaches to accounting for traits that covary with another trait (e.g. limb proportions and body size, etc.).

Acknowledgments

We thank Kent Passingham (NC State University) for assistance with blood collection and the NCBI staff for help with sequence contamination identification. We thank Katerina Zapfe for her invaluable input on figure refinement, and Brandon Turner for helpful insights on TEs. Fish paintings used in figures produced by Julie Johnson (https://www.lifesciencestudios.com) for TJ Near.

Funding

This work was supported by the National Science Foundation (United States) (IOS1755242 to A.D. and IOS1755330 to J.A.Y.).

Data Availability

The P. bichir genome sequence is available through NCBI Bioproject PRJNA811142. Raw transcriptome reads are available through NCBI under the accession number SRR19537224, SRR19537225, SRR19537226, SRR19537227, SRR19537228, SRR19537229, SRR19537230. Computationally assembled transcriptome sequences are available on NCBI under the accession number GKOV000000000. All the files and code used for TE analysis and visualization are available on Zenodo (DOI: 10.5281/zenodo.10398557).

Supplementary Material

Supplementary material is available at Genome Biology and Evolution online.

Literature Cited

Albert
 
JS
,
Tagliacollo
 
VA
,
Dagosta
 
F
.
Diversification of neotropical freshwater fishes
.
Annu Rev Ecol Evol Syst
.
2020
:
51
(
1
):
27
53
. .

Alonso
 
C
,
Pérez
 
R
,
Bazaga
 
P
,
Herrera
 
CM
.
Global DNA cytosine methylation as an evolving trait: phylogenetic signal and correlated evolution with genome size in angiosperms
.
Front Genet
.
2015
:
6
:
4
. .

Aparicio
 
S
,
Chapman
 
J
,
Stupka
 
E
,
Putnam
 
N
,
Chia
 
J-M
,
Dehal
 
P
,
Christoffels
 
A
,
Rash
 
S
,
Hoon
 
S
,
Smit
 
A
, et al.  
Whole-genome shotgun assembly and analysis of the genome of Fugu rubripes
.
Science
.
2002
:
297
(
5585
):
1301
1310
. .

Bachmann
 
K
,
Goin
 
OB
,
Goin
 
CJ
.
The nuclear DNA of polypterus palmas
.
Copeia
.
1972
:
1972
(
2
):
363
. .

Baduel
 
P
,
Leduque
 
B
,
Ignace
 
A
,
Gy
 
I
,
Gil
 
J
, Jr
,
Loudet
 
O
,
Colot
 
V
,
Quadrana
 
L
.
Genetic and environmental modulation of transposition shapes the evolutionary potential of Arabidopsis thaliana
.
Genome Biol
.
2021
:
22
(
1
):
138
. .

Beaulieu
 
JM
,
Leitch
 
IJ
,
Patel
 
S
,
Pendharkar
 
A
,
Knight
 
CA
.
Genome size is a strong predictor of cell size and stomatal density in angiosperms
.
New Phytol
.
2008
:
179
(
4
):
975
986
. .

Benjamini
 
Y
,
Hochberg
 
Y
.
Controlling the false discovery rate: a practical and powerful approach to multiple testing
.
J R Stat Soc
.
1995
:
57
(
1
):
289
300
. .

Bi
 
X
,
Wang
 
K
,
Yang
 
L
,
Pan
 
H
,
Jiang
 
H
,
Wei
 
Q
,
Fang
 
M
,
Yu
 
H
,
Zhu
 
C
,
Cai
 
Y
, et al.  
Tracing the genetic footprints of vertebrate landing in non-teleost ray-finned fishes
.
Cell
.
2021
:
184
(
5
):
1377
1391.e14
. .

Blomberg
 
SP
,
Garland
 
T
 Jr
,
Ives
 
AR
.
Testing for phylogenetic signal in comparative data: behavioral traits are more labile
.
Evolution
.
2003
:
57
(
4
):
717
745
. .

Boettiger
 
C
,
Lang
 
DT
,
Wainwright
 
PC
.
rfishbase: exploring, manipulating and visualizing FishBase data from R
.
J Fish Biol
.
2012
:
81
(
6
):
2030
2039
. .

Böhne
 
A
,
Brunet
 
F
,
Galiana-Arnoux
 
D
,
Schultheis
 
C
,
Volff
 
J-N
.
Transposable elements as drivers of genomic and biological diversity in vertebrates
.
Chromosome Res
.
2008
:
16
(
1
):
203
215
. .

Bolger
 
AM
,
Lohse
 
M
,
Usadel
 
B
.
Trimmomatic: a flexible trimmer for illumina sequence data
.
Bioinformatics
.
2014
:
30
(
15
):
2114
2120
. .

Bollback
 
JP
.
SIMMAP: stochastic character mapping of discrete traits on phylogenies
.
BMC Bioinformatics
.
2006
:
7
(
1
):
88
. .

Braasch
 
I
,
Gehrke
 
AR
,
Smith
 
JJ
,
Kawasaki
 
K
,
Manousaki
 
T
,
Pasquier
 
J
,
Amores
 
A
,
Desvignes
 
T
,
Batzel
 
P
,
Catchen
 
J
, et al.  
The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons
.
Nat Genet
.
2016
:
48
(
4
):
427
437
. .

Britz
 
R
,
David Johnson
 
G
.
On the homology of the posteriormost gill arch in polypterids (Cladistia, Actinopterygii)
.
Zool J Linn Soc
.
2003
:
138
(
4
):
495
503
. .

Britz
 
R
,
Johnson
 
DG
. Occipito-vertebral fusion in actinopterygians: conjecture, myth and reality. Part 1: non-teleosts.
Origin and phylogenetic interrelationships of teleosts honoring gloria arratia
.
Munich (Germany
):
Verlag Dr. Friedrich Pfeil
;
2010
. p.
77
93
. https://repository.si.edu/bitstream/handle/10088/9748/vz_10Britzand_Johnson.pdf.

Brůna
 
T
,
Hoff
 
KJ
,
Lomsadze
 
A
,
Stanke
 
M
,
Borodovsky
 
M
.
BRAKER2: automatic eukaryotic genome annotation with GeneMark-EP+ and AUGUSTUS supported by a protein database
.
NAR Genom Bioinform
.
2021
:
3
(
1
):
lqaa108
. .

Brunet
 
FG
,
Roest Crollius
 
H
,
Paris
 
M
,
Aury
 
J-M
,
Gibert
 
P
,
Jaillon
 
O
,
Laudet
 
V
,
Robinson-Rechavi
 
M
.
Gene loss and evolutionary rates following whole-genome duplication in teleost fishes
.
Mol Biol Evol
.
2006
:
23
(
9
):
1808
1816
. .

Cabanettes
 
F
,
Klopp
 
C
.
D-GENIES: dot plot large genomes in an interactive, efficient and simple way
.
PeerJ
.
2018
:
6
:
e4958
. .

Capy
 
P
,
Gasperi
 
G
,
Biémont
 
C
,
Bazin
 
C
.
Stress and transposable elements: co-evolution or useful parasites?
 
Heredity (Edinb).
 
2000
:
85
(
Pt 2
):
101
106
. .

Carducci
 
F
,
Barucca
 
M
,
Canapa
 
A
,
Carotti
 
E
,
Biscotti
 
MA
.
Mobile elements in ray-finned fish genomes
.
Life (Basel)
.
2020
:
10
(
10
):
221
. .

Chalopin
 
D
,
Fan
 
S
,
Simakov
 
O
,
Meyer
 
A
,
Schartl
 
M
,
Volff
 
J-N
.
Evolutionary active transposable elements in the genome of the coelacanth
.
J Exp Zool B Mol Dev Evol
.
2014
:
322
(
6
):
322
333
. .

Chalopin
 
D
,
Naville
 
M
,
Plard
 
F
,
Galiana
 
D
,
Volff
 
J-N
.
Comparative analysis of transposable elements highlights mobilome diversity and evolution in vertebrates
.
Genome Biol Evol
.
2015
:
7
(
2
):
567
580
. .

Chalopin
 
D
,
Volff
 
J-N
.
Analysis of the spotted gar genome suggests absence of causative link between ancestral genome duplication and transposable element diversification in teleost fish
.
J Exp Zool B Mol Dev Evol
.
2017
:
328
(
7
):
629
637
. .

Cheng
 
P
,
Huang
 
Y
,
Du
 
H
,
Li
 
C
,
Lv
 
Y
,
Ruan
 
R
,
Ye
 
H
,
Bian
 
C
,
You
 
X
,
Xu
 
J
, et al.  
Draft genome and complete hox-cluster characterization of the sterlet (Acipenser ruthenus)
.
Front Genet
.
2019
:
10
:
776
. .

Chernyavskaya
 
Y
,
Mudbhary
 
R
,
Zhang
 
C
,
Tokarz
 
D
,
Jacob
 
V
,
Gopinath
 
S
,
Sun
 
X
,
Wang
 
S
,
Magnani
 
E
,
Madakashira
 
BP
, et al.  
Loss of DNA methylation in zebrafish embryos activates retrotransposons to trigger antiviral signaling
.
Development
.
2017
:
144
(
16
):
2925
2939
. .

Crow
 
KD
,
Wagner
 
GP
,
SMBE Tri-National Young Investigators
.
Proceedings of the SMBE tri-national young investigators’ workshop 2005. What is the role of genome duplication in the evolution of complexity and diversity?
 
Mol Biol Evol
.
2006
:
23
(
5
):
887
892
. .

Daane
 
JM
,
Dornburg
 
A
,
Smits
 
P
,
MacGuigan
 
DJ
,
Brent Hawkins
 
M
,
Near
 
TJ
,
William Detrich Iii
 
H
,
Harris
 
MP
.
Historical contingency shapes adaptive radiation in Antarctic fishes
.
Nat Ecol Evol
.
2019
:
3
(
7
):
1102
1109
. .

Davis
 
MP
,
Holcroft
 
NI
,
Wiley
 
EO
,
Sparks
 
JS
,
Leo Smith
 
W
.
Species-specific bioluminescence facilitates speciation in the deep sea
.
Mar Biol
.
2014
:
161
(
5
):
1139
1148
. .

de Boer
 
JG
,
Yazawa
 
R
,
Davidson
 
WS
,
Koop
 
BF
.
Bursts and horizontal evolution of DNA transposons in the speciation of pseudotetraploid salmonids
.
BMC Genomics
.
2007
:
8
(
1
):
422
. .

De-Kayne
 
R
,
Selz
 
OM
,
Marques
 
DA
,
Frei
 
D
,
Seehausen
 
O
,
Feulner
 
PGD
.
Genomic architecture of adaptive radiation and hybridization in Alpine whitefish
.
Nat Commun
.
2022
:
13
(
1
):
4479
. .

Dodsworth
 
S
,
Chase
 
MW
,
Kelly
 
LJ
,
Leitch
 
IJ
,
Macas
 
J
,
Novák
 
P
,
Piednoël
 
M
,
Weiss-Schneeweiss
 
H
,
Leitch
 
AR
.
Genomic repeat abundances contain phylogenetic signal
.
Syst Biol
.
2015
:
64
(
1
):
112
126
. .

Dornburg
 
A
,
Federman
 
S
,
Lamb
 
AD
,
Jones
 
CD
,
Near
 
TJ
.
Cradles and museums of Antarctic teleost biodiversity
.
Nat Ecol Evol
.
2017
:
1
(
9
):
1379
1384
. .

Dornburg
 
A
,
Townsend
 
JP
,
Friedman
 
M
,
Near
 
TJ
.
Phylogenetic informativeness reconciles ray-finned fish molecular divergence times
.
BMC Evol Biol
.
2014
:
14
(
1
):
169
. .

Du
 
K
,
Stöck
 
M
,
Kneitz
 
S
,
Klopp
 
C
,
Woltering
 
JM
,
Adolfi
 
MC
,
Feron
 
R
,
Prokopov
 
D
,
Makunin
 
A
,
Kichigin
 
I
, et al.  
The sterlet sturgeon genome sequence and the mechanisms of segmental rediploidization
.
Nat Ecol Evol
.
2020
:
4
(
6
):
841
852
. .

Fan
 
G
,
Song
 
Y
,
Yang
 
L
,
Huang
 
X
,
Zhang
 
S
,
Zhang
 
M
,
Yang
 
X
,
Chang
 
Y
,
Zhang
 
H
,
Li
 
Y
, et al.  
Initial data release and announcement of the 10,000 Fish Genomes Project (Fish10K)
.
Gigascience
.
2020
:
9
(
8
):
giaa080
. .

Flynn
 
JM
,
Hubley
 
R
,
Goubert
 
C
,
Rosen
 
J
,
Clark
 
AG
,
Feschotte
 
C
,
Smit
 
AF
.
RepeatModeler2 for automated genomic discovery of transposable element families
.
Proc Natl Acad Sci U S A
.
2020
:
117
(
17
):
9451
9457
. .

Fujino
 
K
,
Hashida
 
S-N
,
Ogawa
 
T
,
Natsume
 
T
,
Uchiyama
 
T
,
Mikami
 
T
,
Kishima
 
Y
.
Temperature controls nuclear import of Tam3 transposase in Antirrhinum
.
Plant J
.
2011
:
65
(
1
):
146
155
. .

Fujito
 
NT
,
Nonaka
 
M
.
Highly divergent dimorphic alleles of the proteasome subunit beta type-8 (PSMB8) gene of the bichir Polypterus senegalus: implication for evolution of the PSMB8 gene of jawed vertebrates
.
Immunogenetics
.
2012
:
64
(
6
):
447
453
. .

Gabriel
 
L
,
Hoff
 
KJ
,
Brůna
 
T
,
Borodovsky
 
M
,
Stanke
 
M
.
TSEBRA: transcript selector for BRAKER
.
BMC Bioinformatics
.
2021
:
22
(
1
):
566
. .

Galbraith
 
JD
,
Ludington
 
AJ
,
Sanders
 
KL
,
Amos
 
TG
,
Thomson
 
VA
,
Enosi Tuipulotu
 
D
,
Dunstan
 
N
,
Edwards
 
RJ
,
Suh
 
A
,
Adelson
 
DL
.
Horizontal transposon transfer and its implications for the ancestral ecology of hydrophiine snakes
.
Genes (Basel)
.
2022
:
13
(
2
):
217
. .

Garland
 
T
,
Dickerman
 
AW
,
Janis
 
CM
,
Jones
 
JA
.
Phylogenetic analysis of covariance by computer simulation
.
Syst Biol
.
1993
:
42
(
3
):
265
292
. .

Ghezelayagh
 
A
,
Harrington
 
RC
,
Burress
 
ED
,
Campbell
 
MA
,
Buckner
 
JC
,
Chakrabarty
 
P
,
Glass
 
JR
,
McCraney
 
WT
,
Unmack
 
PJ
,
Thacker
 
CE
, et al.  
Prolonged morphological expansion of spiny-rayed fishes following the end-Cretaceous
.
Nat Ecol Evol
.
2022
:
6
(
8
):
1211
1220
. .

Glasauer
 
SMK
,
Neuhauss
 
SCF
.
Whole-genome duplication in teleost fishes and its evolutionary consequences
.
Mol Genet Genomics
.
2014
:
289
(
6
):
1045
1060
. .

Grabherr
 
MG
,
Haas
 
BJ
,
Yassour
 
M
,
Levin
 
JZ
,
Thompson
 
DA
,
Amit
 
I
,
Adiconis
 
X
,
Fan
 
L
,
Raychowdhury
 
R
,
Zeng
 
Q
, et al.  
Full-length transcriptome assembly from RNA-Seq data without a reference genome
.
Nat Biotechnol
.
2011
:
29
(
7
):
644
652
. .

Hao
 
S
,
Han
 
K
,
Meng
 
L
,
Huang
 
X
,
Shi
 
C
,
Zhang
 
M
,
Wang
 
Y
,
Liu
 
Q
,
Zhang
 
Y
,
Seim
 
I
, et al.  
2020
. Three genomes of Osteoglossidae shed light on ancient teleost evolution. bioRxiv 2020.01.19.911958. http://biorxiv.org/lookup/doi/10.1101/2020.01.19.911958, 20 January, 2020, preprint: not peer reviewed.

Helfenrath
 
K
,
Sauer
 
M
,
Kamga
 
M
,
Wisniewsky
 
M
,
Burmester
 
T
,
Fabrizius
 
A
.
The more, the merrier? Multiple myoglobin genes in fish species, especially in gray bichir (Polypterus senegalus) and reedfish (Erpetoichthys calabaricus)
.
Genome Biol Evol
.
2021
:
13
(
7
):
evab078
. .

Hickman
 
AB
,
Dyda
 
F
.
Mechanisms of DNA transposition
.
Microbiol Spectr
.
2015
:
3
(
2
):
MDNA3–MDNA0034–2014
. .

Hua-Van
 
A
,
Le Rouzic
 
A
,
Boutin
 
TS
,
Filée
 
J
,
Capy
 
P
.
The struggle for life of the genome's selfish architects
.
Biol Direct
.
2011
:
6
(
1
):
19
. .

Hughes
 
LC
,
Ortí
 
G
,
Huang
 
Y
,
Sun
 
Y
,
Baldwin
 
CC
,
Thompson
 
AW
,
Arcila
 
D
,
Betancur-R
 
R
,
Li
 
C
,
Becker
 
L
, et al.  
Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data
.
Proc Natl Acad Sci U S A
.
2018
:
115
(
24
):
6249
6254
. .

Iglesias
 
TL
,
Dornburg
 
A
,
Brandley
 
MC
,
Alfaro
 
ME
,
Warren
 
DL
.
Life in the unthinking depths: energetic constraints on encephalization in marine fishes
.
J Evol Biol
.
2015
:
28
(
5
):
1080
1090
. .

Jiang
 
S
,
Cai
 
D
,
Sun
 
Y
,
Teng
 
Y
.
Isolation and characterization of putative functional long terminal repeat retrotransposons in the Pyrus genome
.
Mob DNA
.
2016
:
7
(
1
):
1
. .

Kapitonov
 
VV
,
Jurka
 
J
.
Chapaev-a novel superfamily of DNA transposons
.
Repbase Rep
.
2007
:
7
:
774
781
.

Kelleher
 
ES
,
Barbash
 
DA
,
Blumenstiel
 
JP
.
Taming the turmoil within: new insights on the containment of transposable elements
.
Trends Genet
.
2020
:
36
(
7
):
474
489
. .

Kim
 
D
,
Paggi
 
JM
,
Park
 
C
,
Bennett
 
C
,
Salzberg
 
SL
.
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype
.
Nat Biotechnol
.
2019
:
37
(
8
):
907
915
. .

Kimura
 
Y
,
Nikaido
 
M
.
Conserved keratin gene clusters in ancient fish: an evolutionary seed for terrestrial adaptation
.
Genomics
.
2021
:
113
(
1
):
1120
1128
. .

Kordis
 
D
,
Lovsin
 
N
,
Gubensek
 
F
.
Phylogenomic analysis of the L1 retrotransposons in Deuterostomia
.
Syst Biol
.
2006
:
55
(
6
):
886
901
. .

Krysanov
 
EY
,
Golubtsov
 
AS
.
Karyotypes of four fish species from the Nile and Omo-Turkana basins in Ethiopia
.
J Ichthyol
.
2014
:
54
(
10
):
889
892
. .

Kumar
 
S
,
Suleski
 
M
,
Craig
 
JM
,
Kasprowicz
 
AE
,
Sanderford
 
M
,
Li
 
M
,
Stecher
 
G
,
Hedges
 
SB
.
TimeTree 5: an expanded resource for species divergence times
.
Mol Biol Evol
.
2022
:
39
(
8
):
msac174
. .

Laporte
 
M
,
Le Luyer
 
J
,
Rougeux
 
C
,
Dion-Côté
 
A-M
,
Krick
 
M
,
Bernatchez
 
L
.
DNA methylation reprogramming, TE derepression, and postzygotic isolation of nascent animal species
.
Sci Adv
.
2019
:
5
(
10
):
eaaw1644
. .

Lehmann
 
R
,
Kovařík
 
A
,
Ocalewicz
 
K
,
Kirtiklis
 
L
,
Zuccolo
 
A
,
Tegner
 
JN
,
Wanzenböck
 
J
,
Bernatchez
 
L
,
Lamatsch
 
DK
,
Symonová
 
R
.
DNA transposon expansion is associated with genome size increase in mudminnows
.
Genome Biol Evol
.
2021
:
13
(
10
):
evab228
. .

Lieberman-Aiden
 
E
,
van Berkum
 
NL
,
Williams
 
L
,
Imakaev
 
M
,
Ragoczy
 
T
,
Telling
 
A
,
Amit
 
I
,
Lajoie
 
BR
,
Sabo
 
PJ
,
Dorschner
 
MO
, et al.  
Comprehensive mapping of long-range interactions reveals folding principles of the human genome
.
Science
.
2009
:
326
(
5950
):
289
293
. .

Lo
 
R
,
Dougan
 
KE
,
Chen
 
Y
,
Shah
 
S
,
Bhattacharya
 
D
,
Chan
 
CX
.
Alignment-free analysis of whole-genome sequences from symbiodiniaceae reveals different phylogenetic signals in distinct regions
.
Front Plant Sci
.
2022
:
13
:
815714
. .

Lynch
 
M
.
The frailty of adaptive hypotheses for the origins of organismal complexity
.
Proc Natl Acad Sci U S A
.
2007
:
104
(
Suppl 1
):
8597
8604
. .

Mallik
 
R
,
Carlson
 
KB
,
Wcisel
 
DJ
,
Fisk
 
M
,
Yoder
 
JA
,
Dornburg
 
A
.
A chromosome-level genome assembly of longnose gar, Lepisosteus osseus
.
G3 (Bethesda)
.
2023
:
13
(
7
):
jkad095
. .

Manni
 
M
,
Berkeley
 
MR
,
Seppey
 
M
,
Simão
 
FA
,
Zdobnov
 
EM
.
BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes
.
Mol Biol Evol
.
2021
:
38
(
10
):
4647
4654
. .

Martinez-Gutierrez
 
CA
,
Aylward
 
FO
.
Genome size distributions in bacteria and archaea are strongly linked to evolutionary history at broad phylogenetic scales
.
PLoS Genet
.
2022
:
18
(
5
):
e1010220
. .

Matzke
 
MA
,
Matzke
 
AJ
.
Polyploidy and transposons
.
Trends Ecol Evol
.
1998
:
13
(
6
):
241
. .

McClintock
 
B
.
The significance of responses of the genome to challenge
.
Science
.
1984
:
226
(
4676
):
792
801
. .

Medeiros
 
APM
,
Santos
 
BA
,
Betancur-R
 
R
.
Does genome size increase with water depth in marine fishes?
 
Evolution
.
2022
:
76
(
7
):
1578
1589
. .

Mikami
 
M
,
Ineno
 
T
,
Thompson
 
AW
,
Braasch
 
I
,
Ishiyama
 
M
,
Kawasaki
 
K
.
Convergent losses of SCPP genes and ganoid scales among non-teleost actinopterygians
.
Gene
.
2022
:
811
:
146091
. .

Miller
 
EC
,
Martinez
 
CM
,
Friedman
 
ST
,
Wainwright
 
PC
,
Price
 
SA
,
Tornabene
 
L
.
Alternating regimes of shallow and deep-sea diversification explain a species-richness paradox in marine fishes
.
Proc Natl Acad Sci U S A
.
2022
:
119
(
43
):
e2123544119
. .

Morescalchi
 
MA
,
Liguori
 
I
,
Rocco
 
L
,
Stingo
 
V
.
Karyotypic characterization and genomic organization of the 5S rDNA in Erpetoichthys calabaricus (Osteichthyes, Polypteridae)
.
Genetica
.
2007
:
131
(
2
):
209
216
. .

Moritz
 
T
,
Britz
 
R
.
Revision of the extant Polypteridae (Actinopterygii: Cladistia)
.
Ichthyol Explor Freshw
.
2019
:
IEF-1094
:
1
96
. .

Moriyama
 
Y
,
Koshiba-Takeuchi
 
K
.
Significance of whole-genome duplications on the emergence of evolutionary novelties
.
Brief Funct Genomics
.
2018
:
17
(
5
):
329
338
. .

Muñoz-López
 
M
,
García-Pérez
 
JL
.
DNA transposons: nature and applications in genomics
.
Curr Genomics
.
2010
:
11
(
2
):
115
128
. .

Naville
 
M
,
Henriet
 
S
,
Warren
 
I
,
Sumic
 
S
,
Reeve
 
M
,
Volff
 
J-N
,
Chourrout
 
D
.
Massive changes of genome size driven by expansions of non-autonomous transposable elements
.
Curr Biol
.
2019
:
29
(
7
):
1161
1168.e6
. .

Near
 
TJ
,
Dornburg
 
A
,
Eytan
 
RI
,
Keck
 
BP
,
Smith
 
WL
,
Kuhn
 
KL
,
Moore
 
JA
,
Price
 
SA
,
Burbrink
 
FT
,
Friedman
 
M
, et al.  
Phylogeny and tempo of diversification in the superradiation of spiny-rayed fishes
.
Proc Natl Acad Sci U S A
.
2013
:
110
(
31
):
12738
12743
. .

Near
 
TJ
,
Dornburg
 
A
,
Kuhn
 
KL
,
Eastman
 
JT
,
Pennington
 
JN
,
Patarnello
 
T
,
Zane
 
L
,
Fernández
 
DA
,
Jones
 
CD
.
Ancient climate change, antifreeze, and the evolutionary diversification of Antarctic fishes
.
Proc Natl Acad Sci U S A
.
2012
:
109
(
9
):
3434
3439
. .

Near
 
TJ
,
Dornburg
 
A
,
Tokita
 
M
,
Suzuki
 
D
,
Brandley
 
MC
,
Friedman
 
M
.
Boom and bust: ancient and recent diversification in bichirs (Polypteridae: Actinopterygii), a relictual lineage of ray-finned fishes
.
Evolution
.
2014
:
68
(
4
):
1014
1026
. .

Niu
 
X-M
,
Xu
 
Y-C
,
Li
 
Z-W
,
Bian
 
Y-T
,
Hou
 
X-H
,
Chen
 
J-F
,
Zou
 
Y-P
,
Jiang
 
J
,
Wu
 
Q
,
Ge
 
S
, et al.  
Transposable elements drive rapid phenotypic variation in
.
Proc Natl Acad Sci U S A
.
2019
:
116
(
14
):
6908
6913
. .

Novick
 
PA
,
Smith
 
JD
,
Floumanhaft
 
M
,
Ray
 
DA
,
Boissinot
 
S
.
The evolution and diversity of DNA transposons in the genome of the Lizard Anolis carolinensis
.
Genome Biol Evol
.
2011
:
3
:
1
14
. .

Olmo
 
E
,
Capriglione
 
T
,
Odierna
 
G
.
Genome size evolution in vertebrates: trends and constraints
.
Comp Biochem Physiol B
.
1989
:
92
(
3
):
447
453
. .

Pagel
 
M
.
Inferring the historical patterns of biological evolution
.
Nature
.
1999
:
401
(
6756
):
877
884
. .

Papon
 
N
,
Lasserre-Zuber
 
P
,
Rimbert
 
H
,
De Oliveira
 
R
,
Paux
 
E
,
Choulet
 
F
.
All families of transposable elements were active in the recent wheat genome evolution and polyploidy had no impact on their activity
.
Plant Genome
.
2023
:
16
(
3
):
e20347
. .

Parisod
 
C
,
Alix
 
K
,
Just
 
J
,
Petit
 
M
,
Sarilar
 
V
,
Mhiri
 
C
,
Ainouche
 
M
,
Chalhoub
 
B
,
Grandbastien
 
M-A
.
Impact of transposable elements on the organization and function of allopolyploid genomes
.
New Phytol
.
2010
:
186
(
1
):
37
45
. .

Parker
 
E
,
Zapfe
 
KL
,
Yadav
 
J
,
Frédérich
 
B
,
Jones
 
CD
,
Economo
 
EP
,
Federman
 
S
,
Near
 
TJ
,
Dornburg
 
A
.
Periodic environmental disturbance drives repeated ecomorphological diversification in an adaptive radiation of antarctic fishes
.
Am Nat
.
2022
:
200
(
6
):
E221
E236
. .

Pasquier
 
J
,
Cabau
 
C
,
Nguyen
 
T
,
Jouanno
 
E
,
Severac
 
D
,
Braasch
 
I
,
Journot
 
L
,
Pontarotti
 
P
,
Klopp
 
C
,
Postlethwait
 
JH
, et al.  
Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database
.
BMC Genomics
.
2016
:
17
(
1
):
368
. .

Paulat
 
NS
,
Storer
 
JM
,
Moreno-Santillán
 
DD
,
Osmanski
 
AB
,
Sullivan
 
KAM
,
Grimshaw
 
JR
,
Korstian
 
J
,
Halsey
 
M
,
Garcia
 
CJ
,
Crookshanks
 
C
, et al.  
Chiropterans are a hotspot for horizontal transfer of DNA transposons in mammalia
.
Mol Biol Evol
.
2023
:
40
(
5
):
msad092
. .

Piskurek
 
O
,
Jackson
 
DJ
.
Tracking the ancestry of a deeply conserved eumetazoan SINE domain
.
Mol Biol Evol
.
2011
:
28
(
10
):
2727
2730
. .

Putnam
 
NH
,
O’Connell
 
BL
,
Stites
 
JC
,
Rice
 
BJ
,
Blanchette
 
M
,
Calef
 
R
,
Troll
 
CJ
,
Fields
 
A
,
Hartley
 
PD
,
Sugnet
 
CW
, et al.  
Chromosome-scale shotgun assembly using an in vitro method for long-range linkage
.
Genome Res
.
2016
:
26
(
3
):
342
350
. .

Randhawa
 
SS
,
Pawar
 
R
.
Fish genomes: sequencing trends, taxonomy and influence of taxonomy on genome attributes
.
J Appl Ichthyol
.
2021
:
37
(
4
):
553
562
. .

Rech
 
GE
,
Radío
 
S
,
Guirao-Rico
 
S
,
Aguilera
 
L
,
Horvath
 
V
,
Green
 
L
,
Lindstadt
 
H
,
Jamilloux
 
V
,
Quesneville
 
H
,
González
 
J
.
Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila
.
Nat Commun
.
2022
:
13
(
1
):
1948
. .

Reinar
 
WB
,
Tørresen
 
OK
,
Nederbragt
 
AJ
,
Matschiner
 
M
,
Jentoft
 
S
,
Jakobsen
 
KS
.
Teleost genomic repeat landscapes in light of diversification rates and ecology
.
Mob DNA
.
2023
:
14
(
1
):
14
. .

Reiss
 
D
,
Mialdea
 
G
,
Miele
 
V
,
de Vienne
 
DM
,
Peccoud
 
J
,
Gilbert
 
C
,
Duret
 
L
,
Charlat
 
S
.
Global survey of mobile DNA horizontal transfer in arthropods reveals Lepidoptera as a prime hotspot
.
PLoS Genet
.
2019
:
15
(
2
):
e1007965
. .

Revell
 
LJ
.
Phytools: an R package for phylogenetic comparative biology (and other things)
.
Methods Ecol Evol
.
2012
:
3
(
2
):
217
223
. .

Revell
 
LJ
.
phytools 2.0: an updated R ecosystem for phylogenetic comparative methods (and other things).
 
PeerJ
.
2024
:
12(2024): e16505
.

Ricci
 
M
,
Peona
 
V
,
Guichard
 
E
,
Taccioli
 
C
,
Boattini
 
A
.
Transposable elements activity is positively related to rate of speciation in mammals
.
J Mol Evol
.
2018
:
86
(
5
):
303
310
. .

Rodriguez
 
F
,
Arkhipova
 
IR
.
Transposable elements and polyploid evolution in animals
.
Curr Opin Genet Dev
.
2018
:
49
:
115
123
. .

Senft
 
AD
,
Macfarlan
 
TS
.
Transposable elements shape the evolution of mammalian development
.
Nat Rev Genet
.
2021
:
22
(
11
):
691
711
. .

Seppey
 
M
,
Manni
 
M
,
Zdobnov
 
EM
.
BUSCO: assessing genome assembly and annotation completeness
.
Methods Mol Biol
.
2019
:
1962
:
227
245
. .

Serrato-Capuchina
 
A
,
Matute
 
DR
.
The role of transposable elements in speciation
.
Genes (Basel).
 
2018
:
9
(
5
):
254
. .

Sessegolo
 
C
,
Burlet
 
N
,
Haudry
 
A
.
Strong phylogenetic inertia on genome size and transposable element content among 26 species of flies
.
Biol Lett
.
2016
:
12
(
8
):
20160407
. .

Simão
 
FA
,
Waterhouse
 
RM
,
Ioannidis
 
P
,
Kriventseva
 
EV
,
Zdobnov
 
EM
.
BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs
.
Bioinformatics
.
2015
:
31
(
19
):
3210
3212
. .

Simkin
 
A
,
Wong
 
A
,
Poh
 
Y-P
,
Theurkauf
 
WE
,
Jensen
 
JD
.
Recurrent and recent selective sweeps in the piRNA pathway
.
Evolution
.
2013
:
67
(
4
):
1081
1090
. .

Smit
 
AFA
,
Hubley
 
R
,
Green
 
P.
RepeatMasker; 2005 [accessed 2023]. https://www.repeatmasker.org/webrepeatmaskerhelp.html.

Sotero-Caio
 
CG
,
Platt
 
RN
 2nd
,
Suh
 
A
,
Ray
 
DA
.
Evolution and diversity of transposable elements in vertebrate genomes
.
Genome Biol Evol
.
2017
:
9
(
1
):
161
177
. .

Suzuki
 
D
,
Brandley
 
MC
,
Tokita
 
M
.
The mitochondrial phylogeny of an ancient lineage of ray-finned fishes (Polypteridae) with implications for the evolution of body elongation, pelvic fin loss, and craniofacial morphology in Osteichthyes
.
BMC Evol Biol
.
2010
:
10
(
1
):
209
. .

Tambones
 
IL
,
Haudry
 
A
,
Simão
 
MC
,
Carareto
 
CMA
.
High frequency of horizontal transfer in Jockey families (LINE order) of drosophilids
.
Mob DNA
.
2019
:
10
(
1
):
43
. .

Thompson
 
AW
,
Hawkins
 
MB
,
Parey
 
E
,
Wcisel
 
DJ
,
Ota
 
T
,
Kawasaki
 
K
,
Funk
 
E
,
Losilla
 
M
,
Fitch
 
OE
,
Pan
 
Q
, et al.  
The bowfin genome illuminates the developmental evolution of ray-finned fishes
.
Nat Genet
.
2021
:
53
(
9
):
1373
1384
. .

Torres
 
DE
,
Thomma
 
BPHJ
,
Seidl
 
MF
.
Transposable elements contribute to genome dynamics and gene expression variation in the fungal plant pathogen verticillium dahliae
.
Genome Biol Evol
.
2021
:
13
(
7
):
evab135
. .

Turner
 
BA
,
Miorin
 
TR
,
Stewart
 
NB
,
Reid
 
RW
,
Moore
 
CC
,
Rogers
 
RL
.
Chromosomal rearrangements as a source of local adaptation in island Drosophila, arXiv, arXiv:2109.09801, preprint: not peer reviewed
2021
.

Vervoort
 
A
.
Karyotypes and nuclear DNA contents of polypteridae (Osteichthyes)
.
Experientia
.
1980
:
36
(
6
):
646
647
. .

Vialle
 
RA
,
de Souza
 
JES
,
Lopes
 
KDP
,
Teixeira
 
DG
,
Alves Sobrinho
 
PDA
,
Ribeiro-Dos-Santos
 
AM
,
Furtado
 
C
,
Sakamoto
 
T
,
Oliveira Silva
 
FA
,
Herculano Corrêa de Oliveira
 
E
, et al.  
Whole genome sequencing of the pirarucu (Arapaima gigas) supports independent emergence of major teleost clades
.
Genome Biol Evol
.
2018
:
10
(
9
):
2366
2379
. .

Volff
 
J-N
,
Bouneau
 
L
,
Ozouf-Costaz
 
C
,
Fischer
 
C
.
Diversity of retrotransposable elements in compact pufferfish genomes
.
Trends Genet
.
2003
:
19
(
12
):
674
678
. .

Wcisel
 
DJ
,
Howard
 
JT
 3rd
,
Yoder
 
JA
,
Dornburg
 
A
.
Transcriptome ortholog alignment sequence tools (TOAST) for phylogenomic dataset assembly
.
BMC Evol Biol
.
2020
:
20
(
1
):
41
. .

Wen
 
H
,
Luo
 
T
,
Wang
 
Y
,
Wang
 
S
,
Liu
 
T
,
Xiao
 
N
,
Zhou
 
J
.
Molecular phylogeny and historical biogeography of the cave fish genus Sinocyclocheilus (Cypriniformes: Cyprinidae) in southwest China
.
Integr Zool
.
2022
:
17
(
2
):
311
325
. .

Wickham
 
H
.
ggplot2: elegant graphics for data analysis
. New York:
Springer-Verlag
;
2016
.
[accessed 2024 Oct].
 https://ggplot2.tidyverse.org.

Wong
 
WY
,
Simakov
 
O
,
Bridge
 
DM
,
Cartwright
 
P
,
Bellantuono
 
AJ
,
Kuhn
 
A
,
Holstein
 
TW
,
David
 
CN
,
Steele
 
RE
,
Martínez
 
DE
.
Expansion of a single transposable element family is associated with genome-size increase and radiation in the genus
.
Proc Natl Acad Sci U S A
.
2019
:
116
(
46
):
22915
22917
. .

Wright
 
JJ
,
Bruce
 
SA
,
Sinopoli
 
DA
,
Palumbo
 
JR
,
Stewart
 
DJ
.
Phylogenomic analysis of the bowfin (Amia calva) reveals unrecognized species diversity in a living fossil lineage
.
Sci Rep
.
2022
:
12
(
1
):
16514
. .

Yoder
 
JA
,
Walsh
 
CP
,
Bestor
 
TH
.
Cytosine methylation and the ecology of intragenomic parasites
.
Trends Genet
.
1997
:
13
(
8
):
335
340
. .

Yu
 
G
.
Data integration, manipulation and visualization of phylogenetic trees
.
CRC Press
;
2022
.

Yuan
 
Z
,
Liu
 
S
,
Zhou
 
T
,
Tian
 
C
,
Bao
 
L
,
Dunham
 
R
,
Liu
 
Z
.
Comparative genome analysis of 52 fish species suggests differential associations of repetitive elements with their living aquatic environments
.
BMC Genomics
.
2018
:
19
(
1
):
141
. .

Zhang
 
H-H
,
Feschotte
 
C
,
Han
 
M-J
,
Zhang
 
Z
.
Recurrent horizontal transfers of Chapaev transposons in diverse invertebrate and vertebrate animals
.
Genome Biol Evol
.
2014
:
6
(
6
):
1375
1386
. .

Zhang
 
H-H
,
Peccoud
 
J
,
Xu
 
M-R-X
,
Zhang
 
X-G
,
Gilbert
 
C
.
Horizontal transfer and evolution of transposable elements in vertebrates
.
Nat Commun
.
2020
:
11
(
1
):
1362
. .

Zhang
 
Z
,
Sakuma
 
A
,
Kuraku
 
S
,
Nikaido
 
M
.
2021
. Evolutionary transition from water to land in vertebrates illuminated by basal ray-finned fish vomeronasal type 2 receptor (OlfC) genes. Preprint [Internet]. Available from: .

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Associate Editor: Andrea Betancourt
Andrea Betancourt
Associate Editor
Search for other works by this author on:

Supplementary data