Abstract

The origin of methanogenesis can be traced to the common ancestor of non-DPANN archaea, whereas haloarchaea (or Halobacteria) are believed to have evolved from a methanogenic ancestor through multiple evolutionary events. However, due to the accelerated evolution and compositional bias of proteins adapting to hypersaline habitats, Halobacteria exhibit substantial evolutionary divergence from methanogens, and the identification of the closest methanogen (either Methanonatronarchaeia or other taxa) to Halobacteria remains a subject of debate. Here, we obtained five metagenome-assembled genomes with high completeness from soda-saline lakes on the Ordos Plateau in Inner Mongolia, China, and we proposed the name Candidatus Ordosarchaeia for this novel class. Phylogenetic analyses revealed that Ca. Ordosarchaeia is firmly positioned near the median position between the Methanonatronarchaeia and Halobacteria–Hikarchaeia lineages. Functional predictions supported the transitional status of Ca. Ordosarchaeia with the metabolic potential of nonmethanogenic and aerobic chemoheterotrophy, as did remnants of the gene sequences of methylamine/dimethylamine/trimethylamine metabolism and coenzyme M biosynthesis. Based on the similarity of the methyl-coenzyme M reductase genes mcrBGADC in Methanonatronarchaeia with the phylogenetically distant methanogens, an alternative evolutionary scenario is proposed, in which Methanonatronarchaeia, Ca. Ordosarchaeia, Ca. Hikarchaeia, and Halobacteria share a common ancestor that initially lost mcr genes. However, certain members of Methanonatronarchaeia subsequently acquired mcr genes through horizontal gene transfer from distantly related methanogens. This hypothesis is supported by amalgamated likelihood estimation, phylogenetic analysis, and gene arrangement patterns. Altogether, Ca. Ordosarchaeia genomes clarify the sisterhood of Methanonatronarchaeia with Halobacteria and provide new insights into the evolution from methanogens to haloarchaea.

Introduction

Halobacteria (also named haloarchaea or extreme halophiles) was one of the first identified archaeal lineages [1]. This class was initially affiliated with the phylum Euryarchaeota [2], and it was recently classified into a new phylum named Halobacteriota using a concatenated protein phylogeny in the Genome Taxonomy Database (GTDB) [3, 4]. Halobacteria are characterized by the ability to thrive in and depend on hypersaline environments [5, 6]. Most Halobacteria species require more than 1.5 M NaCl for growth [7]. The accumulation of sufficient inorganic salt (i.e. KCl) in the cytoplasm maintains an osmotic pressure equal to that in hypersaline environments [8]. Along with this thermodynamically favorable “salt-in” strategy, intracellular proteins accumulate a high proportion of acidic amino acids, leading to an acidified proteome [9, 10]. Moreover, these acidified proteins become dependent on high salt concentrations [10]. Although separated by a short phylogenetic distance from methanogens [11–13], Halobacteria species exhibit distinctive energy metabolism from that of methanogens. They cannot produce methane, and they generally exhibit aerobic respiration [14–16]. In fact, the methanogenic origin has been widely inferred to date back to the archaeal common ancestor [12, 17, 18]. The evolutionary pathway from anaerobic methanogens to aerobic and nonmethanogenic Halobacteria is an important topic.

Initially, Halobacteria were found to be closely related to Methanomicrobia [1], and their sisterhood was inferred in several phylogenetic analyses [2, 19, 20]. Later, Methanonatronarchaeia inhabiting hypersaline environments were reported as an alternative methanogenic sister [12], and some studies agree with this statement [21, 22]. However, the phylogenetic placement of Methanonatronarchaeia is also debated for long-branch attraction in the phylum Halobacteriota [23, 24]. The long branch of Halobacteria is possibly the result of fast adaptive evolution to (hyper)saline environments [25]. By removing the fast-evolving sites from the multiple sequence alignment prior to tree reconstruction, Methanonatronarchaeia was pushed to the root of the phylum Halobacteriota [23]. However, highly heterogeneous sites also contain phylogenetic information, and their removal might cause tree reconstruction artifacts [24]. Recently, the genomes of Candidatus Hikarchaeia species were assembled, and their inclusion as a transitional phase provided novel evidence that Methanonatronarchaeia were not the most closely related methanogens [13]. In the case that either Methanomicrobia or Methanonatronarchaeia represent the most closely related methanogens, Halobacteria were considered to have lost methanogenesis-related genes in the last common ancestor, and they laterally acquired many genes, including cytochrome c oxidase genes, from bacteria or other lineages [11, 13, 19, 21].

In this study, we obtained five highly complete Candidatus Ordosarchaeia genomes by assembling metagenomes collected from soda-saline lakes on the Ordos Plateau in China. Apart from analyzing their ecological distribution and metabolic potentials, our findings suggest that Ca. Ordosarchaeia diverged from an intermediate evolutionary position between the Methanonatronarchaeia and Halobacteria–Hikarchaeia lineages. The inclusion of Ca. Ordosarchaeia relieves the long-branch attraction of Halobacteria in phylogenetic analysis. Additional analyses based on their robust phylogeny provided new insights into the evolution from methanogens to haloarchaea.

Materials and methods

Sample collection, metagenome sequencing, and genome assembly

In total, 23 brine and sediment samples were collected from the ponds (salinity: 1%–33%) of soda-saline lakes (Habor Lake and Hutong Qagan Lake) on the Ordos Plateau of the Inner Mongolia Autonomous Region, China. Five deep sediment samples were used for enrichment cultivation. The sample site, physicochemical characterization, enrichment culture, DNA extraction, and metagenomic sequencing were described in a previous study [26–28]. Read quality control, contig assembly, and draft genome binning were performed with the same parameters as in previous research [27] based on the metaWRAP (v1.2.2) pipeline [29]. The representative genomes of Ca. Ordosarchaeia were included in the metagenome-assembled genomes (MAGs).

Genomic estimation and taxonomic assignment

Genome size, contig number, N50, N90, maximum contig length, and G + C content were analyzed using the bbstats.sh script (25 July 2019) in the BBTools suite (sourceforge.net/projects/bbmap/). Genome completeness and contamination were determined using a lineage-specific workflow in CheckM (v1.1.3) [30]. Taxonomic classifications were assigned using the classify workflow in GTDB-Tk (v2.0.0) [31] based on Release 207 in GTDB. The genomes were annotated using Prokka (v1.14.6) with the settings of Archaea for annotation mode and RNAmmer (v1.2) for rRNA prediction [32, 33]. Aragorn (v1.2.41) was used to predict tRNAs [34], whereas Prodigal (v2.6.3) was used to predict coding genes [35]. The isoelectric point of each protein was predicted using the protein isoelectric point calculator IPC (January 2016) [36]. In total, 37 Halobacteriota genomes that were closely related to Ca. Ordosarchaeia were simultaneously analyzed as references (Supplementary Table S1). The average nucleotide identity between two genomes and the average amino acid identity (AAI) between two sets of predicted coding sequences from genomes were calculated using FastANI (v1.33) with default options [37] and the online AAI calculator (http://enve-omics.ce.gatech.edu/aai/index), respectively. The identities among the 16S rRNA gene sequences were estimated using BLASTN (v2.6.0+) [38].

Phylogenetic analysis

The 3412 archaeal genomes in GTDB Release 207 were selected as a reference for phylogenomic analyses. Species trees were reconstructed from a concatenated set of 53 archaeal marker proteins [39], and the multiple sequence alignment file was produced during the classification workflow of GTDB-Tk [31]. The multiple sequence alignment could be directly used to rebuild the phylogenetic trees (named “untreated”), or it could be trimmed using BMGE v2.0 to remove regions with high entropy-like scores (weighted with the BLOSUM30 similarity matrix), which are not suited for phylogenetic inference [40]. Alternatively, ClipKIT (v1.3.0) was selected to identify and retain parsimony-informative sites [41]. Maximum-likelihood trees were reconstructed using IQ-TREE (multicore version 1.6.12) with an ultrafast bootstrap approximation based on 1000 replicates, and the best-fit model was chosen according to the Bayesian Information Criterion in ModelFinder [42]. To interpret the phylogenomic relationship, the multiple sequence alignment was subjected to the removal of sequences of Ca. Ordosarchaeia lineage and the removal of acidic amino acid residues by referencing the locations in different lineages. In addition, a Bayesian Markov chain Monte Carlo (MCMC)-based phylogenic analysis was performed using MrBayes (3.2.7a x86_64) [43, 44] with the following parameters: aamodelpr option for prior probability distribution, fixed gtr; rates for likelihood model, invgamma; number of rate categories for the gamma distribution, 4; number of runs, 2; number of generations, 3 000 000; number of chains for Metropolis-coupled MCMC variant, 4; samplefreq, 100; printfreq, 100; diagnfreq, 500; checkfreq, 1000; and fraction of samples that will be discarded in convergence diagnostics, 25%. Seventy-two archaeal genomes (including the 37 Halobacteriota genomes) were elaborately selected (listed in Supplementary Table S2) as a reference for the Bayes tree. A convergent Bayes tree was obtained when the average standard deviation of split frequencies was <0.01. The roots of all trees were set between DPANN and other phyla or between TACK and Euryarchaeota according to previous research [45]. All phylogenetic trees were visualized using the online tool Interactive Tree of Life (v6.6) [46].

Functional annotation

The functions of encoding genes were predicted using an ortholog-based method. The five Ca. Ordosarchaeia genomes obtained in this research, along with 37 reference genomes, were used (Supplementary Table S1). All encoding genes were first functionally annotated according to the highest bit scores to the reference sequences in Archaeal Clusters of Orthologous Genes (arCOGs, the latest version ar18 at https://ftp.ncbi.nih.gov/pub/wolf/COGs/arCOG/) using BLASTP with threshold values of 30% identity and an evalue of 1e-5 [38]. Additionally, eggNOG-mapper (v2.0.0) was used to provide more information [47, 48], including Clusters of Orthologous Genes categories [49], Kyoto Encyclopedia of Genes and Genomes Orthology identifiers, and carbohydrate-active enzymes [50]. The provirus sequences in the genomes were predicted using VirSorter version 2.2.4 [51]. The defense systems were identified using DefenseFinder [52].

Approximation of gene content and evolutionary events in ancestors

The presence or absence of each orthogroup containing four or more protein sequences and evolutionary events in ancestors was approximated across the archaeal domain according to our previous approach [27]. The 77 archaeal genomes, including 5 genomes obtained in this research, 37 representatives of Halobacteriota, and 35 genomes from other phyla, were selected. Briefly, the sequences of identical orthogroups were aligned using MAFFT (v7.407) with the L-INS-i method of high accuracy [53] and then trimmed using the heuristic automated1 method of trimAl (1.2rev59) [54]. Next, sequences containing too many gaps were excluded with the following options: minimum overlap of a position with other positions, 0.3; and minimum percentage of the satisfied positions, 50. A UFBOOT tree as a gene tree for each orthogroup was reconstructed using IQ-TREE with the settings (−m, LG + G; −bb, 1000; −wbtl) reported in previous research [55]. The frequencies of duplication, transfer (gene transfers from the lineage inside the species tree), loss, and origination (gene transfers from the lineages outside the species tree, or true gene originations), as well as the copy number of each orthogroup at each node, were inferred by maximum-likelihood estimation using the amalgamated likelihood estimation (ALE, v0.4) approach [56]. Considering the incompleteness of some genomes, the expected fraction of missing genes per genome was introduced from the completeness estimation of checkM. The orthogroups that exhibited a threshold of 0.3 in the raw reconciliation frequencies were counted [13].

Phylogenetic analysis of marker proteins involved in methane metabolism

The methyl-coenzyme M reductase (Mcr) ABG subunits are marker proteins of anaerobic methane metabolism, including methanogenesis [19, 57, 58]. We selected the protein sequences of the McrABG subunits in Methanonatronarchaeia as seeds. The sequences of each subunit were screened using BLASTP with an evalue of 1e-3 from 3412 archaeal representative genomes in GTDB Release 207 for phylogenetic analysis. The putative alkyl-coenzyme M reductase (Acr) subunits were selected as outgroups for the phylogenetic trees according to previous research [18, 59]. The sequences were aligned using MAFFT with the L-INS-i method. After trimming using trimAl with the aforementioned settings, the maximum-likelihood trees were reconstructed using IQ-TREE with an ultrafast bootstrap approximation based on 1000 replicates, and the best-fit model was chosen.

Relative abundance and geographic distribution

The relative abundance of genomes in the metagenomes was expressed as the reads per kilobase of contigs per million reads mapped calculated using CoverM (v.0.6.0) with the genome module (https://github.com/wwood/CoverM). The global distribution was based on the location of Ca. Ordosarchaeia detected from 16S rRNA gene and genomic sequences. Five Ca. Ordosarchaeia genomes were obtained from our samples, and one related genome (assembly accession: GCA_018609935.1) was collected from GTDB. The 16S rRNA gene sequences were screened from the nt database (update date, 16 October 2022) using BLASTN [38] against those retrieved from Ca. Ordosarchaeia genomes (two 16S rRNA sequences from the six Ca. Ordosarchaeia genomes). The taxonomic assignment was identified through phylogenetic placement. The 16S phylogenetic tree was reconstructed using IQ-TREE [42] after multiple sequence alignment and trimming using MAFFT [53] and trimAl [54], respectively. All options were set as previously described. The sequences that satisfied the following conditions were regarded as Ca. Ordosarchaeia: more than 500 bp in length; identity exceeding 82% with one of the reference sequences; and closer to reference sequences in the phylogenetic tree. The location information was retrieved from the metadata or by consulting the related publication, and the latitude and longitude values were mapped using the “maps” package (v3.4.0; https://CRAN.R-project.org/package=maps) in R project (v4.2.0).

Results

Biogeographic distribution and hypersaline adaptation of Ca. Ordosarchaeia, a novel class in the phylum Halobacteriota

Five MAGs with high completeness were obtained from soda-saline lakes on the Ordos Plateau of Inner Mongolia (Table 1). The MAGs were predicted to belong to the order JAHENH01 in the GTDB taxonomy database (Supplementary Table S1). However, they exhibited comparable average amino acid and 16S rRNA gene identities with Halobacteria (43%–52% and 79%–86%, respectively) as Marine Group IV archaea, which was recently renamed a novel class Ca. Hikarchaeia (47%–52% and 85%–89%, respectively; Supplementary Table S3). In addition, they shared lower identities (42%–45% and 80%–83%, respectively) with Ca. Hikarchaeia. Since they form a separate branch from Halobacteria and Ca. Hikarchaeia in phylogenetic analyses, we proposed that they be considered a novel class, which we named Ca. Ordosarchaeia (detailed in the Supplementary Information). We mainly utilized 16S rRNA gene sequences from Ca. Ordosarchaeia to explore their diversity and global distribution patterns (Supplementary Table S4). The majority of the 47 obtained sequences, which formed a cluster with Ods01, were found in various hypersaline or saline habitats, including brine, sediment, soil, and crust, whereas only two sequences (LN870305.1 and LN870307.2) were obtained from the human respiratory tract (Fig. 1A). Moreover, one sequence (MK894681.1) from hypersaline water exhibited high similarity to BinSanityLC-kmean-bin_57-bin_0-refined_5 (one MAG reported in the Bonneville Salt Flats [60]; renamed JAHENH01 in Fig. 1A). The geographic locations of environmental samples were mapped to illustrate the distribution. Samples were collected in Eurasia, Africa, America, and Australia (Fig. 1B). In summary, Ca. Ordosarchaeia exhibited a preference for hypersaline environments and a wide global distribution.

Phylogeny and biogeography of Ca. Ordosarchaeia; (A) phylogenetic diversity of Ca. Ordosarchaeia based on the 16S rRNA gene; the maximum-likelihood tree was reconstructed using the SYM + I + G4 model; the number at the node represents the percentage of ultrafast bootstrap support (>70%) after 1000 iterations; the number in the bracket behind the class name represents the number of sequences; Ods01 was obtained in this study; (B) geographic distribution of Ca. Ordosarchaeia; the latitude and longitude were mapped to illustrate the 16S rRNA genes more than 500 bp in length and the closely related phylogenetic relationships with Ca. Ordosarchaeia MAGs; colored triangles and circles denote environmental features, whereas black squares represent the sequences obtained from the host respiratory tract (Supplementary Table S4); “Hypersaline” indicates that the sample contains a high concentration of salts, whereas “saline” indicates that the sample contains salts, but the content was not provided; the blue circle with a black border marks the sequences obtained in this study.
Figure 1

Phylogeny and biogeography of Ca. Ordosarchaeia; (A) phylogenetic diversity of Ca. Ordosarchaeia based on the 16S rRNA gene; the maximum-likelihood tree was reconstructed using the SYM + I + G4 model; the number at the node represents the percentage of ultrafast bootstrap support (>70%) after 1000 iterations; the number in the bracket behind the class name represents the number of sequences; Ods01 was obtained in this study; (B) geographic distribution of Ca. Ordosarchaeia; the latitude and longitude were mapped to illustrate the 16S rRNA genes more than 500 bp in length and the closely related phylogenetic relationships with Ca. Ordosarchaeia MAGs; colored triangles and circles denote environmental features, whereas black squares represent the sequences obtained from the host respiratory tract (Supplementary Table S4); “Hypersaline” indicates that the sample contains a high concentration of salts, whereas “saline” indicates that the sample contains salts, but the content was not provided; the blue circle with a black border marks the sequences obtained in this study.

Table 1

Genomic features of the metagenome-assembled genomes (MAGs) affiliated with Ca. Ordosarchaeia in this study.

MAGWGS accession numberSize (bp)Contigs numberN50 (bp)GC (mol%)Completeness (%)Contamination (%)Reference for metagenome
Ods01JAQZCZ0000000001 865 47813325 31660.8297.062.64[27]
Ods02JAQZDA0000000002 272 0495679 72861.4894.750.65[27]
Ods05JAQZDB0000000001 224 804315430261.9955.362.78[27]
Ods03JAQZDC0000000001 686 625416591262.6684.013.98[27]
Ods04JAQZDD0000000001 703 903411504260.0973.532.61[26]
MAGWGS accession numberSize (bp)Contigs numberN50 (bp)GC (mol%)Completeness (%)Contamination (%)Reference for metagenome
Ods01JAQZCZ0000000001 865 47813325 31660.8297.062.64[27]
Ods02JAQZDA0000000002 272 0495679 72861.4894.750.65[27]
Ods05JAQZDB0000000001 224 804315430261.9955.362.78[27]
Ods03JAQZDC0000000001 686 625416591262.6684.013.98[27]
Ods04JAQZDD0000000001 703 903411504260.0973.532.61[26]
Table 1

Genomic features of the metagenome-assembled genomes (MAGs) affiliated with Ca. Ordosarchaeia in this study.

MAGWGS accession numberSize (bp)Contigs numberN50 (bp)GC (mol%)Completeness (%)Contamination (%)Reference for metagenome
Ods01JAQZCZ0000000001 865 47813325 31660.8297.062.64[27]
Ods02JAQZDA0000000002 272 0495679 72861.4894.750.65[27]
Ods05JAQZDB0000000001 224 804315430261.9955.362.78[27]
Ods03JAQZDC0000000001 686 625416591262.6684.013.98[27]
Ods04JAQZDD0000000001 703 903411504260.0973.532.61[26]
MAGWGS accession numberSize (bp)Contigs numberN50 (bp)GC (mol%)Completeness (%)Contamination (%)Reference for metagenome
Ods01JAQZCZ0000000001 865 47813325 31660.8297.062.64[27]
Ods02JAQZDA0000000002 272 0495679 72861.4894.750.65[27]
Ods05JAQZDB0000000001 224 804315430261.9955.362.78[27]
Ods03JAQZDC0000000001 686 625416591262.6684.013.98[27]
Ods04JAQZDD0000000001 703 903411504260.0973.532.61[26]

The five MAGs obtained in this study exhibited high abundance in soda-saline lakes or enrichment cultures with high salt concentrations (Supplementary Fig. S1A). Thus, these MAGs can be considered the representative genomes of Ca. Ordosarchaeia to decipher their hypersaline adaptation and ecophysiological potential. An acid shift in the isoelectric point profile of a proteome can be indicative of the salt-in strategy [28]. The predicted proteomes based on the genome sequences revealed that the five MAGs have acid-shifted isoelectric point profiles and possess similarly low average isoelectric points as Haloferax volcanii (Supplementary Fig. S1B and C), a model species of Halobacteria as a typical salt-in halophile [61]. In addition, the five Ca. Ordosarchaeia MAGs contained high proportions of acidic amino acid residues in the predicted proteomes (Supplementary Fig. S1D). Collectively, these findings suggest that Ca. Ordosarchaeia species employ the salt-in strategy to maintain sufficient osmotic pressure within the cytoplasm for hypersaline adaptation.

Methanonatronarchaeia is the methanogen most closely related to the Ordosarchaeia–Hikarchaeia–Halobacteria branch

The phylogenetic tree based on 16S rRNA genes revealed that Ca. Ordosarchaeia is a deep lineage of Ca. Hikarchaeia and Halobacteria. Moreover, these three classes form a sister branch with Methanonatronarchaeia (Fig. 1A). To determine the placements of these four classes and their evolutionary relationships, we conducted whole genome-based phylogenetic analyses. The phylogenomic tree based on the 53 “top-ranked” archaeal marker proteins indicated that the four classes form a distinct clade from the other classes within the phylum Halobacteriota with 100% bootstrap support, and Methanonatronarchaeia was located at the root of the clade with 90% bootstrap support (Fig. 2A). To mitigate potential errors caused by long-branch attraction in the phylogeny of Halobacteriota, we employed multiple approaches for tree reconstruction. First, we removed the fast-evolving sites in the multiple sequence alignment prior to maximum-likelihood tree reconstruction using two different tools, namely, BMGE and ClipKit [40, 41]. In this analysis, the phylogenetic positions of the four classes remained unchanged (Supplementary Fig. S2). Second, we generated a consensus tree using Bayesian inference, which positioned the four classes with 100% probability for each partition, similar to the maximum-likelihood trees (Supplementary Fig. S3). Third, we performed a maximum-likelihood analysis excluding the Ca. Ordosarchaeia lineage, which resulted in a significant alteration of the tree structure. In this scenario, Methanonatronarchaeia was relocated to the root of the phylum Halobacteriota with 100% bootstrap support (Fig. 2B). These modifications collectively indicate that incorporating Ca. Ordosarchaeia at an intermediate position between Methanonatronarchaeia and Halobacteria substantially enhances the robustness of phylogenetic analyses.

Phylogenomic analyses of the classes in the phylum Halobacteriota; maximum-likelihood trees (LG + F + G4 model) with (A) and without (B) Ca. Ordosarchaeia MAGs; the number at the node represents the percentage of ultrafast bootstrap support (>70%) after 1000 iterations; both phylogenomic trees are based on the 53 archaeal marker proteins, and the reference members are listed in Supplementary Table S2; some branches are collapsed, and the number in the bracket represents the number of genomes.
Figure 2

Phylogenomic analyses of the classes in the phylum Halobacteriota; maximum-likelihood trees (LG + F + G4 model) with (A) and without (B) Ca. Ordosarchaeia MAGs; the number at the node represents the percentage of ultrafast bootstrap support (>70%) after 1000 iterations; both phylogenomic trees are based on the 53 archaeal marker proteins, and the reference members are listed in Supplementary Table S2; some branches are collapsed, and the number in the bracket represents the number of genomes.

To identify the factors influencing the phylogenetic analysis of Halobacteriota, we conducted artificial treatments to remove the acidic amino acid residues, i.e. glutamate and aspartate, in the multiple sequence alignment based on the 53 archaeal marker proteins before the reconstruction of maximum-likelihood trees. Methanonatronarchaeia was also pushed down to the root of the whole phylum by referencing the locations of the sites in each representative sequence from Halobacteria, Ca. Hikarchaeia, Ca. Ordosarchaeia, and Methanonatronarchaeia (Supplementary Fig. S4A–D). However, when considering Methanomicrobia and Methanosarcinia, the position of Methanonatronarchaeia close to Ordosarchaeia–Hikarchaeia–Halobacteria remained unchanged (Supplementary Fig. S4E and F). We observed that the most selected tree models in ModelFinder were LG + G4, but the model became LG + F + G4 when referring to the Ca. Hikarchaeia sequence. Therefore, we used the LG + G4 model for the treatment based on Ca. Hikarchaeia (Supplementary Fig. S4G). Both models exhibited the same topological structure at the class level. Collectively, the trees based on the different treatments suggest that the clade of Halobacteria, Ca. Hikarchaeia, Ca. Ordosarchaeia, and Methanonatronarchaeia share conserved acidic amino acid residues at certain sites that probably drive the evolution of this clade. In addition, the acidic amino acid residues accumulated in the early evolutionary phase might not have been driven solely by hypersaline conditions considering the slightly halophilic Ca. Hikarchaeia and MAG BinSanityLC-kmean-bin_57-bin_0-refined_5 (discussed below).

Energy metabolism comparison supports Ca. Ordosarchaeia as an intermediate phase between Methanonatronarchaeia and Halobacteria

The main difference between methanogens and haloarchaea is energy metabolism. Ca. Ordosarchaeia MAGs do not have any mcr genes, but they contain cytochrome c oxidase genes (Fig. 3). Some members (Ods01 and Ods04) even encode bacteriorhodopsin and the related protein arCOG02947 (Supplementary Fig. S5, Supplementary Tables S5 and S6). Obviously, their metabolism based on functional gene prediction was similar to that of Halobacteria even though they do not harbor cytochrome b and cytochrome bd-type quinol oxidase for cytochrome c reductase (Fig. 3). The electron transfer chain is possibly incomplete in Ca. Ordosarchaeia, or electron transfer from quinol to cytochrome c might be accomplished by different enzymes or via a different pathway.

Energy-related metabolism of Ca. Ordosarchaeia compared with reference lineages; the colored and white squares denote the presence and absence of functional genes in the genomes, respectively, and the full names of the genes and their functions are listed in Supplementary Table S6; the assembly accession numbers of the genomes are presented in Supplementary Table S1 and Supplementary Fig. S5; the abbreviations of superordinate taxonomies were as follows: Ods, Ca. Ordosarchaeia; Hik, Ca. Hikarchaeia; Hlb, Halobacteria; Mna, Methanonatronarchaeia; Mmb, Methanomicrobia; ocH, other classes in the phylum Halobacteriota; Mb, Methanobacteriota; MbA, Methanobacteriota_A; MbB, Methanobacteriota_B; Tpl, Thermoplasmatota; Tpr, Thermoproteota; opA, other phyla in archaea.
Figure 3

Energy-related metabolism of Ca. Ordosarchaeia compared with reference lineages; the colored and white squares denote the presence and absence of functional genes in the genomes, respectively, and the full names of the genes and their functions are listed in Supplementary Table S6; the assembly accession numbers of the genomes are presented in Supplementary Table S1 and Supplementary Fig. S5; the abbreviations of superordinate taxonomies were as follows: Ods, Ca. Ordosarchaeia; Hik, Ca. Hikarchaeia; Hlb, Halobacteria; Mna, Methanonatronarchaeia; Mmb, Methanomicrobia; ocH, other classes in the phylum Halobacteriota; Mb, Methanobacteriota; MbA, Methanobacteriota_A; MbB, Methanobacteriota_B; Tpl, Thermoplasmatota; Tpr, Thermoproteota; opA, other phyla in archaea.

Methanonatronarchaeia members were characterized to perform formate- or H2-dependent methyl-reduction to produce methane [12, 62]. We annotated the related genes, including formate dehydrogenases, [NiFe]-hydrogenase, multiple methyltransferases, and Mcr and CoB-CoM heterodisulfide reductase, in some Methanonatronarchaeia genomes (Fig. 3). However, we did not find the complete pathway in any Ca. Ordosarchaeia MAG, but rather, we only identified some subunits, such as methylcobalamin:coenzyme M methyltransferase subunit A (MtbA) in methylamine/dimethylamine/trimethylamine metabolism and Fe-S-cluster-containing formate dehydrogenase (FdhB) for formate utilization. Hydrotroilite (FeS × nH2O) has been reported to be necessary for both the growth and methanogenic activity of Methanonatronarchaeia, although whether hydrotroilite affects the activity of FdhB is unclear [12]. In contrast, most Methanomicrobia species do not carry the genes to reduce methylated compounds or fdhB (Fig. 3). Although the metabolic processes of Ca. Ordosarchaeia remain unidentified, the presence of mtbA and fdhB in Ca. Ordosarchaeia supports its closer relationship with Methanonatronarchaeia than with Methanomicrobia. In addition, the absence of uvrABC genes involved in nucleotide excision repair in both Ca. Ordosarchaeia and Methanonatronarchaeia is consistent with their close relationship. In comparison, these genes are widely distributed among Halobacteria, Ca. Hikarchaeia, Methanomicrobia, and other classes within the phylum Halobacteriota (see Supplementary Information).

Moreover, the five MAGs of Ca. Ordosarchaeia contain genes involved in many types of ion transporters, including the Kef- and Trk-type K+ transport systems and Mnh Na+/H+ antiporters (Supplementary Fig. S5, Supplementary Table S6A). Ion transport performed by salt-in halophiles plays an important role in ion homeostasis and osmotic pressure balance [63]. The Ca. Ordosarchaeia MAGs also featured high copy numbers of universal stress protein A (UspA), especially arCOG02053 and arCOG00449 (Supplementary Table S6A), a marker protein of adaptation to extreme (e.g. hypersaline) environments [12]. The presence of these genes further supports the hypothesis of the salt-in strategy. Additionally, we found that Ca. Ordosarchaeia MAGs harbored genes related to central carbohydrate metabolism, oxidative phosphorylation, terpenoid biosynthesis, and DNA repair but lacked the potential for carbon fixation (Supplementary Fig. S5). Therefore, it is likely that Ca. Ordosarchaeia species are chemoheterotrophs with an aerobic respiration-based lifestyle.

Methanonatronarchaeia possesses an evolutionarily distant Mcr across the robust phylogeny of archaea, including Ca. Ordosarchaeia

During archaeal evolution, Halobacteria are considered to have lost methanogenic capability from their closely related methanogenic ancestor [11–13]. The trees of relevant genes were reconciled against the robust archaeal species tree in this study to reconstruct the ancestral states using the state-of-the-art ALE approach. Orthogroups that exhibited a threshold of 0.3 in the raw reconciliation frequencies were selected to avoid missing true events [13]. In our evolutionary approximation, the ancestor of Halobacteria, Ca. Hikarchaeia, Ca. Ordosarchaeia, and Methanonatronarchaeia (AcsHHOM for short at node N147) might not have harbored all of the genes involved in methanogenesis because the frequencies of the Mcr genes mcrGA are predicted to be no more than 0.3 (Fig. 4A). The frequencies of the mcrBGA genes in the common ancestor of Methanonatronarchaeia (node N124) were greater than 0.3 but much lower than 1.0, whereas they reached or approached 1.0 in the later ancestor (node N113). Approximately 40% of the mcrB genes and more than half of the mcrGA genes of Methanonatronarchaeia were estimated to be obtained at nodes N124 and N113 through horizontal gene transfer (HGT) (Fig. 4A). Similarly, a significant proportion of mcrDC genes in Methanonatronarchaeia appeared to have been laterally acquired at node N113. According to the principle of ALE analysis, the mcr genes harbored by partial Methanonatronarchaeia members might be separated by a longer phylogenetic distance from the genes of most methanogenic lineages in the phylum Halobacteriota compared to those outside the phylum. This was supported by phylogenetic analyses of the sequences of the five Mcr proteins (Fig. 4B–F). Taken together, these results suggest that the mcr genes were likely acquired laterally by the common ancestor (node N124) or a descendant ancestor (node N113) of Methanonatronarchaeia (Supplementary Table S6B, Supplementary Fig. S6). Considering the widespread distribution of methane metabolism in archaea [19, 57, 64], we collected the amino acid sequences of the representative mcrBGA genes across the whole archaeal domain and then reconstructed their phylogenies to trace the source of these genes. In the maximum-likelihood tree based on McrA (also including the homolog alkyl-coenzyme M reductase AcrA subunit) sequences, Methanomicrobia, Methanosarcinia, Methanoliparia, Bog-38, and Methanocellia clustered together, whereas other sequences were located outside this cluster with high confidence (greater than 70% bootstrap support). In other words, Methanonatronarchaeia and ANME-1 are closer to Methanomassiliicoccales and Methanofastidiosales, and Archaeoglobi is in the branch with Thermoproteota (Fig. 5A). The phylogenetic trees based on McrB- and McrG-like sequences featured similar topological structures (Supplementary Fig. S7A and B).

Ancestral inference and phylogenetic analyses of marker proteins involved in methane metabolism; (A)inference of the marker proteins McrBGADC (abbreviated as BGADC for the five proteins) involved in methane metabolism in the main ancestors across the archaea domain; the symbols BGADC are colored to indicate that the copies of the encoding genes are calculated as <0.3; the ancestral potential was estimated by reconciling UFBOOT gene trees against an archaeal species tree using ALE; the ratios of transfers were calculated by dividing the frequencies of transfers by the frequencies of copies; the maximum-likelihood tree (LG + F + I + G4 model) of the consensus species was reconstructed using ultrafast bootstrap approximation based on the 53 archaeal marker proteins from 77 representative genomes (listed in Supplementary Table S2), and the node numbers are presented in Supplementary Fig. S6; the colored and black branches mark the different classes in the phylum Halobacteriota, and the other phyla are not presented; (B–F) phylogenies of five marker proteins from 77 representative genomes; the colored branches mark the different taxonomic sources of the proteins in the archaea.
Figure 4

Ancestral inference and phylogenetic analyses of marker proteins involved in methane metabolism; (A)inference of the marker proteins McrBGADC (abbreviated as BGADC for the five proteins) involved in methane metabolism in the main ancestors across the archaea domain; the symbols BGADC are colored to indicate that the copies of the encoding genes are calculated as <0.3; the ancestral potential was estimated by reconciling UFBOOT gene trees against an archaeal species tree using ALE; the ratios of transfers were calculated by dividing the frequencies of transfers by the frequencies of copies; the maximum-likelihood tree (LG + F + I + G4 model) of the consensus species was reconstructed using ultrafast bootstrap approximation based on the 53 archaeal marker proteins from 77 representative genomes (listed in Supplementary Table S2), and the node numbers are presented in Supplementary Fig. S6; the colored and black branches mark the different classes in the phylum Halobacteriota, and the other phyla are not presented; (B–F) phylogenies of five marker proteins from 77 representative genomes; the colored branches mark the different taxonomic sources of the proteins in the archaea.

Arrangement of mcr genes across the phylogeny of subunit A; (A) phylogenetic analysis of the McrA subunit and its homologs; the maximum-likelihood tree was reconstructed using the LG + F + I + G4 model; the circle at the node signifies >70% ultrafast bootstrap support based on 1000 iterations; some branches are collapsed, and the number in the bracket represents the number of sequences; the label is colored to indicate the sequences that were affiliated with lineages of the phylum Halobacteriota; all sequences are listed in Supplementary Table S7; the detailed approaches are described in the Methods; (B) organization comparison of representative mcr gene clusters involved in methanogenesis; (C) organizational comparison of representative mcr gene clusters involved in the anaerobic oxidation of methane and short-chain alkanes; the direction and color of each arrow denote the gene direction and function, respectively; the contig ID and position of each gene cluster are listed in Supplementary Table S8.
Figure 5

Arrangement of mcr genes across the phylogeny of subunit A; (A) phylogenetic analysis of the McrA subunit and its homologs; the maximum-likelihood tree was reconstructed using the LG + F + I + G4 model; the circle at the node signifies >70% ultrafast bootstrap support based on 1000 iterations; some branches are collapsed, and the number in the bracket represents the number of sequences; the label is colored to indicate the sequences that were affiliated with lineages of the phylum Halobacteriota; all sequences are listed in Supplementary Table S7; the detailed approaches are described in the Methods; (B) organization comparison of representative mcr gene clusters involved in methanogenesis; (C) organizational comparison of representative mcr gene clusters involved in the anaerobic oxidation of methane and short-chain alkanes; the direction and color of each arrow denote the gene direction and function, respectively; the contig ID and position of each gene cluster are listed in Supplementary Table S8.

Furthermore, we compared the gene organization within the mcr cluster. Each Methanonatronarchaeia genome carried a single mcrBDGA gene cluster organized in a head-to-tail manner (Fig. 5B). This gene arrangement, termed the BDGA type in this study, is identical to those in Methanomassiliicoccales and Methanofastidiosales, the McrABG sequences of which share high similarity. However, it differs from the gene arrangement observed in Methanosarcinia, Methanomicrobia, Methanoliparia, Bog-38, and Methanocellia, which belong to the same phylum as Methanonatronarchaeia, but their representative genomes consistently contained a BDCGA-type mcr cluster. Additionally, in Methanomicrobia, approximately half of the representative genomes carried an additional BDGA-type cluster, whereas some genomes featured other types (Fig. 5B). Unexpectedly, the multiple mcr clusters of the two lineages shape separate branches in the phylogenetic trees of McrA, McrB, and McrG (Fig. 5A, Supplementary Fig. S7). In other words, McrABGs located in different types in the same taxonomic class might have been generated through gene duplication. Furthermore, certain genomes within the phylum Methanobacteriota also exhibited two different types of mcr clusters (Fig. 5B and C). Considering the expansion and conservation of the BDCGA type in Halobacteriota and Methanobacteriota (excluding Methanonatronarchaeia and Methanofastidiosales, respectively), we hypothesize that this type was vertically inherited in both phyla and that other types were later derived from it (in the lineages possessing two or more types of mcr gene clusters), or that all of them were lost in lineages lacking mcr genes. With respect to the evolutionarily distant BDGA types in Methanonatronarchaeia, Methanofastidiosales, and Methanomassiliicoccales, they might have diverged in certain methanogens in the early phase and later horizontally transferred to other lineages (such as Methanonatronarchaeia). Furthermore, considering Methanonatronarchaeia as the sole methanogenic class in the branch of Methanonatronarchaeia, Ca. Ordosarchaeia, Ca. Hikarchaeia, and Halobacteria, the hypothesis of HGT is reasonable. In fact, another hypothesis involving the horizontal transfer of these genes is that the mcr genes were ancestrally present in the Methanonatronarchaeia lineage and that the horizontally transferred mcr genes replaced the ancestral genes. To evaluate this possibility, we reconstructed the phylogenetic tree of CoB-CoM heterodisulfide reductase subunits D and E (HdrDE) while considering the absence of coenzyme M biosynthesis in Methanonatronarchaeia and the limited distribution of MtbA in other methanogens (Fig. 3). Some Methanonatronarchaeia members were found to possess two copies of hdrD and hdrE, and one copy was found to form a gene cluster with mcrC (Supplementary Fig. S8A). In the phylogeny of HdrD, Thermoplasmatota exhibited the highest diversity of proteins, and the clustered HdrD of Methanonatronarchaeia, along with certain sequences in Methanomicrobia and other phyla, was adjacent to Thermoplasmatota but distant from the main branch of Methanomicrobia and other classes in the phylum Halobacteriota (Supplementary Fig. S8B). A similar condition was observed for another copy. However, there were insufficient HdrE sequences for the relevant analysis (Supplementary Fig. S8C). These results indicate that HdrD, McrC, and HdrE in Methanonatronarchaeia were also acquired through HGT, thereby refuting the hypothesis of orthologous displacement. In summary, we propose an alternative course of haloarchaeal evolution based on the divergent mcr gene cluster in Methanonatronarchaeia and the robust phylogenetic relationship of Methanonatronarchaeia and Ordosarchaeia–Hikarchaeia–Halobacteria.

Discussion

Ca. Ordosarchaeia is widely distributed in hypersaline habitats. The presence of acid-shifted proteomes and multiple copies of uspA genes in Ca. Ordosarchaeia species support their adaptation to hypersaline environments, although the occurrence of certain clones in the nonsaline respiratory tract appears to challenge this claim. Halobacteria species are well known for their ability to thrive in hypersaline environments [7, 65]. Nevertheless, with the development of high-throughput sequencing, certain Halobacteria species have also been discovered in the human gut, representing a nonsaline environment [66, 67]. Several cultivated Halobacteria strains, e.g. Halalkalicoccus, Haladaptatus, Halocatena, and Halomarina oriensis (Supplementary Table S9), can maintain cellular integrity even under nonsaline conditions. Importantly, these exceptions do not nullify the hypersaline characteristics of Halobacteria. Similarly, it is reasonable to assess the extreme halophilic features of Ca. Ordosarchaeia. However, it remains difficult to determine the mechanism by which these Halobacteria and Ca. Ordosarchaeia species reconcile the salt-in strategy and their dependence on high salt-dependent enzymes with nonsaline conditions or assess whether they enter a dormant state under such conditions.

The long-branch attraction in the phylogenetic analysis of Halobacteriota cannot be overlooked because the phylogenetic relationship between Halobacteria and Methanonatronarchaeia is equivocal [12, 13, 23, 24]. Facing the same problem, the phylogeny of Candidatus Nanohaloarchaeota has also been a subject of debate, with some studies placing them within the phylum Halobacteriota [25, 68]. This potential phylogenetic artifact might arise from the rapid evolution of Halobacteria and Ca. Nanohaloarchaeota in response to saline adaptation. The discovery of Ca. Ordosarchaeia provides valuable insight, as it is placed at a central position between Methanonatronarchaeia and Halobacteria in the phylogenies, and the inclusion of Ca. Ordosarchaeia helps to break the long distance between Halobacteria and Ca. Hikarchaeia and other lineages. Additionally, Methanonatronarchaeia consistently moved close to Halobacteria, Ca. Hikarchaeia, and Ca. Ordosarchaeia from the root. A recent report also supports that improved taxon sampling is crucial for obtaining robust phylogenies [69]. Apparently, when the intermediate lineage has not been discovered, it will be an effective approach to select unambiguously aligned sites (same as removing highly variable sites) to estimate phylogenetic relationships in many cases [70–72], but this might not be suitable for Halobacteria. We agree with the view that fast-evolving sites contain evolutionary signals and that their removal can sometimes lead to phylogenetic artifacts [24]. Ca. Hikarchaeia was located at the transitional stage, but the placement of Methanonatronarchaeia at the root appears to be accurate [13]. In our opinion, Ca. Hikarchaeia is located excessively close to Halobacteria but far from Methanonatronarchaeia. Therefore, including Ca. Hikarchaeia does not clarify the overall phylogeny.

Fast adaptive evolution to saline environments was considered the cause of the long-branch formation of Halobacteria [25]. In combination with the accumulation of acidic amino acids in the proteomes of salt-in lineages for hypersaline survival, the results of phylogenetic analyses after removing glutamate and aspartate appear to support hypersaline selection. Ca. Hikarchaeia species might be slight halophiles, whereas they also feature certain acidic amino acid sites similar to other extreme halophilic organisms, such as Halobacteria, Methanonatronarchaeia, and Ca. Ordosarchaeia species. Marine microorganisms generally produce or import organic osmolytes, e.g. ectoine, glycine betaine, Nε-acetyl-β-lysine, and β-glutamine, to cope with high-salinity environments [73–76]. Based on these observations, we hypothesize that the last common ancestor of Methanonatronarchaeia, Ca. Ordosarchaeia, Ca. Hikarchaeia, and Halobacteria is a slight salt-in halophile. The slight salt-in strategy could be identified by measuring the intracellular salt concentration in a pure culture of Ca. Hikarchaeia or by providing biochemical evidence to test the optimal salinity of intracellular enzymes in future studies. In fact, the salt-in strategy might have independently originated in different archaeal classes, such as the halophilic lineage Candidatus Halarchaeoplasmatales in the phylum Thermoplasmatota [28] and nanosized Ca. Nanohaloarchaeota [27, 77]. Similarly, extreme salt-in adaptation might have independently evolved in Halobacteria, Ca. Ordosarchaeia, and Methanonatronarchaeia from the slight salt-in strategy of their common ancestor. Apparently, we cannot dismiss the possibility that the common ancestor was an extreme halophile and that the classes later returned to slightly halophilic lineages.

Generally, the inference of the evolutionary process of archaeal methane metabolism relies on a consensus of phylogenetic analyses conducted with each methanogenic lineage [12, 18, 19, 59, 64]. Possibly because methanogenesis was commonly inferred to originate in the common ancestor of non-DPANN Archaea [12, 18, 57, 59, 64, 78, 79], Halobacteria were thought to have lost methanogenesis from the last common ancestor with the most closely related methanogen, and this ancestor was previously considered to harbor methanogenesis-related genes regardless of which lineage was the most closely related methanogen [12, 13]. Based on the robust phylogeny of Methanonatronarchaeia and Halobacteria in this study, we propose an alternative possibility that Methanonatronarchaeia might have laterally acquired mcrABGDC and hdrDE genes from Methanomassiliicoccales or Methanofastidiosales, considering the phylogenetically and organizationally divergent mcr gene cluster of Methanonatronarchaeia from closely related methanogens and the absence of mcr genes in Halobacteria, Ca. Hikarchaeia, Ca. Ordosarchaeia, and some Methanonatronarchaeia members. Inferring horizontal transfer of methanogenesis genes was reasonable because similar events were predicted to occur in other methanogens, e.g. Methanomassiliicoccales [18, 80]. Although Methanonatronarchaeia was generally found in hypersaline environments with neutral or alkaline pH [12, 62, 81], we believe that the lateral transfer of mcrABG can occur in extreme environments. In fact, most Methanomassiliicoccales species were reported in host-associated or nonhypersaline natural environments [82, 83], but some members are certain to be detected in hypersaline and alkaline environments as the source of HGT [26, 28, 84]. Similarly, Methanofastidiosales species were also found in soda lake sediment [85].

This study presented the novel class Ca. Ordosarchaeia within the phylum Halobacteriota and characterized its widespread distribution in hypersaline environments, as well as its adaptation strategy and metabolic potential based on its MAGs. Additionally, the inclusion of Ca. Ordosarchaeia significantly reduced the phylogenetic distance of Halobacteria and Ca. Hikarchaeia from the most closely related lineages and apparently relieves the problem of long-branch attraction in phylogenetic analyses. Based on the phylogenetic consensus and the divergent mcr gene cluster for methanogenesis found in the Methanonatronarchaeia-related branch, an alternative evolutionary scenario from methanogens to haloarchaea was proposed. Specifically, the ancestral lineage of Halobacteria and the closely related methanogen Methanonatronarchaeia underwent an initial loss of mcr genes, followed by the later acquisition of mcr from phylogenetically distant methanogens.

Author contributions

Hua Xiang and Dahe Zhao conceived the study. Shengjie Zhang and Dahe Zhao carried out the metagenomic analysis and genomic binning. Dahe Zhao performed the bioinformatic analyses on phylogenies, comparative genomics, functional annotation, evolutionary event approximation, biogeography, and statistics. Dahe Zhao prepared the figures and drafted the manuscript. Shengjie Zhang, Junyu Chen, Juanjuan Zhao, Peng An, and Hua Xiang participated in the functional interpretation, discussion and manuscript revision. All authors read and approved the final manuscript.

Conflicts of interest

The authors declare no conflict of interest.

Funding

This study was funded by the National Natural Science Foundation of China (No. 92251302 and 32000046), and supported by Central Asian Drug Discovery and Development Center of Chinese Academy of Sciences (No. CAM202202).

Data availability

The five Ca. Ordosarchaeia genomes are available from the NCBI under the BioProject identifier PRJNA924292. Metagenomic sequencing data have been deposited in BioProject identifiers PRJNA549802, PRJNA679647, and PRJNA769545. Raw data (including protein sequence, functional annotation, protein cluster, multiple sequences alignment, amalgamated likelihood estimation for history approximation, and scripts, etc.) generated in this study are available through [https://doi.org/10.6084/m9.figshare.21989720].

References

1.

Woese
CR
,
Kandler
O
,
Wheelis
ML
.
Towards a natural system of organisms: proposal for the domains archaea, bacteria, and eucarya
.
Proc Natl Acad Sci U S A
1990
;
87
:
4576
9
. https://doi.org/10.1073/pnas.87.12.4576.

2.

Adam
PS
,
Borrel
G
,
Brochier-Armanet
C
et al.
The growing tree of archaea: new perspectives on their diversity, evolution and ecology
.
ISME J
2017
;
11
:
2407
25
. https://doi.org/10.1038/ismej.2017.122.

3.

Parks
DH
,
Chuvochina
M
,
Chaumeil
P-A
et al.
A complete domain-to-species taxonomy for bacteria and archaea
.
Nat Biotechnol
2020
;
38
:
1079
86
. https://doi.org/10.1038/s41587-020-0501-8.

4.

Rinke
C
,
Chuvochina
M
,
Mussig
AJ
et al.
A standardized archaeal taxonomy for the genome taxonomy database
.
Nat Microbiol
2021
;
6
:
946
59
. https://doi.org/10.1038/s41564-021-00918-8.

5.

Larsen
H
.
Halophilic and halotolerant microorganisms - an overview and historical perspective
.
FEMS Microbiol Lett
1986
;
39
:
3
7
. https://doi.org/10.1111/j.1574-6968.1986.tb01835.x.

6.

Oren
A
.
Molecular ecology of extremely halophilic archaea and bacteria
.
FEMS Microbiol Ecol
2002
;
39
:
1
7
. https://doi.org/10.1111/j.1574-6941.2002.tb00900.x.

7.

Gupta
RS
,
Naushad
S
,
Baker
S
.
Phylogenomic analyses and molecular signatures for the class Halobacteria and its two major clades: a proposal for division of the class Halobacteria into an emended order Halobacteriales and two new orders, Haloferacales ord. nov and Natrialbales ord. nov., containing the novel families Haloferacaceae fam. nov and Natrialbaceae fam. nov
.
Int J Syst Evol Microbiol
2015
;
65
:
1050
69
. https://doi.org/10.1099/ijs.0.070136-0.

8.

Gunde-Cimerman
N
,
Plemenitaš
A
,
Oren
A
.
Strategies of adaptation of microorganisms of the three domains of life to high salt concentrations
.
FEMS Microbiol Rev
2018
;
42
:
353
75
. https://doi.org/10.1093/femsre/fuy009.

9.

Oren
A
.
Bioenergetic aspects of halophilism
.
Microbiol Mol Biol Rev
1999
;
63
:
334
48
. https://doi.org/10.1128/MMBR.63.2.334-348.1999.

10.

Lanyi
JK
.
Salt-dependent properties of proteins from extremely halophilic bacteria
.
Bacteriol Rev
1974
;
38
:
272
90
. https://doi.org/10.1128/br.38.3.272-290.1974.

11.

Nelson-Sathi
S
,
Dagan
T
,
Landan
G
et al.
Acquisition of 1,000 eubacterial genes physiologically transformed a methanogen at the origin of Haloarchaea
.
Proc Natl Acad Sci U S A
2012
;
109
:
20537
42
. https://doi.org/10.1073/pnas.1209119109.

12.

Sorokin
DY
,
Makarova
KS
,
Abbas
B
et al.
Discovery of extremely halophilic, methyl-reducing euryarchaea provides insights into the evolutionary origin of methanogenesis
.
Nat Microbiol
2017
;
2
:
17081
. https://doi.org/10.1038/nmicrobiol.2017.81.

13.

Martijn
J
,
Schon
ME
,
Lind
AE
et al.
Hikarchaeia demonstrate an intermediate stage in the methanogen-to-halophile transition
.
Nat Commun
2020
;
11
:
5490
. https://doi.org/10.1038/s41467-020-19200-2.

14.

Dundas
IE
.
Physiology of halobacteriaceae
.
Adv Microb Physiol
1977
;
15
:
85
120

15.

Scharf
B
,
Wittenberg
R
,
Engelhard
M
.
Electron transfer proteins from the haloalkaliphilic archaeon Natronobacterium pharaonis: possible components of the respiratory chain include cytochrome bc and a terminal oxidase cytochrome ba3
.
Biochemistry
1997
;
36
:
4471
9
. https://doi.org/10.1021/bi962312d.

16.

Mattar
S
,
Engelhard
M
.
Cytochrome ba3 from Natronobacterium pharaonis--an archaeal four-subunit cytochrome-c-type oxidase
.
Eur J Biochem
1997
;
250
:
332
41
. https://doi.org/10.1111/j.1432-1033.1997.0332a.x.

17.

Kelly
S
,
Wickstead
B
,
Gull
K
.
Archaeal phylogenomics provides evidence in support of a methanogenic origin of the archaea and a thaumarchaeal origin for the eukaryotes
.
Proc Biol Sci
2011
;
278
:
1009
18
. https://doi.org/10.1098/rspb.2010.1427.

18.

Wang
Y
,
Wegener
G
,
Williams
TA
et al.
A methylotrophic origin of methanogenesis and early divergence of anaerobic multicarbon alkane metabolism
.
Sci Adv
2021
;
7
:eabj1453. https://doi.org/10.1126/sciadv.abj1453.

19.

Evans
PN
,
Boyd
JA
,
Leu
AO
et al.
An evolving view of methane metabolism in the archaea
.
Nat Rev Microbiol
2019
;
17
:
219
32
. https://doi.org/10.1038/s41579-018-0136-7.

20.

Hug
LA
,
Baker
BJ
,
Anantharaman
K
et al.
A new view of the tree of life
.
Nat Microbiol
2016
;
1
:
16048
. https://doi.org/10.1038/nmicrobiol.2016.48.

21.

Baker
BJ
,
De Anda
V
,
Seitz
KW
et al.
Diversity, ecology and evolution of archaea
.
Nat Microbiol
2020
;
5
:
887
900
. https://doi.org/10.1038/s41564-020-0715-z.

22.

Spang
A
,
Caceres
EF
,
Ettema
TJ
.
Genomic exploration of the diversity, ecology, and evolution of the archaeal domain of life
.
Science
2017
;
357
:
eaaf3883
. https://doi.org/10.1126/science.aaf3883.

23.

Aouad
M
,
Borrel
G
,
Brochier-Armanet
C
et al.
Evolutionary placement of Methanonatronarchaeia
.
Nat Microbiol
2019
;
4
:
558
9
. https://doi.org/10.1038/s41564-019-0359-z.

24.

Sorokin
DY
,
Makarova
KS
,
Abbas
B
et al.
Reply to 'Evolutionary placement of Methanonatronarchaeia'
.
Nat Microbiol
2019
;
4
:
560
1
. https://doi.org/10.1038/s41564-019-0358-0.

25.

Aouad
M
,
Taib
N
,
Oudart
A
et al.
Extreme halophilic archaea derive from two distinct methanogen class II lineages
.
Mol Phylogenet Evol
2018
;
127
:
46
54
. https://doi.org/10.1016/j.ympev.2018.04.011.

26.

Zhao
D
,
Zhang
S
,
Xue
Q
et al.
Abundant taxa and favorable pathways in the microbiome of soda-saline lakes in Inner Mongolia
.
Front Microbiol
2020
;
11
:
1740
. https://doi.org/10.3389/fmicb.2020.01740.

27.

Zhao
D
,
Zhang
SJ
,
Kumar
S
et al.
Comparative genomic insights into the evolution of Halobacteria-associated "Candidatus Nanohaloarchaeota"
.
mSystems
2022
;
7
:
e0066922
. https://doi.org/10.1128/msystems.00669-22.

28.

Zhou
H
,
Zhao
D
,
Zhang
S
et al.
Metagenomic insights into the environmental adaptation and metabolism of Candidatus Halarchaeoplasmatales, one archaeal order thriving in saline lakes
.
Environ Microbiol
2022
;
24
:
2239
58
. https://doi.org/10.1111/1462-2920.15899.

29.

Uritskiy
GV
,
DiRuggiero
J
,
Taylor
J
.
MetaWRAP-a flexible pipeline for genome-resolved metagenomic data analysis
.
Microbiome
2018
;
6
:
158
. https://doi.org/10.1186/s40168-018-0541-1.

30.

Parks
DH
,
Imelfort
M
,
Skennerton
CT
et al.
CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes
.
Genome Res
2015
;
25
:
1043
55
. https://doi.org/10.1101/gr.186072.114.

31.

Chaumeil
PA
,
Mussig
AJ
,
Hugenholtz
P
et al.
GTDB-Tk v2: memory friendly classification with the genome taxonomy database
.
Bioinformatics
2022
;
38
:
5315
6
. https://doi.org/10.1093/bioinformatics/btac672.

32.

Seemann
T
.
Prokka: rapid prokaryotic genome annotation
.
Bioinformatics
2014
;
30
:
2068
9
. https://doi.org/10.1093/bioinformatics/btu153.

33.

Lagesen
K
,
Hallin
P
,
Rodland
EA
et al.
RNAmmer: consistent and rapid annotation of ribosomal RNA genes
.
Nucleic Acids Res
2007
;
35
:
3100
8
. https://doi.org/10.1093/nar/gkm160.

34.

Laslett
D
,
Canback
B
.
ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences
.
Nucleic Acids Res
2004
;
32
:
11
6
. https://doi.org/10.1093/nar/gkh152.

35.

Hyatt
D
,
Chen
GL
,
LoCascio
PF
et al.
Prodigal: prokaryotic gene recognition and translation initiation site identification
.
BMC Bioinformatics
2010
;
11
:
119
. https://doi.org/10.1186/1471-2105-11-119.

36.

Kozlowski
LP
.
IPC - isoelectric point calculator
.
Biol Direct
2016
;
11
:
55
. https://doi.org/10.1186/s13062-016-0159-9.

37.

Jain
C
,
Rodriguez-R
LM
,
Phillippy
AM
et al.
High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries
.
Nat Commun
2018
;
9
:
5114
. https://doi.org/10.1038/s41467-018-07641-9.

38.

Camacho
C
,
Coulouris
G
,
Avagyan
V
et al.
BLAST+: architecture and applications
.
BMC Bioinformatics
2009
;
10
:
421
. https://doi.org/10.1186/1471-2105-10-421.

39.

Dombrowski
N
,
Williams
TA
,
Sun
J
et al.
Undinarchaeota illuminate DPANN phylogeny and the impact of gene transfer on archaeal evolution
.
Nat Commun
2020
;
11
:
3939
. https://doi.org/10.1038/s41467-020-17408-w.

40.

Criscuolo
A
,
Gribaldo
S
.
BMGE (block mapping and gathering with entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments
.
BMC Evol Biol
2010
;
10
:
210
. https://doi.org/10.1186/1471-2148-10-210.

41.

Steenwyk
JL
,
Buida
TJ
3rd
,
Li
Y
et al.
ClipKIT: a multiple sequence alignment trimming software for accurate phylogenomic inference
.
PLoS Biol
2020
;
18
:
e3001007
. https://doi.org/10.1371/journal.pbio.3001007.

42.

Nguyen
LT
,
Schmidt
HA
,
von Haeseler
A
et al.
IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies
.
Mol Biol Evol
2015
;
32
:
268
74
. https://doi.org/10.1093/molbev/msu300.

43.

Ronquist
F
,
Teslenko
M
,
van der Mark
P
et al.
MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space
.
Syst Biol
2012
;
61
:
539
42
. https://doi.org/10.1093/sysbio/sys029.

44.

Altekar
G
,
Dwarkadas
S
,
Huelsenbeck
JP
et al.
Parallel metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference
.
Bioinformatics
2004
;
20
:
407
15
. https://doi.org/10.1093/bioinformatics/btg427.

45.

Williams
TA
,
Szollosi
GJ
,
Spang
A
et al.
Integrative modeling of gene and genome evolution roots the archaeal tree of life
.
Proc Natl Acad Sci U S A
2017
;
114
:
E4602
11
. https://doi.org/10.1073/pnas.1618463114.

46.

Letunic
I
,
Bork
P
.
Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation
.
Nucleic Acids Res
2021
;
49
:
W293
6
. https://doi.org/10.1093/nar/gkab301.

47.

Huerta-Cepas
J
,
Szklarczyk
D
,
Heller
D
et al.
eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses
.
Nucleic Acids Res
2019
;
47
:
D309
14
. https://doi.org/10.1093/nar/gky1085.

48.

Buchfink
B
,
Reuter
K
,
Drost
HG
.
Sensitive protein alignments at tree-of-life scale using DIAMOND
.
Nat Methods
2021
;
18
:
366
8
. https://doi.org/10.1038/s41592-021-01101-x.

49.

Galperin
MY
,
Wolf
YI
,
Makarova
KS
et al.
COG database update: focus on microbial diversity, model organisms, and widespread pathogens
.
Nucleic Acids Res
2021
;
49
:
D274
81
. https://doi.org/10.1093/nar/gkaa1018.

50.

Henrissat
B
,
Davies
G
.
Structural and sequence-based classification of glycoside hydrolases
.
Curr Opin Struc Biol
1997
;
7
:
637
44
. https://doi.org/10.1016/S0959-440X(97)80072-3.

51.

Guo
J
,
Bolduc
B
,
Zayed
AA
et al.
VirSorter2: a multi-classifier, expert-guided approach to detect diverse DNA and RNA viruses
.
Microbiome
2021
;
9
:
37
. https://doi.org/10.1186/s40168-020-00990-y.

52.

Tesson
F
,
Herve
A
,
Mordret
E
et al.
Systematic and quantitative view of the antiviral arsenal of prokaryotes
.
Nat Commun
2022
;
13
:
2561
. https://doi.org/10.1038/s41467-022-30269-9.

53.

Katoh
K
,
Standley
DM
.
MAFFT multiple sequence alignment software version 7: improvements in performance and usability
.
Mol Biol Evol
2013
;
30
:
772
80
. https://doi.org/10.1093/molbev/mst010.

54.

Capella-Gutierrez
S
,
Silla-Martinez
JM
,
Gabaldon
T
.
trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses
.
Bioinformatics
2009
;
25
:
1972
3
. https://doi.org/10.1093/bioinformatics/btp348.

55.

Huang
WC
,
Liu
Y
,
Zhang
X
et al.
Comparative genomic analysis reveals metabolic flexibility of Woesearchaeota
.
Nat Commun
2021
;
12
:
5281
. https://doi.org/10.1038/s41467-021-25565-9.

56.

Szollosi
GJ
,
Rosikiewicz
W
,
Boussau
B
et al.
Efficient exploration of the space of reconciled gene trees
.
Syst Biol
2013
;
62
:
901
12
. https://doi.org/10.1093/sysbio/syt054.

57.

Borrel
G
,
Adam
PS
,
McKay
LJ
et al.
Wide diversity of methane and short-chain alkane metabolisms in uncultured archaea
.
Nat Microbiol
2019
;
4
:
603
13
. https://doi.org/10.1038/s41564-019-0363-3.

58.

Wang
Y
,
Wegener
G
,
Ruff
SE
et al.
Methyl/alkyl-coenzyme M reductase-based anaerobic alkane oxidation in archaea
.
Environ Microbiol
2021
;
23
:
530
41
. https://doi.org/10.1111/1462-2920.15057.

59.

Adam
PS
,
Kolyfetis
GE
,
Bornemann
TLV
et al.
Genomic remnants of ancestral methanogenesis and hydrogenotrophy in archaea drive anaerobic carbon cycling
.
Sci Adv
2022
;
8
:eabm9651. https://doi.org/10.1126/sciadv.abm9651.

60.

McGonigle
JM
,
Bernau
JA
,
Bowen
BB
et al.
Metabolic potential of microbial communities in the hypersaline sediments of the Bonneville Salt Flats
.
mSystems
2022
;
7
:
e0084622
. https://doi.org/10.1128/msystems.00846-22.

61.

Hartman
AL
,
Norais
C
,
Badger
JH
et al.
The complete genome sequence of Haloferax volcanii DS2, a model archaeon
.
PLoS One
2010
;
5
:
e9605
. https://doi.org/10.1371/journal.pone.0009605.

62.

Sorokin
DY
,
Merkel
AY
,
Abbas
B
.
Ecology of Methanonatronarchaeia
.
Environ Microbiol
2022
;
24
:
5217
29
. https://doi.org/10.1111/1462-2920.16108.

63.

Stautz
J
,
Hellmich
Y
,
Fuss
MF
et al.
Molecular mechanisms for bacterial potassium homeostasis
.
J Mol Biol
2021
;
433
:
166968
. https://doi.org/10.1016/j.jmb.2021.166968.

64.

Ou
YF
,
Dong
HP
,
McIlroy
SJ
et al.
Expanding the phylogenetic distribution of cytochrome b-containing methanogenic archaea sheds light on the evolution of methanogenesis
.
ISME J
2022
;
16
:
2373
87
. https://doi.org/10.1038/s41396-022-01281-0.

65.

Oren
A
.
The ecology of the extremely halophilic archaea
.
FEMS Microbiol Rev
1994
;
13
:
415
39
. https://doi.org/10.1111/j.1574-6976.1994.tb00060.x.

66.

Kim
JY
,
Whon
TW
,
Lim
MY
et al.
The human gut archaeome: identification of diverse haloarchaea in Korean subjects
.
Microbiome
2020
;
8
:
114
. https://doi.org/10.1186/s40168-020-00894-x.

67.

Bai
X
,
Sun
Y
,
Li
Y
et al.
Landscape of the gut archaeome in association with geography, ethnicity, urbanization, and diet in the Chinese population
.
Microbiome
2022
;
10
:
147
. https://doi.org/10.1186/s40168-022-01335-7.

68.

Feng
Y
,
Neri
U
,
Gosselin
S
et al.
The evolutionary origins of extreme halophilic archaeal lineages
.
Genome Biol Evol
2021
;
13
. https://doi.org/10.1093/gbe/evab166.

69.

Baker
BA
,
Gutiérrez-Preciado
A
,
Río
ÁR
et al.
Several independent adaptations of archaea to hypersaline environments
.
bioRxiv
.
2023
. https://doi.org/10.1101/2023.07.03.547478.

70.

Delsuc
F
,
Brinkmann
H
,
Philippe
H
.
Phylogenomics and the reconstruction of the tree of life
.
Nat Rev Genet
2005
;
6
:
361
75
. https://doi.org/10.1038/nrg1603.

71.

Brinkmann
H
,
Philippe
H
.
Archaea sister group of bacteria? Indications from tree reconstruction artifacts in ancient phylogenies
.
Mol Biol Evol
1999
;
16
:
817
25
. https://doi.org/10.1093/oxfordjournals.molbev.a026166.

72.

Pisani
D
.
Identifying and removing fast-evolving sites using compatibility analysis: an example from the Arthropoda
.
Syst Biol
2004
;
53
:
978
89
. https://doi.org/10.1080/10635150490888877.

73.

Roesser
M
,
Muller
V
.
Osmoadaptation in bacteria and archaea: common principles and differences
.
Environ Microbiol
2001
;
3
:
743
54
. https://doi.org/10.1046/j.1462-2920.2001.00252.x.

74.

Empadinhas
N
,
da Costa
MS
.
Osmoadaptation mechanisms in prokaryotes: distribution of compatible solutes
.
Int Microbiol
2008
;
11
:
151
61

75.

Robertson
DE
,
Lai
MC
,
Gunsalus
RP
et al.
Composition, variation, and dynamics of major osmotic solutes in Methanohalophilus strain FDF1
.
Appl Environ Microbiol
1992
;
58
:
2438
43
. https://doi.org/10.1128/aem.58.8.2438-2443.1992.

76.

Lai
MC
,
Sowers
KR
,
Robertson
DE
et al.
Distribution of compatible solutes in the halophilic methanogenic archaebacteria
.
J Bacteriol
1991
;
173
:
5352
8
. https://doi.org/10.1128/jb.173.17.5352-5358.1991.

77.

La Cono
V
,
Messina
E
,
Rohde
M
et al.
Symbiosis between nanohaloarchaeon and haloarchaeon is based on utilization of different polysaccharides
.
Proc Natl Acad Sci U S A
2020
;
117
:
20223
34
. https://doi.org/10.1073/pnas.2007232117.

78.

Berghuis
BA
,
Yu
FB
,
Schulz
F
et al.
Hydrogenotrophic methanogenesis in archaeal phylum Verstraetearchaeota reveals the shared ancestry of all methanogens
.
Proc Natl Acad Sci U S A
2019
;
116
:
5037
44
. https://doi.org/10.1073/pnas.1815631116.

79.

Hua
ZS
,
Wang
YL
,
Evans
PN
et al.
Insights into the ecological roles and evolution of methyl-coenzyme M reductase-containing hot spring archaea
.
Nat Commun
2019
;
10
:
4574
. https://doi.org/10.1038/s41467-019-12574-y.

80.

Zinke
LA
,
Evans
PN
,
Santos-Medellin
C
et al.
Evidence for non-methanogenic metabolisms in globally distributed archaeal clades basal to the Methanomassiliicoccales
.
Environ Microbiol
2021
;
23
:
340
57
. https://doi.org/10.1111/1462-2920.15316.

81.

Sorokin
DY
,
Merkel
AY
,
Abbas
B
et al.
Methanonatronarchaeum thermophilum gen. nov., sp. nov. and 'Candidatus Methanohalarchaeum thermophilum', extremely halo(natrono)philic methyl-reducing methanogens from hypersaline lakes comprising a new euryarchaeal class Methanonatronarchaeia classis nov
.
Int J Syst Evol Microbiol
2018
;
68
:
2199
208
. https://doi.org/10.1099/ijsem.0.002810.

82.

Dridi
B
,
Fardeau
M-L
,
Ollivier
B
et al.
Methanomassiliicoccus luminyensis gen. nov., sp. nov., a methanogenic archaeon isolated from human faeces
.
Int J Syst Evol Microbiol
2012
;
62
:
1902
7
. https://doi.org/10.1099/ijs.0.033712-0.

83.

Cozannet
M
,
Borrel
G
,
Roussel
E
et al.
New insights into the ecology and physiology of Methanomassiliicoccales from terrestrial and aquatic environments
.
Microorganisms
2021
;
9
:
30
. https://doi.org/10.3390/microorganisms9010030.

84.

Vavourakis
CD
,
Andrei
AS
,
Mehrshad
M
et al.
A metagenomics roadmap to the uncultured genome diversity in hypersaline soda lake sediments
.
Microbiome
2018
;
6
:
168
. https://doi.org/10.1186/s40168-018-0548-7.

85.

Szuroczki
S
,
Szabo
A
,
Korponai
K
et al.
Prokaryotic community composition in a great shallow soda lake covered by large reed stands (Neusiedler See/Lake Ferto) as revealed by cultivation- and DNA-based analyses
.
FEMS Microbiol Ecol
2020
;
96
:fiaa159. https://doi.org/10.1093/femsec/fiaa159.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data