Abstract

The polar regions host a diverse array of moss species that have evolved to thrive in extreme environments. These mosses exhibit remarkable adaptations, including tolerance to freezing temperatures, desiccation, and ultraviolet radiation. Despite their ecological significance, genomic data on these organisms are still limited, impeding our understanding of their evolutionary history and adaptive mechanisms in the context of climate change. In this study, we present the first chromosome-scale genome assembly and annotation of the Arctic moss Ptychostomum knowltonii. The assembled genome is 408.8 Mb in size, anchored to 12 pseudochromosomes, with a scaffold N50 of 32.61 Mb. Repetitive elements account for 56.24% of the genome. The genome contains 28,014 protein-coding genes, with a BUSCO completeness of 96.20%. This genomic resource will enable future comparative genomic studies, enhancing our understanding of how polar mosses may respond to a warming climate and shedding light on their evolutionary trajectories in persistently extreme environments.

Significance

Genomic data from bryophytes provide a crucial foundation for investigating the vast array of resistance genes these organisms possess, making the sequencing and assembly of bryophyte genomes from various extreme habitats particularly valuable. In polar regions, mosses that are adapted to harsh conditions thrive, yet genomic data for these species remain limited. This gap in data hampers our understanding of their adaptive mechanisms in response to climate change. In this study, we present the genome assembly of the Arctic moss Ptychostomum knowltonii, which will enable future comparative analyses and provide essential insights into the adaptive strategies of mosses in extreme environments, as well as their responses to a changing climate.

Introduction

Land plants consist of two sister lineages, i.e. vascular plants and bryophytes, and the latter comprise mosses, liverworts, and hornworts. Bryophytes are a group of minute and delicate creatures, with a haploid-dominant vegetative body (the gametophyte) and an unbranched sporophyte that remains attached to the gametophyte (Shaw and Renzaglia 2004). Their gametophyte is structurally simple with no vascular tissue and true roots (Ligrone et al. 2012). However, bryophytes have diversified significantly, with some 22,000 extant species (Christenhusz and Byng 2016). They have adapted to a wide range of habitats, including terrestrial, epiphytic, and aquatic environments, and are distributed across nearly all ecosystems globally (Degola et al. 2022). Bryophytes can endure a wide range of temperatures, from freezing cold to intense heat. They have evolved mechanisms to stabilize cellular structures during freezing and to reduce metabolic activity in high temperatures. These adaptations make them a promising source of novel resistance genes with potential applications in agriculture and biotechnology. However, compared with the significant progress made in genome sequencing of vascular plants, the number of sequenced bryophyte genomes remains limited, particularly for mosses adapted to extreme environments (Szovenyi et al. 2021).

To date, the Antarctic moss Pohlia nutans is the only polar bryophyte whose genome has been sequenced, providing insights into its adaptation to Antarctic conditions through mechanisms such as DNA repair, reactive oxygen species scavenging, and flavonoid biosynthesis (Liu et al. 2022). Like Antarctic plants, Arctic plants face highly challenging environments. In addition to short growing seasons, Arctic plants must cope with temperature fluctuations, strong winds, and nutrient-poor soils (Lee 2020), as well as disturbances from glacial meltwater (Kim et al. 2022). Therefore, sequencing the genomes of Arctic bryophytes is crucial for uncovering their adaptive mechanisms in response to these extreme conditions.

Here, we report the first chromosome-scale genome assembly of an Arctic moss, Ptychostomum knowltonii (Bryaceae, Bryales) (supplementary fig. S1, Supplementary Material online). P. knowltonii (formerly Bryum knowltonii) is a green or yellow-green monoecious (synoicous) moss widely distributed across various Arctic and sub-Arctic regions. It exhibits distinct morphological and ecological adaptations, making it a valuable subject for bryophyte research (Rolston 1982; Ramsay and Spence 2005). Furthermore, Byun et al. (2021) introduced the first genetic transfection method for Arctic moss species using Bryum. However, the lack of genomic resources has hindered further development of this technique. The availability of this P. knowltonii genome will enable future comparative genomic studies, facilitate genetic transfection research, and support the discovery and application of moss resistance genes. This will enhance our understanding of how mosses adapt to extreme environments, respond to a warming climate, and evolve in perpetually cold habitats.

Results and Discussion

Genome Assembly

To achieve a chromosome-level genome assembly of P. knowltonii, we generated 30.23 Gb of long-read data (∼74× coverage) using Oxford Nanopore Technologies for preliminary assembly, 55.88 Gb of Illumina clean short-read data (∼137× coverage) for genome survey and polishing, and 103.48 Gb of Hi-C clean data (∼253× coverage) for scaffolding. After removing redundancies, potential contaminants, and unanchored organelle fragments, we obtained an optimized genome assembly. The final assembly totaled 409 Mb in size, distributed across 692 contigs, with a contig N50 of 1.23 Mb and a scaffold N50 of 32.61 Mb (Table 1). The genome size of P. knowltonii is slightly smaller than the estimated 449 Mb based on K-mer analysis (see supplementary fig. S2, Supplementary Material online). Approximately 97.75% of the contigs (399 Mb) were anchored to 12 pseudochromosomes (Fig. 1a), with ambiguous “N” bases accounting for 282,110 bp, or 0.06% of the total genome length. The genome size of P. knowltonii is smaller than that of P. nutans (698.20 Mb), a species from the same family Bryaceae (Liu et al. 2022), but larger than Bryum argenteum (259 Mb) (Gao et al. 2023).

a) Heatmap of interaction frequency of genomic fragments of P. knowltonii. b) Circus plot of the genome of P. knowltonii. a, assembled chromosomes, b, Gypsy density, c, Copia density, d, repeat density, e, gene density, f, GC content, g, chromosome synteny.
Fig. 1.

a) Heatmap of interaction frequency of genomic fragments of P. knowltonii. b) Circus plot of the genome of P. knowltonii. a, assembled chromosomes, b, Gypsy density, c, Copia density, d, repeat density, e, gene density, f, GC content, g, chromosome synteny.

Table 1

Genome assembly and annotation statistics of P. knowltonii

Genome assemblyValue
Total scaffold length (bp)408,794,622
Number of chromosome-level scaffolds12
Longest scaffold (bp)54,874,844
N50 of scaffold (bp)32,607,604
Total contig length (bp)408,512,582
Longest contig (bp)6,478,272
N50 of contig (bp)1,232,632
GC content (%)40.48
BUSCO completeness (genome mode) (%)92.20
Genome annotation
Number of PCGs28,014
Repeats in genome (%)56.24
TEs in genome (%)53.98
Mean coding sequence length (bp)229.40
Average number of exons per gene5.71
BUSCO completeness (protein mode) (%)96.20
Gene functional annotation
InterProScan (%)56.27
Gene Ontology (%)41.42
Kyoto Encyclopedia of Genes and Genomes (%)62.49
Swiss-prot (%)52.02
TrEMBL (%)76.69
With Arabidopsis thaliana blast Hit (%)56.22
Percentage of overall annotated genes (%)79.72
Genome assemblyValue
Total scaffold length (bp)408,794,622
Number of chromosome-level scaffolds12
Longest scaffold (bp)54,874,844
N50 of scaffold (bp)32,607,604
Total contig length (bp)408,512,582
Longest contig (bp)6,478,272
N50 of contig (bp)1,232,632
GC content (%)40.48
BUSCO completeness (genome mode) (%)92.20
Genome annotation
Number of PCGs28,014
Repeats in genome (%)56.24
TEs in genome (%)53.98
Mean coding sequence length (bp)229.40
Average number of exons per gene5.71
BUSCO completeness (protein mode) (%)96.20
Gene functional annotation
InterProScan (%)56.27
Gene Ontology (%)41.42
Kyoto Encyclopedia of Genes and Genomes (%)62.49
Swiss-prot (%)52.02
TrEMBL (%)76.69
With Arabidopsis thaliana blast Hit (%)56.22
Percentage of overall annotated genes (%)79.72
Table 1

Genome assembly and annotation statistics of P. knowltonii

Genome assemblyValue
Total scaffold length (bp)408,794,622
Number of chromosome-level scaffolds12
Longest scaffold (bp)54,874,844
N50 of scaffold (bp)32,607,604
Total contig length (bp)408,512,582
Longest contig (bp)6,478,272
N50 of contig (bp)1,232,632
GC content (%)40.48
BUSCO completeness (genome mode) (%)92.20
Genome annotation
Number of PCGs28,014
Repeats in genome (%)56.24
TEs in genome (%)53.98
Mean coding sequence length (bp)229.40
Average number of exons per gene5.71
BUSCO completeness (protein mode) (%)96.20
Gene functional annotation
InterProScan (%)56.27
Gene Ontology (%)41.42
Kyoto Encyclopedia of Genes and Genomes (%)62.49
Swiss-prot (%)52.02
TrEMBL (%)76.69
With Arabidopsis thaliana blast Hit (%)56.22
Percentage of overall annotated genes (%)79.72
Genome assemblyValue
Total scaffold length (bp)408,794,622
Number of chromosome-level scaffolds12
Longest scaffold (bp)54,874,844
N50 of scaffold (bp)32,607,604
Total contig length (bp)408,512,582
Longest contig (bp)6,478,272
N50 of contig (bp)1,232,632
GC content (%)40.48
BUSCO completeness (genome mode) (%)92.20
Genome annotation
Number of PCGs28,014
Repeats in genome (%)56.24
TEs in genome (%)53.98
Mean coding sequence length (bp)229.40
Average number of exons per gene5.71
BUSCO completeness (protein mode) (%)96.20
Gene functional annotation
InterProScan (%)56.27
Gene Ontology (%)41.42
Kyoto Encyclopedia of Genes and Genomes (%)62.49
Swiss-prot (%)52.02
TrEMBL (%)76.69
With Arabidopsis thaliana blast Hit (%)56.22
Percentage of overall annotated genes (%)79.72

To assess the quality of the P. knowltonii assembly, we performed a Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis. Given that the embryophyta_odb10 database significantly underestimates BUSCO completeness in bryophytes (Bi et al. 2024), we used the viridiplantae_odb10 database instead. The results indicated that 92.20% of the gene space was captured in the P. knowltonii assembly, with 84.90% of the genes being complete and single-copy and 7.30% being complete and duplicated (Table 1 and supplementary fig. S3, Supplementary Material online).

Repeat Annotation

We identified 229.89 Mb of repetitive sequences, representing 56.24% of the total genome length of P. knowltonii. The most abundant repeat elements were transposable elements (TEs), which accounted for 53.98% of the genome (Table 1). Among these, terminal inverted repeat transposons were the dominant type, comprising 29.13% of the genome, with CACTA elements contributing the largest proportion (supplementary table S1, Supplementary Material online). In contrast, the two long terminal repeat superfamilies, Copia and Gypsy, which are commonly abundant in angiosperms, were present at much lower levels in the P. knowltonii genome. Specifically, Copia elements accounted for 6.46 Mb (1.58% of the genome), while Gypsy elements contributed 26.20 Mb (6.41% of the genome) (supplementary table S1, Supplementary Material online).

Gene Annotation and Functional Annotation

To predict protein-coding genes (PCGs) in P. knowltonii, we utilized two methods: BRAKER3 (Gabriel et al. 2024) and Helixer (Holst et al. 2023). The former predicted 15,718 genes, while the latter predicted 28,434 genes. Additionally, we incorporated gene predictions from homology alignments using miniprot (Li, 2023) and from transcriptome-based predictions using TransDecoder. These various sources of evidence were integrated using EVidenceModeler (Haas et al. 2008), resulting in a final set of 28,014 PCGs for P. knowltonii (Table 1), with an average gene length of 2,485 bp and an average of 5.71 exons per gene. However, the relatively small median gene length (1,781 bp) and intron length (185 bp) suggest that a subset of genes with very long introns has skewed the average gene length.

We evaluated the quality of the PCG set using the BUSCO protein model, which indicated a completeness of 96.20%. This is comparable to P. nutans (96.70%), higher than B. argenteum (94.80%), and slightly lower than the model bryophyte Physcomitrium patens (97.40%) (supplementary fig. S4, Supplementary Material online). Functional annotation assigned potential putative functions to 22,333 of the 28,014 PCGs in the P. knowltonii genome, representing 79.72% of the total gene set. The remaining 20.28% of the predicted genes lack functional annotations and may represent candidates for novel or previously uncharacterized functions.

Sex Chromosome Characteristics

In the Hi-C heatmap of P. knowltonii, we observed that two chromosomes (chr1 and chr5) had lower interaction frequencies with the other 10 chromosomes (Fig. 1a). These chromosomes also showed lower gene density, higher GC content, and higher Gypsy density (Fig. 1b). This pattern suggests that the monoecious P. knowltonii, like P. nutans, may retain both male-specific (V) and female-specific (U) chromosomes in a single plant (Liu et al. 2022; Gao et al. 2023; Zhou et al. 2023). However, the evolution from U/V sex chromosome systems to co-sexuality may differ between these species. In P. nutans (22 chromosomes), this could be due to unreduced UV diploids, whereas in P. knowltonii (11 chromosomes), it may result from aneuploid UV haploids (Ekwealor and Roy 2023). Overall, the chromosome-level genome assembly of P. knowltonii may offer a valuable new model for studying sex system transitions and sex chromosome evolution in bryophytes.

Materials and Methods

Plant Materials, and DNA, RNA Extraction

Wild gametophytes of P. knowltonii were collected from Ny-Ålesund, Svalbard, Norway. The voucher specimen (collection number: Yao-002) has been deposited at the Herbarium of Fairy Lake Botanical Garden, Shenzhen, China. This plant material was regenerated in the lab by cultivating it in a low-temperature incubator for one month. Then, fresh shoots were cut, cleaned, and sterilized with a 2% NaClO solution for 10 s for producing tissue culture. The gem-free cultures were used for DNA extraction using the FastPure Plant DNA Isolation Mini Kit (Vazyme, Nanjing) and RNA-easy™ Isolation Reagent (Vazyme, Nanjing), respectively. DNA and RNA quantification and qualification was performed using 1% agarose gel electrophoresis, a Qubit fluorometer (Thermo Fisher Scientific, USA), and a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, USA).

Genome Assembly

We followed the method of Zhou et al. (2023) for constructing and sequencing Oxford Nanopore, Illumina, Hi-C, and RNA libraries. Low-quality sequences and adapter contaminations in the whole-genome sequencing data were filtered using Trimmomatic v.0.39 (Bolger et al. 2014). Based on the clean reads, we generated a K-mer depth distribution using Jellyfish v.2.3.1 (Marçais and Kingsford 2011) and estimated the genome size using GenomeScope v.2.0 (Ranallo-Benavidez et al. 2020). The de novo assembly of the P. knowltonii was carried out using Nextdenovo v.2.5.0 (Hu et al. 2024), and then the raw assembly was polished three times with NextPolish v.1.3.1 (Hu et al. 2019), using both the Nanopore long reads and cleaned Illumina short reads. Redundant contigs were removed with purge_dups v.1.2.6 (Guan et al. 2020), based on the Nanopore long reads and cleaned Illumina reads. To enhance the assembly’s contiguity, we employed the 3D-DNA pipeline (Dudchenko et al. 2017) to anchor the contigs to 12 pseudochromosomes. The anchored assemblies were then manually refined using Juicebox v.1.11.08 (Durand et al. 2016). For contigs that cannot be anchored to the 12 pseudochromosomes, we followed the process of Zhou et al. (2023) to remove contaminating contigs and organelle fragments.

Repeat Annotation and Gene Annotation

Tandem Repeats Finder (TRF v.4.07) (Benson 1999) was used to identify tandem repeat sequences across the genome. For TEs, we applied the extensive de novo TE Annotator (EDTA v.2.0.1) pipeline (Ou et al. 2019), in combination with RepeatModeler2 (Flynn et al. 2020). To predict PCGs of P. knowltonii, we integrated three lines of evidence. Homology-based protein evidence was gathered from the proteome sequences of P. patens (Bi et al. 2024), Marchantia polymorpha (https://marchantia.info/), Arabidopsis thaliana (https://www.arabidopsis.org/), and Swiss-Prot (https://www.uniprot.org/). For transcriptome-based evidence, clean reads were aligned to the reference genome using HISAT v.2.2.0 (Kim et al. 2019) to generate BAM files. Gene model predictions were then made based on the soft-masked genome, protein evidence, and transcriptome data using the BRAKER3 pipeline v.3.0.6 (Gabriel et al. 2024), which incorporates AUGUSTUS (Stanke et al. 2006) and GeneMark (Brůna et al. 2020). We also performed de novo gene model predictions using Helixer v.0.3.3 (Holst et al. 2023), which leverages deep learning and hidden Markov models.

Additionally, transcriptome-based gene models were predicted using StringTie v.2.2.1 (Pertea et al. 2015) and TransDecoder v.5.5.0 (https://github.com/TransDecoder/TransDecoder). For additional homology-based evidence, we employed miniprot v.0.12-r237 (Li, 2023) to align the homology protein set to the genome. All these evidences were integrated using EVidenceModeler v.2.1.0 (Haas et al. 2008). Finally, Trinity v.2.8.4 (Haas et al. 2013) was employed for de novo assembly of transcripts; the gene models were updated and added untranslated regions and alternative splicing variants using the PASA pipeline v.2.5.3 (Haas et al. 2003). To assess the completeness of the gene space, the BUSCO v.3.1.0-based viridiplantae_odb10 (Manni et al. 2021) was employed. For gene function annotation, the annotated protein sequences were blasted against the UniProt (SwissProt and TrEMBL) and TAIR (https://www.arabidopsis.org/) mdatabases with an E-value cutoff of <1 × 10−5. The gene ontology annotations combine the results of eggNOG-mapper v.2 (Cantalapiedra et al. 2021), InterProScan v.5.51–85.0 (Jones et al. 2014), and PANNZER2 (Törönen et al. 2018), while the KEGG annotations combine the results of eggNOG-mapper v.2 (Cantalapiedra et al. 2021) and KofamKOALA (Aramaki et al. 2020).

Supplementary Material

Supplementary material is available at Genome Biology and Evolution online.

Funding

This study was supported by the Scientific Foundation of the Urban Management Bureau of Shenzhen (202302 to S.D. and No. 202403 to Y.L.).

Data Availability

The raw sequencing data have been deposited in NGDC data center (https://ngdc.cncb.ac.cn/) under the project number PRJCA030700. Genome assemblies and annotations have been deposited in figshare repository at https://doi.org/10.6084/m9.figshare.27186099.

Literature Cited

Aramaki
 
T
,
Blanc-Mathieu
 
R
,
Endo
 
H
,
Ohkubo
 
K
,
Kanehisa
 
M
,
Goto
 
S
,
Ogata
 
H
.
KofamKOALA: Kegg ortholog assignment based on profile HMM and adaptive score threshold
.
Bioinformatics
.
2020
:
36
(
7
):
2251
2252
. .

Benson
 
G
.
Tandem repeats finder: a program to analyze DNA sequences
.
Nucleic Acids Res
.
1999
:
27
(
2
):
573
580
. .

Bi
 
G
,
Zhao
 
S
,
Yao
 
J
,
Wang
 
H
,
Zhao
 
M
,
Sun
 
Y
,
Hou
 
X
,
Haas
 
FB
,
Varshney
 
D
,
Prigge
 
M
, et al.  
Near telomere-to-telomere genome of the model plant Physcomitrium patens
.
Nat Plants.
 
2024
:
10
(
2
):
327
343
. .

Bolger
 
AM
,
Lohse
 
M
,
Usadel
 
B
.
Trimmomatic: a flexible trimmer for illumina sequence data
.
Bioinformatics
.
2014
:
30
(
15
):
2114
2120
. .

Brůna
 
T
,
Lomsadze
 
A
,
Borodovsky
 
M
.
GeneMark-EP+: eukaryotic gene prediction with self-training in the space of genes and proteins
.
NAR Genom Bioinform
.
2020
:
2
(
2
):
lqaa026
. .

Byun
 
MY
,
Seo
 
S
,
Lee
 
J
,
Yoo
 
Y-H
,
Lee
 
H
.
Transfection of Arctic Bryum sp. KMR5045 as a model for genetic engineering of cold-tolerant mosses
.
Front Plant Sci
.
2021
:
11
:
609847
. .

Cantalapiedra
 
CP
,
Hernández-Plaza
 
A
,
Letunic
 
I
,
Bork
 
P
,
Huerta-Cepas
 
J
.
eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale
.
Mol Biol Evol
.
2021
:
38
(
12
):
5825
5829
. .

Christenhusz
 
MJM
,
Byng
 
JW
.
The number of known plants species in the world and its annual increase
.
Phytotaxa
.
2016
:
261
(
3
):
201
217
. .

Degola
 
F
,
Sanità di Toppi
 
L
,
Petraglia
 
A.
 
2022
.
Bryophytes: how to conquer an alien planet and live happily (ever after)
.
J Exp Bot.
 
73
(
13
):
4267
4272
. .

Dudchenko
 
O
,
Batra
 
SS
,
Omer
 
AD
,
Nyquist
 
SK
,
Hoeger
 
M
,
Durand
 
NC
,
Shamim
 
MS
,
Machol
 
I
,
Lander
 
ES
,
Aiden
 
AP
, et al.  
De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds
.
Science
.
2017
:
356
(
6333
):
92
95
. .

Durand
 
NC
,
Robinson
 
JT
,
Shamim
 
MS
,
Machol
 
I
,
Mesirov
 
JP
,
Lander
 
ES
,
Aiden
 
EL
.
Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom
.
Cell Syst
.
2016
:
3
(
1
):
99
101
. .

Ekwealor
 
JTB
,
Roy
 
SW
.
Sex chromosomes: how to make a hermaphrodite
.
Curr Biol.
 
2023
:
33
(
21
):
R1150
R1152
. .

Flynn
 
JM
,
Hubley
 
R
,
Goubert
 
C
,
Rosen
 
J
,
Clark
 
AG
,
Feschotte
 
C
,
Smit
 
AF
.
RepeatModeler2 for automated genomic discovery of transposable element families
.
Proc Natl Acad Sci U S A
.
2020
:
117
(
17
):
9451
9457
. .

Gabriel
 
L
,
Brůna
 
T
,
Hoff
 
KJ
,
Ebel
 
M
,
Lomsadze
 
A
,
Borodovsky
 
M
,
Stanke
 
M
.
BRAKER3: fully automated genome annotation using RNA-seq and protein evidence with GeneMark-ETP, AUGUSTUS and TSEBRA
.
Genome Res
.
2024
:
34
(
5
):
769
777
. .

Gao
 
B
,
Li
 
X
,
Liang
 
Y
,
Zhang
 
J
,
Oliver
 
M
,
Zhang
 
D
.
Characterization of UV sex chromosomes and synteny-guided phylogenomic resolution of subgenomes in Bryopsida mosses
.
Research Square
. 2023. .

Guan
 
D
,
McCarthy
 
SA
,
Wood
 
J
,
Howe
 
K
,
Wang
 
Y
,
Durbin
 
R
.
Identifying and removing haplotypic duplication in primary genome assemblies
.
Bioinformatics
.
2020
:
36
(
9
):
2896
2898
. .

Haas
 
BJ
,
Delcher
 
AL
,
Mount
 
SM
,
Wortman
 
JR
,
Smith
 
RK
 Jr.
,
Hannick
 
LI
,
Maiti
 
R
,
Ronning
 
CM
,
Rusch
 
DB
,
Town
 
CD
, et al.  
Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies
.
Nucleic Acids Res
.
2003
:
31
(
19
):
5654
5666
. .

Haas
 
BJ
,
Papanicolaou
 
A
,
Yassour
 
M
,
Grabherr
 
M
,
Blood
 
PD
,
Bowden
 
J
,
Couger
 
MB
,
Eccles
 
D
,
Li
 
B
,
Lieber
 
M
, et al.  
De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis
.
Nat Protoc
.
2013
:
8
(
8
):
1494
1512
. .

Haas
 
BJ
,
Salzberg
 
SL
,
Zhu
 
W
,
Pertea
 
M
,
Allen
 
JE
,
Orvis
 
J
,
White
 
O
,
Buell
 
CR
,
Wortman
 
JR
.
Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments
.
Genome Biol
.
2008
:
9
(
1
):
R7
. .

Holst
 
F
,
Bolger
 
A
,
Günther
 
C
,
Maß
 
J
,
Triesch
 
S
,
Kindel
 
F
,
Kiel
 
N
,
Saadat
 
N
,
Ebenhöh
 
O
,
Usadel
 
B
, et al.  
Helixer–de novo prediction of primary eukaryotic gene models combining deep learning and a hidden markov model
.
bioRxiv 527280
. ,
6 February 2023
, preprint: not peer reviewed.

Hu
 
J
,
Fan
 
J
,
Sun
 
Z
,
Liu
 
S
.
NextPolish: a fast and efficient genome polishing tool for long-read assembly
.
Bioinformatics
.
2019
:
36
(
7
):
2253
2255
. .

Hu
 
J
,
Wang
 
Z
,
Sun
 
Z
,
Hu
 
B
,
Ayoola
 
AO
,
Liang
 
F
,
Li
 
J
,
Sandoval
 
JR
,
Cooper
 
DN
,
Ye
 
K
, et al.  
NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads
.
Genome Biol
.
2024
:
25
(
1
):
107
. .

Jones
 
P
,
Binns
 
D
,
Chang
 
H-Y
,
Fraser
 
M
,
Li
 
W
,
McAnulla
 
C
,
McWilliam
 
H
,
Maslen
 
J
,
Mitchell
 
A
,
Nuka
 
G
, et al.  
InterProScan 5: genome-scale protein function classification
.
Bioinformatics
.
2014
:
30
(
9
):
1236
1240
. .

Kim
 
D
,
Paggi
 
JM
,
Park
 
C
,
Bennett
 
C
,
Salzberg
 
SL
.
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype
.
Nat Biotechnol
.
2019
:
37
(
8
):
907
915
. .

Kim
 
YJ
,
Laffly
 
D
,
Kim
 
S-E
,
Nilsen
 
L
,
Chi
 
J
,
Nam
 
S
,
Lee
 
YB
,
Jeong
 
S
,
Mishra
 
U
,
Lee
 
YK
, et al.  
Chronological changes in soil biogeochemical properties of the glacier foreland of Midtre Lovénbreen, Svalbard, attributed to soil-forming factors
.
Geoderma
.
2022
:
415
:
115777
. .

Lee
 
Y-K
.
Arctic plants of Svalbard: what we learn from the green in the treeless white world
.
Switzerland
:
Springer Cham
;
2020
.

Li
 
H
.
Protein-to-genome alignment with miniprot
.
Bioinformatics
.
2023
:
39
(
1
):
btad014
. .

Ligrone
 
R
,
Duckett
 
JG
,
Renzaglia
 
KS
.
Major transitions in the evolution of early land plants: a bryological perspective
.
Ann Bot.
 
2012
:
109
(
5
):
851
871
. .

Liu
 
S
,
Fang
 
S
,
Cong
 
B
,
Li
 
T
,
Yi
 
D
,
Zhang
 
Z
,
Zhao
 
L
,
Zhang
 
P
.
The Antarctic moss Pohlia nutans genome provides insights into the evolution of bryophytes and the adaptation to extreme terrestrial habitats
.
Front Plant Sci.
 
2022
:
13
:
920138
. .

Manni
 
M
,
Berkeley
 
MR
,
Seppey
 
M
,
Simão
 
FA
,
Zdobnov
 
EM
.
BUSCO update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes
.
Mol Biol Evol
.
2021
:
38
(
10
):
4647
4654
. .

Marçais
 
G
,
Kingsford
 
C
.
A fast, lock-free approach for efficient parallel counting of occurrences of k-mers
.
Bioinformatics
.
2011
:
27
(
6
):
764
770
. .

Ou
 
S
,
Su
 
W
,
Liao
 
Y
,
Chougule
 
K
,
Agda
 
JRA
,
Hellinga
 
AJ
,
Lugo
 
CSB
,
Elliott
 
TA
,
Ware
 
D
,
Peterson
 
T
, et al.  
Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline
.
Genome Biol
.
2019
:
20
(
1
):
275
. .

Pertea
 
M
,
Pertea
 
GM
,
Antonescu
 
CM
,
Chang
 
T-C
,
Mendell
 
JT
,
Salzberg
 
SL
.
StringTie enables improved reconstruction of a transcriptome from RNA-seq reads
.
Nat Biotechnol.
 
2015
:
33
(
3
):
290
295
. .

Ramsay
 
HP
,
Spence
 
JR
.
New genera and combinations in the Bryaceae (Bryales, Musci) for Australia
.
Phytologia
.
2005
:
87
:
15
28
. .

Ranallo-Benavidez
 
TR
,
Jaron
 
KS
,
Schatz
 
MC
.
GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes
.
Nat Commun.
 
2020
:
11
(
1
):
1432
. .

Rolston
 
H
.
Bryum knowltonii new to the United States
.
Bryologist.
 
1982
:
85
(
4
):
420
. .

Shaw
 
J
,
Renzaglia
 
K
.
Phylogeny and diversification of bryophytes
.
Am J Bot.
 
2004
:
91
(
10
):
1557
1581
. .

Stanke
 
M
,
Keller
 
O
,
Gunduz
 
I
,
Hayes
 
A
,
Waack
 
S
,
Morgenstern
 
B
.
AUGUSTUS: ab initio prediction of alternative transcripts
.
Nucleic Acids Res
.
2006
:
34
(
Web Server
):
W435
W439
. .

Szovenyi
 
P
,
Gunadi
 
A
,
Li
 
F-W
.
Charting the genomic landscape of seed-free plants
.
Nat Plants
.
2021
:
7
(
5
):
554
565
. .

Törönen
 
P
,
Medlar
 
A
,
Holm
 
L
.
PANNZER2: a rapid functional annotation web server
.
Nucleic Acids Res
.
2018
:
46
(
W1
):
W84
W88
. .

Zhou
 
X
,
Peng
 
T
,
Zeng
 
Y
,
Cai
 
Y
,
Zuo
 
Q
,
Zhang
 
L
,
Dong
 
S
,
Liu
 
Y
.
Chromosome-level genome assembly of Niphotrichum japonicum provides new insights into heat stress responses in mosses
.
Front Plant Sci
.
2023
:
14
:
1271357
. .

Author notes

Changling Ma and Xuping Zhou contributed equally to this work.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Associate Editor: Vincent Castric
Vincent Castric
Associate Editor
Search for other works by this author on:

Supplementary data