Chromosome-level assembly of Dictyophora rubrovolvata genome using third-generation DNA sequencing and Hi-C analysis

General features of the D. rubrovolvata genome.

Feature
Genome assembly
Sequence and assembly statistic	Scaffold length (bp)	32,887,857
	Scaffold number	132
	Scaffold N50 length (bp)	2,711,256
	Scaffold N90 length (bp)	840,000
	Contig length	32,887,457
	Contigs number	136
	Contig N50 length (bp)	2,480,000
	Contig N90 length (bp)	840,000
	GC content (%)	45.16
Gene prediction
Gene	Number of protein-coding genes	9,725
	Total gene length (bp)	20,763,569
	Average gene length (bp)	2,135.07
Exon	Total exon length (bp)	16,349,310
	Average exon length (bp)	237.56
	Total exon number	68,822
	Average number/gene	7.08
CDS	Total CDS length (bp)	14,344,617
	Average CDS length (bp)	215.64
	Total CDS number	66,521
	Average number/gene	6.84
Intron	Total intron length (bp)	4,414,259
	Average intron length (bp)	74.7
	Total intron number	59,097
	Average number/gene	6.08

Feature
Genome assembly
Sequence and assembly statistic	Scaffold length (bp)	32,887,857
	Scaffold number	132
	Scaffold N50 length (bp)	2,711,256
	Scaffold N90 length (bp)	840,000
	Contig length	32,887,457
	Contigs number	136
	Contig N50 length (bp)	2,480,000
	Contig N90 length (bp)	840,000
	GC content (%)	45.16
Gene prediction
Gene	Number of protein-coding genes	9,725
	Total gene length (bp)	20,763,569
	Average gene length (bp)	2,135.07
Exon	Total exon length (bp)	16,349,310
	Average exon length (bp)	237.56
	Total exon number	68,822
	Average number/gene	7.08
CDS	Total CDS length (bp)	14,344,617
	Average CDS length (bp)	215.64
	Total CDS number	66,521
	Average number/gene	6.84
Intron	Total intron length (bp)	4,414,259
	Average intron length (bp)	74.7
	Total intron number	59,097
	Average number/gene	6.08

Table 1.

General features of the D. rubrovolvata genome.

Feature
Genome assembly
Sequence and assembly statistic	Scaffold length (bp)	32,887,857
	Scaffold number	132
	Scaffold N50 length (bp)	2,711,256
	Scaffold N90 length (bp)	840,000
	Contig length	32,887,457
	Contigs number	136
	Contig N50 length (bp)	2,480,000
	Contig N90 length (bp)	840,000
	GC content (%)	45.16
Gene prediction
Gene	Number of protein-coding genes	9,725
	Total gene length (bp)	20,763,569
	Average gene length (bp)	2,135.07
Exon	Total exon length (bp)	16,349,310
	Average exon length (bp)	237.56
	Total exon number	68,822
	Average number/gene	7.08
CDS	Total CDS length (bp)	14,344,617
	Average CDS length (bp)	215.64
	Total CDS number	66,521
	Average number/gene	6.84
Intron	Total intron length (bp)	4,414,259
	Average intron length (bp)	74.7
	Total intron number	59,097
	Average number/gene	6.08

Feature
Genome assembly
Sequence and assembly statistic	Scaffold length (bp)	32,887,857
	Scaffold number	132
	Scaffold N50 length (bp)	2,711,256
	Scaffold N90 length (bp)	840,000
	Contig length	32,887,457
	Contigs number	136
	Contig N50 length (bp)	2,480,000
	Contig N90 length (bp)	840,000
	GC content (%)	45.16
Gene prediction
Gene	Number of protein-coding genes	9,725
	Total gene length (bp)	20,763,569
	Average gene length (bp)	2,135.07
Exon	Total exon length (bp)	16,349,310
	Average exon length (bp)	237.56
	Total exon number	68,822
	Average number/gene	7.08
CDS	Total CDS length (bp)	14,344,617
	Average CDS length (bp)	215.64
	Total CDS number	66,521
	Average number/gene	6.84
Intron	Total intron length (bp)	4,414,259
	Average intron length (bp)	74.7
	Total intron number	59,097
	Average number/gene	6.08

Hi-C

Hi-C has been widely used to map chromatin interactions within regions of interest and across the genome. In total, 20.1 million read pairs (6.03 Gb clean data) were generated from the Hi-C library, and the GC content and Q30 ratio (the percentage of clean reads more than 30 bp) were 43.25 and 94.03%, respectively (Supplementary Table 3).

The Hi-C library quality was assessed based on the ratio of mapped reads and the proportions of valid interaction pairs and invalid interaction pairs. Only valid interaction pairs can provide effective information for genome assembly. Invalid interaction pairs mainly consist of self-circle ligation, dangling ends, re-ligation, and dumped pairs. The mapped reads ratio was 95.19% (Supplementary Table 4). Of the unique mapped read pairs, 74.34% were the valid interaction pairs (12.12 million), which were used for the next Hi-C assembly (Supplementary Table 5).

Overall, we constructed a chromosomal-level assembly of D. rubrovolvata with 11 pseudo-chromosomes with lengths ranging from 1.77 to 3.37 Mb (Table 2). Hi-C assembly incorporated 28,241,566 bp genomic sequences, accounting for 85.87% of the total sequence length on the chromosomes. The detailed distribution of each chromosome sequence is summarized in Table 2.

Table 2.

Detailed results of chromosome-level scaffolding using Hi-C technology.

Chromosome	Sequence number	Sequence length (bp)
Chr1	2	3,366,512
Chr2	1	3,149,261
Chr3	3	3,126,176
Chr4	1	2,788,612
Chr5	1	2,754,995
Chr6	1	2,711,256
Chr7	1	2,430,384
Chr8	2	2,398,327
Chr9	1	1,936,385
Chr10	1	1,814,427
Chr11	1	1,765,231
Total sequences clustered (ratio %)	15 (11.03)	28,241,566 (85.87)
Total sequences ordered and oriented (ratio %)	15 (100)	28,241,566 (100)

Chromosome	Sequence number	Sequence length (bp)
Chr1	2	3,366,512
Chr2	1	3,149,261
Chr3	3	3,126,176
Chr4	1	2,788,612
Chr5	1	2,754,995
Chr6	1	2,711,256
Chr7	1	2,430,384
Chr8	2	2,398,327
Chr9	1	1,936,385
Chr10	1	1,814,427
Chr11	1	1,765,231
Total sequences clustered (ratio %)	15 (11.03)	28,241,566 (85.87)
Total sequences ordered and oriented (ratio %)	15 (100)	28,241,566 (100)

Table 2.

Detailed results of chromosome-level scaffolding using Hi-C technology.

Chromosome	Sequence number	Sequence length (bp)
Chr1	2	3,366,512
Chr2	1	3,149,261
Chr3	3	3,126,176
Chr4	1	2,788,612
Chr5	1	2,754,995
Chr6	1	2,711,256
Chr7	1	2,430,384
Chr8	2	2,398,327
Chr9	1	1,936,385
Chr10	1	1,814,427
Chr11	1	1,765,231
Total sequences clustered (ratio %)	15 (11.03)	28,241,566 (85.87)
Total sequences ordered and oriented (ratio %)	15 (100)	28,241,566 (100)

Chromosome	Sequence number	Sequence length (bp)
Chr1	2	3,366,512
Chr2	1	3,149,261
Chr3	3	3,126,176
Chr4	1	2,788,612
Chr5	1	2,754,995
Chr6	1	2,711,256
Chr7	1	2,430,384
Chr8	2	2,398,327
Chr9	1	1,936,385
Chr10	1	1,814,427
Chr11	1	1,765,231
Total sequences clustered (ratio %)	15 (11.03)	28,241,566 (85.87)
Total sequences ordered and oriented (ratio %)	15 (100)	28,241,566 (100)

For the Hi-C assembled chromosomes, the genome was cut into 20 kb bins of equal length. The number of Hi-C read pairs covered between any 2 bins was then used as the intensity signal of the interaction between the bins to construct a heat map (Yang et al. 2021). The heat map demonstrated that the 11 chromosome groups can be clearly distinguished (Fig. 1). Within each group, the intensity of interaction in the diagonal position was higher than that in the off-diagonal position, indicating that the intensity of adjacent sequences (diagonal position) interaction in the Hi-C assembly was high, while the intensity of nonadjacent sequences (off-diagonal position) interaction was weak. The heat map of the Hi-C assembly interaction bins was consistent with a genome assembly of excellent quality.

Fig. 1.

Hi-C assembly of a chromosome interactive heat map. Lachesis Group (LG) means chromosome. LG01–LG10 are the abbreviations of 11 chromosomes. The abscissa and ordinate represent the order of each bin on the corresponding chromosome group.

Repeat sequence

The total length of the repeat sequence was 3,243,445 bp, which accounted for 9.86% of the D. rubrovolvata genome length. It was subdivided into 5 major types: retrotransposon, transposon, potential host gene, simple sequence repeat (SSR), and unknown duplications. A total of 2,428 retrotransposon, 2,141,399 bp in length, accounted for 6.51% of the genome length. In retrotransposon, the long terminal repeat-retrotransposons Copia (LTR/Copia) and long terminal repeat-retrotransposons Gypsy (LTR/Gypsy) accounted for 0.49 and 2.68% of the assembled genome, respectively. Transposon represented 0.71% of the assembled genomes. The Helitron transposable element, miniature inverted repeat transposable element, and terminal inverted repeat transposable element accounted for 0.16, 0.09, and 0.38% of the assembled genome, respectively (Table 3).

Table 3.

Statistical results of genomic repeat sequencing.

Type	Number	Length (bp)	Percentage
Retrotransposon	2,428	2,141,399	6.51
Retrotransposon/DIRS	1	53	0.00
Retrotransposon/LINE	35	2,635	0.01
Retrotransposon/LTR/Copia	264	159,709	0.49
Retrotransposon/LTR/Gypsy	1,303	880,309	2.68
Retrotransposon/PLE\|LARD	824	1,125,981	3.42
Retrotransposon/unknown	1	68	0.00
Transposon	526	233,980	0.71
Transposon/Helitron	127	51,911	0.16
Transposon/MITE	147	30,361	0.09
Transposon/TIR	224	125,860	0.38
Transposon/unknown	28	26,234	0.08
Potential host gene	226	210,983	0.64
SSR	11	1,006	0.00
Unknown	1,772	740,122	2.25

Type	Number	Length (bp)	Percentage
Retrotransposon	2,428	2,141,399	6.51
Retrotransposon/DIRS	1	53	0.00
Retrotransposon/LINE	35	2,635	0.01
Retrotransposon/LTR/Copia	264	159,709	0.49
Retrotransposon/LTR/Gypsy	1,303	880,309	2.68
Retrotransposon/PLE\|LARD	824	1,125,981	3.42
Retrotransposon/unknown	1	68	0.00
Transposon	526	233,980	0.71
Transposon/Helitron	127	51,911	0.16
Transposon/MITE	147	30,361	0.09
Transposon/TIR	224	125,860	0.38
Transposon/unknown	28	26,234	0.08
Potential host gene	226	210,983	0.64
SSR	11	1,006	0.00
Unknown	1,772	740,122	2.25

Table 3.

Statistical results of genomic repeat sequencing.

Type	Number	Length (bp)	Percentage
Retrotransposon	2,428	2,141,399	6.51
Retrotransposon/DIRS	1	53	0.00
Retrotransposon/LINE	35	2,635	0.01
Retrotransposon/LTR/Copia	264	159,709	0.49
Retrotransposon/LTR/Gypsy	1,303	880,309	2.68
Retrotransposon/PLE\|LARD	824	1,125,981	3.42
Retrotransposon/unknown	1	68	0.00
Transposon	526	233,980	0.71
Transposon/Helitron	127	51,911	0.16
Transposon/MITE	147	30,361	0.09
Transposon/TIR	224	125,860	0.38
Transposon/unknown	28	26,234	0.08
Potential host gene	226	210,983	0.64
SSR	11	1,006	0.00
Unknown	1,772	740,122	2.25

Type	Number	Length (bp)	Percentage
Retrotransposon	2,428	2,141,399	6.51
Retrotransposon/DIRS	1	53	0.00
Retrotransposon/LINE	35	2,635	0.01
Retrotransposon/LTR/Copia	264	159,709	0.49
Retrotransposon/LTR/Gypsy	1,303	880,309	2.68
Retrotransposon/PLE\|LARD	824	1,125,981	3.42
Retrotransposon/unknown	1	68	0.00
Transposon	526	233,980	0.71
Transposon/Helitron	127	51,911	0.16
Transposon/MITE	147	30,361	0.09
Transposon/TIR	224	125,860	0.38
Transposon/unknown	28	26,234	0.08
Potential host gene	226	210,983	0.64
SSR	11	1,006	0.00
Unknown	1,772	740,122	2.25

Noncoding RNA

The results of noncoding RNAs in the D. rubrovolvata Di001 genome were shown in Table 4. With regard to RNA, 329 rRNAs, 150 tRNAs, and 29 other ncRNAs were predicted. Among the rRNAs, there were 79 5S_rRNA, 82 5.8S_rRNA, 88 18S_rRNA, and 80 28S_rRNA.

Table 4.

Statistical results of noncoding RNAs.

Type		Number
tRNA		150
rRNA	5S_rRNA	79
	5.8S_rRNA	82
	18S_rRNA	88
	28S_rRNA	80
Other ncRNA	U2	6
	RNaseP_nuc	2
	U4	2
	U5	3
	U6	4
	RNase_MRP	1
	Bacteria_small_SRP	1
	Cobalamin	1
	snosnR60_Z15	2
	snoZ13_snr52	1
	snosnR61	1
	Glycine	1
	suhB	1
	snR75	1
	snR56	1
	5_ureB_sRNA	1

Type		Number
tRNA		150
rRNA	5S_rRNA	79
	5.8S_rRNA	82
	18S_rRNA	88
	28S_rRNA	80
Other ncRNA	U2	6
	RNaseP_nuc	2
	U4	2
	U5	3
	U6	4
	RNase_MRP	1
	Bacteria_small_SRP	1
	Cobalamin	1
	snosnR60_Z15	2
	snoZ13_snr52	1
	snosnR61	1
	Glycine	1
	suhB	1
	snR75	1
	snR56	1
	5_ureB_sRNA	1

Table 4.

Statistical results of noncoding RNAs.

Type		Number
tRNA		150
rRNA	5S_rRNA	79
	5.8S_rRNA	82
	18S_rRNA	88
	28S_rRNA	80
Other ncRNA	U2	6
	RNaseP_nuc	2
	U4	2
	U5	3
	U6	4
	RNase_MRP	1
	Bacteria_small_SRP	1
	Cobalamin	1
	snosnR60_Z15	2
	snoZ13_snr52	1
	snosnR61	1
	Glycine	1
	suhB	1
	snR75	1
	snR56	1
	5_ureB_sRNA	1

Type		Number
tRNA		150
rRNA	5S_rRNA	79
	5.8S_rRNA	82
	18S_rRNA	88
	28S_rRNA	80
Other ncRNA	U2	6
	RNaseP_nuc	2
	U4	2
	U5	3
	U6	4
	RNase_MRP	1
	Bacteria_small_SRP	1
	Cobalamin	1
	snosnR60_Z15	2
	snoZ13_snr52	1
	snosnR61	1
	Glycine	1
	suhB	1
	snR75	1
	snR56	1
	5_ureB_sRNA	1

Gene prediction and genome comparisons

A total of 9,725 genes were predicted in the D. rubrovolvata genome (Supplementary Table 6), among which there were 8,830 homology-predicted genes or RNA-seq-predicted genes (90.79%; Fig. 2), indicating high reliability of the prediction. The total length of the encoded genes was 20.76 Mb, accounting for 63.1% of the whole genome, and the average length of each gene was 2,135.07 bp. The average exon and intron numbers were 7.08 and 6.08, respectively (Table 1).

Fig. 2.

The integrated genes were derived from the statistical plots of 3 predictive methods.

Gene function annotation

To predict the protein sequences, a similarity analysis of 9,725 nonredundant genes in multiple public databases (GO, KEGG, KOG, NR, Pfam, CAZy, Swiss-Prot, and TrEMBL) identified 8,727 genes that were functionally annotated, which accounted for 89.74% of the assembled genome. Most genes were matched using the Nr (8,671 genes) database, followed by TrEMBL (8,298 genes) and Pfam (6,492 genes) database (Supplementary Table 7).

KOG annotations

A statistical map of annotated genes in the KOG database is shown in Fig. 3. A total of 4,899 genes were assigned to 25 categories of KOG, of which the top 5 were “General function prediction only” (850, 15.32%), “Posttranslational modification, protein turnover, chaperones” (536, 9.66%), “Signal transduction mechanisms” (392, 7.07%), “Translation, ribosomal structure and biogenesis” (308, 5.55%), and “Function unknown” (308, 5.55%; Supplementary Table 8).

Fig. 3.

The KOG function classification of proteins.

GO annotations

In GO database, 3 independent ontologies including biological process, cellular component, and molecular function were used to describe gene products according to their functional annotations. A total of 4,001 genes were assigned to 3 major categories: biological processes (18 branches), cellular components (15 branches), and molecular functions (14 branches). These were mainly distributed in 5 functional entries, “catalytic activity,” “metabolic process,” “cellular process,” “cell part,” and “cell,” of which the number of annotated genes was 2,058, 1,873, 1,777, 1,722, and 1,702, respectively (Fig. 4). Dictyophora rubrovolvata had more genes in the common subcategories of “metabolic process” and “cellular process” within the biological process and “catalytic activity” within the molecular function categories (Supplementary Table 9).

Fig. 4.

The GO function annotation.

KEGG annotations

To further systematically analyze the metabolic pathways of gene products in cells and the functions of these gene products, the KEGG database was used to annotate the gene functions of D. rubrovolvata. A statistical map of the number of annotated genes in the KEGG database is shown in Fig. 5. The 3,046 genes were assigned into 4 categories in KEGG: metabolism (90 branches), genetic information processing (15 branches), cellular processes (5 branches), and environmental information processing (1 branches). Of these, 1,863 genes were assigned to the “metabolism” category. Within metabolism, the biosynthesis of unsaturated fatty acids possesses 111 genes, followed by carbon metabolism (96), amino sugar and nucleotide sugar metabolism (47), and glutathione metabolism (46). A total of 817 genes were assigned to the “genetic information processing” functional category, including nucleocytoplasmic transport (100), ribosome (99), protein processing in the endoplasmic reticulum (85), and spliceosome (79). For cellular processes (265 genes), the cell cycle was the most involved (74). In addition to the above 3 major categories, only 28 genes were assigned to the “environmental information processing” category (Supplementary Table 10).

Fig. 5.

The KEGG function annotation.

CAZymes

Fungi secrete an array of CAZymes (carbohydrate-active enzymes) and lignin-degrading enzymes for the degradation of lignocellulose. In this study, the CAZymes of D. rubrovolvata and 18 other fungi were analyzed (Fig. 6 and Supplementary Table 10). A total of 360 genes were annotated, including 162 glycoside hydrolases (GHs), 77 glycosyl transferases, 12 polysaccharide lyases, 20 carbohydrate esterases (CEs), 7 carbohydrate-binding modules, and 82 auxiliary activities (AAs; Fig. 5 and Supplementary Table 11).

Fig. 6.

The number of CAZymes genes in D. rubrovolvata and the other 18 fungi.

10.1186/s13100-015-0041-9

Dictyophora rubrovolvata genome had 82 AAs, which more than A. cinnamomea (44), B. edulis (54), C. militaris (55), L. bicolor (47), O. sinensis (36), S. latifolia (38), T. matsutake (66), and W. cocos (43). Proteins in the AA category were mainly distributed in AA3 (28), AA7 (16), AA1 (11), AA2 (7), AA9 (7), and AA5 (5). GHs accounted for 45% of the total identified CAZymes in D. rubrovolvata. 28 genes (18 in GH5 and 10 in GH3) were identified in D. rubrovolvata, and these genes were related to cellulose digestion. Twenty-five genes (18 in GH16, 4 in GH43, and 3 in GH10) were also identified, and these genes were involved in hemicellulose digestion. Proteins in the CE category were mainly distributed in CE16 (6, 30%), CE17 (4, 20%), CE4 (3, 15%), and CE8 (2, 10%).

The CYPs family

The CYP superfamily is a diverse group of enzymes involved in various physiological processes, including detoxification, degradation of xenobiotics, and the biosynthesis of secondary metabolites (Yap et al. 2014). Dictyophora rubrovolvata had a total of 425 CYP genes, which can be classified into 41 families according to Nelson's nomenclature (Fischer et al. 2007). The CYP51 family, which may play a role in demethylation, was found to have the greatest number of genes (118 genes, 27.8%), followed by CYP 620 family (52 genes, 12.2%), CYP53 family (43 genes, 10.1%), and CYP 504 family (27 genes, 6.4%). Proteins in the CYP51 family were mainly consisted of CYP51_422 (33 genes, 28.0%), CYP51_6 (31 genes, 26.3%), and CYP51_11 (14, 11.9%; Supplementary Table 12).

Conclusion

In this study, we report a highly accurate chromosome-level genome assembly of D. rubrovolvata based on the PacBio SMRT and Hi-C technologies. The final genome size was 32.89 Mb. A total of 9,725 protein-coding genes were predicted using the strategy of multievidence combination, and 8,727 genes were functionally annotated. To the best of our knowledge, this genome-wide assembly and annotation data represent the first genome scale assembly of D. rubrovolvata. The genome data created in this study will serve as valuable resources for fungal diversity research and breeding of D. rubrovolvata and will further provide essential genomic information for understanding the molecular mechanism in its fruiting body formation during morphological development and facilitate the exploitation of medicinal compounds produced by this mushroom.

Data availability

Genome sequencing of D. rubrovolvata Di001 generated for this study has been submitted to the NCBI (BioProject: PRJNA908074 and BioSample: SAMN32024313).

Supplemental material available at G3 online.

Funding

This work was supported by the Natural Science Foundation of Fujian province of China (2021J01504), the Special Fund for Scientific Research in the Public Interest of Fujian Province (2022R1035005), the Science and Technology Innovations Program of Fujian Academy of Agricultural Science (CXTD2021016-2), and the guiding scientific and technological innovation projects of Fujian Academy of Agricultural Science (YDXM202209).

Author contributions

L.M. and C.Y. conceived and designed the project. L.M and C.Y. contributed equally to this work. D.X. performed the experiments. X.L. and X.J. contributed reagents and materials. L.M., C.Y., and Z.Y. analyzed the data. H.L. did the review and editing. L.M., C.Y., and Y.L. wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Literature cited

Alioto

Blanco

Parra

Guigó

Using geneid to identify genes

Curr Protoc Bioinformatics

2018

;

(

e56

. doi:

Bao

Kojima

Kohany

Repbase update, a database of repetitive elements in eukaryotic genomes

Mob DNA

2015

;

(

. doi:

Belton

McCord

Gibcus

Naumova

Zhan

Dekker

Hi-C: a comprehensive technique to capture the conformation of genomes

Methods

2012

;

(

268

–

276

. doi:

10.1016/j.ymeth.2012.05.001

Berlin

Koren

Chin

Drake

Landolin

Phillippy

Assembling large genomes with single-molecule sequencing and locality-sensitive hashing

Nat Biotechnol

2015

;

(

623

–

630

. doi:

Burge

Karlin

Prediction of complete gene structures in human genomic DNA

J Mol Biol

1997

;

268

(

–

. doi:

10.1006/jmbi.1997.0951

Burton

Adey

Patwardhan

Qiu

Kitzman

Shendure

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions

Nat Biotechnol

2013

;

(

1119

–

1125

. doi:

Campbell

Haas

Hamilton

Mount

Buell

Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis

BMC Genomics

2006

;

(

327

. doi:

10.1186/1471-2164-7-327

Cheng

Concepcion

Feng

Zhang

Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm

Nat Methods

2021

;

(

170

–

175

. doi:

10.1038/s41592-020-01056-5

Dai

Sun

Yin

Gao

Zhao

Jia

Yuan

Pleurotus eryngii genomes reveal evolution and adaptation to the Gobi Desert environment

Front Microbiol

2019

;

2024

–

2035

. doi:

10.3389/fmicb.2019.02024

Deng

Shang

Chen

Liu

Chen

Mechanism of the immunostimulatory activity by a polysaccharide from Dictyophora indusiata

Int J Biol Macromol

2016

;

752

–

759

. doi:

10.1016/j.ijbiomac.2016.06.024

Edgar

Myers

PILER: identification and classification of genomic repeats

Bioinformatics

2005

;

(

Suppl. 1

i152

–

i158

. doi:

10.1093/bioinformatics/bti1003

Fischer

Knoll

Sirim

Wagner

Funke

Pleiss

The cytochrome P450 engineering database: a navigation and prediction tool for the cytochrome P450 protein family

Bioinformatics

2007

;

(

2015

–

2017

. doi:

10.1093/bioinformatics/btm268

Lin

Wei

Zhou

Zhao

Zhang

Lin

Liu

Chen

, et al.

Quantitative evaluation of ultrasound-assisted extraction of 1,3-β-glucans from Dictyophora indusiata using an improved fluorometric assay

Polymers (Basel).

2019

;

(

864

. doi:

10.3390/polym11050864

Gao

Yan

Song

Fan

Wang

Liu

Huang

Rong

Guo

Zhao

, et al.

Haplotype-resolved genome analyses reveal genetically distinct nuclei within a commercial cultivar of Lentinula edodes

J Fungi (Basel)

2022

;

(

167

. doi:

Haas

Salzberg

Zhu

Pertea

Allen

Orvis

White

Buell

Wortman

Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments

Genome Biol

2008

;

(

. doi:

10.1186/gb-2008-9-1-r7

Han

Wessler

MITE-hunter: a program for discovering miniature inverted-repeat transposable elements from genomic sequences

Nucleic Acids Res

2010

;

(

e199

. doi:

Hang

Zou

Tian

Sun

Chen

Analysis of volatile components from Dictyophora rubrovolota Zang, Ji et liou

Procedia Eng

2012

;

240

–

249

. doi:

10.1016/j.proeng.2012.04.234

10.1038/s41597-022-01601-1

Zhao

Yuan

Canario

Liu

Chen

Guo

Luo

Yan

Zhang

, et al.

Chromosome-level genome assembly of largemouth bass (Micropterus salmoides) using PacBio and Hi-C technologies

Sci Data

2022

;

(

482

. doi:

Jiang

Zhou

The first whole genome sequencing of Sanghuangporus sanghuang provides insights into its medicinal application and evolution

J Fungi (Basel)

2021

;

(

787

. doi:

Jin Jing

Ang

Hui

Ming Wen

Liang

Ming Jie

Hong

Zhi Yong

Transcriptome analysis and its application in identifying genes associated with fruiting body development in basidiomycete Hypsizygus marmoreus

PLoS One

2015

;

(

e0123025

. doi:

10.1371/journal.pone.0123025

10.1093/bioinformatics/bth315

Keilwagen

Wenk

Erickson

Schattat

Grau

Hartung

Using intron position conservation for homology-based gene prediction

Nucleic Acids Res

2016

;

(

e89

. doi:

Korf

Gene finding in novel genomes

BMC Bioinformatics

2004

;

. doi:

10.1186/1471-2105-5-59

Zhao

Mao

Guo

Whole genome sequence of an edible mushroom Stropharia rugosoannulata (Daqiugaigu)

J Fungi (Basel)

2022

;

(

. doi:

Liao

Luo

Liu

Ning

Yang

Ren

Structure characterization of a novel polysaccharide from Dictyophora indusiata and its macrophage immunomodulatory activities

J Agric Food Chem

2015

;

(

535

–

544

. doi:

Majoros

Pertea

Salzberg

Tigrscan and GlimmerHMM: two open source ab initio eukaryotic gene-finders

Bioinformatics

2004

;

(

2878

–

2879

. doi:

Meng

Zhang

Yang

Wang

Dong

Yao

A preliminary study of chromosome number in Ophiocordyceps sinensis

Chin J Appl Entomol

2021

;

(

1330

–

1338

. doi:

10.7679/j.issn.2095-1353.2021.133

10.1371/journal.pone.0093560

Morin

Kohler

Baker

Foulongne-Oriol

Lombard

Nagy

Ohm

Patyshakuliyeva

Brun

Aerts

, et al.

Genome sequence of the button mushroom Agaricus bisporus reveals mechanisms governing adaptation to a humic-rich ecological niche

Proc Natl Acad Sci U S A

2012

;

109

(

17501

–

17506

. doi:

10.1073/pnas.1206847109

Park

Baek

Lee

Kim

Rhee

Kim

Seo

Park

Yoon

Nam

, et al.

Whole genome and global gene expression analyses of the model mushroom Flammulina velutipes reveal a high capacity for lignocellulose degradation

PLoS One

2014

;

(

e93560

. doi:

10.1093/bioinformatics/bti1018

Pertea

Kim

Pertea

Leek

Salzberg

Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown

Nat Protoc

2016

;

(

1650

–

1667

. doi:

10.1038/nprot.2016.095

Price

Jones

Pevzner

De novo identification of repeat families in large genomes

Bioinformatics

2005

;

(

Suppl. 1

i351

–

i358

. doi:

Servant

Varoquaux

Lajoie

Viara

Chen

Vert

Heard

Dekker

Barillot

HiC-Pro: an optimized and flexible pipeline for Hi-C data processing

Genome Biol

2015

;

259

. doi:

10.1186/s13059-015-0831-x

Shim

Park

Kim

Bae

Lee

Kim

Ryoo

Rhee

, et al.

Whole genome de novo sequencing and genome annotation of the world popular cultivated edible mushroom, Lentinula edodes

J Biotechnol

2016

;

223

–

. doi:

10.1016/j.jbiotec.2016.02.032

Stanke

Waack

Gene prediction with a hidden Markov model and a new intron submodel

Bioinformatics

2003

;

(

Suppl. 2

ii215

–

ii225

. doi:

10.1093/bioinformatics/btg1080

Sun

Zhang

Jiang

Yang

Wang

Lei

Qiu

, et al.

Whole genome sequencing and annotation of Naematelia aurantialba (Basidiomycota, edible-medicinal fungi)

J Fungi (Basel)

2021

;

(

. doi:

Tarailo-Graovac

Chen

Using RepeatMasker to identify repetitive elements in genomic sequences

Curr Protoc Bioinformatics

2009

;

Chapter 4

4.10.1

–

4.10.14

. doi:

10.1002/0471250953.bi0410s25

Walker

Abeel

Shea

Priest

Earl

Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement

PLoS One

2014

;

(

e112963

. doi:

10.1371/journal.pone.0112963

10.1016/j.ijbiomac.2019.09.210

Wang

Wen

Yang

Liu

Geng

De novo transcriptome and proteome analysis of Dictyophora indusiata fruiting bodies provides insights into the changes during morphological development

Int J Biol Macromol

2020

;

146

875

–

886

. doi:

Wicker

Sabot

Van

Bennetzen

Capy

Chalhoub

Flavell

Leroy

Morgante

Panaud

, et al.

A unified classification system for eukaryotic transposable elements

Nat Rev Genet

2007

;

(

973

–

982

. doi:

Cai

Jiao

Zhang

Liu

Chen

Xiao

Gao

, et al.

Whole-genome sequencing and transcriptome analysis of Ganoderma lucidum strain Yw-1-5 provides new insights into the enhanced effect of tween80 on exopolysaccharide production

J Fungi (Basel)

2022

;

(

1081

. doi:

Xiao

Yang

Ying

Jiang

Lin

De novo sequencing of a Sparassis latifolia genome and its associated comparative analyses

Can J Infect Dis Med Microbiol

2018

;

2018

1857170

. doi:

10.1155/2018/1857170

10.1093/g3journal/jkab173

Wang

LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons

Nucleic Acids Res

2007

;

(

Web Server issue

W265

–

W268

. doi:

Yang

Xiao

Liu

Jiang

Ying

Lin

Chromosome-scale assembly of the Sparassis latifolia genome obtained using long-read and Hi-C sequencing

G3 (Bethesda)

2021

;

(

jkab173

. doi:

10.16333/j.1001-6880.2016.3.017

Yap

H-YY

Chooi

Firdaus-Raih

Fung

Tan

The genome of the tiger milk mushroom, Lignosus rhinocerotis, provides insights into the genetic basis of its medicinal properties

BMC Genomics

2014

;

(

635

. doi:

10.1186/1471-2164-15-635

Wen

Huan

Xie

Effects of Dictyophora rubrovalvata polysaccharide on anti-fatigue and hypoxia endurance in mice

Nat Prod Res Dev

2016

;

416

–

419

. doi:

10.13386/j.issn1002-0306.2016.07.057

Wen

Peng

Zhang

Effects of Dictyophora rubrovalvata polysaccharide on anti-aging and hypoglycemic in mice

Sci Technol Food Industry

2016

;

(

343

–

345

. doi:

10.1186/s12864-022-08325-x

Zhang

Shang

Peng

Xiao

Tan

Chromosomal genome and population genetic analyses to reveal genetic architecture, breeding history and genes related to cadmium accumulation in Lentinula edodes

BMC Genomics

2022

;

(

120

. doi:

Zang

Notes on Phallaceae from the eastern Himalayan region of China

Acta Mycologica Sinica

1985

;

(

109

–

117

. doi:

10.13346/j.mycosystema.1985.02.008