Table 1
L. kohalensis sequencing, assembly, and gene space evaluation statistics
Sequencing statisticsRaw dataProcessed data
LibrarySize (Gb)CoverageaSize (Gb)Coveragea
Paired-end 0.2-kb inserts28.91526.114
Paired-end 0.5-kb inserts63.13359.831
Mate-pair 2-kb inserts36.21931.817
Mate-pair 5-Kb inserts34.31827.814
Total162.585145.576
Sequencing statisticsRaw dataProcessed data
LibrarySize (Gb)CoverageaSize (Gb)Coveragea
Paired-end 0.2-kb inserts28.91526.114
Paired-end 0.5-kb inserts63.13359.831
Mate-pair 2-kb inserts36.21931.817
Mate-pair 5-Kb inserts34.31827.814
Total162.585145.576
Assembly statisticsContigsScaffolds
Total assembly size (Gb)1.61.6
Total assembled sequences219,073148,874
Longest sequence length (kb)4654,541
Average sequence length (kb)7.210.7
N90 indexb40,9263,505
N90 length (kb)7.767.7
N50 index9,917756
N50 length (kb)43.6583
GC content (%)34.934.9
Assembly statisticsContigsScaffolds
Total assembly size (Gb)1.61.6
Total assembled sequences219,073148,874
Longest sequence length (kb)4654,541
Average sequence length (kb)7.210.7
N90 indexb40,9263,505
N90 length (kb)7.767.7
N50 index9,917756
N50 length (kb)43.6583
GC content (%)34.934.9
Gene space statisticsMapping percentage
Laupala unigenes from the Gene Index 95
Laupala RNA-seq reads92
Gene space statisticsMapping percentage
Laupala unigenes from the Gene Index 95
Laupala RNA-seq reads92
BUSCO databaseComplete (%)Single copy (%)Duplicated (%)Fragmented (%)Missing (%)Total
Eukaryota_odb998.793.75.00.31.0303
Arthropoda_odb999.396.82.50.10.61066
BUSCO databaseComplete (%)Single copy (%)Duplicated (%)Fragmented (%)Missing (%)Total
Eukaryota_odb998.793.75.00.31.0303
Arthropoda_odb999.396.82.50.10.61066
a

Coverage is based on an estimated genome size of 1.91 Gb (Petrov et al. 2000).

b

When ordering all contigs (or scaffolds) by size, the N50 or N90 index indicates the number of the longest sequences (contigs or scaffolds) that contain 50 or 90% of the total assembled sequence, respectively. The N50 and N90 length indicate the length of the shortest sequence in the set of the largest contigs (or scaffolds) that contain 50 or 90% of all the sequence in the assembly, respectively.

Table 1
L. kohalensis sequencing, assembly, and gene space evaluation statistics
Sequencing statisticsRaw dataProcessed data
LibrarySize (Gb)CoverageaSize (Gb)Coveragea
Paired-end 0.2-kb inserts28.91526.114
Paired-end 0.5-kb inserts63.13359.831
Mate-pair 2-kb inserts36.21931.817
Mate-pair 5-Kb inserts34.31827.814
Total162.585145.576
Sequencing statisticsRaw dataProcessed data
LibrarySize (Gb)CoverageaSize (Gb)Coveragea
Paired-end 0.2-kb inserts28.91526.114
Paired-end 0.5-kb inserts63.13359.831
Mate-pair 2-kb inserts36.21931.817
Mate-pair 5-Kb inserts34.31827.814
Total162.585145.576
Assembly statisticsContigsScaffolds
Total assembly size (Gb)1.61.6
Total assembled sequences219,073148,874
Longest sequence length (kb)4654,541
Average sequence length (kb)7.210.7
N90 indexb40,9263,505
N90 length (kb)7.767.7
N50 index9,917756
N50 length (kb)43.6583
GC content (%)34.934.9
Assembly statisticsContigsScaffolds
Total assembly size (Gb)1.61.6
Total assembled sequences219,073148,874
Longest sequence length (kb)4654,541
Average sequence length (kb)7.210.7
N90 indexb40,9263,505
N90 length (kb)7.767.7
N50 index9,917756
N50 length (kb)43.6583
GC content (%)34.934.9
Gene space statisticsMapping percentage
Laupala unigenes from the Gene Index 95
Laupala RNA-seq reads92
Gene space statisticsMapping percentage
Laupala unigenes from the Gene Index 95
Laupala RNA-seq reads92
BUSCO databaseComplete (%)Single copy (%)Duplicated (%)Fragmented (%)Missing (%)Total
Eukaryota_odb998.793.75.00.31.0303
Arthropoda_odb999.396.82.50.10.61066
BUSCO databaseComplete (%)Single copy (%)Duplicated (%)Fragmented (%)Missing (%)Total
Eukaryota_odb998.793.75.00.31.0303
Arthropoda_odb999.396.82.50.10.61066
a

Coverage is based on an estimated genome size of 1.91 Gb (Petrov et al. 2000).

b

When ordering all contigs (or scaffolds) by size, the N50 or N90 index indicates the number of the longest sequences (contigs or scaffolds) that contain 50 or 90% of the total assembled sequence, respectively. The N50 and N90 length indicate the length of the shortest sequence in the set of the largest contigs (or scaffolds) that contain 50 or 90% of all the sequence in the assembly, respectively.

Close
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close

This PDF is available to Subscribers Only

View Article Abstract & Purchase Options

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Close