Table 2

Comparisons of the performance of Red, Tallymer, WindowMasker, phRAIDER, GRF and RepLoc in de novo repeat detection

MethodSECRs (%)SESDs (%)SEARs (%)DEL (bp)URL (bp)PRL (bp)FDR (%)SPNumTime (min)Mem (GB)
Arabidopsis thaliana
RepLoc75.6462.6866.9727 792 2735 325 8831 412 75414.0877611.620.73
Red76.9956.2965.4148 266 60026 325 011800 32752.8827282.780.91
Tallymer63.7938.1449.1321 002 1774 520 842503 48019.1380391.770.94
GRF55.5835.5243.2915 443 068921 123447 9353.067728.852.40
WinMask38.0723.6030.4825 335 17115 108 496128 50659.1371770.600.32
phRAIDER7.823.315.361 810 76811 10853510.321301.073.40
Drosophila melanogaster
RepLoc86.0595.1686.1130 406 8253 624 759286 47610.9819 4111.750.73
Red83.0283.8080.4030 727 8185 722 087157 73018.1112 1603.281.00
Tallymer82.1182.7679.4028 758 8534 063 921163 67113.5618 8122.251.08
GRF70.0279.4768.1821 619 040412 980148 7661.2217208.882.80
WinMask49.5139.7247.0429 686 52315 055 68232 44450.6119 3810.680.32
phRAIDER37.2344.8136.7411 491 24564 10746 1700.1610963.383.90
Oryza sativa
RepLoc85.6977.2182.34185 011 07112 032 7314 111 0084.2877 4622.601.50
Tallymer77.1346.6368.42149 613 4475 873 8201 435 1822.9766 5769.323.00
Red73.0541.4964.08137 634 4182 999 8451 071 0691.4055 5369.182.40
GRF72.9550.7166.15143 570 2754 592 3641 807 7961.9422 61311.928.30
WinMask58.0728.4150.40120 138 22614 243 639209 24011.6838 2613.001.10
phRAIDER34.4214.8129.3161 742 598160 94976 3770.14753510.085.30
Danio rerio
RepLoc88.3291.4187.98814 834 12260 910 4508 919 5966.38416 85913.486.20
Tallymer87.2784.1286.33802 714 83062 891 3116 284 0387.05373 03844.6711.50
Red74.6275.2473.60644 011 24213 254 5723 338 0951.54334 42438.686.40
GRF73.4678.9672.91648 858 01924 069 7904 677 0472.99130 31965.5837.60
WinMask67.4152.1765.75646 304 38882 840 5101 870 44612.53203 96515.521.30
phRAIDER38.7051.6838.70335 782 2834 172 8591 320 2760.8588 89839.628.60
Zea mays
RepLoc97.8093.0894.361 842 724 70323 764 99314 207 7960.5259 01928.906.20
Tallymer97.1389.0091.561 784 439 41319 397 98110 361 8760.5155 60096.1019.30
GRF95.3288.6290.551 763 611 50618 136 51510 059 9940.4622 173147.4857.80
Red93.6682.4185.891 662 061 4916 330 7155 000 2580.0836 21940.479.10
WinMask82.3568.2772.801 408 960 3795 668 126971 8540.3315 56322.821.50
phRAIDER84.6670.3474.981 447 164 0021 868 0481 396 7250.0313 123115.9521.40
Mus musculus
RepLoc71.5789.4472.421 034 179 39685 509 55717 027 0746.62613 28123.8811.70
Tallymer71.0482.0670.471 015 198 89792 063 88912 607 0927.83565 20975.3224.00
Red66.6076.7365.57918 070 89159 121 01510 459 4665.30527 81778.538.80
GRF60.8179.9461.38900 984 31948 454 91410 742 4244.42270 293149.2264.00
WinMask62.4561.7560.03949 478 121163 016 9456 364 91016.50433 33028.931.40
phRAIDER43.7661.2644.04591 931 80214 963 2776 768 8161.38295 62182.3021.80
Homo sapiens
RepLoc67.3085.5767.891 190 032 803102 056 42828 757 0566.16716 46324.2211.70
Tallymer65.9363.7464.851 144 451 245105 103 16016 578 9237.74648 553149.4725.70
Red65.5457.2463.951 134 956 885110 011 69912 935 4728.55571 44886.409.30
GRF59.2067.6658.941 008 246 53163 625 90416 546 6264.67519 015177.80247.70
WinMask58.2938.5056.021 104 198 589206 414 0023 464 95818.38569 38532.471.50
phRAIDER25.2921.0424.34396 065 1815 948 7712 324 1050.92264 12860.0327.20
MethodSECRs (%)SESDs (%)SEARs (%)DEL (bp)URL (bp)PRL (bp)FDR (%)SPNumTime (min)Mem (GB)
Arabidopsis thaliana
RepLoc75.6462.6866.9727 792 2735 325 8831 412 75414.0877611.620.73
Red76.9956.2965.4148 266 60026 325 011800 32752.8827282.780.91
Tallymer63.7938.1449.1321 002 1774 520 842503 48019.1380391.770.94
GRF55.5835.5243.2915 443 068921 123447 9353.067728.852.40
WinMask38.0723.6030.4825 335 17115 108 496128 50659.1371770.600.32
phRAIDER7.823.315.361 810 76811 10853510.321301.073.40
Drosophila melanogaster
RepLoc86.0595.1686.1130 406 8253 624 759286 47610.9819 4111.750.73
Red83.0283.8080.4030 727 8185 722 087157 73018.1112 1603.281.00
Tallymer82.1182.7679.4028 758 8534 063 921163 67113.5618 8122.251.08
GRF70.0279.4768.1821 619 040412 980148 7661.2217208.882.80
WinMask49.5139.7247.0429 686 52315 055 68232 44450.6119 3810.680.32
phRAIDER37.2344.8136.7411 491 24564 10746 1700.1610963.383.90
Oryza sativa
RepLoc85.6977.2182.34185 011 07112 032 7314 111 0084.2877 4622.601.50
Tallymer77.1346.6368.42149 613 4475 873 8201 435 1822.9766 5769.323.00
Red73.0541.4964.08137 634 4182 999 8451 071 0691.4055 5369.182.40
GRF72.9550.7166.15143 570 2754 592 3641 807 7961.9422 61311.928.30
WinMask58.0728.4150.40120 138 22614 243 639209 24011.6838 2613.001.10
phRAIDER34.4214.8129.3161 742 598160 94976 3770.14753510.085.30
Danio rerio
RepLoc88.3291.4187.98814 834 12260 910 4508 919 5966.38416 85913.486.20
Tallymer87.2784.1286.33802 714 83062 891 3116 284 0387.05373 03844.6711.50
Red74.6275.2473.60644 011 24213 254 5723 338 0951.54334 42438.686.40
GRF73.4678.9672.91648 858 01924 069 7904 677 0472.99130 31965.5837.60
WinMask67.4152.1765.75646 304 38882 840 5101 870 44612.53203 96515.521.30
phRAIDER38.7051.6838.70335 782 2834 172 8591 320 2760.8588 89839.628.60
Zea mays
RepLoc97.8093.0894.361 842 724 70323 764 99314 207 7960.5259 01928.906.20
Tallymer97.1389.0091.561 784 439 41319 397 98110 361 8760.5155 60096.1019.30
GRF95.3288.6290.551 763 611 50618 136 51510 059 9940.4622 173147.4857.80
Red93.6682.4185.891 662 061 4916 330 7155 000 2580.0836 21940.479.10
WinMask82.3568.2772.801 408 960 3795 668 126971 8540.3315 56322.821.50
phRAIDER84.6670.3474.981 447 164 0021 868 0481 396 7250.0313 123115.9521.40
Mus musculus
RepLoc71.5789.4472.421 034 179 39685 509 55717 027 0746.62613 28123.8811.70
Tallymer71.0482.0670.471 015 198 89792 063 88912 607 0927.83565 20975.3224.00
Red66.6076.7365.57918 070 89159 121 01510 459 4665.30527 81778.538.80
GRF60.8179.9461.38900 984 31948 454 91410 742 4244.42270 293149.2264.00
WinMask62.4561.7560.03949 478 121163 016 9456 364 91016.50433 33028.931.40
phRAIDER43.7661.2644.04591 931 80214 963 2776 768 8161.38295 62182.3021.80
Homo sapiens
RepLoc67.3085.5767.891 190 032 803102 056 42828 757 0566.16716 46324.2211.70
Tallymer65.9363.7464.851 144 451 245105 103 16016 578 9237.74648 553149.4725.70
Red65.5457.2463.951 134 956 885110 011 69912 935 4728.55571 44886.409.30
GRF59.2067.6658.941 008 246 53163 625 90416 546 6264.67519 015177.80247.70
WinMask58.2938.5056.021 104 198 589206 414 0023 464 95818.38569 38532.471.50
phRAIDER25.2921.0424.34396 065 1815 948 7712 324 1050.92264 12860.0327.20

SECRs is the sensitivity to common repeats annotated by RepeatMasker. SESDs is the sensitivity to segmental duplications identified by SEDEF. SEARs is the sensitivity to all repeats merged from CRs and SDs. DEL is the total length of repeats detected by every tool in the table. URL is the total length of repeats detected by de novo tools but are unannotated by RepeatMasker or SEDEF. PRL is the total length of PRs that are not annotated but validated by BLAST. FDR is the false discovery rate. SPNum is the number of specifically detected repeats. WinMask is short for WindowMasker. The symbol ‘bp’ means base pair. ‘min’ is minute. ‘GB’ is gigabyte. The CPU cores used by RepLoc and GRF for A. thaliana, D. melanogaster, O. sativa, D. rerio, Z. mays, M. musculus and H. sapiens are 5, 7, 12, 20, 10, 20 and 20, respectively.

Table 2

Comparisons of the performance of Red, Tallymer, WindowMasker, phRAIDER, GRF and RepLoc in de novo repeat detection

MethodSECRs (%)SESDs (%)SEARs (%)DEL (bp)URL (bp)PRL (bp)FDR (%)SPNumTime (min)Mem (GB)
Arabidopsis thaliana
RepLoc75.6462.6866.9727 792 2735 325 8831 412 75414.0877611.620.73
Red76.9956.2965.4148 266 60026 325 011800 32752.8827282.780.91
Tallymer63.7938.1449.1321 002 1774 520 842503 48019.1380391.770.94
GRF55.5835.5243.2915 443 068921 123447 9353.067728.852.40
WinMask38.0723.6030.4825 335 17115 108 496128 50659.1371770.600.32
phRAIDER7.823.315.361 810 76811 10853510.321301.073.40
Drosophila melanogaster
RepLoc86.0595.1686.1130 406 8253 624 759286 47610.9819 4111.750.73
Red83.0283.8080.4030 727 8185 722 087157 73018.1112 1603.281.00
Tallymer82.1182.7679.4028 758 8534 063 921163 67113.5618 8122.251.08
GRF70.0279.4768.1821 619 040412 980148 7661.2217208.882.80
WinMask49.5139.7247.0429 686 52315 055 68232 44450.6119 3810.680.32
phRAIDER37.2344.8136.7411 491 24564 10746 1700.1610963.383.90
Oryza sativa
RepLoc85.6977.2182.34185 011 07112 032 7314 111 0084.2877 4622.601.50
Tallymer77.1346.6368.42149 613 4475 873 8201 435 1822.9766 5769.323.00
Red73.0541.4964.08137 634 4182 999 8451 071 0691.4055 5369.182.40
GRF72.9550.7166.15143 570 2754 592 3641 807 7961.9422 61311.928.30
WinMask58.0728.4150.40120 138 22614 243 639209 24011.6838 2613.001.10
phRAIDER34.4214.8129.3161 742 598160 94976 3770.14753510.085.30
Danio rerio
RepLoc88.3291.4187.98814 834 12260 910 4508 919 5966.38416 85913.486.20
Tallymer87.2784.1286.33802 714 83062 891 3116 284 0387.05373 03844.6711.50
Red74.6275.2473.60644 011 24213 254 5723 338 0951.54334 42438.686.40
GRF73.4678.9672.91648 858 01924 069 7904 677 0472.99130 31965.5837.60
WinMask67.4152.1765.75646 304 38882 840 5101 870 44612.53203 96515.521.30
phRAIDER38.7051.6838.70335 782 2834 172 8591 320 2760.8588 89839.628.60
Zea mays
RepLoc97.8093.0894.361 842 724 70323 764 99314 207 7960.5259 01928.906.20
Tallymer97.1389.0091.561 784 439 41319 397 98110 361 8760.5155 60096.1019.30
GRF95.3288.6290.551 763 611 50618 136 51510 059 9940.4622 173147.4857.80
Red93.6682.4185.891 662 061 4916 330 7155 000 2580.0836 21940.479.10
WinMask82.3568.2772.801 408 960 3795 668 126971 8540.3315 56322.821.50
phRAIDER84.6670.3474.981 447 164 0021 868 0481 396 7250.0313 123115.9521.40
Mus musculus
RepLoc71.5789.4472.421 034 179 39685 509 55717 027 0746.62613 28123.8811.70
Tallymer71.0482.0670.471 015 198 89792 063 88912 607 0927.83565 20975.3224.00
Red66.6076.7365.57918 070 89159 121 01510 459 4665.30527 81778.538.80
GRF60.8179.9461.38900 984 31948 454 91410 742 4244.42270 293149.2264.00
WinMask62.4561.7560.03949 478 121163 016 9456 364 91016.50433 33028.931.40
phRAIDER43.7661.2644.04591 931 80214 963 2776 768 8161.38295 62182.3021.80
Homo sapiens
RepLoc67.3085.5767.891 190 032 803102 056 42828 757 0566.16716 46324.2211.70
Tallymer65.9363.7464.851 144 451 245105 103 16016 578 9237.74648 553149.4725.70
Red65.5457.2463.951 134 956 885110 011 69912 935 4728.55571 44886.409.30
GRF59.2067.6658.941 008 246 53163 625 90416 546 6264.67519 015177.80247.70
WinMask58.2938.5056.021 104 198 589206 414 0023 464 95818.38569 38532.471.50
phRAIDER25.2921.0424.34396 065 1815 948 7712 324 1050.92264 12860.0327.20
MethodSECRs (%)SESDs (%)SEARs (%)DEL (bp)URL (bp)PRL (bp)FDR (%)SPNumTime (min)Mem (GB)
Arabidopsis thaliana
RepLoc75.6462.6866.9727 792 2735 325 8831 412 75414.0877611.620.73
Red76.9956.2965.4148 266 60026 325 011800 32752.8827282.780.91
Tallymer63.7938.1449.1321 002 1774 520 842503 48019.1380391.770.94
GRF55.5835.5243.2915 443 068921 123447 9353.067728.852.40
WinMask38.0723.6030.4825 335 17115 108 496128 50659.1371770.600.32
phRAIDER7.823.315.361 810 76811 10853510.321301.073.40
Drosophila melanogaster
RepLoc86.0595.1686.1130 406 8253 624 759286 47610.9819 4111.750.73
Red83.0283.8080.4030 727 8185 722 087157 73018.1112 1603.281.00
Tallymer82.1182.7679.4028 758 8534 063 921163 67113.5618 8122.251.08
GRF70.0279.4768.1821 619 040412 980148 7661.2217208.882.80
WinMask49.5139.7247.0429 686 52315 055 68232 44450.6119 3810.680.32
phRAIDER37.2344.8136.7411 491 24564 10746 1700.1610963.383.90
Oryza sativa
RepLoc85.6977.2182.34185 011 07112 032 7314 111 0084.2877 4622.601.50
Tallymer77.1346.6368.42149 613 4475 873 8201 435 1822.9766 5769.323.00
Red73.0541.4964.08137 634 4182 999 8451 071 0691.4055 5369.182.40
GRF72.9550.7166.15143 570 2754 592 3641 807 7961.9422 61311.928.30
WinMask58.0728.4150.40120 138 22614 243 639209 24011.6838 2613.001.10
phRAIDER34.4214.8129.3161 742 598160 94976 3770.14753510.085.30
Danio rerio
RepLoc88.3291.4187.98814 834 12260 910 4508 919 5966.38416 85913.486.20
Tallymer87.2784.1286.33802 714 83062 891 3116 284 0387.05373 03844.6711.50
Red74.6275.2473.60644 011 24213 254 5723 338 0951.54334 42438.686.40
GRF73.4678.9672.91648 858 01924 069 7904 677 0472.99130 31965.5837.60
WinMask67.4152.1765.75646 304 38882 840 5101 870 44612.53203 96515.521.30
phRAIDER38.7051.6838.70335 782 2834 172 8591 320 2760.8588 89839.628.60
Zea mays
RepLoc97.8093.0894.361 842 724 70323 764 99314 207 7960.5259 01928.906.20
Tallymer97.1389.0091.561 784 439 41319 397 98110 361 8760.5155 60096.1019.30
GRF95.3288.6290.551 763 611 50618 136 51510 059 9940.4622 173147.4857.80
Red93.6682.4185.891 662 061 4916 330 7155 000 2580.0836 21940.479.10
WinMask82.3568.2772.801 408 960 3795 668 126971 8540.3315 56322.821.50
phRAIDER84.6670.3474.981 447 164 0021 868 0481 396 7250.0313 123115.9521.40
Mus musculus
RepLoc71.5789.4472.421 034 179 39685 509 55717 027 0746.62613 28123.8811.70
Tallymer71.0482.0670.471 015 198 89792 063 88912 607 0927.83565 20975.3224.00
Red66.6076.7365.57918 070 89159 121 01510 459 4665.30527 81778.538.80
GRF60.8179.9461.38900 984 31948 454 91410 742 4244.42270 293149.2264.00
WinMask62.4561.7560.03949 478 121163 016 9456 364 91016.50433 33028.931.40
phRAIDER43.7661.2644.04591 931 80214 963 2776 768 8161.38295 62182.3021.80
Homo sapiens
RepLoc67.3085.5767.891 190 032 803102 056 42828 757 0566.16716 46324.2211.70
Tallymer65.9363.7464.851 144 451 245105 103 16016 578 9237.74648 553149.4725.70
Red65.5457.2463.951 134 956 885110 011 69912 935 4728.55571 44886.409.30
GRF59.2067.6658.941 008 246 53163 625 90416 546 6264.67519 015177.80247.70
WinMask58.2938.5056.021 104 198 589206 414 0023 464 95818.38569 38532.471.50
phRAIDER25.2921.0424.34396 065 1815 948 7712 324 1050.92264 12860.0327.20

SECRs is the sensitivity to common repeats annotated by RepeatMasker. SESDs is the sensitivity to segmental duplications identified by SEDEF. SEARs is the sensitivity to all repeats merged from CRs and SDs. DEL is the total length of repeats detected by every tool in the table. URL is the total length of repeats detected by de novo tools but are unannotated by RepeatMasker or SEDEF. PRL is the total length of PRs that are not annotated but validated by BLAST. FDR is the false discovery rate. SPNum is the number of specifically detected repeats. WinMask is short for WindowMasker. The symbol ‘bp’ means base pair. ‘min’ is minute. ‘GB’ is gigabyte. The CPU cores used by RepLoc and GRF for A. thaliana, D. melanogaster, O. sativa, D. rerio, Z. mays, M. musculus and H. sapiens are 5, 7, 12, 20, 10, 20 and 20, respectively.

Close
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close

This PDF is available to Subscribers Only

View Article Abstract & Purchase Options

For full access to this pdf, sign in to an existing account, or purchase an annual subscription.

Close