We read with interest the recent article by Wang et al.1 regarding the potential role of variants in the gene UNC13B encoding the unc-13 homolog B protein in focal epilepsies, including one patient with benign occipital epilepsy and two with benign childhood epilepsy with centrotemporal spikes (BECTS). We are concerned that the published work1 contains methodological issues relating to restricted selection of controls from public databases and interpretation of splicing variants.

Essentially all variants of major effect described in epilepsies are ultra-rare (<0.0005 minor allele frequency), as defined by the Epi4K consortium, and typically absent from large databases (for gnomAD v2.1.1 currently >140 000 subjects).2–4 Inspection of the nine allele variants that the authors’ state in their abstract are ‘potentially associated with epilepsy’ reveals seven are not novel and are in fact present in the gnomAD (https://gnomad.broadinstitute.org/)3 and dbSNP (https://ncbi.nlm.nih.gov/snp/)5 databases (Table 1). We were surprised that the authors detected only one of these variants (c.2644G > T; p.Gly882Trp) in the sample of 10 640 Han Chinese individuals given the frequency of the respective variants within East Asian populations as reported in gnomAD.1,3 Furthermore, the only nonsense variant p.Trp45X, and the p.Gly882Trp missense variant, are both present in gnomAD (see Table 1) despite the authors reporting in their results that they are absent.1 These discrepancies arose because the authors restricted their UNC13B variant analysis to those identified in their patients with focal epilepsy to ‘Control-only’ samples in gnomAD. Since these ‘Control-only’ samples represent only 60 146 of 141 456 individuals on gnomAD (v2.1.1), we suggest that the additional gnomAD subsets of ‘Non-TOPMed’, ‘Non-cancer’ and ‘Non-neuro’, should also have been included in their analyses as these are also appropriate controls for a neurological disorder, as is done in conventional analyses. In general, whilst we acknowledge the gene burden analysis undertaken by the authors, we suggest greater rigor in design. The use of publicly available control databases in gene-set burden analyses, as opposed to utilizing matched control sets from other studies, without controlling for variability in sequencing platforms, analysis pipelines, or population ancestry and substructure, can lead to erroneous findings and interpretations, as has been described for previous studies.6

Table 1

Presence and frequency of UNC13B variants in gnomAD, dbSNP and other databases

dbSNPTranscriptaProteingnomADbgnomADcOther Databases
‘Control-only’ alleles (frequency)All alleles (frequency)Alleles (frequency) database
All populationsEast Asian subjectsAll populationsEast Asian subjects
rs1564081215c.135G > Ap.Trp45X001 (A = 0.000003986)00
rs780641411c.4008+1G > T2 (T = 0.00001828)2 (T = 0.0002211)4 (T = 0.00001591)4 (T = 0.0002175)2 (T = 0.000016) ExAC
N/Ac.4330+7G > A00000
rs140811719c.308C > Tp.Thr103Met15 (T = 0.0001248)11 (T = 0.001106)30 (T = 0.0001062)22 (T = 0.001104)18 (T = 0.000068) TOPMED2 (T = 0.0004) 1000Gh
16 (T = 0.000132) ExAC1 (T = 0.0003) TWINSUKi
18 (T = 0.00023) PAGEd11 (T = 0.0038) KOREAN
19 (T = 0.00113) KJPNe3 (T = 0.0016) Korea1Kj
1 (T = 0.00007) ALFAf1 (T = 0.005) Vietnamese
1 (T = 0.00008) GO-ESPg
rs373846094c.662G > Ap.Arg221Gln5 (A = 0.00004571)1 (A = 0.0001106)7 (A = 0.00002784)1 (A = 0.00005439)8 (A = 0.00048) KJPNe
3 (A = 0.0010) KOREAN
rs770750153c.1190C > Tp.Ser397Phe2 (T = 0.00001663)2 (T = 0.0002008)3 (T = 0.00001061)3 (T = 0.0001504)2 (T = 0.0000008) TOPMED
1 (T = 0.0000008) ExAC
rs371965791c.1981C > Tp.Arg661Cys5 (T = 0.00004571)2 (T = 0.0002211)13 (T = 0.00005174)8 (T = 0.0004351)8 (T = 0.000066) ExAC
2 (T = 0.00015) GO-ESPg
1 (T = 0.00006) KJPNe
1 (T = 0.00003) ALFAf
N/Ac.2381G > Ap.Gly794Asp00000
rs1037446397c.2644G > Tp.Gly882Trp001 (T = 0.00003184)1 (T = 0.0006410)1 (T = 0.000004) TOPMED
Total alleles: 59Total alleles: 39
dbSNPTranscriptaProteingnomADbgnomADcOther Databases
‘Control-only’ alleles (frequency)All alleles (frequency)Alleles (frequency) database
All populationsEast Asian subjectsAll populationsEast Asian subjects
rs1564081215c.135G > Ap.Trp45X001 (A = 0.000003986)00
rs780641411c.4008+1G > T2 (T = 0.00001828)2 (T = 0.0002211)4 (T = 0.00001591)4 (T = 0.0002175)2 (T = 0.000016) ExAC
N/Ac.4330+7G > A00000
rs140811719c.308C > Tp.Thr103Met15 (T = 0.0001248)11 (T = 0.001106)30 (T = 0.0001062)22 (T = 0.001104)18 (T = 0.000068) TOPMED2 (T = 0.0004) 1000Gh
16 (T = 0.000132) ExAC1 (T = 0.0003) TWINSUKi
18 (T = 0.00023) PAGEd11 (T = 0.0038) KOREAN
19 (T = 0.00113) KJPNe3 (T = 0.0016) Korea1Kj
1 (T = 0.00007) ALFAf1 (T = 0.005) Vietnamese
1 (T = 0.00008) GO-ESPg
rs373846094c.662G > Ap.Arg221Gln5 (A = 0.00004571)1 (A = 0.0001106)7 (A = 0.00002784)1 (A = 0.00005439)8 (A = 0.00048) KJPNe
3 (A = 0.0010) KOREAN
rs770750153c.1190C > Tp.Ser397Phe2 (T = 0.00001663)2 (T = 0.0002008)3 (T = 0.00001061)3 (T = 0.0001504)2 (T = 0.0000008) TOPMED
1 (T = 0.0000008) ExAC
rs371965791c.1981C > Tp.Arg661Cys5 (T = 0.00004571)2 (T = 0.0002211)13 (T = 0.00005174)8 (T = 0.0004351)8 (T = 0.000066) ExAC
2 (T = 0.00015) GO-ESPg
1 (T = 0.00006) KJPNe
1 (T = 0.00003) ALFAf
N/Ac.2381G > Ap.Gly794Asp00000
rs1037446397c.2644G > Tp.Gly882Trp001 (T = 0.00003184)1 (T = 0.0006410)1 (T = 0.000004) TOPMED
Total alleles: 59Total alleles: 39

N/A = not applicable.

a

Canonical transcript: ENST00000378495.7 (CCDS6579).

b

gnomAD v2.1.1 ‘Control-only’ subjects (n = 60 146)—analysed by Wang et al.1

c

all gnomAD v2.1.1 subjects (n = 141 456).

d

The Population Architecture using Genomics and Epidemiology (PAGE) study.

e

ToMMo 8.3KJPN Japanese Allele Frequency Panel.

f

NCBI Allele Frequency Aggregator.

g

NHBLI GO Exome Sequencing Project.

h

1000 Genomes Project.

i

TwinsUK Cohort.

j

Korean Genome Project.

Table 1

Presence and frequency of UNC13B variants in gnomAD, dbSNP and other databases

dbSNPTranscriptaProteingnomADbgnomADcOther Databases
‘Control-only’ alleles (frequency)All alleles (frequency)Alleles (frequency) database
All populationsEast Asian subjectsAll populationsEast Asian subjects
rs1564081215c.135G > Ap.Trp45X001 (A = 0.000003986)00
rs780641411c.4008+1G > T2 (T = 0.00001828)2 (T = 0.0002211)4 (T = 0.00001591)4 (T = 0.0002175)2 (T = 0.000016) ExAC
N/Ac.4330+7G > A00000
rs140811719c.308C > Tp.Thr103Met15 (T = 0.0001248)11 (T = 0.001106)30 (T = 0.0001062)22 (T = 0.001104)18 (T = 0.000068) TOPMED2 (T = 0.0004) 1000Gh
16 (T = 0.000132) ExAC1 (T = 0.0003) TWINSUKi
18 (T = 0.00023) PAGEd11 (T = 0.0038) KOREAN
19 (T = 0.00113) KJPNe3 (T = 0.0016) Korea1Kj
1 (T = 0.00007) ALFAf1 (T = 0.005) Vietnamese
1 (T = 0.00008) GO-ESPg
rs373846094c.662G > Ap.Arg221Gln5 (A = 0.00004571)1 (A = 0.0001106)7 (A = 0.00002784)1 (A = 0.00005439)8 (A = 0.00048) KJPNe
3 (A = 0.0010) KOREAN
rs770750153c.1190C > Tp.Ser397Phe2 (T = 0.00001663)2 (T = 0.0002008)3 (T = 0.00001061)3 (T = 0.0001504)2 (T = 0.0000008) TOPMED
1 (T = 0.0000008) ExAC
rs371965791c.1981C > Tp.Arg661Cys5 (T = 0.00004571)2 (T = 0.0002211)13 (T = 0.00005174)8 (T = 0.0004351)8 (T = 0.000066) ExAC
2 (T = 0.00015) GO-ESPg
1 (T = 0.00006) KJPNe
1 (T = 0.00003) ALFAf
N/Ac.2381G > Ap.Gly794Asp00000
rs1037446397c.2644G > Tp.Gly882Trp001 (T = 0.00003184)1 (T = 0.0006410)1 (T = 0.000004) TOPMED
Total alleles: 59Total alleles: 39
dbSNPTranscriptaProteingnomADbgnomADcOther Databases
‘Control-only’ alleles (frequency)All alleles (frequency)Alleles (frequency) database
All populationsEast Asian subjectsAll populationsEast Asian subjects
rs1564081215c.135G > Ap.Trp45X001 (A = 0.000003986)00
rs780641411c.4008+1G > T2 (T = 0.00001828)2 (T = 0.0002211)4 (T = 0.00001591)4 (T = 0.0002175)2 (T = 0.000016) ExAC
N/Ac.4330+7G > A00000
rs140811719c.308C > Tp.Thr103Met15 (T = 0.0001248)11 (T = 0.001106)30 (T = 0.0001062)22 (T = 0.001104)18 (T = 0.000068) TOPMED2 (T = 0.0004) 1000Gh
16 (T = 0.000132) ExAC1 (T = 0.0003) TWINSUKi
18 (T = 0.00023) PAGEd11 (T = 0.0038) KOREAN
19 (T = 0.00113) KJPNe3 (T = 0.0016) Korea1Kj
1 (T = 0.00007) ALFAf1 (T = 0.005) Vietnamese
1 (T = 0.00008) GO-ESPg
rs373846094c.662G > Ap.Arg221Gln5 (A = 0.00004571)1 (A = 0.0001106)7 (A = 0.00002784)1 (A = 0.00005439)8 (A = 0.00048) KJPNe
3 (A = 0.0010) KOREAN
rs770750153c.1190C > Tp.Ser397Phe2 (T = 0.00001663)2 (T = 0.0002008)3 (T = 0.00001061)3 (T = 0.0001504)2 (T = 0.0000008) TOPMED
1 (T = 0.0000008) ExAC
rs371965791c.1981C > Tp.Arg661Cys5 (T = 0.00004571)2 (T = 0.0002211)13 (T = 0.00005174)8 (T = 0.0004351)8 (T = 0.000066) ExAC
2 (T = 0.00015) GO-ESPg
1 (T = 0.00006) KJPNe
1 (T = 0.00003) ALFAf
N/Ac.2381G > Ap.Gly794Asp00000
rs1037446397c.2644G > Tp.Gly882Trp001 (T = 0.00003184)1 (T = 0.0006410)1 (T = 0.000004) TOPMED
Total alleles: 59Total alleles: 39

N/A = not applicable.

a

Canonical transcript: ENST00000378495.7 (CCDS6579).

b

gnomAD v2.1.1 ‘Control-only’ subjects (n = 60 146)—analysed by Wang et al.1

c

all gnomAD v2.1.1 subjects (n = 141 456).

d

The Population Architecture using Genomics and Epidemiology (PAGE) study.

e

ToMMo 8.3KJPN Japanese Allele Frequency Panel.

f

NCBI Allele Frequency Aggregator.

g

NHBLI GO Exome Sequencing Project.

h

1000 Genomes Project.

i

TwinsUK Cohort.

j

Korean Genome Project.

There are also concerns with the interpretation of some variants as pathogenic by the authors without supporting functional analysis.1 The c.4008+1G > T variant does affect an invariant splice site position but is observed four times in East Asian subjects on gnomAD (Table 1), and is inherited from an unaffected mother as noted by the authors.1 The other splice site variant (c.4330+7G > A) does not affect a residue within the consensus donor splice site sequence, and occurs at a site not typically associated with disease-causing variation.7

While we cannot exclude that UNC13B variants may be population-specific risk factors for focal epilepsies, the presence of the reported variants in East Asian individuals at frequencies higher than would be expected for rare dominant alleles known to cause epilepsies2,4 draws the authors conclusions into question. Further analyses of larger cohorts with careful genotype-phenotype correlation are required to confirm any potential disease associations of UNC13B with seizure disorders.

Data availability

Data sharing is not applicable to this article as no new data were created or analysed in this study.

Competing interests

The authors report no competing interests.

References

1

Wang
J
,
Qiao
J-D
,
Liu
XR
, et al.
UNC13B variants associated with partial epilepsy with favourable outcome
.
Brain.
2021
;
144
(
10
):
3050
3060
.

2

Epi4K Consortium
.
Ultra-rare genetic variation in common epilepsies: a case-control sequencing study
.
Lancet Neurol.
2017
;
16
:
135
143
.

3

Karczewski
KJ
,
Francioli
LC
,
Tiao
G
, et al.
The mutational constraint spectrum quantified from variation in 141,456 humans
.
Nature.
2020
;
581
:
434
443
.

4

Bennett
CA
,
Petrovski
S
,
Oliver
KL
,
Berkovic
SF
.
ExACtly zero or once: A clinically helpful guide to assessing genetic variants in mild epilepsies
.
Neurol. Genet.
2017
;
3
:
e163
.

5

Sherry
ST
,
Ward
M
,
Sirotkin
K
.
dbSNP–Database for single nucleotide polymorphisms and other classes of minor genetic variation
.
Genome Res.
1999
;
9
:
677
679
.

6

Guo
MH
,
Plummer
L
,
Chan
Y-M
,
Hirschhorn
JN
,
Lippincott
MF
.
Burden testing of rare variants Identified through exome sequencing via publicly available control data
.
Am J Hum Genet.
2018
;
103
:
522
534
.

7

Shapiro
MB
,
Senapathy
P
.
RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression
.
Nucleic Acids Res.
1987
;
15
:
7155
7174
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)