Abstract

The process of domestication has led to dramatic differences in behavioral traits between domestic dogs and gray wolves. Whole-genome research found that a class of putative positively selected genes were related to various aspects of learning and memory, such as long-term potentiation and long-term depression. In this study, we constructed a single-nucleus transcriptomic atlas of the dog hippocampus to illustrate its cell types, cell lineage and molecular features. Using the transcriptomes of 105 057 nuclei from the hippocampus of a Beagle dog, we identified 26 cell clusters and a putative trajectory of oligodendrocyte development. Comparative analysis revealed a significant convergence between dog differentially expressed genes (DEGs) and putative positively selected genes (PSGs). Forty putative PSGs were DEGs in glutamatergic neurons, especially in Cluster 14, which is related to the regulation of nervous system development. In summary, this study provides a blueprint to understand the cellular mechanism of dog domestication.

INTRODUCTION

The domestication process has led to dramatic differences in behavioral and morphological traits between domesticated animals and their wild ancestors [1–3]. Domestic dogs have undergone parallel evolution and convergent evolution with humans in the history of early hunting–gathering and the recent change of living environments from agrarian societies to modern cities [4–8]; in this process, dogs have undergone strong artificial selection, resulting in ∼450 globally recognized breeds, which makes them the most variable mammalian species on Earth [9–11]. Genome-wide scans for positive selection revealed that the behavioral and neurological traits likely changed subsequently with the process of domestication [12]. For instance, putative positively selected genes (PSGs) in dogs were reported to be linked to neural crest and central nervous system (CNS) development [13]. Gene expression in brains showed a strong difference in putative PSGs in neurons on learning, memory and behavior between domestic animals and their wild relatives [14,15].

The hippocampus is an important part of the limbic system in the brain that has an essential role in memory [16]. A class of domesticated genes is related to hippocampal synaptic long-term potentiation and long-term depression [5]. Single-cell RNA sequencing (scRNA-seq) has been used to identify the cell-type atlas [17–19], reveal cellular and molecular dynamics in neurogenesis [20] and decode hippocampal development and evolution [21]. It is worth noting that dozens of putative PSGs in dogs were related to cell types of both excitatory and inhibitory neurons and contributed to the connectivity and development of synapses and dendrites [13,15].

In the present study, we collected single nuclei from the hippocampus of a Beagle dog and performed a comprehensive analysis of the transcriptomic profiles of 105 057 nuclei. We defined 26 cell clusters and identified a set of marker genes for each cell type. Cross-species comparative analysis showed that the major cell types in the hippocampus were highly conserved between dogs and humans. More interestingly, 86 of the differentially expressed genes (DEGs) were putative PSGs during dog domestication. The comprehensive cell landscape of the dog hippocampus could help us establish correspondence between cell types in the nervous system and putative PSGs in dogs and facilitate the understanding of the molecular features of cells during domestication.

RESULTS

Single-nucleus transcriptomics of the dog hippocampus constructed using SPLiT-seq

A standardized single-nucleus RNA-sequencing (snRNA-seq) pipeline was built using SPLiT-seq [22]. The snRNA-seq data of single nuclei from the hippocampus of a 5-month-old Beagle dog were obtained with random and oligo(dT) primers. After detailed preprocessing and filtering (see ‘Methods’ section), we created a digital expression matrix of 105 057 single nuclei with a median of 804 genes and 1109 counts per nucleus. To remove the potential batch effects, partial principal component analysis (partial-PCA, see ‘Methods’ section) was performed instead of the classical PCA and then the uniform manifold approximation and projection method (UMAP) was applied to project these two batches of transcripts into the common comparable 2D space (Supplementary Fig. S1A). Furthermore, the Leiden community detection algorithm was employed to group these cells into 26 cell clusters (Fig. 1A).

Single-nucleus transcriptomic landscape of the dog hippocampus. (A) Clustering and visualization of transcriptomes of 105 057 single cells by UMAP. In this article, ‘cluster’ means a group of cells and is related to the cluster analysis result. ‘Cell type’ is a group of clusters and we defined these clusters as the same cell types. (B) Dot plot showing the averaged DEG expression (z-cores) in different clusters (Supplementary Table S12). (C) Visualization of CCBE1 by UMAP. (D) CCBE1 expressed in the dog hippocampus revealed by immunohistochemical staining. (E) Visualization of GAD2 by UMAP. (F) GAD2 expression in the dog hippocampus revealed by immunofluorescent staining. Red: GAD2; blue: DAPI; white square indicates region boxed had higher-magnification view. (G) Visualization of NDNF by UMAP. (H) NDNF expression in the dog hippocampus revealed by immunohistochemical staining. Red points indicate cells that were found expressed DEGs. DG, dentate gyrus; GCL, granule cell layer, SGZ, subgranular zone; ML, molecular layer; CA, cornu ammonis; SO, stratum oriens; SP, stratum pyramidale; SR, stratum radiatum; SLM, stratum lacunosum-moleculare; Sub, subiculum; HF, hippocampus fissure; A, alveus.
Figure 1.

Single-nucleus transcriptomic landscape of the dog hippocampus. (A) Clustering and visualization of transcriptomes of 105 057 single cells by UMAP. In this article, ‘cluster’ means a group of cells and is related to the cluster analysis result. ‘Cell type’ is a group of clusters and we defined these clusters as the same cell types. (B) Dot plot showing the averaged DEG expression (z-cores) in different clusters (Supplementary Table S12). (C) Visualization of CCBE1 by UMAP. (D) CCBE1 expressed in the dog hippocampus revealed by immunohistochemical staining. (E) Visualization of GAD2 by UMAP. (F) GAD2 expression in the dog hippocampus revealed by immunofluorescent staining. Red: GAD2; blue: DAPI; white square indicates region boxed had higher-magnification view. (G) Visualization of NDNF by UMAP. (H) NDNF expression in the dog hippocampus revealed by immunohistochemical staining. Red points indicate cells that were found expressed DEGs. DG, dentate gyrus; GCL, granule cell layer, SGZ, subgranular zone; ML, molecular layer; CA, cornu ammonis; SO, stratum oriens; SP, stratum pyramidale; SR, stratum radiatum; SLM, stratum lacunosum-moleculare; Sub, subiculum; HF, hippocampus fissure; A, alveus.

The DEGs (P-value < 0.001 and log2(fold change) > 0.25) of each cell cluster were used to assign most of the cells to specific cell types based on the known markers in mice and humans [17,22–25] (Supplementary Table S1 and Supplementary Fig. S1B). Glutamatergic neurons, GABAergic neurons, Cajal-Retzius cells, oligodendrocyte precursor cells (OPCs), myelinating oligodendrocytes, endothelial cells, astrocytes and microglia cells were detected in the hippocampus of the Beagle at proportions of 55.82%, 12.80%, 0.14%, 3.14%, 11.68%, 0.30%, 11.82% and 0.30%, respectively (Fig. 1B and Supplementary Fig. S1B–D).

Among the clusters, 12 clusters (0, 3, 5, 6, 7, 10, 11, 13, 14, 15, 17 and 20) were identified as the glutamatergic neurons, which produce the most common excitatory neurotransmitter in the CNS [26]. Although Clusters 3, 5 and 6 had a similar expression pattern, there were unique DEGs expressed in each cluster. For instance, both Clusters 3 and 6 had high expression of GRM7, TENM2 and CACNA1E, while Cluster 3 had no expression of STMN2 or NEFL (Fig. 1B). CCBE1 is a specific marker gene specifically in Cluster 0 (Fig. 1C). It marked the canine hippocampus cornu ammonis 3 (CA3) and stratum pyramidale (SP) (Fig. 1D) as the same in the mouse [27]. The GO enrichment analysis showed that Clusters 7 and 13 were enriched in the GO term cognition (GO : 0050890, Supplementary Table S2), Clusters 11 and 20 were enriched in behavior (GO : 0007610, Supplementary Table S2) and Cluster 13 was enriched in adult locomotory behavior (GO : 0008344, Supplementary Table S2).

Four clusters (4, 8, 12 and 21) are GABAergic neurons related to a kind of inhibitory neurotransmitter in the CNS, indicated by the common marker gene GAD2 [23] (Fig. 1E). Immunofluorescence (IF) analysis showed that GAD2 was highly expressed in the granule cell layer (GCL) and molecular layer (ML) of the dentate gyrus (DG) area and in the stratum lacunosum-moleculare (SLM) and SP of the CA (cornu ammonis) area (Fig. 1F). The GO enrichment analysis illustrated that four GABAergic neuron clusters were enriched in behavior (GO : 0007610, Supplementary Table S3). Cluster 25 was identified as the Cajal-Retzius cell, which plays a major role in cortical development [28,29]. The marker gene of the cell type was NDNF (Fig. 1G and H), which was also expressed in Cajal-Retzius cells in humans and mice [22,30]. Beyond the neuron cell clusters described above, another five clusters represented non-neuronal cells. Clusters 19 and 22 were unidentified types. Clusters 1, 23 and 24 were identified as astrocytes, endothelial cells and microglia cells, respectively.

Cross-species comparison between dog and human hippocampus

To validate the cell-type annotations and explore the conservation of the hippocampus, we performed a cross-species comparison between dog and human hippocampus using human hippocampus scRNA-seq data from Zhong et al. [21]. We used Harmony [31] to integrate these data sets based on the one-to-one homologous genes between dogs and humans. We found that most of the homologous major cell types were consistent and located nearby on the UMAP plots (Fig. 2A–C). The homologous DEG overlaps between each pair of major dog and human types also revealed a similar correspondence (Fig. 2D). In particular, five typical markers are shared between dog and humans (Supplementary Table S4). GAD1-expressing inhibitory neurons are associated with human developmental and epileptic encephalopathy [32]. MBP marks oligodendrocytes and encodes the protein of the myelin membrane in the CNS [33] and several studies have shown that MBP is a biomarker for multiple sclerosis [34]. AQP4, an astrocyte marker, plays an important role in brain water homeostasis [35]. SPARC inhibits mitogenic effects in microvascular endothelial cells [36]. PTPRC (also known as CD45) marks microglia cells [37]. In addition, we applied the cross-species cell-typing and integrative tool CAME [38] to predict the major cell types of dog hippocampal cells (the query) with human snRNA-seq data as the reference (Fig. 2E). We also performed the reverse prediction, i.e. querying the human cells using dog data as the reference (Fig. 2F). We found that the cross-species cell-type predictions of CAME were consistent with the results of Harmony integration and the DEG comparisons. These results suggest that the cell-type annotations of the dog hippocampus are reasonable and that the major types in the hippocampus are highly conserved between dogs and humans.

Cross-species comparison between dog and human hippocampal cells. (A)–(C) The UMAP visualization of dog and human hippocampal cells in the integrated space (by Harmony), colored by species (A), the major-type annotations of dog (B) and human (C). (D) The one-to-one homologous overlaps between the DEG sets of dogs and humans. (E) The predicted cell-type probabilities of each cell in the dog hippocampus, output by CAME, using the human data as the reference. (F) The predicted cell-type probabilities of each cell in the human hippocampus, output by CAME, using the dog data as the reference. Oligo-Astro (mix): a mix of oligodendrocytes and astrocytes; Oligo (mix): a mixing of oligodendrocytes; Oligo: oligodendrocytes.
Figure 2.

Cross-species comparison between dog and human hippocampal cells. (A)–(C) The UMAP visualization of dog and human hippocampal cells in the integrated space (by Harmony), colored by species (A), the major-type annotations of dog (B) and human (C). (D) The one-to-one homologous overlaps between the DEG sets of dogs and humans. (E) The predicted cell-type probabilities of each cell in the dog hippocampus, output by CAME, using the human data as the reference. (F) The predicted cell-type probabilities of each cell in the human hippocampus, output by CAME, using the dog data as the reference. Oligo-Astro (mix): a mix of oligodendrocytes and astrocytes; Oligo (mix): a mixing of oligodendrocytes; Oligo: oligodendrocytes.

Putative trajectory analysis reveals the conserved oligodendrocyte development trajectory

With the defined cell-type markers, Clusters 2 and 9 were inferred to be myelinating oligodendrocytes and OPCs, respectively (Fig. 3A). Consistently with this classification, CNP, a myelin-related marker gene of Cluster 2 [39], was highly expressed in the hilus, ML, SLM, F and A areas (Fig. 3B). AGAP1 was a DEG in Cluster 9 and could be used as a new marker of OPCs (Fig. 3B). Cluster 16 was linked to Clusters 2 and 9 (Fig. 1A), with few specifically expressed genes to assign its identity. Therefore, we inferred that Clusters 2, 9 and 16 might form a development trajectory from OPCs to myelinating oligodendrocytes, which was also found in the mouse hippocampus [22].

Putative trajectory analysis and cross-species transcriptome comparison between dog and mouse oligodendrocyte. (A) UMAP embedding of oligodendrocyte subclusters. Red indicates cells of Clusters 2, 16 and 9. (B) Staining of AGAP1 and CNP of dog brain at hippocampus. Red: AGAP1 (top) and CNP (bottom), blue: DAPI; white square indicates region boxed had higher-magnification view. (C) UMAP embedding of OPCs putative trajectory. Cells are colored by pseudotime, with dark colors representing mature cell stages and light colors representing immature cell stages. (D) GO enrichment analysis of OPCs differentiation clusters. The color scale represents adjusted P-value; dot size represents the gene ratio. (E) UMAP visualization of 20 989 single cells (15 920 from dog and 5069 from mouse) analysed by LIGER, color-coded by cell types.
Figure 3.

Putative trajectory analysis and cross-species transcriptome comparison between dog and mouse oligodendrocyte. (A) UMAP embedding of oligodendrocyte subclusters. Red indicates cells of Clusters 2, 16 and 9. (B) Staining of AGAP1 and CNP of dog brain at hippocampus. Red: AGAP1 (top) and CNP (bottom), blue: DAPI; white square indicates region boxed had higher-magnification view. (C) UMAP embedding of OPCs putative trajectory. Cells are colored by pseudotime, with dark colors representing mature cell stages and light colors representing immature cell stages. (D) GO enrichment analysis of OPCs differentiation clusters. The color scale represents adjusted P-value; dot size represents the gene ratio. (E) UMAP visualization of 20 989 single cells (15 920 from dog and 5069 from mouse) analysed by LIGER, color-coded by cell types.

To validate this hypothesis, we constructed the single-nucleus trajectories using Slingshot [40] based on the recalculated UMAP embedding and the original cluster labels (Fig. 3C). As a result, PTPRZ1, the marker gene for precursor cells that maintained OPCs in an undifferentiated state [41], was expressed in Cluster 9; SOX6 was expressed in Cluster 16 and repressed oligodendrocyte differentiation [42]; PLP1 was another myelin-related gene expressed in Cluster 2 (Supplementary Fig. S2A). The GO enrichment analysis showed that oligodendrocytes (Clusters 2, 9 and 16) were involved in different biological pathways (Fig. 3D). The DEGs in Cluster 9 (OPCs) were related to neuron differentiation (GO : 0030182, Supplementary Table S5) and regulation of neurogenesis (GO : 0050767, Supplementary Table S5). Those in Cluster 16 were related to cell projection morphogenesis (GO : 0048858, Supplementary Table S5) and neuron projection development (GO : 0031175, Supplementary Table S5). Those in Cluster 2 were enriched in ensheathment of neurons (GO : 0007272, Supplementary Table S5) and axon ensheathment (GO : 0008366, Supplementary Table S5). The DEGs of the putative trajectory were consistent with the differentiation process of oligodendrocyte development and we also verified this result in mouse data (Fig. 3E and Supplementary Fig. S2B–E). Additionally, we combined the data with single-cell transcriptomes from the human hippocampus for comparative analysis and validation [21]. The results showed that the hippocampus followed a developmental trajectory from OPCs to oligodendrocytes in humans and dog (Fig. 2A–C). All these results indicate that humans, dog and mice have a similar trajectory of oligodendrocyte development.

Significant convergence between DEGs and putative PSGs in domestication

Previous studies showed that dog putative PSGs were enriched in neurological processes such as learning and memory [5,43] and expressed specifically in brain tissues [14]. To identify which types of cells play an important role during domestication, we gathered the DEGs in the 26 clusters (with P-value < 0.001 and log2(fold change) > 0.25) and compared them with the putative PSGs from published studies [5,13,43–45]. The genes expressed in more than three cells in the dog hippocampus were used as the background (see ‘Methods’ section). The results showed that 630 of 841 putative PSGs were detected in the hippocampus single-nucleus transcriptomes (Supplementary Table S6 and Supplementary Fig. S3). Eighty-six of 630 detected putative PSGs occurred in the 1628 DEGs with a statistical significance with a P-value of 5.10E-09 (Table 1). However, the 1000 random gene sets with equal sizes were not significant, with a P-value of 0.47. Since Freedman et al. (2016) performed a statistical analysis to determine the likelihood that specific genes were under selection, we used their putative PSGs to verify the statistical significance. As a result, 16 genes occurred in both DEGs and putative PSGs with a P-value of 6.99E-03. Furthermore, there was also a significant overlap between DEGs and the putative PSGs reported in at least two published studies (P-value = 2.45E-07, Supplementary Table S7).

Table 1.

Overlapped genes between putative PSGs and DEGs.

Cell typeClusterGene
Glutamatergic neurons0, 3, 5, 6, 7, 10, 11, 13, 14, 15, 17, 20ADCY8, ARHGAP32, ASAP1a, CACNA1A, CADM2a, CADPS2a, CCBE1, CNTN5, CUX2a, DGKI, EEF1A1a, EML6, FOXP2a, FSTL4, GLIS1, GRIK3, HAPLN4, KCNJ3, KCNMA1a, KLF12a, LRRTM3a, NRXN3a, NTNG1, PIK3R1, PPM1E, PRKCAa, PRRC2B, RFX3a, RIMS2a, RYR3, SATB2, SBF2, SCN2A, SYN2a, TCF4a, TIAM1, TMEM132Da, TMEM59L, YWHAH, ZMAT4a
GABAergic neuron4, 8, 12, 21CADPS2, CUX2, GABRA4, GAD2, KCNC2a, NRXN3, PRKCA, ROR1, SYN2, TCF4, ZMAT4
Oligodendrocytes2, 9, 16, 18AGAP1, ANKRD44a, ASAP1, BAZ2Ba, CADM2, CCSER2, CLIC4, COL11A1, ENSCAFG00000031463, FAM107B, HIPK2a, ITCH, KLF12, LIMCH1, LRRTM3, MAP3K1, MBP, PLXDC2a, PRKCA, RALGAPA2a, REEP3, RIMS2, SAMD12, TMEM132D
Microglia24ANKRD44, BAZ2B, DOCK2, ENTPD1, IKZF1, KLF12, MRC2, PLXDC2, SHB, SLCO2B1a, SSH2, TBXAS1
Other1, 19, 22, 23, 25AHCYL2, ATXN2, CAPS2, CUX2, DNAH3, EEF1A1, FHOD3, FOXP2, GNA12, HIPK2, HYDIN, ITGB1, KCNC2, KCNMA1, KIAA1328, MYOF, PDE7B, PPM1H, PSD2, RALGAPA2, RBPMS, RFTN2, RFX3, SLCO2B1, TOM1L2, WDR66
Cell typeClusterGene
Glutamatergic neurons0, 3, 5, 6, 7, 10, 11, 13, 14, 15, 17, 20ADCY8, ARHGAP32, ASAP1a, CACNA1A, CADM2a, CADPS2a, CCBE1, CNTN5, CUX2a, DGKI, EEF1A1a, EML6, FOXP2a, FSTL4, GLIS1, GRIK3, HAPLN4, KCNJ3, KCNMA1a, KLF12a, LRRTM3a, NRXN3a, NTNG1, PIK3R1, PPM1E, PRKCAa, PRRC2B, RFX3a, RIMS2a, RYR3, SATB2, SBF2, SCN2A, SYN2a, TCF4a, TIAM1, TMEM132Da, TMEM59L, YWHAH, ZMAT4a
GABAergic neuron4, 8, 12, 21CADPS2, CUX2, GABRA4, GAD2, KCNC2a, NRXN3, PRKCA, ROR1, SYN2, TCF4, ZMAT4
Oligodendrocytes2, 9, 16, 18AGAP1, ANKRD44a, ASAP1, BAZ2Ba, CADM2, CCSER2, CLIC4, COL11A1, ENSCAFG00000031463, FAM107B, HIPK2a, ITCH, KLF12, LIMCH1, LRRTM3, MAP3K1, MBP, PLXDC2a, PRKCA, RALGAPA2a, REEP3, RIMS2, SAMD12, TMEM132D
Microglia24ANKRD44, BAZ2B, DOCK2, ENTPD1, IKZF1, KLF12, MRC2, PLXDC2, SHB, SLCO2B1a, SSH2, TBXAS1
Other1, 19, 22, 23, 25AHCYL2, ATXN2, CAPS2, CUX2, DNAH3, EEF1A1, FHOD3, FOXP2, GNA12, HIPK2, HYDIN, ITGB1, KCNC2, KCNMA1, KIAA1328, MYOF, PDE7B, PPM1H, PSD2, RALGAPA2, RBPMS, RFTN2, RFX3, SLCO2B1, TOM1L2, WDR66

aIndicates this gene appears in different cell types, i.e. CUX2 in Glutamatergic neurons, GABAergic neurons and non-neurons.

Table 1.

Overlapped genes between putative PSGs and DEGs.

Cell typeClusterGene
Glutamatergic neurons0, 3, 5, 6, 7, 10, 11, 13, 14, 15, 17, 20ADCY8, ARHGAP32, ASAP1a, CACNA1A, CADM2a, CADPS2a, CCBE1, CNTN5, CUX2a, DGKI, EEF1A1a, EML6, FOXP2a, FSTL4, GLIS1, GRIK3, HAPLN4, KCNJ3, KCNMA1a, KLF12a, LRRTM3a, NRXN3a, NTNG1, PIK3R1, PPM1E, PRKCAa, PRRC2B, RFX3a, RIMS2a, RYR3, SATB2, SBF2, SCN2A, SYN2a, TCF4a, TIAM1, TMEM132Da, TMEM59L, YWHAH, ZMAT4a
GABAergic neuron4, 8, 12, 21CADPS2, CUX2, GABRA4, GAD2, KCNC2a, NRXN3, PRKCA, ROR1, SYN2, TCF4, ZMAT4
Oligodendrocytes2, 9, 16, 18AGAP1, ANKRD44a, ASAP1, BAZ2Ba, CADM2, CCSER2, CLIC4, COL11A1, ENSCAFG00000031463, FAM107B, HIPK2a, ITCH, KLF12, LIMCH1, LRRTM3, MAP3K1, MBP, PLXDC2a, PRKCA, RALGAPA2a, REEP3, RIMS2, SAMD12, TMEM132D
Microglia24ANKRD44, BAZ2B, DOCK2, ENTPD1, IKZF1, KLF12, MRC2, PLXDC2, SHB, SLCO2B1a, SSH2, TBXAS1
Other1, 19, 22, 23, 25AHCYL2, ATXN2, CAPS2, CUX2, DNAH3, EEF1A1, FHOD3, FOXP2, GNA12, HIPK2, HYDIN, ITGB1, KCNC2, KCNMA1, KIAA1328, MYOF, PDE7B, PPM1H, PSD2, RALGAPA2, RBPMS, RFTN2, RFX3, SLCO2B1, TOM1L2, WDR66
Cell typeClusterGene
Glutamatergic neurons0, 3, 5, 6, 7, 10, 11, 13, 14, 15, 17, 20ADCY8, ARHGAP32, ASAP1a, CACNA1A, CADM2a, CADPS2a, CCBE1, CNTN5, CUX2a, DGKI, EEF1A1a, EML6, FOXP2a, FSTL4, GLIS1, GRIK3, HAPLN4, KCNJ3, KCNMA1a, KLF12a, LRRTM3a, NRXN3a, NTNG1, PIK3R1, PPM1E, PRKCAa, PRRC2B, RFX3a, RIMS2a, RYR3, SATB2, SBF2, SCN2A, SYN2a, TCF4a, TIAM1, TMEM132Da, TMEM59L, YWHAH, ZMAT4a
GABAergic neuron4, 8, 12, 21CADPS2, CUX2, GABRA4, GAD2, KCNC2a, NRXN3, PRKCA, ROR1, SYN2, TCF4, ZMAT4
Oligodendrocytes2, 9, 16, 18AGAP1, ANKRD44a, ASAP1, BAZ2Ba, CADM2, CCSER2, CLIC4, COL11A1, ENSCAFG00000031463, FAM107B, HIPK2a, ITCH, KLF12, LIMCH1, LRRTM3, MAP3K1, MBP, PLXDC2a, PRKCA, RALGAPA2a, REEP3, RIMS2, SAMD12, TMEM132D
Microglia24ANKRD44, BAZ2B, DOCK2, ENTPD1, IKZF1, KLF12, MRC2, PLXDC2, SHB, SLCO2B1a, SSH2, TBXAS1
Other1, 19, 22, 23, 25AHCYL2, ATXN2, CAPS2, CUX2, DNAH3, EEF1A1, FHOD3, FOXP2, GNA12, HIPK2, HYDIN, ITGB1, KCNC2, KCNMA1, KIAA1328, MYOF, PDE7B, PPM1H, PSD2, RALGAPA2, RBPMS, RFTN2, RFX3, SLCO2B1, TOM1L2, WDR66

aIndicates this gene appears in different cell types, i.e. CUX2 in Glutamatergic neurons, GABAergic neurons and non-neurons.

To explore whether putative PSGs have higher or lower specificity across clusters or cell types, we calculated the entropy of putative PSGs. The higher entropy-specificity score for a gene implies its uniqueness in specific clusters or cell types [46]. We found that the putative PSGs that occurred in more than one study have significantly higher entropy specificity across both clusters and cell types compared with the background (Table 2). To determine which cluster was highly enriched with putative PSGs, we performed enrichment analysis for each cluster-specific in the putative PSG set (Table 3). Four clusters (Clusters 2, 9, 14 and 24) were significantly enriched in three putative PSG sets and six clusters (Clusters 0, 8, 11, 16, 17 and 20) were enriched in two putative PSG sets (with P-value < 0.05).

Table 2.

The entropy of putative PSGs in different cell types and cluster.

Gene-specificity based on the average expressions of clusterGene-specificity based on the average expressions in a cell type
Proportion-cutPSG-841PSG-78PSG-145PSG-841PSG-78PSG-145
PSG01.57E-011.46E-011.85E-012.11E-011.94E-012.36E-01
bg02.47E-012.45E-012.45E-013.31E-013.28E-013.28E-01
PSG0.17.96E-021.26E-011.05E-011.01E-011.52E-011.27E-01
bg0.19.36E-029.28E-029.29E-021.20E-011.19E-011.19E-01
PSG0.27.43E-021.23E-018.97E-029.21E-021.52E-011.10E-01
bg0.28.97E-028.87E-028.89E-021.14E-011.13E-011.13E-01
PSG0.257.00E-021.07E-019.32E-028.68E-021.34E-011.15E-01
bg0.259.00E-028.88E-028.89E-021.14E-011.13E-011.13E-01
Gene-specificity based on the average expressions of clusterGene-specificity based on the average expressions in a cell type
Proportion-cutPSG-841PSG-78PSG-145PSG-841PSG-78PSG-145
PSG01.57E-011.46E-011.85E-012.11E-011.94E-012.36E-01
bg02.47E-012.45E-012.45E-013.31E-013.28E-013.28E-01
PSG0.17.96E-021.26E-011.05E-011.01E-011.52E-011.27E-01
bg0.19.36E-029.28E-029.29E-021.20E-011.19E-011.19E-01
PSG0.27.43E-021.23E-018.97E-029.21E-021.52E-011.10E-01
bg0.28.97E-028.87E-028.89E-021.14E-011.13E-011.13E-01
PSG0.257.00E-021.07E-019.32E-028.68E-021.34E-011.15E-01
bg0.259.00E-028.88E-028.89E-021.14E-011.13E-011.13E-01

bg, background genes that were expressed by at least a proportion of ‘proportion-cut’ in any cell cluster to filter out those genes having low expression frequencies; PSG-841, putative PSGs in five articles; PSG-78, putative PSGs occurred in no fewer than two articles; PS-145, putative PSGs occurred in Freedman et al. 2016.

Table 2.

The entropy of putative PSGs in different cell types and cluster.

Gene-specificity based on the average expressions of clusterGene-specificity based on the average expressions in a cell type
Proportion-cutPSG-841PSG-78PSG-145PSG-841PSG-78PSG-145
PSG01.57E-011.46E-011.85E-012.11E-011.94E-012.36E-01
bg02.47E-012.45E-012.45E-013.31E-013.28E-013.28E-01
PSG0.17.96E-021.26E-011.05E-011.01E-011.52E-011.27E-01
bg0.19.36E-029.28E-029.29E-021.20E-011.19E-011.19E-01
PSG0.27.43E-021.23E-018.97E-029.21E-021.52E-011.10E-01
bg0.28.97E-028.87E-028.89E-021.14E-011.13E-011.13E-01
PSG0.257.00E-021.07E-019.32E-028.68E-021.34E-011.15E-01
bg0.259.00E-028.88E-028.89E-021.14E-011.13E-011.13E-01
Gene-specificity based on the average expressions of clusterGene-specificity based on the average expressions in a cell type
Proportion-cutPSG-841PSG-78PSG-145PSG-841PSG-78PSG-145
PSG01.57E-011.46E-011.85E-012.11E-011.94E-012.36E-01
bg02.47E-012.45E-012.45E-013.31E-013.28E-013.28E-01
PSG0.17.96E-021.26E-011.05E-011.01E-011.52E-011.27E-01
bg0.19.36E-029.28E-029.29E-021.20E-011.19E-011.19E-01
PSG0.27.43E-021.23E-018.97E-029.21E-021.52E-011.10E-01
bg0.28.97E-028.87E-028.89E-021.14E-011.13E-011.13E-01
PSG0.257.00E-021.07E-019.32E-028.68E-021.34E-011.15E-01
bg0.259.00E-028.88E-028.89E-021.14E-011.13E-011.13E-01

bg, background genes that were expressed by at least a proportion of ‘proportion-cut’ in any cell cluster to filter out those genes having low expression frequencies; PSG-841, putative PSGs in five articles; PSG-78, putative PSGs occurred in no fewer than two articles; PS-145, putative PSGs occurred in Freedman et al. 2016.

Table 3.

Enrichment analysis for each cluster-specific genes in putative PSG sets.

ClusterPSG-841PSG-78PSG-145
0 Glutamatergic neurons4.06E-056.18E-031.42E-01
0 Glutamatergic neurons (random)4.33E-012.32E-013.00E-01
3 Glutamatergic neurons2.38E-035.35E-024.86E-01
3 Glutamatergic neurons (random)4.30E-012.25E-013.04E-01
5 Glutamatergic neurons4.37E-013.76E-012.04E-01
5 Glutamatergic neurons (random)4.27E-012.63E-013.26E-01
6 Glutamatergic neurons9.51E-022.31E-013.75E-01
6 Glutamatergic neurons (random)4.01E-011.87E-012.63E-01
7 Glutamatergic neurons1.38E-012.57E-014.11E-01
7 Glutamatergic neurons (random)4.06E-011.94E-012.66E-01
10 Glutamatergic neurons1.02E-022.77E-014.39E-01
10 Glutamatergic neurons (random)4.10E-012.14E-012.89E-01
11 Glutamatergic neurons7.28E-052.43E-026.88E-02
11 Glutamatergic neurons (random)4.22E-011.74E-012.43E-01
13 Glutamatergic neurons1.83E-012.15E-013.50E-01
13 Glutamatergic neurons (random)4.08E-011.74E-012.52E-01
14 Glutamatergic neurons1.77E-022.04E-041.51E-02
14 Glutamatergic neurons (random)4.04E-011.99E-012.69E-01
15 Glutamatergic neurons5.16E-022.52E-014.05E-01
15 Glutamatergic neurons (random)4.11E-011.95E-012.79E-01
17 Glutamatergic neurons1.54E-033.47E-024.08E-01
17 Glutamatergic neurons (random)4.05E-011.98E-012.81E-01
20 Glutamatergic neurons8.16E-024.23E-022.06E-02
20 Glutamatergic neurons (random)4.33E-012.12E-012.79E-01
4 GABAergic neurons7.88E-024.18E-031.13E-01
4 GABAergic neurons (random)4.05E-012.14E-012.91E-01
8 GABAergic neurons2.32E-032.09E-033.64E-01
8 GABAergic neurons (random)3.99E-011.80E-012.53E-01
12 GABAergic neurons2.08E-014.87E-034.58E-01
12 GABAergic neurons (random)4.18E-012.21E-012.99E-01
ClusterPSG-841PSG-78PSG-145
21 GABAergic neurons1.57E-015.90E-025.06E-01
21 GABAergic neurons (random)4.27E-012.39E-013.05E-01
25 Cajal-Retzius2.69E-011.78E-012.95E-01
25 Cajal-Retzius (random)3.79E-011.48E-012.19E-01
1 Astrocytes1.46E-011.07E-012.62E-01
1 Astrocytes (random)4.26E-012.81E-013.50E-01
24 Microglia1.09E-053.34E-043.11E-04
24 Microglia (random)4.15E-012.13E-012.82E-01
9 Oligodendrocyte precursor cells2.38E-035.30E-045.45E-04
9 Oligodendrocyte precursor cells (random)4.42E-012.21E-013.07E-01
16 Unknown8.92E-022.72E-033.84E-03
16 Unknown (random)4.42E-012.84E-013.55E-01
2 Myelinating oligodendrocytes6.22E-032.57E-025.62E-03
2 Myelinating oligodendrocytes (random)4.42E-012.96E-013.53E-01
18 Myelinating oligodendrocytes3.81E-014.52E-021.22E-01
18 Myelinating oligodendrocytes (random)4.05E-012.18E-012.88E-01
19 Non-neuron1.46E-012.37E-032.62E-01
19 Non-neuron (random)4.45E-012.87E-013.46E-01
22 Non-neuron2.12E-011.59E-017.21E-01
22 Non-neuron (random)4.43E-013.11E-013.71E-01
23 Endothelial cells2.28E-011.01E-017.12E-02
23 Endothelial cells (random)4.26E-012.77E-013.45E-01
ClusterPSG-841PSG-78PSG-145
0 Glutamatergic neurons4.06E-056.18E-031.42E-01
0 Glutamatergic neurons (random)4.33E-012.32E-013.00E-01
3 Glutamatergic neurons2.38E-035.35E-024.86E-01
3 Glutamatergic neurons (random)4.30E-012.25E-013.04E-01
5 Glutamatergic neurons4.37E-013.76E-012.04E-01
5 Glutamatergic neurons (random)4.27E-012.63E-013.26E-01
6 Glutamatergic neurons9.51E-022.31E-013.75E-01
6 Glutamatergic neurons (random)4.01E-011.87E-012.63E-01
7 Glutamatergic neurons1.38E-012.57E-014.11E-01
7 Glutamatergic neurons (random)4.06E-011.94E-012.66E-01
10 Glutamatergic neurons1.02E-022.77E-014.39E-01
10 Glutamatergic neurons (random)4.10E-012.14E-012.89E-01
11 Glutamatergic neurons7.28E-052.43E-026.88E-02
11 Glutamatergic neurons (random)4.22E-011.74E-012.43E-01
13 Glutamatergic neurons1.83E-012.15E-013.50E-01
13 Glutamatergic neurons (random)4.08E-011.74E-012.52E-01
14 Glutamatergic neurons1.77E-022.04E-041.51E-02
14 Glutamatergic neurons (random)4.04E-011.99E-012.69E-01
15 Glutamatergic neurons5.16E-022.52E-014.05E-01
15 Glutamatergic neurons (random)4.11E-011.95E-012.79E-01
17 Glutamatergic neurons1.54E-033.47E-024.08E-01
17 Glutamatergic neurons (random)4.05E-011.98E-012.81E-01
20 Glutamatergic neurons8.16E-024.23E-022.06E-02
20 Glutamatergic neurons (random)4.33E-012.12E-012.79E-01
4 GABAergic neurons7.88E-024.18E-031.13E-01
4 GABAergic neurons (random)4.05E-012.14E-012.91E-01
8 GABAergic neurons2.32E-032.09E-033.64E-01
8 GABAergic neurons (random)3.99E-011.80E-012.53E-01
12 GABAergic neurons2.08E-014.87E-034.58E-01
12 GABAergic neurons (random)4.18E-012.21E-012.99E-01
ClusterPSG-841PSG-78PSG-145
21 GABAergic neurons1.57E-015.90E-025.06E-01
21 GABAergic neurons (random)4.27E-012.39E-013.05E-01
25 Cajal-Retzius2.69E-011.78E-012.95E-01
25 Cajal-Retzius (random)3.79E-011.48E-012.19E-01
1 Astrocytes1.46E-011.07E-012.62E-01
1 Astrocytes (random)4.26E-012.81E-013.50E-01
24 Microglia1.09E-053.34E-043.11E-04
24 Microglia (random)4.15E-012.13E-012.82E-01
9 Oligodendrocyte precursor cells2.38E-035.30E-045.45E-04
9 Oligodendrocyte precursor cells (random)4.42E-012.21E-013.07E-01
16 Unknown8.92E-022.72E-033.84E-03
16 Unknown (random)4.42E-012.84E-013.55E-01
2 Myelinating oligodendrocytes6.22E-032.57E-025.62E-03
2 Myelinating oligodendrocytes (random)4.42E-012.96E-013.53E-01
18 Myelinating oligodendrocytes3.81E-014.52E-021.22E-01
18 Myelinating oligodendrocytes (random)4.05E-012.18E-012.88E-01
19 Non-neuron1.46E-012.37E-032.62E-01
19 Non-neuron (random)4.45E-012.87E-013.46E-01
22 Non-neuron2.12E-011.59E-017.21E-01
22 Non-neuron (random)4.43E-013.11E-013.71E-01
23 Endothelial cells2.28E-011.01E-017.12E-02
23 Endothelial cells (random)4.26E-012.77E-013.45E-01

PSG-841, putative PSGs in five articles; PSG-78, putative PSGs occurred in no fewer than two articles; PSG-145, putative PSGs occurred in Freedman et al. 2016.

Table 3.

Enrichment analysis for each cluster-specific genes in putative PSG sets.

ClusterPSG-841PSG-78PSG-145
0 Glutamatergic neurons4.06E-056.18E-031.42E-01
0 Glutamatergic neurons (random)4.33E-012.32E-013.00E-01
3 Glutamatergic neurons2.38E-035.35E-024.86E-01
3 Glutamatergic neurons (random)4.30E-012.25E-013.04E-01
5 Glutamatergic neurons4.37E-013.76E-012.04E-01
5 Glutamatergic neurons (random)4.27E-012.63E-013.26E-01
6 Glutamatergic neurons9.51E-022.31E-013.75E-01
6 Glutamatergic neurons (random)4.01E-011.87E-012.63E-01
7 Glutamatergic neurons1.38E-012.57E-014.11E-01
7 Glutamatergic neurons (random)4.06E-011.94E-012.66E-01
10 Glutamatergic neurons1.02E-022.77E-014.39E-01
10 Glutamatergic neurons (random)4.10E-012.14E-012.89E-01
11 Glutamatergic neurons7.28E-052.43E-026.88E-02
11 Glutamatergic neurons (random)4.22E-011.74E-012.43E-01
13 Glutamatergic neurons1.83E-012.15E-013.50E-01
13 Glutamatergic neurons (random)4.08E-011.74E-012.52E-01
14 Glutamatergic neurons1.77E-022.04E-041.51E-02
14 Glutamatergic neurons (random)4.04E-011.99E-012.69E-01
15 Glutamatergic neurons5.16E-022.52E-014.05E-01
15 Glutamatergic neurons (random)4.11E-011.95E-012.79E-01
17 Glutamatergic neurons1.54E-033.47E-024.08E-01
17 Glutamatergic neurons (random)4.05E-011.98E-012.81E-01
20 Glutamatergic neurons8.16E-024.23E-022.06E-02
20 Glutamatergic neurons (random)4.33E-012.12E-012.79E-01
4 GABAergic neurons7.88E-024.18E-031.13E-01
4 GABAergic neurons (random)4.05E-012.14E-012.91E-01
8 GABAergic neurons2.32E-032.09E-033.64E-01
8 GABAergic neurons (random)3.99E-011.80E-012.53E-01
12 GABAergic neurons2.08E-014.87E-034.58E-01
12 GABAergic neurons (random)4.18E-012.21E-012.99E-01
ClusterPSG-841PSG-78PSG-145
21 GABAergic neurons1.57E-015.90E-025.06E-01
21 GABAergic neurons (random)4.27E-012.39E-013.05E-01
25 Cajal-Retzius2.69E-011.78E-012.95E-01
25 Cajal-Retzius (random)3.79E-011.48E-012.19E-01
1 Astrocytes1.46E-011.07E-012.62E-01
1 Astrocytes (random)4.26E-012.81E-013.50E-01
24 Microglia1.09E-053.34E-043.11E-04
24 Microglia (random)4.15E-012.13E-012.82E-01
9 Oligodendrocyte precursor cells2.38E-035.30E-045.45E-04
9 Oligodendrocyte precursor cells (random)4.42E-012.21E-013.07E-01
16 Unknown8.92E-022.72E-033.84E-03
16 Unknown (random)4.42E-012.84E-013.55E-01
2 Myelinating oligodendrocytes6.22E-032.57E-025.62E-03
2 Myelinating oligodendrocytes (random)4.42E-012.96E-013.53E-01
18 Myelinating oligodendrocytes3.81E-014.52E-021.22E-01
18 Myelinating oligodendrocytes (random)4.05E-012.18E-012.88E-01
19 Non-neuron1.46E-012.37E-032.62E-01
19 Non-neuron (random)4.45E-012.87E-013.46E-01
22 Non-neuron2.12E-011.59E-017.21E-01
22 Non-neuron (random)4.43E-013.11E-013.71E-01
23 Endothelial cells2.28E-011.01E-017.12E-02
23 Endothelial cells (random)4.26E-012.77E-013.45E-01
ClusterPSG-841PSG-78PSG-145
0 Glutamatergic neurons4.06E-056.18E-031.42E-01
0 Glutamatergic neurons (random)4.33E-012.32E-013.00E-01
3 Glutamatergic neurons2.38E-035.35E-024.86E-01
3 Glutamatergic neurons (random)4.30E-012.25E-013.04E-01
5 Glutamatergic neurons4.37E-013.76E-012.04E-01
5 Glutamatergic neurons (random)4.27E-012.63E-013.26E-01
6 Glutamatergic neurons9.51E-022.31E-013.75E-01
6 Glutamatergic neurons (random)4.01E-011.87E-012.63E-01
7 Glutamatergic neurons1.38E-012.57E-014.11E-01
7 Glutamatergic neurons (random)4.06E-011.94E-012.66E-01
10 Glutamatergic neurons1.02E-022.77E-014.39E-01
10 Glutamatergic neurons (random)4.10E-012.14E-012.89E-01
11 Glutamatergic neurons7.28E-052.43E-026.88E-02
11 Glutamatergic neurons (random)4.22E-011.74E-012.43E-01
13 Glutamatergic neurons1.83E-012.15E-013.50E-01
13 Glutamatergic neurons (random)4.08E-011.74E-012.52E-01
14 Glutamatergic neurons1.77E-022.04E-041.51E-02
14 Glutamatergic neurons (random)4.04E-011.99E-012.69E-01
15 Glutamatergic neurons5.16E-022.52E-014.05E-01
15 Glutamatergic neurons (random)4.11E-011.95E-012.79E-01
17 Glutamatergic neurons1.54E-033.47E-024.08E-01
17 Glutamatergic neurons (random)4.05E-011.98E-012.81E-01
20 Glutamatergic neurons8.16E-024.23E-022.06E-02
20 Glutamatergic neurons (random)4.33E-012.12E-012.79E-01
4 GABAergic neurons7.88E-024.18E-031.13E-01
4 GABAergic neurons (random)4.05E-012.14E-012.91E-01
8 GABAergic neurons2.32E-032.09E-033.64E-01
8 GABAergic neurons (random)3.99E-011.80E-012.53E-01
12 GABAergic neurons2.08E-014.87E-034.58E-01
12 GABAergic neurons (random)4.18E-012.21E-012.99E-01
ClusterPSG-841PSG-78PSG-145
21 GABAergic neurons1.57E-015.90E-025.06E-01
21 GABAergic neurons (random)4.27E-012.39E-013.05E-01
25 Cajal-Retzius2.69E-011.78E-012.95E-01
25 Cajal-Retzius (random)3.79E-011.48E-012.19E-01
1 Astrocytes1.46E-011.07E-012.62E-01
1 Astrocytes (random)4.26E-012.81E-013.50E-01
24 Microglia1.09E-053.34E-043.11E-04
24 Microglia (random)4.15E-012.13E-012.82E-01
9 Oligodendrocyte precursor cells2.38E-035.30E-045.45E-04
9 Oligodendrocyte precursor cells (random)4.42E-012.21E-013.07E-01
16 Unknown8.92E-022.72E-033.84E-03
16 Unknown (random)4.42E-012.84E-013.55E-01
2 Myelinating oligodendrocytes6.22E-032.57E-025.62E-03
2 Myelinating oligodendrocytes (random)4.42E-012.96E-013.53E-01
18 Myelinating oligodendrocytes3.81E-014.52E-021.22E-01
18 Myelinating oligodendrocytes (random)4.05E-012.18E-012.88E-01
19 Non-neuron1.46E-012.37E-032.62E-01
19 Non-neuron (random)4.45E-012.87E-013.46E-01
22 Non-neuron2.12E-011.59E-017.21E-01
22 Non-neuron (random)4.43E-013.11E-013.71E-01
23 Endothelial cells2.28E-011.01E-017.12E-02
23 Endothelial cells (random)4.26E-012.77E-013.45E-01

PSG-841, putative PSGs in five articles; PSG-78, putative PSGs occurred in no fewer than two articles; PSG-145, putative PSGs occurred in Freedman et al. 2016.

Glutamatergic receptors constitute a major excitatory transmitter system, and dog excitatory synaptic plasticity increased in the domestication process [15]. Forty of 86 putative PSGs were expressed in glutamatergic neurons and 28 genes were expressed in Clusters 0, 11, 14, 17 and 20. The GO enrichment analysis showed that DEGs of Cluster 0 were enriched in the regulation of cell migration (GO : 0030334, Supplementary Table S2) and regulation of cell motility (GO : 2000145, Supplementary Table S2), those in Clusters 11 and 14 were involved in neuron differentiation (GO : 0030182, Supplementary Table S2) and those in Clusters 17 and 20 were enriched in axon guidance (GO : 0007411, Supplementary Table S2) and neuron projection guidance (GO : 0097485, Supplementary Table S2). A putative PSG CUX2 was highly expressed in DG granular cells as glutamatergic neuron DEGs in Clusters 11 and 20, and it was also expressed in CA granular cells, as the same expression pattern in mice (Fig. 4A and B and Supplementary Fig. S4B) [47]. Another putative PSG, GRIK3, in Cluster 14 was detected mainly in the GCL of the DG and SP of the CA area (Fig. 4A and B). It is worth noting that the putative PSGs were significantly enriched in Cluster 14 (Table 3), and the DEGs in Cluster 14 were enriched in synapse organization (GO : 0050808, Supplementary Table S2), neuron differentiation (GO : 0030182, Supplementary Table S2), synaptic signaling (GO : 0099536, Supplementary Table S2) and synapse assembly (GO : 0007416, Supplementary Table S2).

The mean expression of DEGs in different cell types. (A) The values of each gene (row) were its original values divided by its maximum (Supplementary Table S13). (B) Six genes in Fig. 4A expression in the dog hippocampus revealed by immunohistochemical and immunofluorescent staining. Six genes on the right of the picture were differentially expressed in different cell types and the GO enrichment analysis showed their relevant gene functions. It is worth noting that CUX2 and GRIK3 are DEGs in glutamatergic neurons, although compared to other cells they are highly expressed in non-neurons and GABAergic neurons, respectively. High-definition immunohistochemical and immunofluorescent staining is shown in Supplementary Fig. S4A.
Figure 4.

The mean expression of DEGs in different cell types. (A) The values of each gene (row) were its original values divided by its maximum (Supplementary Table S13). (B) Six genes in Fig. 4A expression in the dog hippocampus revealed by immunohistochemical and immunofluorescent staining. Six genes on the right of the picture were differentially expressed in different cell types and the GO enrichment analysis showed their relevant gene functions. It is worth noting that CUX2 and GRIK3 are DEGs in glutamatergic neurons, although compared to other cells they are highly expressed in non-neurons and GABAergic neurons, respectively. High-definition immunohistochemical and immunofluorescent staining is shown in Supplementary Fig. S4A.

Eleven putative PSGs were expressed in GABAergic neurons, seven of them belonged to Cluster 8. GO enrichment analysis suggested that DEGs in Cluster 8 were involved in behavior (GO : 0007610, Supplementary Table S3), forebrain development (GO : 0030900, Supplementary Table S3) and adult locomotory behavior (GO : 0008344, Supplementary Table S3). GAD2 was expressed in Clusters 4, 8, 12 and 21 (Fig. 1E) and it has been taken as a candidate gene to regulate brain development and behavior during domestication (Fig. 4A and B) [13]. Moreover, DEGs in Clusters 4, 8, 12 and 21 were enriched with behavior (GO : 0007610, Supplementary Table S3) and anterograde transsynaptic signaling (GO : 0098916, Supplementary Table S3). Previous studies suggested that ablation of GAD2 in mice reduced freezing and increased flight and escape behavior [48]. NRXN3 (Clusters 4 and 8) encodes a receptor and cell adhesion molecule in the nervous system, and it was highly expressed in the SP of the hippocampal CA area (Fig. 4A and B). GO enrichment analysis showed that DEGs in Clusters 4 and 8 were relevant to behavior (GO : 0007610, Supplementary Table S3), regulation of synapse organization (GO : 0050807, Supplementary Table S3) and adult locomotory behavior (GO : 0008344, Supplementary Table S3). A previous study also confirmed that putative PSGs in Chinese native dogs were enriched in locomotory behavior [14].

Twenty-four putative PSGs occurred in oligodendrocytes, including Clusters 2, 9, 16 and 18. Three of them are related to OPC differentiation. Oligodendrocytes are a kind of cell that primarily forms CNS myelin [49]. One of the overlapping genes, COL11A1 (Clusters 9 and 16), marked OPCs and was expressed in the partial area of the hippocampal CA (Fig. 4A and B). Previous studies have shown that OPCs are important for human white matter expansion and myelination [50]. Myelinating oligodendrocytes were the result of OPC differentiation. MBP was a marker gene that was expressed in Clusters 2, 16 and 18, and highly expressed in the hilus, ML, SLM and other hippocampal regions (Fig. 4A and B). As a domestication gene, MBP encodes myelin basic protein. Domestication influenced myelination and/or axonal diameter in rabbits [51] and affected neurological function in dogs [43]. The relevant functions in these clusters were ensheathment of neurons (Clusters 2 and 16, GO : 0007272, Supplementary Table S8) and axon ensheathment (Clusters 2 and 16, GO : 0008366, Supplementary Table S8).

Reconstructed gene regulatory networks of the dog hippocampus

We reconstructed dog hippocampal gene regulatory networks using GENIE3 software [52]. To attenuate the effects of noise and outliers, we used 4523 genes and 4281 pseudo cells (see ‘Methods’ section about Weighted correlation network analysis, WGCNA), which contained all the cell types in this study. More than 2 million interactions were found between the putative regulatory genes and target genes, with 2239 putative regulatory genes in the top 1% (Supplementary Table S9). The transcription factors in dogs (Canis familiaris) from AnimalTFDB3.0 [53] were used to verify these putative regulatory genes. As the results showed, there were 118 putative regulatory genes, 11 of which were both PSGs and DEGs, including Cut Like Homeobox 2 (CUX2) [13,43] and Regulatory Factor X3 (RFX3) [5], in glutamatergic neurons. GO enrichment analysis showed that the target genes of CUX2 were related to glutamatergic synaptic transmission (GO : 0035249, Supplementary Table S10). The transcription factor CUX2 is involved in early neocortical circuits, cellular fate selection and mechanosensation development [54,55]. Another regulatory gene, RFX3, is expressed in the DG in mice [56] and its target gene was enriched in neurogenesis (GO : 0022008, Supplementary Table S11), neuron development (GO : 0048666, Supplementary Table S11) and locomotion (GO : 0040011, Supplementary Table S11). Taken together, the results above implied that glutamatergic neurons may be involved in the adaptive evolution of dog behaviors.

DISCUSSION

In this study, we presented single-nucleus transcriptomics of the dog hippocampus and identified 26 cell clusters based on 105 057 single-nucleus transcriptomes, including glutamatergic neurons, GABAergic neurons, Cajal-Retzius cells, OPCs, myelinating oligodendrocytes, endothelial cells, astrocytes, microglia cells and so on. In addition, our study demonstrated the trajectory of oligodendrocyte differentiation based on the recalculated UMAP embedding analysis. The cross-species analysis revealed that eight major cell types were shared between the human and dog hippocampus [21], suggesting that dogs have potential as a model organism for human mental illness.

Gene expression has spatial heterogeneity, which could reveal the relationship between hippocampal subareas and cell clusters. The hippocampus is divided into two main parts, namely the cornu ammonis and the DG, which contain pyramidal neurons and granule cells, respectively [57]. Clusters 19 (unidentified), 20 (glutamatergic neurons), 25 (Cajal-Retzius cells) and 16 (oligo-mix) might be distributed in the DG because these clusters had the highest expression proportion and average expression of CUX2, CXCR4 and SEMA5A, respectively (Supplementary Table S12). Immunohistochemical showed that CUX2 was highly expressed in the DG (Fig. 4B and Supplementary Fig. S4A), and CXCR4 and SEMA5A are also expressed in the DG [21,58]. Clusters 7 (glutamatergic neurons) and 20 (glutamatergic neurons) might be located in CA3 since these two clusters had the highest expression proportion and average expression in NRIP3 and SULF2, respectively. These two genes are marker genes for CA3 [21]. Cluster 19 might also be located in CA1 because PID1, a marker gene for CA1, had the highest expression proportion and average expression [21]. Clusters 6 (glutamatergic neurons) and 13 (glutamatergic neurons) had the highest expression proportion and average expression in RGS14, which is a CA2 marker; therefore, we inferred that these two clusters might be located in CA2 [59].

Reduced fear and aggression are important traits selected by humans in the first step of animal domestication, which help animals not only live commensally with humans, but also stay in a crowded environment [60,61]. As a subtype of inhibitory neurons, GABAergic neurons have been linked to the response to learning and fear memory [62,63]. Domestication may reduce myelination levels, compromise neural conduction, also change the size of brain structures relevant for memory, reflexes and fear processing [51]. The lower reactive level is a component of domestication syndrome, which is a collection of common traits in domestic animals [64]. In our data, the putative PSGs in domestication were involved in the development of oligodendrocytes and might be involved in myelination levels during nervous system development.

Glutamate receptors play an important role in the CNS and respond to basal excitatory synaptic transmission; some of them are involved in learning and memory, and there were great changes in glutamate receptors in animal domestication and modern human evolution [65]. Furthermore, Cluster 14 consisted of glutamatergic neurons, and DEGs in the cluster were enriched in the regulation of synapse structure or activity (GO : 0050803, Supplementary Table S2) and regulation of nervous system development (GO : 0051960, Supplementary Table S2). These findings reveal that cells in Cluster 14 may play an important role in a decreased stress response not only in domesticated animals, but also in humans [65]. These results could verify the hypothesis that glutamatergic neurons changed their behavior during domestication and reduced the fear response in dogs by regulating cells in the hippocampus, and Cluster 14 is probably an important cluster in dog domestication.

In summary, 105 057 single-nucleus transcriptomes were classified into 26 clusters in this study and were defined with distinct identities. Further exploration revealed 86 genes that overlapped between putative PSGs and DEGs. In addition, we illustrated the OPC differentiation trajectory based on the UMAP embedding. Our results contributed to defining the cell types and revealing the development of oligodendrocytes in the dog hippocampus, illustrating the difference between the subcell types and connecting the gap in our understanding between the molecular and cellular mechanisms of animal domestication.

MATERIALS AND METHODS

Information on the materials used to conduct the research and descriptions of all methods used in the analysis are available in the Supplementary information. A 5-month-old Beagle was used in this work; the dog was provided by the Department of Laboratory Animal Science, Kunming Medical University. All animal processing procedures and experiments performed in the present study were approved by the Animal Ethics Committee of Kunming Institute of Zoology, Chinese Academy of Sciences (SMKX-20 160 301-01).

DATA AVAILABILITY

The data underlying this article are available in the Genome Sequence Archive in the BIG Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences, and can be accessed with PRJCA004294. We also constructed the dog hippocampus atlas website at https://ngdc.cncb.ac.cn/idog/ (single-cell module).

FUNDING

This work was supported by the National Key R&D Program of China (2019YFA0707101, 2021ZD0203903 and 2019YFA0709501), the Key Research Program of Frontier Sciences of the Chinese Academy of Sciences (CAS) (ZDBS-LY-SM011 and QYZDB-SSW-SYS008), the National Natural Science Foundation of China (61621003), the National Ten Thousand Talent Program for Young Top-notch Talents, and Innovative Research Team (in Science and Technology) of Yunnan Province (202005AE160012). G.D.W. is supported by the Youth Innovation Promotion Association of CAS and Spring City Plan: the High-level Talent Promotion and Training Project of Kunming. This work was also supported by the Animal Branch of the Germplasm Bank of Wild Species, Chinese Academy of Sciences (the Large Research Infrastructure Funding).

AUTHOR CONTRIBUTIONS

G.-D.W., S.Z. and B.M. conceived of the study; Q.-J.Z., X.L. and R.W. analysed the data; Q.-J.Z. and L.Z. collected the data; L.Z. performed the experiment; X.L., G.L., Y.H., Z.D. and P.M. performed the scRNA-seq library; T.Y. helped with the anatomy experiment; Z.W. revised the manuscript. All authors wrote, revised and approved the manuscript.

Conflict of interest statement. None declared.

REFERENCES

1.

Wang
MS
,
Thakur
M
,
Peng
MS
et al.
863 genomes reveal the origin and domestication of chicken
.
Cell Res
2020
;
30
:
693
701
.

2.

Lu
J
,
Tang
T
,
Tang
H
et al.
The accumulation of deleterious mutations in rice genomes: a hypothesis on the cost of domestication
.
Trends Genet
2006
;
22
:
126
31
.

3.

Wang
XF
,
He
ZW
,
Shi
SH
et al.
Genes and speciation: is it time to abandon the biological species concept?
Natl Sci Rev
2020
;
7
:
1387
97
.

4.

Wang
GD
,
Zhai
WW
,
Yang
HC
et al.
The genomics of selection in dogs and the parallel evolution between dogs and humans
.
Nat Commun
2013
;
4
:
2814
.

5.

Wang
GD
,
Zhai
WW
,
Yang
HC
et al.
Out of southern East Asia: the natural history of domestic dogs across the world
.
Cell Res
2016
;
26
:
21
33
.

6.

Cao
X
,
Liu
WP
,
Cheng
LG
et al.
Whole genome analyses reveal significant convergence in obsessive-compulsive disorder between humans and dogs
.
Sci Bull
2021
;
66
:
187
96
.

7.

Liu
YH
,
Wang
L
,
Zhang
ZG
et al.
Whole-genome sequencing reveals lactase persistence adaptation in European logs
.
Mol Biol Evol
2021
;
38
:
4884
90
.

8.

Wu
CI
,
Wang
GD
,
Xu
SH
.
Convergent adaptive evolution: how common, or how rare?
Natl Sci Rev
2020
;
7
:
945
6
.

9.

Hajeski
NJ
.
Every Dog: The Ultimate Guide to Over 450 Dog Breeds
.
Buffalo, NY
:
Firefly Books
,
2016
.

10.

Wang
GD
,
Larson
G
,
Kidd
JM
et al.
Dog10K: the International Consortium of Canine Genome Sequencing
.
Natl Sci Rev
2019
;
6
:
611
3
.

11.

Ostrander
EA
,
Wang
GD
,
Larson
G
et al.
Dog10K: an international sequencing effort to advance studies of canine domestication, phenotypes and health
.
Natl Sci Rev
2019
;
6
:
810
24
.

12.

Wang
GD
,
Shao
XJ
,
Bai
B
et al.
Structural variation during dog domestication: insights from gray wolf and whole genomes
.
Natl Sci Rev
2019
;
6
:
110
22
.

13.

Pendleton
AL
,
Shen
FC
,
Taravella
AM
et al.
Comparison of village dog and wolf genomes highlights the role of the neural crest in dog domestication
.
BMC Biol
2018
;
16
:
64
.

14.

Li
Y
,
vonHoldt
BM
,
Reynolds
A
et al.
Artificial selection on brain-expressed genes during the domestication of dog
.
Mol Biol Evol
2013
;
30
:
1867
76
.

15.

Li
Y
,
Wang
GD
,
Wang
MS
et al.
Domestication of the dog from the wolf was promoted by enhanced excitatory synaptic plasticity: a hypothesis
.
Genome Biol Evol
2014
;
6
:
3115
21
.

16.

Nadel
L
,
Moscovitch
M
.
Memory consolidation, retrograde amnesia and the hippocampal complex
.
Curr Opin Neurobiol
1997
;
7
:
217
27
.

17.

Zeisel
A
,
Munoz-Manchado
AB
,
Codeluppi
S
et al.
Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq
.
Science
2015
;
347
:
1138
42
.

18.

Yi
WY
,
Lu
YF
,
Zhong
SJ
et al.
A single-cell transcriptome atlas of the aging human and macaque retina
.
Natl Sci Rev
2021
;
8
:
nwaa179

19.

Li
JY
,
Zheng
YX
,
Yan
PZ
et al.
A single-cell transcriptomic atlas of primate pancreatic islet aging
.
Natl Sci Rev
2021
;
8
:
nwaa127
.

20.

Hochgerner
H
,
Zeisel
A
,
Lobnnerberg
P
et al.
Conserved properties of dentate gyrus neurogenesis across postnatal development revealed by single-cell RNA sequencing
.
Nat Neurosci
2018
;
21
:
290
9
.

21.

Zhong
SJ
,
Ding
WY
,
Sun
L
et al.
Decoding the development of the human hippocampus
.
Nature
2020
;
577
:
531
6
.

22.

Rosenberg
AB
,
Roco
CM
,
Muscat
RA
et al.
Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding
.
Science
2018
;
360
:
176
82
.

23.

Zhang
XX
,
Lan
YJ
,
Xu
JY
et al.
CellMarker: a manually curated resource of cell markers in human and mouse
.
Nucleic Acids Res
2019
;
47
:
D721
8
.

24.

2015 Allen Institute for Brain Science.
(December 2019, date last accessed).

25.

2015 Allen Institute for Brain Science.
(December 2019, date last accessed).

26.

Baude
A
,
Strube
C
,
Tell
F
et al.
Glutamatergic neurotransmission in the nucleus tractus solitarii: structural and functional characteristics
.
J Chem Neuroanat
2009
;
38
:
145
53
.

27.

2020 Allen Institute for Brain Science.
Allen Brain map. http://mouse.brain-map.org/experiment/show/74513995 (January 2020, date last accessed)
.

28.

Villar-Cervino
V
,
Marin
O
.
Cajal-Retzius cells
.
Curr Biol
2012
;
22
:
R179
.

29.

Soriano
E
,
del Rio
JA
.
The cells of Cajal-Retzius: still a mystery one century after
.
Neuron
2005
;
46
:
389
94
.

30.

Fan
XY
,
Dong
J
,
Zhong
SJ
et al.
Spatial transcriptomic survey of human embryonic cerebral cortex by single-cell RNA-seq analysis
.
Cell Res
2018
;
28
:
730
45
.

31.

Korsunsky
I
,
Millard
N
,
Fan
J
et al.
Fast, sensitive and accurate integration of single-cell data with harmony
.
Nat Methods
2019
;
16
:
1289
96
.

32.

Chatron
N
,
Becker
F
,
Morsy
H
et al.
Bi-allelic GAD1 variants cause a neonatal onset syndromic developmental and epileptic encephalopathy
.
Brain
2020
;
143
:
1447
61
.

33.

Kamholz
J
,
de Ferra
F
,
Puckett
C
et al.
Identification of three forms of human myelin basic protein by cDNA cloning
.
Proc Natl Acad Sci USA
1986
;
83
:
4962
6
.

34.

Martinsen
V
,
Kursula
P
.
Multiple sclerosis and myelin basic protein: insights into protein disorder and disease
.
Amino Acids
2022
;
54
:
99
109
.

35.

Nagelhus
EA
,
Ottersen
OP
.
Physiological roles of aquaporin-4 in brain
.
Physiol Rev
2013
;
93
:
1543
62
.

36.

Kupprion
C
,
Motamed
K
,
Sage
EH
.
SPARC (BM-40, osteonectin) inhibits the mitogenic effect of vascular endothelial growth factor on microvascular endothelial cells
.
J Biol Chem
1998
;
273
:
29635
40
.

37.

Trowbridge
IS
,
Thomas
ML
.
Cd45—an Emerging role as a protein-tyrosine-phosphatase required for lymphocyte-activation and development
.
Annu Rev Immunol
1994
;
12
:
85
116
.

38.

Liu
X
,
Shen
Q
,
Zhang
S
.
Cross-species cell-type assignment of single-cell RNA-seq by a heterogeneous graph neural network
.
bioRxiv
2021
. doi: https://doi.org/10.1101/2021.09.25.461790.

39.

Darmanis
S
,
Sloan
SA
,
Zhang
Y
et al.
A survey of human brain transcriptome diversity at the single cell level
.
Proc Natl Acad Sci USA
2015
;
112
:
7285
90
.

40.

Street
K
,
Risso
D
,
Fletcher
RB
et al.
Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics
.
BMC Genomics
2018
;
19
:
477
.

41.

Kuboyama
K
,
Fujikawa
A
,
Masumura
M
et al.
Protein tyrosine phosphatase receptor type Z negatively regulates oligodendrocyte differentiation and myelination
.
PLoS One
2012
;
7
:
e48797
.

42.

Stolt
CC
,
Schlierf
A
,
Lommes
P
et al.
SoxD proteins influence multiple stages of oligodendrocyte development and modulate SoxE protein function
.
Dev Cell
2006
;
11
:
697
709
.

43.

Freedman
AH
,
Schweizer
RM
,
Ortega-Del Vecchyo
D
et al.
Demographically-Based evaluation of genomic regions under selection in domestic dogs
.
PLoS Genet
2016
;
12
:
e1005851
.

44.

Axelsson
E
,
Ratnakumar
A
,
Arendt
ML
et al.
The genomic signature of dog domestication reveals adaptation to a starch-rich diet
.
Nature
2013
;
495
:
360
4
.

45.

vonHoldt
BM
,
Pollinger
JP
,
Lohmueller
KE
et al.
Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication
.
Nature
2010
;
464
:
898
902
.

46.

Martinez
O
,
Reyes-Valdes
MH
.
Defining diversity, specialization, and gene specificity in transcriptomes through information theory
.
Proc Natl Acad Sci USA
2008
;
105
:
9709
14
.

47.

Yamada
M
,
Clark
J
,
McClelland
C
et al.
Cux2 activity defines a subpopulation of perinatal neurogenic progenitors in the hippocampus
.
Hippocampus
2015
;
25
:
253
67
.

48.

Stork
O
,
Yamanaka
H
,
Stork
S
et al.
Altered conditioned fear behavior in glutamate decarboxylase 65 null mutant mice
.
Genes Brain Behav
2003
;
2
:
65
70
.

49.

Fancy
SPJ
,
Miller
RH
.
Signaling pathways that regulate glial development and early migration—oligodendrocytes
. In:
Rubenstein
J
,
Rakic
P
,
Chen
B
et al (eds).
Patterning and Cell Type Specification in the Developing CNS and PNS
, 2nd edn.
Cambridge, MA: Academic Press
,
2020
,
867
90
.

50.

Huang
W
,
Bhaduri
A
,
Velmeshev
D
et al.
Origins and proliferative states of human oligodendrocyte precursor cells
.
Cell
2020
;
182
:
594
608
.

51.

Brusini
I
,
Carneiro
M
,
Wang
CL
et al.
Changes in brain architecture are consistent with altered fear processing in domestic rabbits
.
Proc Natl Acad Sci USA
2018
;
115
:
7380
5
.

52.

Huynh-Thu
VA
,
Irrthum
A
,
Wehenkel
L
et al.
Inferring regulatory networks from expression data using tree-based methods
.
PLoS One
2010
;
5
. doi: 10.1371/journal.pone.0012776.

53.

Hu
H
,
Miao
YR
,
Jia
LH
et al.
AnimalTFDB 3.0: a comprehensive resource for annotation and prediction of animal transcription factors
.
Nucleic Acids Res
2019
;
47
:
33
8
.

54.

Miskic
T
,
Kostovic
I
,
Rasin
MR
et al.
Adult upper cortical layer specific transcription factor CUX2 is expressed in transient subplate and marginal zone neurons of the developing human brain
.
Cells
2021
;
10
. doi: 10.3390/cells10020415.

55.

Bachy
I
,
Franck
MCM
,
Li
LL
et al.
The transcription factor CUX2 marks development of an A-delta sublineage of TrkA sensory neurons
.
Dev Biol
2011
;
360
:
77
86
.

56.

Allen Institute for Brain Science. https://mouse.brain-map.org/experiment/show/638719 (July 2022, date last accessed)
.

57.

Sarkar
A
,
Marchetto
MC
,
Gage
FH
.
Neural induction of embryonic stem/induced pluripotent stem cells
. In:
Rubenstein
J
,
Rakic
P
,
Chen
B
et al (eds).
Patterning and Cell Type Specification in the Developing CNS and PNS
, 2nd edn.
Cambridge, MA: Academic
Press
,
2020
,
185
203
.

58.

Yao
ZZ
,
van Velthoven
CTJ
,
Nguyen
TN
et al.
A taxonomy of transcriptomic cell types across the isocortex and hippocampal formation
.
Cell
2021
;
184
:
3222
41
.

59.

Lee
SE
,
Simons
SB
,
Heldt
SA
et al.
RGS14 is a natural suppressor of both synaptic plasticity in CA2 neurons and hippocampal-based learning and memory
.
Proc Natl Acad Sci USA
2010
;
107
:
16994
8
.

60.

de Kloet
ER
,
Joels
M
,
Holsboer
F
.
Stress and the brain: from adaptation to disease
.
Nat Rev Neurosci
2005
;
6
:
463
75
.

61.

Price
EO
.
Principles and Applications of Domestic Animal Behavior
.
Wallingford: CABI Publishing
,
2008
.

62.

Harris
JA
,
Westbrook
RF
.
Evidence that GABA transmission mediates context specific extinction of learned fear
.
Psychopharmacology (Berl)
1998
;
140
:
105
15
.

63.

Stork
O
,
Ji
FY
,
Obata
K
.
Reduction of extracellular GABA in the mouse amygdala during and following confrontation with a conditioned fear stimulus
.
Neurosci Lett
2002
;
327
:
138
42
.

64.

Wilkins
AS
,
Wrangham
RW
,
Fitch
WT
.
The ‘Domestication syndrome’ in mammals: a unified explanation based on neural crest cell behavior and genetics
.
Genetics
2014
;
197
:
795
808
.

65.

O’Rourke
T
,
Boeckx
C
.
Glutamate receptors in domestication and modern human evolution
.
Neurosci Biobehav Rev
2020
;
108
:
341
57
.

Author notes

Lead contact.

Equally contributed to this work.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data