Abstract

Human leukocyte antigen (HLA) genes play a crucial role in the adaptation of human populations to the dynamic pathogenic environment. Despite their significance, investigating the pathogen-driven evolution of HLAs and the implications for autoimmune diseases presents considerable challenges. Here, we genotyped over twenty HLA genes at 3-field resolution in 8278 individuals from diverse ethnic backgrounds, including 4013 unrelated Han Chinese. We focused on the adaptation of HLAs in the Han Chinese by analyzing their binding affinity for various pathogens, and explored the potential correlations between pathogen adaptation and autoimmune diseases. Our findings reveal that specific HLA alleles like HLA-DRB1*07:01 and HLA-DQB1*06:01 confer strong pathogen adaptability at the sequence level, notably for Corynebacterium diphtheriae and Bordetella pertussis. Additionally, alleles like HLA-C*03:02 demonstrate adaptive selection against pathogens like Mycobacterium tuberculosis and coronavirus at the gene expression level. Simultaneously, the aforementioned HLA alleles are closely related to some autoimmune diseases such as multiple sclerosis (MS). These exploratory discoveries shed light on the intricate coevolutionary relationships between pathogen adaptation and autoimmune diseases in the human population. These efforts led to an HLA database at http://bigdata.ibp.ac.cn/HLAtyping, aiding searches for HLA allele frequencies across populations.

Information Accepted manuscripts
Accepted manuscripts are PDF versions of the author’s final manuscript, as accepted for publication by the journal but prior to copyediting or typesetting. They can be cited using the author(s), article title, journal title, year of online publication, and DOI. They will be replaced by the final typeset articles, which may therefore contain changes. The DOI will remain the same throughout.
This content is only available as a PDF.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data