Shandong Provincial Hospital for Skin Diseases & Shandong Provincial Institute of Dermatology and Venereology, Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, Shandong, China
Filaggrin, encoded by the FLG gene, plays a crucial role in the barrier function of epidermis, but the association between FLG loss-of-function mutations and infectious skin diseases has not been systematically studied. FLG coding sequences from 945 patients with leprosy and 916 healthy controls were captured and enriched using an array-based high-throughput system, and subjected to next-generation sequencing. The loss-of-function mutations found were further validated by Sanger sequencing. A total of 21 loss-of-function mutations were found in 945 patients with leprosy, with a carrier rate of 17.53%, while the prevalence of these mutations in 916 healthy controls was 14.77%, which was significantly lower than in patients. Two individual FLG loss-of-function mutations (K4022X and Q1790X) were found to be significantly associated with leprosy. These results suggest a possible role for filaggrin in defending against leprosy pathogens.
Key words: leprosy; filaggrin; FLG; loss-of-function mutation; association study.
Accepted Oct 7, 2020; Epub ahead of print Oct 13, 2020
Acta Derm Venereol 2020; 100: adv00299.
Corr: Zihao Mi and Hong Liu, Shandong Provincial Hospital for Skin Diseases & Shandong Provincial Institute of Dermatology and Venereology, Shandong First Medical University & Shandong Academy of Medical Sciences, 27397 Jingshi Lu, Jinan 250022, Shandong Province, China. E-mails: email@example.com; firstname.lastname@example.org
Leprosy is a chronic infectious skin disease caused by the pathogen Mycobacterium leprae, which shows strong genetic predisposition. This study sequenced the FLG gene coding sequence in a large cohort of patients with leprosy and matched healthy controls. The results show that the carrier rate of FLG loss-of-function mutations in patients with leprosy was significantly higher than that in healthy controls. Skin barrier deficiency, caused by FLG loss-of-function mutations, may contribute to the pathogenesis of leprosy by increasing the risk of infection through impaired skin. These results suggest a possible role for filaggrin in defending against leprosy pathogens.
The FLG gene encodes filaggrin, which is the main constituent of keratohyalin granules and is indispensable for proper functioning of the epidermal barrier. Loss-of-function (LoF) mutations in the FLG gene, which cause pre-stop and frameshift of coding, are the cause of the Mendelian disease ichthyosis vulgaris (IV) (1). A strong association between FLG LoF mutations and atopic dermatitis (AD) has also been shown (2–4). Based on the speculation that an impaired skin barrier may facilitate the invasion of microbes, the association between FLG mutations and skin infection has been studied in patients with AD, and filaggrin deficiency has been shown to increase patients’ susceptibility to bacterial infection (5). Nevertheless, the association between FLG LoF mutations and infectious skin disease has been little studied. In 2017, the low-frequency FLG mutation K4022X was shown to increase susceptibility to leprosy, in a genome-wide analysis of protein-coding variants in a Chinese population (6), highlighting the involvement of FLG in infectious diseases per se.
Leprosy is a chronic infectious disease caused by Mycobacterium leprae or Mycobacterium lepromatosis, which impairs the skin and peripheral nerves, leading to irreversible disabilities (7). Although infection with the pathogen is needed for the onset of leprosy, genetic factors in the host have long been regarded as a major contributor, with estimated heritability of up to 57% (8–12). Currently, more than 30 susceptibility genes have been identified as associated with leprosy, most of which are immune-related genes involved in pathogen recognition and the subsequent immune response (6). However, in the case of FLG, Gene Ontology analysis showed that this gene was not included in any immune networks with other reported leprosy-associated genes, indicating its possible barrier role in defending against infection with Mycobacterium leprae (6). Given that a series of low-frequency and rare LoF mutations in FLG have been reported in AD and IV (13–16), the finding of only one leprosy-associated low-frequency (3.54%) LoF mutation (K4022X) in FLG may be not adequately convincing of the potential role of FLG in leprosy (6).
The aim of the current study was to systematically investigate the association between leprosy and FLG LoF mutations through deep sequencing, and to preliminarily evaluate the effect of barrier deficiency caused by FLG LoF mutations on the onset of infectious skin diseases.
Study subjects and genomic DNA preparation
All patients with leprosy and healthy controls were recruited using uniform criteria from Shandong Provincial Institute of Dermatology and Venereology, as described previously (6). The study was approved by the institutional review board and all subjects gave written informed consent. Leprosy was diagnosed by at least 2 dermatologists based on clinical manifestations and histopathological examination. Blood samples were taken at the time of patient diagnosis. Genomic DNA was extracted using QuickGene DNA Whole Blood Kit L (AADS Biotec, Neyagawa, Japan) following the manufacturer’s instructions.
High-throughput sequencing and Sanger sequencing
A newly reported array-based high-throughput sequencing method was used to sequence the FLG coding sequence (reference gene sequence, NG_016190.1) (16). Briefly, the whole FLG coding sequences of FLG were simultaneously amplified by the Access Array system (Fluidigm, San Francisco, CA, USA) followed by sequencing of pooled amplicons using the Miseq system (Illumina, San Diego, CA, USA). The basic quality of sequencing reads was checked by FastQC v0.11.8, and the raw sequencing reads were trimmed and filtered with Trimmonmatic v0.38 (17). The clean reads obtained were then aligned with hg19 using BWA v0.7.17 (18). Variants were called using GATK v184.108.40.206 following GATK Best Practices. --filterExpression “QD < 2.0 || FS > 60.0 || MQ < 40.0 || MQRankSum < –12.5 || ReadPosRankSum < –8.0” were used for sequencing quality control. The sequencing depth for each fragment was larger than 50×. All identified LoF mutations were further validated by Sanger sequencing. Before Sanger sequencing, FastStart™ High Fidelity PCR System (Roche Applied Science, Mannheim, Germany) was used to amplify target sequences. Sanger sequencing was performed using Applied Biosystems™ 3500xL Dx Genetic Analyzer (Life Technologies, NY, USA). All primers used have been described previously (16).
Fisher’s exact test was used to compare the allele frequencies between patients and controls. The odds ratios (ORs) of the significant mutation and the combined FLG LoF mutation genotype were estimated by logistic regression analysis.
All patients and controls were of Chinese Han ethnicity and were enrolled from Shandong province, China. The mean ± standard deviation (SD) age of the 945 patients with leprosy was 63.31 ± 8.52 years (range 23–80 years) and 54.6% (516) of patients were male. The mean ± SD age of the 916 controls was 63.61 ± 11.48 years (range 31–90 years), and 53.60% (491) of controls were male. No sex or age bias was observed between patients and controls.
In total, 21 LoF mutations, including 11 nonsense and 10 frameshift mutations, were discovered and validated in 945 Chinese Han patients with leprosy and 916 healthy controls (Table I). Among them, 9 mutations had not previously been reported, namely c.12159_12162del:p.Q4053Hfs*18, c.C11227T:p.R3743X, c.G5841A:p.W1947X, c.4774dupT:p.S1592Ffs*3, c.C2185T:p.Q729X, c.G622T:p.E208X, c.428delG:p.S143Tfs*50, c.A418T:p.R140X and c.A388T:p.R130X (Fig. 1). The other 12 mutations were previously reported, of which 2 were low-frequency variants (1% < minor allele frequency < 5%) and the remaining 10 were rare variants (minor allele frequency < 1%). The frequencies of these 12 known mutations in the healthy controls showed similar frequencies in public databases (Table SI).
Table I. Case-control analysis of 21 FLG mutations
Fig. 1. Nine novel mutations discovered in patients with leprosy.
The overall carrier rate of 21 mutations in patients with leprosy was 17.53%, which was significantly higher than the 14.77% carrier rate in healthy controls (p = 0.041, OR 1.31, 95% confidence interval (CI) 1.04–1.66, Table I). Two individual mutations (K4022X and Q1790X) were found to be significantly associated with leprosy (p < 0.05). Consistent with the results of our previous study, a significant association between mutation c.A12064T:p.K4022X and leprosy was observed (10.10% carrier rate in patients vs 6.89% in controls, p = 0.016, OR 1.48, 95% CI 1.08–2.04, Table I). The other leprosy-associated rare mutation was c.C5368T:p.Q1790X (0.95% carrier rate in patients vs 0.11% in controls, p = 0.021, OR 8.80, 95% CI 1.11–69.58, Table I). Among the remaining 19 mutations, 14 showed higher frequency in patients with leprosy than in healthy controls, but with insignificant p-values (Fig. 2, Table I). In contrast, only 2 individual mutations showed the same frequency in patients with leprosy and healthy controls, and 3 individual mutations showed insignificant higher frequencies in healthy controls (Fig. 2, Table I).
Regardless of mutation frequency in patients, the distribution of the 21 mutations found across the coding sequence of FLG showed no obvious hot spots. Nevertheless, 17 of 21 mutations (80.95%) were located before the 7th repeat of exon 3 (Fig. 2).
Fig. 2. The distribution of 21 discovered mutations across the coding sequencing of gene FLG. Mutations labelled above the schematic diagram of the gene had higher frequencies in patients with leprosy than in healthy controls. Mutations labelled below the schematic diagram of the gene had the same or lower frequencies in patients with leprosy compared with healthy controls. Nine novel discovered mutations are indicated in red.
By employing an array-based high-throughput deep sequencing method, the coding sequence of the filaggrin gene in 945 patients with leprosy and 916 controls was finely mapped. The overall carrier rate of 21 FLG LoF mutations was found to be significantly higher in patients with leprosy than in healthy controls. Two individual mutations (K4022X and Q1790X) were shown to be significantly associated with leprosy. It is well known that the FLG LoF mutations, which can cause pre-stop and frameshift of coding leading to a deficiency of filaggrin, can be directly treated as causal variants without further functional analysis (14). Thus, our findings are the first, to our knowledge, to indicate a role of the FLG gene in the pathogenesis of leprosy.
FLG sequencing remains a challenge because exon 3 of FLG is very large (>12 kb) and highly repetitive (2), which makes direct sequencing of FLG using the Sanger sequencing method difficult, costly and time-consuming. Hence, most previous studies focused only on the genotyping of known predominant LoF mutations in patients, and few studies of the sequencing of the entire FLG gene were reported in large cohorts. In recent years, efforts have been made to sequence FLG using target-region enrichment based on the next-generation sequencing (NGS) method. In 2017, a probe capture enrichment-based NGS method (Roche NimbleGen) was first used to sequence FLG in 54 Chinese Han IV patients (13). Another probe capture system-based NGS method (Agilent SureDesign, Agilent Technologies) was also developed and used for FLG sequencing in 741 children with AD (15, 19). In addition to these 2 probe capture-based enrichment methods, a high-throughput PCR enrichment method (Access Array, Fluidigm), followed by NGS, was also developed and used to sequence FLG in 334 Singaporean patients with IV or AD (16). Those studies not only confirmed the known FLG LoF mutations, but also found several novel LoF mutations. Herein, using the high-throughput PCR enrichment method, NGS of the FLG coding sequence was performed in 945 patients with leprosy and 916 healthy controls. In total, we identified 21 FLG LoF mutations and, among them, 9 were newly reported, highlighting again the efficacy of the NGS method in FLG sequencing.
Although M. leprae, the pathogenic agent of leprosy, was discovered more than 100 years ago, the transmission patterns of M. leprae are still not fully understood (20). The most widely accepted transmission pathway is the upper respiratory tract (21), but several studies have reported M. leprae infection following trauma (20), such as tattoos (22–24), injury with a glass bangle (25) and knee injury (26), which suggests the possibility of infection through impaired skin. A previous study found that FLG LoF mutations could increase the risk of recurrent skin bacterial infections in patients with AD (5). Moreover, knockdown of FLG was shown to increase the colonization of Staphylococcus aureus in an epidermal model (27). The current study found that the carrier rate of overall FLG LoF mutations and 2 individual mutations were significantly higher in patients with leprosy than in healthy controls, indicating the possible pathogenic role of the FLG gene in leprosy, which suggested that an impaired skin barrier may facilitate the transmission of M. leprae through direct skin contact.
In addition to barrier function, filaggrin has also been demonstrated to have immune functions. Filaggrin deficiencies were reported to cause immunological dysregulation, such as increased CD11c expression in Langerhans cells, increased systemic Th17 response and decreased expression of IFN-γ in keratinocytes (28–30). Considering that Langerhans cells and keratinocytes play roles in the pathogenesis of leprosy, and that Th17 cells are also an important T cell subset involved in the immune response to leprosy (31, 32), it is possible that immune dysregulation caused by FLG LoF mutations also contribute to the pathogenesis of leprosy. Further research, using animal or human epidermis models, will be needed to confirm this proposal.
Similar to other studies on FLG in AD or IV, our studies have focused only on the variants that cause pre-stop or frameshift of coding, and other types of mutations, such as missense mutations, were not included (13, 15, 16). This was based on the conventional opinion that truncated filaggrin proteins cannot function normally, as with the homozygous R501X, which has been shown to cause the loss of keratohyalin granules in the epidermis granular layer (1). In the current study, all 21 mutations could cause stop-gain and frameshift of coding, generating truncated protein. In particular, 17 of 21 (80.95%) LoF mutations located before the seventh repeat of exon 3 could cause at least a 40% loss in exon 3, with severe effects on filaggrin function. Another limitation of this study is that the p-value (p = 0.041) was not corrected for multiple tests, which made it necessary to verify our discovery in other populations and samples with larger sample size.
In summary, a comprehensive analysis of FLG and leprosy using NGS sequencing was performed in 945 patients with leprosy and 916 healthy controls, revealing a significant association between FLG LoF mutations and leprosy. Further investigations will be required to reveal the potential barrier and immunological roles of filaggrin in leprosy pathogenesis.
The authors thank all the participants involved in this study. We thank XFW Wong and JC Common from the Common of Institute of Medical Biology, A*STAR, Singapore, for their technical support.
This work was funded by the National Key Research and Development Program of China (2016YFE0201500), the Academic Promotion Programme of Shandong First Medical University (2019LJ002, 2019RC007), the National Natural Science Foundation of China (81822038), the Natural Science Foundation of Shandong Province (ZR2018BC020), and the Shandong Province Taishan Scholar Project (tspd20150214, tsqn201812124).
The authors have no conflicts of interest to declare.