Abstract
Objective
To identify potential diagnostic markers for vitiligo and determine the significance of immune cell infiltration in pathology.
Methods
Three publicly available gene expression profiles (GSE53146, GSE75819 and GSE65127 datasets) from human vitiligo and control samples were downloaded from the GEO database. Differentially expressed genes (DEGs) were screened between 20 vitiligo and 20 control samples. Logical regression of the selection operator (LASSO) model and support vector machine recursive feature elimination (SVM-RFE) analysis were performed to identify candidate biomarkers. The area under the receiver operating characteristic curve (AUC) value was obtained and was used to evaluate the discriminatory ability. The expression level and diagnostic value of the biomarkers in vitiligo were further validated in the GSE65127 dataset (10 vitiligo patients and 10 healthy controls). Finally, the immune cell infiltration of vitiligo was evaluated by CIBERSORT, and the correlation between biomarkers and infiltrating immune cells was analyzed. The compositional patterns of the 22 types of immune cell fractions in vitiligo were estimated from the pooled cohorts using CIBERSORT. In addition, we established a mouse model of vitiligo with monobenzone and validated the screened biomarkers.
Results
A total of 23 associated DEGs were identified, including 9 up-regulated and 14 down-regulated genes. Subsequently, 17 genes meeting prognostic criteria and 2 common genes (DCT and KIF1A) were obtained by SVM and Venn diagram screening. Immunodifferential analysis showed that microenvironment of vitiligo patients was altered. Finally, the different expression was verified by polymerase chain reaction (PCR).
Conclusion
Biomarkers associated with vitiligo can be screened by comprehensive strategies, and immune cell infiltration plays a key role in the development of vitiligo.
Introduction
Considerable progress has been made in understanding of the pathogenesis of vitiligo, which is now classified as an autoimmune disease and is associated with genetic and environmental factors together as well as abnormalities in metabolism, oxidative stress and cell desquamation.Citation1 Histological examination, immunohistochemical studies and biopsy specimens with a wide panel of antibodies generally show an absence or reduction of melanocytes and the presence of a lymphocytic infiltrate in the lesional skin. In addition, large-scale genome-wide association studies performed in Caucasians of European descent and in Chinese have identified nearly 50 different genetic loci that confer a risk of vitiligo.Citation2
As vitiligo is an autoimmune disease with a genetic background, no cure or way of limiting the spread of vitiligo have been reported. Current treatment outcomes vary between individuals and are often unsatisfactory.Citation3 In general, treatment is more effective in newly developed lesions than in older lesions, arguing for early therapeutic intervention.Citation4 In this study, the GEO database was used to screen for biomarkers associated with vitiligo and to analyze the pathological role of immune cell infiltration in the disease, in order to better understand the molecular immune mechanisms involved in vitiligo, and the role of these biomarkers was verified by animal models of vitiligo.
Materials and Methods
Data Acquisition and Preprocessing
Based on the platform of GPL14951 (Illumina Human HT-12 WG-DASL V4.0 R2 expression bead chip), GPL6884 (Illumina Human WG-6 v3.0 expression bead chip) and the GPL570 (Affymetrix Human Genome U133 Plus 2.0 Array), the series of matrix files of the GSE53146, GSE75819 and GSE65127 datasets were obtained from the Gene Expression Omnibus (GEO) database. The GSE53146, GSE75819 and GSE65127 included skin samples from 5 vitiligo patients (VP) and 5 healthy volunteers (HV) in the USA, from 15 VP in India, and from 10 VP and 10 HV in the USA, respectively. The probes in each dataset were converted to gene symbols according to their probe annotation files. For multiple probes corresponding to the same gene symbol, their average was calculated as the final value of the corresponding gene. In addition, the datasets of GSE53146 and GSE75819 were merged into a metadata cohort, as they are significant for combining data from different datasets. The combat function of the “SVA” package of the R software was used to remove the batch effect.
Differentially Expressed Analysis
The limma package of R (http://www.bioconductor.org/) was used for background correction, normalization between arrays, and differential expression analysis in samples of 30 VP and 30 HV. Genes with P-value < 0.05 and absolute log2FC > 0.585 were considered as differentially expressed genes (DEGs). The volcano plots and heat maps were generated using the “heatmap” and “ggplot2” packages of the R software.
Gene Set Enrichment Analysis
Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was performed using the cluster Profiler package in R to identify the biological functions of DEGs. Disease Ontology (DO) enrichment analyses were performed on the DEGs using the “cluster Profiler” and DOSE packages in R. Gene set enrichment analysis (GSEA) was used to identify the most significant functional differences between the two groups of samples. Gene clusters were considered significant if P<0.05 or the false discovery rate was less than 0.025.
Candidate Diagnostic Biomarker Screening
Three machine learning methods were used to predict patients’ conditions by identifying key prognostic variables. The Least Absolute Compression and Selection Operation (LASSO) is a new method to improve prediction accuracy through regularization methods. In R, the LASSO regression algorithm uses “glmnet” grouping to identify genetic factors that are significantly associated with vitiligo and common samples. Support vector machine (SVM) is one of the most widely used supervised machine learning methods. RFE methods were used to optimize the metadata sequences to avoid repetitive screening. For this purpose, suitable features were screened using SVM- RFE (SVM-RFE) to identify the set of gene banks with the highest recognition capability. The random tree algorithm was implemented using the R software “randomForest”. Finally, the intersection points were obtained using the Venn diagram package.
Dataset Validation
The receiver operating characteristic curve is often used to diagnose disease, predict tumor recurrence and predict treatment response. By comparing the area under the curve (AUC), the best diagnostic value can be determined. In this study, the hub genes were verified using the GSE65127 dataset. The pROC package of R was used for calculation and mapping, and genes with an AUC > 0.8 were selected.
Discovery of Immune Cell Subtypes
To quantify the invasive rate of immune cells in vitiligo, a bioinformatics algorithm called CIBERSORT (https://cibersortx.stanford.edu/) was used. The number of immune cells was estimated using a reference system of 22 isoforms (LM22) for 1000 permutations. Twenty-two infiltrating immune cells were correlated using the R software “corrplot”. The graphs show the infiltration of immune cells in vitiligo and control samples.
Analysis of the Correlation Between Identified Genes and Infiltrating Immune Cells
The R software Spearman’s hierarchical correlation was used to investigate the relationship between identified genetic markers and invading immune cells. Correlations were generated using the graphical technique in the “ggplot2” suite.
Establishment of Mouse Vitiligo Model and Experimental Design
Male C57 mice, 22–24 g, 4 weeks old, were purchased from Ltd. in Jiangsu, China. After 1 week of acclimatization, they were randomly divided into experimental and control groups (n= 6 per group). To induce vitiligo models, 40% monobenzyl ether of hydroquinone (MBEH) cream was topically applied to the shaved dorsal skin of the mice, while the control group was treated with petrolatum. The backs of the mice were shaved every 2–3 days so that MBEH was applied to skin other than the fur. Most of the mice in the experimental group began to develop depigmentation from day 20, and all developed overt vitiligo by 30 days.
HE Staining
Mouse skin tissues were fixed in 4% methylal solution overnight, then embedded in paraffin and cut into approximately 3 mm thick slices, which subjected to dewaxing, dehydration, HE staining, dehydration, transparency and sealing with neutral gum. Inflammatory cell infiltration in the skin tissues was observed and recorded under light l microscopy.
Real-Time Quantitative PCR (qPCR)
Total RNA from skin tissues of vitiligo lesions in mice was extracted using the Trizol (Sigma-Aldrich) method, reverse transcribed into cDNA using the Takara kit, and fluorescence quantitative detection of related gene expression was performed using the Roche LightCycler 480qPCR system. The forward and reverse primer sequences were as follows: DCT forward sequence: GCAAGATTGCCTGTCTCTCCAG, DCT Reverse Sequence: CTTGAGAGTCCAGTGTTCCGTC; KIF1A Forward Sequence: CGCTGACATCTTCTGCCAGTTC, KIF1A Reverse Sequence: CGATGAAGGACTTGGTCACCTC. Data analysis was calculated using the 2−∆∆Ct method.
Statistical Analysis
All values are presented as mean ± standard error and analyzed by Student’s t-test and one-way Wilcoxon rank sum test. All experiments were performed at least three times.
Results
Identification of DEGs in Vitiligo
Data from 30 VP and 30 HV from three GEO datasets (GSE53146, GSE75819, andGSE65127) were analyzed. DEGs from the metadata were analyzed using the limma package after removing batch effects. A total of 23 DEGs were obtained, including 9 up-regulated and 14 down-regulated genes. ().
Functional Correlation Analysis
Vitiligo-related DEGs (VRDEGs) were first analyzed by GO enrichment, KEGG and DO pathway enrichment. As a result, 205 biological processes, 10 signalling pathways, and 127 diseases were significantly enriched. The biological processes were enriched for proteasomal protein catabolic process, ribonucleoprotein complex biogenesis and viral process. The matrix, inner membrane, protein-containing complex of mitochondria and ribosome were enriched. In addition, significant enrichment was observed in cadherin binding, ribosome structural components and transcriptional co-regulator activity in molecular function (). KEGG analysis revealed enrichment of pathways in neurodegenerative multiple diseases, Alzheimer’s disease and amyotrophic lateral sclerosis (). Vitiligo, female reproductive organ carcinoma, hepatitis and renal cell carcinoma were significantly enriched in DO analysis (). In addition, the enrichment difference between the vitiligo and control groups was further investigated using gene set enrichment analysis (GSEA). Cell cycle, oxidative phosphorylation, parkinsons disease, proteasome and ribosome were significantly enriched in the vitiligo group. Acute myeloid leukaemia, axon guidance, basal cell carcinoma, lysosome and melanogenesis were significantly enriched in the control group ( and ).
Identification and Validation of Diagnostic Feature Biomarkers
Two methods were used to identify possible biological markers of vitiligo. LASSO regression was used to study the application of VRDEGs, and 17 biochemical indicators that could be used in the diagnosis of vitiligo were identified (). VRDEGs were identified using the SVM-RFE method for four feature points (). Finally, an intersection of the two algorithms and two related metalloproteinases (), DCT and KIF1A were obtained. We then evaluated the accuracy of the related genes selected by machine learning as disease diagnostic genes by ROC curve ( and ). The AUC values of the two related genes were 0.998 (95% CI 0.985–1.000), indicating high sensitivity and specificity.
Identification and Validation of Diagnostic Biomarkers
The GSE65127 dataset was used to evaluate the accuracy and reliability of the generated results, which showed that the expression levels of DCT and KIF1A in vitiligo tissues were significantly higher than those in the control group ( and ; all P < 0.05). The AUC representing diagnostic ability of the two biomarkers combined, was 1.000 (95% CI 1.000–1.000; and ), suggesting that DCT and KIF1A had a high value for diagnosis of vitiligo.
Immune Cell Infiltration
The analyzed results were mapped to show the distribution of immune cells in the tissues () and correlation analysis was performed to investigate the correlation between the immune cells and vitiligo (). The proportions of regulatory T cells (Tregs) (P < 0.001), macrophages (P = 0.045) and activated dendritic cells (P = 0.011) were significantly lower, and of CD4 memory T cells (P < 0.001), γδ T cells (P = 0.011), macrophages M1 (P < 0.001), mastocytes (P = 0.006) were significantly higher in the vitiligo tissues than those in the normal tissues ().
Correlation Analysis Between Identified Genes and Infiltrating Immune Cells
The correlation between the two biological markers and each immune cell was further investigated (). That all significantly correlated positively with T cells regulatory and activated dendritic cells, and negatively with macrophages M1, T cells CD4 memory activated and T cells gamma delta ( and ).
Vitiligo Model Evaluation and Histological Identification
After topical application of monobenzone, white spots appeared first at the drug exposure site, mainly manifested as white hairs, and the white patches gradually expanded with time. Finally, white spots were observed in some non-drug sites of the mice (). The results of HE staining under light microscope showed that the epidermis of the monobenzone-induced vitiligo model group of mice showed local epidermis thickening, more inflammatory cell infiltration in the superficial dermis, some hair follicles were abnormal and dysplasia, and the number of melanin-containing hair follicles was significantly reduced (). These results demonstrate that we have successfully established a monobenzone-induced vitiligo mouse model.
Validation of Diagnostic Feature Biomarkers
We also identified changes in the levels of KIF1A and DCT genes in vitiligo models. Compared with the control group, the protein expression levels of KIF1A and DCT were significantly increased in the monobenzone group, suggesting that these KIF1A and DCT play a role in the construction of the vitiligo mouse model by monobenzone ().
Discussion
Vitiligo is a common immune-mediated depigmented disease, characterized by chalky-white macules or patches.Citation5,Citation6 In clinical practice, vitiligo can be classified as active or stable and is divided into three types based on the distribution of the lesions, namely generalized, segmental and localized patterns, and it is easy to diagnose but with a long treatment cycle and unpredictable course.Citation1,Citation3 Although vitiligo has little impact on physical health, it seriously affects the quality of life of individuals life in terms of physical, social, psychological and occupational aspects. Therefore, it is necessary to identify new and accurate biomarkers of vitiligo. To achieve this goal, this study used bioinformatics to scientifically analyze the sequencing information of patients, screen out the core genes, identify markers that are predictive of disease onset and progression, and provide a theoretical basis for understanding pathogenesis and molecular targeted therapies.
Dopachrome tautomerase (DCT), another name for TRRP2, is part of the tyrosinase complex, and mutations in TRP-2 cause changes in melanin.Citation7 Much literature has shown that DCT plays an important role in altering the structure and composition of melanin.Citation8 Kinesin, the smallest molecular motor known, catalyzes the hydrolysis of adenosine triphosphate (ATP), efficiently converting the chemical energy carried by ATP molecules into mechanical energy, resulting in a continuous step along the microtubules and the transport of the organelles and intracellular material they carry.Citation9 Kinesins are a large superfamily of motor proteins (KIF superfamily). Kinesin family member 1A (KIF1A) belongs to the kinesin-3 family and is commonly found as a monomer in the body, whose function is to transport different types of cargo from the neuronal soma to the synapses, moving them in an anterograde direction along microtubules.Citation10 KIF1A is a potential diagnostic marker for Hereditary spastic paraplegia (HSP), head and neck squamous cell carcinoma (HNSCC), oral squamous cell carcinomas (OSCC), breast cancer (BC), nasopharyngeal carcinoma (NPC), etc.Citation11–13 However, the function of KIF1A has not been established in vitiligo.
In this study, we used LASSO regression and SVM-RFE method to identify possible biological markers, 17 vitiligo-related predictors from VRDEGs were detected by LASSO regression and 4 biochemical indicators were identified by SVM-RFE method. The intersection of the genes obtained by the two methods was then performed to obtain DCT and KIF1A. The ROC curve and AUC area further indicated their high reliability as biomarkers.
For this reason, the CIBERSORT program was used in the current study to analyze the immune infiltration process that occurs in vitiligo and to study the correlation between these two biometric factors and immune cells. The results showed that macrophage infiltration is present in vitiligo lesions and surrounding skin, and that macrophages are involved in melanocyte clearance. In addition, Treg cells belong to a subgroup of CD4+ T cells, and the impaired number and function of Treg cells are closely related to the immune tolerance of vitiligo patients.Citation4,Citation14
Local inflammation in vitiligo lesions can activate NK cells and drive an adaptive immune response by releasing pro-inflammatory cytokines. NK cells are cytotoxic and can affect the antigen presentation of dendritic cells and promote their function and maturation.Citation15 In addition, the results showed that DCT and KIF1A were significantly correlated with a variety of changes in immune cell infiltration, and the correlation trend of DCT and KIF1A with these immune cells was the same. This led us to speculate that DCT and KIF1A may be involved in the development of vitiligo through an immune response. Although KIF1A has not been extensively studies in vitiligo, it has the potential to serve as an effective new diagnostic indicator, predictor or therapeutic target after further validation.
However, there are still some limitations to this study. The sample size we used was small and the results may have limitations, although we used another dataset for validation. Secondly, this study is based on bioinformatics, the CIBERSORT analysis is based on limited genetic data, and the analysis of immune cell infiltration is still limited. Although the results of some previous studies are consistent with our analysis, the reliability of the research results needs to be verified by further experiments.
Conclusion
In summary, our investigation has identified DCT and KIF1A as biomarkers associated with vitiligo in the cohort studied. In addition, we have identified a correlation between the manifestation of vitiligo and immune cell types including CD4 T cells, CD8 T cells, Tregs, NK cells, dendritic cells, and macrophages. These immune cell populations appear to have a notable influence on the progression of vitiligo. However, it is important to acknowledge the limitations of our study, which are largely due to the bioinformatics methodology and the limited genetic data included in the CIBERSORT analysis. Consequently, the conclusions drawn require careful interpretation, and future investigations should delve deeper into unraveling the intricate interactions between infiltrating immune cells. This more in-depth investigation promises to make significant contribution to delineating precise immunotherapy targets for vitiligo and improving immunomodulatory strategies for people living with this condition.
Data Sharing Statement
The datasets presented in this study can be found in online repositories.
Ethics Statement
The study was approved by the Ethics Committee of the Third Affiliated Hospital of Soochow University and followed the Soochow University Experimental Animal Management Guidelines (Approval No. 2023-046).
Author Contributions
All authors have made a substantial contribution to the work reported, be it in conception, design, conduct, acquisition of data, analysis and interpretation, or all of these; have been involved in drafting, revising, or critically reviewing the article; have given final approval for the version to be published; have agreed on the journal to which the article will be submitted; and agree to accept responsibility for all aspects of the work.
Disclosure
The authors declare that the research was conducted in the absence of any commercial or financial relationship that could be construed as a potential conflict of interest.
Acknowledgments
We are grateful to the GEO database for providing its platforms and to contributors for uploading their powerful datasets.
Additional information
Funding
References
- Bergqvist C, Ezzedine K. Vitiligo: a Review. Dermatology. 2020;236(6):571–592. doi:10.1159/000506103
- Bibeau K, Pandya AG, Ezzedine K, et al. Vitiligo prevalence and quality of life among adults in Europe, Japan and the USA. J Eur Acad Dermatol Venereol. 2022;36(10):1831–1844. doi:10.1111/jdv.18257
- Ezzedine K, Eleftheriadou V, Whitton M, van Geel N. Vitiligo. Lancet. 2015;386(9988):74–84. doi:10.1016/S0140-6736(14)60763-7
- Frisoli ML, Essien K, Harris JE. Vitiligo: mechanisms of Pathogenesis and Treatment. Annu Rev Immunol. 2020;38(1):621–648. doi:10.1146/annurev-immunol-100919-023531
- Hamzavi IH, Bibeau K, Grimes P, et al. Exploring the natural and treatment history of vitiligo: patient and healthcare professional perceptions from the global VALIANT study. Br J Dermatol. 2023;189(5):569–577. doi:10.1093/bjd/ljad245
- LeWitt TM, Kundu RV. Vitiligo. JAMA Dermatol. 2021;157(9):1136. doi:10.1001/jamadermatol.2021.1688
- Avogadri F, Gnjatic S, Tassello J, et al. Protein Expression Analysis of Melanocyte Differentiation Antigen TRP-2. Am J Dermatopathol. 2016;38(3):201–207. doi:10.1097/DAD.0000000000000362
- Salinas-Santander M, Trevino V, De la Rosa-Moreno E, et al. CAPN3, DCT, MLANA and TYRP1 are overexpressed in skin of vitiligo vulgaris Mexican patients. Exp Ther Med. 2018;15(3):2804–2811. doi:10.3892/etm.2018.5764
- Lo KY, Kuzmin A, Unger SM, Petersen JD, Silverman MA. KIF1A is the primary anterograde motor protein required for the axonal transport of dense-core vesicles in cultured hippocampal neurons. Neurosci Lett. 2011;491(3):168–173. doi:10.1016/j.neulet.2011.01.018
- Hammond JW, Cai D, Blasius TL, et al. Mammalian Kinesin-3 motors are dimeric in vivo and move by processive motility upon release of autoinhibition. PLoS Biol. 2009;7(3):e1000072. doi:10.1371/journal.pbio.1000072
- Lu X, Li G, Liu S, Wang H, Zhang Z, Chen B. Bioinformatics Analysis of KIF1A Expression and Gene Regulation Network in Ovarian Carcinoma. Int J Gen Med. 2021;14:3707–3717. doi:10.2147/IJGM.S323591
- Vale RD. The molecular motor toolbox for intracellular transport. Cell. 2003;112(4):467–480. doi:10.1016/S0092-8674(03)00111-9
- Montenegro-Garreaud X, Hansen AW, Khayat MM, et al. Phenotypic expansion in KIF1A-related dominant disorders: a description of novel variants and review of published cases. Hum Mutat. 2020;41(12):2094–2104. doi:10.1002/humu.24118
- Bergqvist C, Ezzedine K. Vitiligo: a focus on pathogenesis and its therapeutic implications. J Dermatol. 2021;48(3):252–270. doi:10.1111/1346-8138.15743
- Zloza A, Lyons GE, Chlewicki LK, et al. Engagement of NK receptor NKG2D, but not 2B4, results in self-reactive CD8+ T cells and autoimmune vitiligo. Autoimmunity. 2011;44(8):599–606. doi:10.3109/08916934.2011.593599