515
Views
0
CrossRef citations to date
0
Altmetric
Animal Genetics and Breeding

Identifying key DNA methylation sites and their cis-methylation quantitative loci for intramuscular fatty acid traits using genome and methylome data in Yorkshire pigs

, , , , , , , , , , & show all
Pages 829-840 | Received 12 Mar 2023, Accepted 07 Aug 2023, Published online: 25 Aug 2023

Abstract

Single-nucleotide polymorphisms (SNPs) that are located near CpG sites have the potential to influence DNA methylation levels, leading to distinct patterns of DNA methylation. These methylation quantitative trait loci (meQTLs) can influence methylation levels across the entire genome, thereby impacting phenotypic traits. In this study, we utilised reduced representation bisulphite sequencing (RRBS) to analyse DNA methylation in muscle tissue samples collected from 140 Yorkshire pigs. Additionally, we assessed SNPs using the Porcine 50K chip, examined 17 fatty acid traits, and conducted epigenome-wide association studies (EWAS) and genome-wide association studies (GWAS). Our investigation revealed numerous associations between DNA methylation and fatty acid traits, including C16:0, C18:0, C20:3n-3, and C22:6n-3. Notably, a substantial portion of these associations was specifically identified by EWAS and not GWAS. Subsequently, we performed GWAS to identify meQTLs using methylation levels of significant CpGs from the EWAS as traits. The results revealed the predominance of trans-meQTLs; however, we also discovered a cis-meQTL on SSC12. Finally, we reported a crucial CpG site (SSC12:3735523) and its cis-meQTL on SSC12, which are located in close proximity to the FASN gene. Our findings contribute to the identification of CpG sites and their cis-meQTLs associated with fatty acid traits, utilising comprehensive genome and methylome data. Moreover, our study provides novel insights into the molecular mechanisms underlying fatty acid traits in pigs.

Introduction

The intramuscular fatty acid composition, serving as the fundamental energy source and a crucial regulator of cellular metabolism in animals, holds significant importance in terms of its nutritional properties and impact on human health (Wood et al. Citation2004). Fatty acids encompass various forms, including saturated fatty acids (SFAs), trans-fatty acids (TFAs), monounsaturated fatty acids (MUFAs), and polyunsaturated fatty acids (PUFAs) (Kremmyda et al. Citation2011; Tvrzicka et al. Citation2011). Additionally, the fatty acid composition substantially influences meat quality traits. Elevated levels of PUFAs have been linked to diminished oxidative stability and flavour (Wood et al. Citation2008), while augmenting the content of MUFAs has shown potential in enhancing organoleptic and technological qualities, as well as nutritional properties (Cameron et al. Citation2000). Therefore, acquiring a comprehensive understanding of the underlying mechanisms that govern pig fatty acid composition is crucial for enhancing the quality of pork and optimising production efficiency.

In the past few decades, extensive genome-wide association studies (GWAS) conducted in diverse pig populations have successfully identified numerous genetic loci associated with fatty acid traits (Nii et al. Citation2006; Guo et al. Citation2009; Muñoz et al. Citation2013; Ros-Freixedes et al. Citation2016; Zhang et al. Citation2016). Additionally, candidate genes involved in lipid metabolism, such as ELOVL fatty acid elongase 6 and 7 (ELOVL6 and ELOVL7) (Zhang et al. Citation2016), fatty acid synthase (FASN) (Sato et al. Citation2017), fatty acid desaturase 1 and 2 (FADS1 and FADS2) (He et al. Citation2018), and stearoyl-CoA desaturase (SCD) (Viterbo et al. Citation2018) have been discovered for fatty acid traits in adipose and muscle tissue. Similarly, (Orozco et al. Citation2014) highlighted the inter-individual variability exhibited by epigenetic modifications, much like genetic variations. Methylation quantitative trait locus (meQTL) studies in human whole blood have identified approximately 4.7 million cis-meQTLs and 630 thousand trans-meQTLs targeting over 120 thousand CpG sites (Huan et al. Citation2019). These findings are supported by (Verma Citation2012; Xu et al. Citation2018; Gomez-Alonso et al. Citation2021). (Schulz et al. Citation2017) analysed 110 human hippocampal tissues, and they discovered that DNA methylation in hippocampal tissues is linked to various complex traits and diseases, resulting in the identification of cis-meQTLs at 14,118 CpG loci. They further integrated these findings with gene expression data to construct a comprehensive map of meQTLs and expression quantitative trait loci (eQTLs) in the human hippocampus. In a separate study, Lin et al.(Lin et al. Citation2018) explored meQTLs in 344 human post-mortem brain tissue samples, constructing a genetic regulatory map of DNA methylation in the human brain genome. Their study revealed that approximately 55% of CpG loci across the genome exhibited meQTL associations, indicating the widespread presence of meQTLs throughout the genome. Furthermore, (Gong et al. Citation2019) in their study on various cancer tissue samples, observed a higher proportion of trans-meQTLs. They identified trans-meQTLs in more than 10% of the total CpG loci analysed in relation to cancer. Collectively, these studies provide further evidence supporting the importance of identifying meQTLs in enhancing our understanding of the impact of SNPs on CpG loci methylation.

In light of the hypothesis that genetic variation can influence DNA methylation and subsequently impact fatty acid traits in pigs, our study aimed to elucidate the molecular mechanisms underlying the genetic regulation of porcine fatty acid traits. We employed an integrated approach that combined genomic and methylomic data to achieve this. By doing so, we aimed to identify crucial DNA methylation sites and their corresponding cis-meQTLs associated with fatty acid traits in pigs.

In our investigation, we conducted epigenome-wide association studies (EWAS) to elucidate the intricate relationship between DNA methylation levels and 17 fatty acid traits in a carefully selected cohort of 140 Yorkshire pigs. The primary objective of our study was to determine the extent to which DNA methylation contributes to the observed variation in fatty acid traits. To accomplish this, we integrated SNP data with reduced representation bisulphite sequencing (RRBS) data, enabling us to reveal a substantial number of associations between DNA methylation patterns and fatty acid traits. Notably, a significant proportion of these associations remained undetected when employing conventional genome-wide association studies (GWAS) methods. Furthermore, to gain further insights into the genetic underpinnings of the observed associations, we conducted GWAS using the significant CpG sites identified from the EWAS as phenotypes and the SNPs as predictors. Through this comprehensive analysis, we successfully identified a cis-meQTL associated with a key CpG site situated at position SSC12:3735523 on the pig genome. Remarkably, the FASN gene, which plays a crucial role in fatty acid metabolism, was found to be in close proximity to the identified key CpG site and its corresponding cis-meQTL. These compelling findings provide compelling evidence of the involvement of DNA methylation in modulating fatty acid traits in pigs and offer valuable insights into the molecular mechanisms underlying the role of the FASN gene in fatty acid metabolism.

Material and methods

Ethics declarations

All experimental procedures were conducted in strict compliance with the ethical guidelines and regulations set forth by the Institutional Review Board (IRB14044) and the Institutional Animal Care and Use Committee of the Sichuan Agricultural University, with complete adherence to permit number DKY-B20140302. These measures ensured the ethical treatment and welfare of the animals involved in the study.

Animals and DNA sample

A total of 140 Yorkshire pigs consisting of 51 males and 89 females, were utilised to extract genomic DNA from muscular tissue. Each pig was considered to be an individual experimental unit. The pigs were raised under the same recommended environment at the conservation farm of Mingxing Agricultural Science and Technology Development Co., Ltd. The pigs with an initial weight of about 30 kg were group-housed in cement-floor pens with 20 pigs per pen. The pigs were fed twice daily with the same diet, and water was provided ad libitum using nipple drinkers. The diet composition was formulated using corn and soybean meal, ensuring that it contained adequate concentrations of crude protein, trace minerals, and vitamins that met or exceeded the recommendations set by the National Research Council (NRC, 1998) for different stages of growth (Luo et al. Citation2021). All individuals were raised until they reached an average weight of 111.71 kg (±12.97 kg) before being transported to the slaughterhouse. A 24-h fasting period was observed prior to slaughter determination. Once the carcase composition traits were determined, samples were taken from the region between the fourth and fifth last rib for fatty acid composition analysis. Samples from the muscle tissue were snap-frozen in liquid nitrogen and stored at −80 °C until analysis. The fatty acid composition was analysed using a GC-14C gas chromatograph (Shimadzu, Kyoto, Japan) following a previously published method (Qin et al. Citation2007). The fatty acid components in the test sample were determined by comparing the peak times of each component in the fatty acid standard solution using gas chromatography. The percentage content of each fatty acid component was calculated using the peak area normalisation method.

Library construction

Genomic DNA was extracted from flash-frozen muscular tissue samples. Subsequently, RRBS libraries were constructed, and paired-end sequencing was conducted using the Illumina HiSeq analyser at Novogene Technology Co., Ltd. (Beijing, China). The raw sequencing data underwent processing using the Illumina base-calling pipeline. The genomic DNA was first digested with the MspI enzyme at 37 °C for 16 h. The resulting DNA fragments were repaired at the ends, and sequencing adapters with all cytosines methylated were ligated. DNA fragments with a length ranging from 40 to 220 bp were selected for size selection. Following this, bisulphite conversion was performed. During bisulphite conversion, unmethylated cytosines were converted to uracils (which were subsequently converted to thymine during PCR amplification), while methylated cytosines remained unchanged. Finally, PCR amplification was conducted to obtain the final DNA library. The Trimmomatic software (Bolger et al. Citation2014) was used to obtain clean reads from the raw data. The following quality control criteria were applied: (1) removal of low-quality reads using a sliding window method (SLIDINGWINDOW:4:15); (2) removal of reads containing adapter sequences (ILLUMINACLIP:adapter.fa:2:30:7:1:true); and (3) removal of reads with a trailing quality score lower than 3 or with unknown bases (TRAILING:3).

SNP data

The genotyping of pigs was performed using the Neogen GeneSeek Porcine50k chip, which is based on the Illumina platform (Illumina Inc., San Diego, USA). Genotypes were subjected to a series of filtering criteria. Loci were filtered to ensure they met the following criteria: Hardy-Weinberg equilibrium P>1.0×106, minor allele frequency (MAF) > 0.05, and call rate > 0.9. After applying these quality control measures, the final dataset used for subsequent analyses comprised a total of 36,163 loci.

Data analysis

The clean reads obtained from sequencing were aligned to the pig reference genome (Sscrofa11.1) using the Bismark software (Krueger and Andrews Citation2011) after removing adaptor sequences. Subsequently, the methylation levels of CpG sites were measured using the bismark_methylation_extractor program. To ensure high-quality data, we filtered the methylation level data by retaining only CpG cytosines that were present in all samples and had coverage in at least 30 samples. Furthermore, we applied an additional filter to include CpG sites with a minimum coverage of 10 reads within each sample.

The distribution of CpG sites was analysed using the R package Rldeogram v0.2.2 (Hao et al. Citation2020). The R package genomation v1.20 (Akalin et al. Citation2015) was used to annotate all the analysed CpGs. The annotations were based on the porcine RefSeq and CpG island databases (Sscrofa11.1/susScr11), obtained from the UCSC website (http://genome.ucsc.edu/cgi-bin/hgTables). In our analysis, the gene body region was defined as the region from the transcription start site (TSS) to the transcription termination site, while the promoter region was defined as the 2-kb region upstream of the TSS.

Epigenome-wide association study

The R package CpGassoc v2.60 was used to assess the association between the methylation level of CpG sites and 17 fatty acid traits. To perform this analysis, we employed a linear mixed-model approach, considering the methylation level of CpG as the outcome variable. The model was adjusted for covariates, including sex, farm, and days of age. The specific model applied was as follows: (1) y=μ+xβ+u+e(1) where y is the vector of the phenotypes, μ is the mean, x  is the vector of CpG, β  is the CpG effect, u represents random effects, e is the vector of random residuals.

We corrected for multiple comparisons using a Bonferroni correction for 0.05/4863340 test setting a significant threshold P value to 1.0821 ×108. Subsequently, we constructed Manhattan plots to present the results of epigenome-wide association analysis. The Manhattan plots and quantile-quantile plots (Q-Q plots) were performed using the R package qqman v0.1.4 (Turner Citation2014).

Candidate genes and annotation

To identify candidate genes associated with the significant CpG sites, we utilised the Ensembl Biomart database (http://www.biomart.org). For intragenic associations, the candidate gene corresponds directly to the gene itself. For intergenic associations, we considered the two nearest flanking genes based on their genomic distance from the CpG site. The distances between the site and each flanking gene were recorded for intergenic associations.

To further investigate the functional implications of these candidate genes, we performed enrichment analysis using the Database for Annotation, Visualisation, and Integrated Discovery (DAVID) v6.8 (http://david.ncifcrf.gov/). The gene lists obtained from the previous step were submitted to DAVID for analysis. Significant Gene Ontology (GO) terms and Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathways were identified, filtering the results with a significance threshold of p < 0.01.

Genome-wide association study for fatty acid traits

GWAS analyses were conducted independently for each of the 17 fatty acid traits using the GEMMA software (DePristo et al. Citation2011). The association analysis was performed using a mixed linear model, which accounted for the complex structure of the data. In this model, the phenotype traits were considered as the dependent variable, and adjustments were made for potential confounding factors such as sex, farm, and days of age. The specific model used for the analysis can be described as follows: (2) y=Xm+Wa+u+e(2) where y is the vector of the phenotypes; m is the SNP effects; a is the vector of residual polygene effects; u represent the random effects; X and W are the incidence matrixes for m and a, respectively; e is the vector of random residuals. SNP associations that achieved the Bonferroni threshold of P<1.38×106 were considered as significant associations.

Overlap of EWAS and GWAS

A proximity criterion was applied to determine the overlap between associations obtained from the EWAS and GWAS analyses. Specifically, associations were considered to be overlapping if they were located within a genomic distance of 2 megabases (2 Mb) from each other (Battram et al. Citation2022). This approach allowed for the identification of potential shared genetic variants or genomic regions that contribute to both DNA methylation patterns and fatty acid traits.

Identification of methylation quantitative trait loci

The methylation levels of CpG loci that remained significant in the previous conditional association analysis were used as dependent variables with SNP as the independent variable and subjected to association analysis to identify methylated quantitative trait loci (meQTL). A mixed linear model was used in the association analysis as follows: y=Xm+Wa+e where y is the CpG methylation level; m is the SNP effect; a is the residual polygenic effect; e is the residual; and X and W are the association matrix of m and a, respectively. SNP associations achieve the Bonferroni threshold of P<4.56×109 were considered as significant associations.

Identification of meQTL by methylation GWAS

GWAS was conducted to identify methylation quantitative trait loci (meQTLs) using the significant CpG sites identified from the EWAS as traits and single nucleotide polymorphisms (SNPs) as predictors. The GEMMA software (DePristo et al. Citation2011) was utilised for the association analysis, employing a mixed linear model. It is important to note that there is a distinction between the GWAS model described previously, which focused on fatty acid traits with the vector of phenotypes denoted as y, and the methylation GWAS linear mixed model where the vector of methylation levels is represented as y. This difference reflects the specific nature of the analysis, which aims to elucidate the genetic factors influencing DNA methylation patterns.

Results

Phenotype statistics

This study conducted a comprehensive analysis of fatty acid composition on muscle samples obtained from 140 pigs. presents the results, revealing the abundance of 17 different fatty acid traits in the samples. Among them, oleic acid (C18:1n-9) was found to be the most predominant fatty acid, constituting over 30% of the total fatty acid content. Following oleic acid, palmitic acid (C16:0) and linoleic acid (C18:2n-6) were observed as the next most abundant fatty acids. Notably, these 17 fatty acid traits collectively account for more than 95% of the total fatty acid content in the muscle samples.

Table 1. Summary of fatty acid Profile of the Yorkshire pigs (% of total fatty acids).

Summary of RRBS data

RRBS libraries were constructed from muscular tissue samples obtained from 140 Yorkshire pigs (), and the sequencing was performed using the Illumina HiSeq platform. On average, each sample yielded approximately 14.02 ± 2.45 Gb of raw data. After quality control, an average of 11.13 ± 2.14 Gb of clean data per sample was obtained. The bisulphite conversion efficiency of the Yorkshire pigs exceeded 99%, indicating a robust conversion of unmethylated cytosines to uracils. Furthermore, more than 60% of the reads from the Yorkshire pigs were successfully mapped to the porcine reference genome.

Table 2. Mapping results of reduced representation bisulphite sequencing (RRBS) data in the Yorkshire pigs.

For further analysis, the methylation data were filtered to include only CpG sites with a minimum coverage of 10× and present in at least 30 samples. This filtering resulted in a total of 4,863,340 CpG sites for subsequent analysis (). The distribution of these CpG sites across different genomic regions was as follows: 3.73% in the promoter region, 4.67% in exons, 38.41% in introns, and 53.19% in intergenic regions. Additionally, CpG islands (CGIs) are generically equipped to influence local chromatin structure and simplify the regulation of gene activity. Furthermore, there is a strong correlation between CGIs and transcription initiation. The percentages of these CpG sites annotated within CpG islands, CpG island shores, and other regions were 3.20%, 11.54%, and 85.26%, respectively.

Table 3. Annotation of Global CpG sites and significant CpG sites identified for EWAS.

EWAS identifies methylation signatures significantly associated with fat acid traits

We conducted an Epigenome-Wide Association Study (EWAS) to investigate the relationship between CpG methylation levels and 17 fatty acid traits. The results of the EWAS analysis are presented in .

Figure 1. Manhattan plots of fatty acid traits in Epigenome-Wide Association Study (EWAS). (A) C10:0; (B) C12:0; (C) C14:0; (D) C16:0; (E) C16:1; (F) C18:0; (G) C18:1n-9; (H) C18:2n-6; (I) C18:3n-3; (J) C18:3n-6; (K) C20:0; (L) C20:1n-9; (M) C20:2; (N) C20:3n-3; (O) C20:3n-6; (P) C20:4n-6; (Q) C22:6n-3.

Note: The x-axis represents the chromosome and the y-axis represents the significant of all CpG sites, represented by -log10(P-value). The red line represents the genome level significance threshold based on 0.05/N, where N is the number of CpG sites.

Figure 1. Manhattan plots of fatty acid traits in Epigenome-Wide Association Study (EWAS). (A) C10:0; (B) C12:0; (C) C14:0; (D) C16:0; (E) C16:1; (F) C18:0; (G) C18:1n-9; (H) C18:2n-6; (I) C18:3n-3; (J) C18:3n-6; (K) C20:0; (L) C20:1n-9; (M) C20:2; (N) C20:3n-3; (O) C20:3n-6; (P) C20:4n-6; (Q) C22:6n-3.Note: The x-axis represents the chromosome and the y-axis represents the significant of all CpG sites, represented by -log10(P-value). The red line represents the genome level significance threshold based on 0.05/N, where N is the number of CpG sites.

A total of 2,390 significant associations were identified between CpG sites and fatty acid traits (). Among these significant CpG sites, 5.02% were annotated within promoter regions, 5.31% were annotated within exon regions, 42.93% were annotated within intron regions, and 46.74% were annotated within intergenic regions. Regarding the distribution of these significant CpG sites, 2.59% were annotated within CpG islands, 10.33% were annotated within CpG island shores, and 87.08% were annotated within other regions ().

Table 4. Summary of Epigenome-Wide Association Study (EWAS) results and Genome-Wide Association Study (GWAS) results for 17 fatty acid traits.

Based on the 2,390 significant CpG sites, we identified a total of 763 protein-coding genes and lncRNAs (Table S1). These genes and lncRNAs were then subjected to enrichment analysis, resulting in the identification of 18 Gene Ontology (GO) terms and 7 Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathways that showed enrichment. Notable enriched terms and pathways included Insulin resistance, actin filament bundle assembly, and F-actin capping protein complex (). It is indicated that these genes and lncRNAs may affect meat quality traits by regulating biological pathways related to insulin and actin composition.

Figure 2. Bubble diagram of gene Ontology (GO) terms and Kyoto encyclopaedia of genes and genomes (KEGG) pathway terms for significant associations in Epigenome-Wide Association Study (EWAS).

The X-axis represents the P value of genes enriched in the corresponding GO and KEGG pathway terms. The Y-axis represents the GO and KEGG pathway terms to which the genes are enriched. The bubble shape represents the classification of GO and KEGG pathway terms. The colour of the bubble represents the log transformation of the P value.

Figure 2. Bubble diagram of gene Ontology (GO) terms and Kyoto encyclopaedia of genes and genomes (KEGG) pathway terms for significant associations in Epigenome-Wide Association Study (EWAS).The X-axis represents the P value of genes enriched in the corresponding GO and KEGG pathway terms. The Y-axis represents the GO and KEGG pathway terms to which the genes are enriched. The bubble shape represents the classification of GO and KEGG pathway terms. The colour of the bubble represents the log transformation of the P value.

Additionally, we used the inflation factor (λ), which represents the regression of observed values on expected values, to assess population stratification. An average inflation factor of 0.95 indicated that there was no inflation in the EWAS results, suggesting that the influence of population stratification was effectively controlled for in the analysis.

Genome-wide association analyses for fatty acid traits

After quality control, a total of 36,163 SNPs were retained for GWAS analysis of the 17 fatty acid traits (). Among these SNPs, we identified 16 significant associations (P<1.38×106)with the fatty acid traits. Specifically, there were 2 significant SNPs associated with C12:0, 4 significant SNPs associated with C20:3n-3, 1 significant SNP associated with C20:4n-6, and 9 significant SNPs associated with C22:6n-3. For the C12:0 trait, the two significant SNPs were located on SSC2 at positions 28,709,127 bp and 28,740,723 bp. Regarding the C20:3n-3 trait, the four significant SNPs were located on SSC12 at positions 25,708,148 bp and 43,997,932 bp, as well as on SSC16 at positions 75,205,458 bp and 75,391,932 bp. The significant SNP associated with C20:4n-6 was located on SSC14 at position 12,857,576 bp. Lastly, for the C22:6n-3 trait, the nine significant SNPs were located on SSC4 at positions 106,382,235 bp, 106,485,236 bp, 109,093,503 bp, 109,885,605 bp, and 110,552,282 bp, as well as on SSC12 at positions 60,260,816 bp and 60,372,315 bp, and on SSC13 at positions 94,960,811 bp and 95,481,059 bp.

Identification of the overlap between EWAS and GWAS

In our study, we identified a significant number of associations between fatty acid traits and CpG methylation using EWAS. Importantly, some of these associations were not identified using traditional GWAS methods. To assess the overlap between EWAS and GWAS, we considered associations to be overlapping if the associated loci were within a 2 Mb distance from each other. Among the fatty acid traits studied, we identified a total of 2,390 significant associations through EWAS and 16 significant associations through GWAS. Out of these, 9 associations were found to be overlapping (), where the associated loci were within 2 Mb of each other (as indicated in Table S2). This suggests that there are shared genetic and epigenetic factors contributing to the observed associations between CpG methylation and fatty acid traits in our study population.

Figure 3. venn plot showing the overlap between Epigenome-Wide Association Study (EWAS) significant associations and Genome-Wide Association Study (GWAS) significant associations.

The red circle represents the EWAS significant associations, the green circle represents the GWAS significant associations.

Figure 3. venn plot showing the overlap between Epigenome-Wide Association Study (EWAS) significant associations and Genome-Wide Association Study (GWAS) significant associations.The red circle represents the EWAS significant associations, the green circle represents the GWAS significant associations.

Conditional association studies

We conducted conditional association studies to investigate whether the associations identified through EWAS and GWAS were attributable to the same underlying signal or represented co-localizing but independent signals. To achieve this, we performed EWAS for the overlapping associations while considering the SNP genotype as a covariate. In this analytical approach, the disappearance of an EWAS association when incorporating the SNP genotype as a covariate indicates that the overlapping association between EWAS and GWAS resulted from the same underlying signal. Conversely, if the EWAS association remains statistically significant even after adjusting for the SNP genotype, it suggests that the overlapping association is independent but co-localizing.

Among the nine overlapping associations observed (as presented in Table S3), none reached statistical significance at the Bonferroni-corrected threshold (P<5.56×103). These findings imply that the same signal likely drove the associations we identified using both EWAS and GWAS at the associated locus.

Out of the 9 overlapping associations (Table ), none were significant at the Bonferroni threshold (P<5.56×103). The result suggests that the associations we found using both EWAS and GWAS likely arose from the same signal at the associated locus.

Identification of meQTLs

GWAS was conducted to identify methylation quantitative trait loci (meQTLs) using the significant CpG sites obtained from the epigenome-wide association studies (EWAS) as traits. A total of 2,390 significant associations were identified, corresponding to 2,310 unique CpG sites. To ensure a stringent control for false positive associations, we applied a Bonferroni threshold (P<1.38×106) for calling significant associations. As a result, we identified 235 associations that surpassed this threshold, involving 52 unique CpGs and 233 unique SNPs (as shown in Table S4).

Among the identified meQTLs, a substantial proportion represented trans-meQTLs, indicating associations between CpG sites and genetic variants located at a distance. Additionally, we observed one cis-meQTL on chromosome SSC12 (). Specifically, for the cis-meQTL-CpG pair (as illustrated in ), the SNP (WU_10.2_12_3868856) was found to be approximately 133 Kb away from the CpG site (SSC12:3735523), indicating their close proximity to the FASN gene.

Figure 4. Boxplot of the cis-meQTL (WU_10.2_12_3868856- SSC12:3735523).

The boxes indicate the interquartile range (IQR) of values between the 75% (Q3) and 25% (Q1). The centre lines indicate the median value. The bars below and above each box indicate the data in Q1-1.5 x IQR and Q3 + 1.5 x IQR, respectively. The y-axis shows CpG methylation levels, and x-axis shows SNP genotypes.

Figure 4. Boxplot of the cis-meQTL (WU_10.2_12_3868856- SSC12:3735523).The boxes indicate the interquartile range (IQR) of values between the 75% (Q3) and 25% (Q1). The centre lines indicate the median value. The bars below and above each box indicate the data in Q1-1.5 x IQR and Q3 + 1.5 x IQR, respectively. The y-axis shows CpG methylation levels, and x-axis shows SNP genotypes.

Table 5. Summary of meQTL identified by significant CpG sites in this study.

Discussion

Based on genome and methylome data, we identified 2390 associations between DNA methylation and 17 fatty acid traits using EWAS. Notably, a substantial proportion of these associations were unique to the EWAS and were not detected by Genome-Wide Association Study (GWAS) methods. Furthermore, by taking methylation of significant CpGs from EWAS as phenotypes, we performed meQTL analysis using GWAS data, resulting in the identification of 235 meQTL-CpG associations. Most of these meQTLs were found to be trans-acting, indicating that genetic variations located at distant genomic regions can influence DNA methylation patterns. Interestingly, we also observed a noteworthy cis-meQTL-CpG pair in close proximity to the FASN gene, suggesting that genetic variations near this gene may impact fatty acid metabolism through the regulation of DNA methylation.

This study provided valuable insights into the associations between DNA methylation and fatty acid traits and elucidated the genetic influence on DNA methylation patterns. Additionally, our findings highlight the importance of considering both EWAS and GWAS approaches to capture the comprehensive landscape of molecular associations.

We identified numerous associations between DNA methylation levels and fatty acid traits (). It is worth noting that the overlap between EWAS and GWAS associations was relatively low, with only 0.38% of the EWAS associations for fatty acid traits being replicated by GWAS in the same panel of pigs. This limited overlap could be attributed to the challenges in detecting associations with small effects using GWAS, as well as the potential impact of low-density SNP coverage, which may result in the failure to detect certain associations. To further investigate the nature of the overlapping associations, we conducted conditional association studies, revealing that the associations identified by both EWAS and GWAS were likely driven by the same underlying signal at the associated locus.

Previous studies have shown that SNPs near the CpG site may alter methylation levels and result in different DNA methylation patterns (Shoemaker et al. Citation2010; Zhi et al. Citation2013). These methylation quantitative trait loci (meQTLs) exert their effects on DNA methylation across the entire genome, thereby potentially impacting phenotypic traits. Uncovering the complex gene network underlying genetic variant-trait associations through meQTL analysis holds significant scientific importance. In our study, we conducted a Genome-Wide Association Study (GWAS) to identify meQTLs by treating the significant CpG sites obtained from the EWAS as phenotypes and the SNPs as predictors. Interestingly, we observed that only 2.25% of the significant CpGs were found to be significantly associated with SNPs. This relatively low overlap could be attributed to our focus on using only the significant CpGs from the EWAS as phenotypes. Notably, previous human meQTL studies have reported associations between genetic variation and 28% of variable CpGs in adipose tissue (Grundberg et al. Citation2013) and 33% in whole blood (Huan et al. Citation2019), further supporting the link between DNA methylation levels and genetic variation. Although most (99.57%) of the CpG-SNP associations we identified were trans-acting, we found a cis association on SSC12 near the FASN gene, a gene directly involved in fatty acid metabolism (Zhang et al. Citation2019). This finding suggests that genetic variation may impact the DNA methylation of the FASN gene, thereby influencing fatty acid traits in pigs.

The growing body of evidence highlights the role of epigenetic modifications, particularly DNA methylation, in contributing to phenotypic variation in livestock species (Triantaphyllopoulos et al. Citation2016; Wang and Ibeagha-Awemu Citation2020) and supports the potential application of epigenetic biomarkers, particularly DNA methylation in livestock breeding programs. Moreover, a previous study has reported the association between SNPs and differential DNA methylation (Imgenberg-Kreuz et al. Citation2018). This provided one possible mechanism that SNPs impact gene expression by altering DNA methylation, thereby suggesting the possible application of epigenetic biomarkers in livestock improvement breeding (Maldonado et al. Citation2019). Thus, in developing new breeding methods, the relationship between DNA methylation biomarkers identified by EWAS and production traits can be considered to quantify the epigenetic contribution to breeding value prediction. (Wang and Ibeagha-Awemu Citation2020). However, the application of epigenetic biomarkers in pig breeding still poses challenges. There is a need for comprehensive data on the contribution of epigenetic changes to pig production traits, as well as the development of statistical methods and software to support the quantification of the epigenetic biomarkers’ contribution to phenotypic variation. Therefore, further studies are warranted to better understand the epigenetic mechanisms underlying phenotypic variation in pig production.

In our study, we identified a crucial DNA methylation site and its cis-meQTL on SSC12 near the FASN gene by integrating EWAS and GWAS. This finding holds promise as a potential epigenetic biomarker for future pig breeding programs. However, it is important to acknowledge that our study’s small sample size poses a limitation, which may have influenced our ability to detect significant loci to some extent. Although we have made efforts to increase the sample size to a certain degree, the challenges associated with measuring and recording traits relevant to fatty acids in actual production settings, along with the high cost of Reduced Representation Bisulphite Sequencing (RRBS) for large sample sizes, have constrained the size of our study population.

Authors’ contributions

G.T., Q.S., and K.W. designed the experiments. Q.S., K.W., and S.W. analyzed data. K.W., D.C., X.J., Q.S., and Y.Y. wrote and edited the manuscript. Y.J., X.X., and X.Q. provided the resources. W.X. and A.J. supervised the research. All authors have read and approved the manuscript. Thanks to K.W. for assistance with the experiments and to G.T. for valuable discussion.

Supplemental material

Supplemental Material

Download MS Excel (20.2 KB)

Supplemental Material

Download MS Excel (19.4 KB)

Supplemental Material

Download MS Excel (11.9 KB)

Supplemental Material

Download MS Excel (73 KB)

Disclosure statement

The authors declare that they have no competing interests.

Data availability statement

The data supporting this study’s findings are available from the corresponding author upon reasonable request.

Additional information

Funding

This study was supported by High-performance Computing Platform of Sichuan Agricultural University. This study was also supported by grants from the Sichuan Science and Technology Program (2020YFN0024), the earmarked fund for the China Agriculture Research System (CARS-35-01A), the National Key R&D Program of China (2018YFD0501204), the National Centre of Technology Innovation for Pigs (NCTIP-XD/B01), the National Natural Science Foundation of China (C170102) and the Sichuan Innovation Team of Pig (sccxtd-2021-08).

References

  • Akalin A, Franke V, Vlahoviček K, Mason CE, Schübeler D. 2015. Genomation: a toolkit to summarize, annotate and visualize genomic intervals. Bioinformatics. 31(7):1127–1129. doi: 10.1093/bioinformatics/btu775.
  • Battram T, Gaunt TR, Relton CL, Timpson NJ, Hemani G. 2022. A comparison of the genes and genesets identified by GWAS and EWAS of fifteen complex traits. Nat Commun. 13(1):7816. doi: 10.1038/s41467-022-35037-3.
  • Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 30(15):2114–2120. doi: 10.1093/bioinformatics/btu170.
  • Cameron ND, Enser M, Nute GR, Whittington FM, Penman JC, Fisken AC, Perry AM, Wood JD. 2000. Genotype with nutrition interaction on fatty acid composition of intramuscular fat and the relationship with flavour of pig meat. Meat Sci. 55(2):187–195. doi: 10.1016/s0309-1740(99)00142-4.
  • DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, Philippakis AA, del Angel G, Rivas MA, Hanna M, et al. 2011. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 43(5):491–498. doi: 10.1038/ng.806.
  • Gomez-Alonso MDC, Kretschmer A, Wilson R, Pfeiffer L, Karhunen V, Seppälä I, Zhang W, Mittelstraß K, Wahl S, Matias-Garcia PR, et al. 2021. DNA methylation and lipid metabolism: an EWAS of 226 metabolic measures. Clin Epigenetics. 13(1):7. doi: 10.1186/s13148-020-00957-8.
  • Gong J, Wan H, Mei S, Ruan H, Zhang Z, Liu C, Guo AY, Diao L, Miao X, Han L. 2019. Pancan-meQTL: a database to systematically evaluate the effects of genetic variants on methylation in human cancer. Nucleic Acids Res. 47(D1):D1066–d1072. doi: 10.1093/nar/gky814.
  • Grundberg E, Meduri E, Sandling JK, Hedman AK, Keildson S, Buil A, Busche S, Yuan W, Nisbet J, Sekowska M, et al. 2013. Global analysis of DNA methylation variation in adipose tissue from twins reveals links to disease-associated variants in distal regulatory elements. Am J Hum Genet. 93(5):876–890. doi: 10.1016/j.ajhg.2013.10.004.
  • Guo T, Ren J, Yang K, Ma J, Zhang Z, Huang L. 2009. Quantitative trait loci for fatty acid composition in longissimus dorsi and abdominal fat: results from a White Duroc × Erhualian intercross F2 population. Anim Genet. 40(2):185–191. doi: 10.1111/j.1365-2052.2008.01819.x.
  • Hao Z, Lv D, Ge Y, Shi J, Weijers D, Yu G, Chen J. 2020. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput Sci. 6:e251. doi: 10.7717/peerj-cs.251.
  • He Z, Zhang R, Jiang F, Zhang H, Zhao A, Xu B, Jin L, Wang T, Jia W, Jia W, et al. 2018. FADS1-FADS2 genetic polymorphisms are associated with fatty acid metabolism through changes in DNA methylation and gene expression. Clin Epigenetics. 10(1):113. doi: 10.1186/s13148-018-0545-5.
  • Huan T, Joehanes R, Song C, Peng F, Guo Y, Mendelson M, Yao C, Liu C, Ma J, Richard M, et al. 2019. Genome-wide identification of DNA methylation QTLs in whole blood highlights pathways for cardiovascular disease. Nat Commun. 10(1):4267. doi: 10.1038/s41467-019-12228-z.
  • Imgenberg-Kreuz J, Carlsson Almlöf J, Leonard D, Alexsson A, Nordmark G, Eloranta ML, Rantapää-Dahlqvist S, Bengtsson AA, Jönsen A, Padyukov L, et al. 2018. DNA methylation mapping identifies gene regulatory effects in patients with systemic lupus erythematosus. Ann Rheum Dis. 77(5):736–743. doi: 10.1136/annrheumdis-2017-212379.
  • Kremmyda LS, Tvrzicka E, Stankova B, Zak A. 2011. Fatty acids as biocompounds: their role in human metabolism, health and disease: a review. part 2: fatty acid physiological roles and applications in human health and disease. Biomed Pap Med Fac Univ Palacky Olomouc Czech Repub. 155(3):195–218. doi: 10.5507/bp.2011.052.
  • Krueger F, Andrews SR 2011. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics (Oxford, England) 27:1571–1572.doi:10.1093/bioinformatics/btr167.
  • Lin D, Chen J, Perrone-Bizzozero N, Bustillo JR, Du Y, Calhoun VD, Liu J. 2018. Characterization of cross-tissue genetic-epigenetic effects and their patterns in schizophrenia. Genome Med. 10(1):13. doi: 10.1186/s13073-018-0519-4.
  • Luo J, Shen L, Gan M, Jiang A, Chen L, Ma J, Jin L, Liu Y, Tang G, Jiang Y, et al. 2021. Profiling of skeletal muscle tissue for long non-coding RNAs related to muscle metabolism in the QingYu pig at the growth inflection point. Anim Biosci. 34(8):1309–1320. doi: 10.5713/ajas.20.0429.
  • Maldonado MBC, de Rezende Neto NB, Nagamatsu ST, Carazzolle MF, Hoff JL, Whitacre LK, Schnabel RD, Behura SK, McKay SD, Taylor JF, et al. 2019. Identification of bovine CpG SNPs as potential targets for epigenetic regulation via DNA methylation. PLoS One. 14(9):e0222329. doi: 10.1371/journal.pone.0222329.
  • Muñoz M, Rodríguez MC, Alves E, Folch JM, Ibañez-Escriche N, Silió L, Fernández AI. 2013. Genome-wide analysis of porcine backfat and intramuscular fat fatty acid composition using high-density genotyping and expression data. BMC Genomics. 14(1):845. doi: 10.1186/1471-2164-14-845.
  • Nii M, Hayashi T, Tani F, Niki A, Mori N, Fujishima‐Kanaya N, Komatsu M, Aikawa K, Awata T, Mikawa S. 2006. Quantitative trait loci mapping for fatty acid composition traits in perirenal and back fat using a Japanese wild boar × Large White intercross. Anim Genet. 37(4):342–347. doi: 10.1111/j.1365-2052.2006.01485.x.
  • Orozco LD, Rubbi L, Martin LJ, Fang F, Hormozdiari F, Che N, Smith AD, Lusis AJ, Pellegrini M. 2014. Intergenerational genomic DNA methylation patterns in mouse hybrid strains. Genome Biol. 15(5):R68. doi: 10.1186/gb-2014-15-5-r68.
  • Qin YM, Hu CY, Pang Y, Kastaniotis AJ, Hiltunen JK, Zhu YX. 2007. Saturated very-long-chain fatty acids promote cotton fiber and Arabidopsis cell elongation by activating ethylene biosynthesis. Plant Cell. 19(11):3692–3704. doi: 10.1105/tpc.107.054437.
  • Ros-Freixedes R, Gol S, Pena RN, Tor M, Ibáñez-Escriche N, Dekkers JC, Estany J. 2016. Genome-wide association study singles out SCD and LEPR as the two main loci influencing intramuscular fat content and fatty acid composition in Duroc pigs. PLoS One. 11(3):e0152496. doi: 10.1371/journal.pone.0152496.
  • Sato S, Uemoto Y, Kikuchi T, Egawa S, Kohira K, Saito T, Sakuma H, Miyashita S, Arata S, Suzuki K. 2017. Genome-wide association studies reveal additional related loci for fatty acid composition in a Duroc pig multigenerational population. Anim Sci J. 88(10):1482–1490. doi: 10.1111/asj.12793.
  • Schulz H, Ruppert AK, Herms S, Wolf C, Mirza-Schreiber N, Stegle O, Czamara D, Forstner AJ, Sivalingam S, Schoch S, et al. 2017. Genome-wide mapping of genetic determinants influencing DNA methylation and gene expression in human hippocampus. Nat Commun. 8(1):1511. doi: 10.1038/s41467-017-01818-4.
  • Shoemaker R, Deng J, Wang W, Zhang K. 2010. Allele-specific methylation is prevalent and is contributed by CpG-SNPs in the human genome. Genome Res. 20(7):883–889. doi: 10.1101/gr.104695.109.
  • Triantaphyllopoulos KA, Ikonomopoulos I, Bannister AJ. 2016. Epigenetics and inheritance of phenotype variation in livestock. Epigen Chrom. 9:31. doi: 10.1186/s13072-016-0081-5.
  • Turner SDJB. 2014. qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots.
  • Tvrzicka E, Kremmyda LS, Stankova B, Zak A. 2011. Fatty acids as biocompounds: their role in human metabolism, health and disease–a review. Part 1: classification, dietary sources and biological functions. Biomed Pap Med Fac Univ Palacky Olomouc Czech Repub. 155(2):117–130. doi: 10.5507/bp.2011.038.
  • Verma M. 2012. Epigenome-Wide Association Studies (EWAS) in Cancer. Curr Genomics. 13(4):308–313. doi: 10.2174/138920212800793294.
  • Viterbo VS, Lopez BIM, Kang H, Kim H, Song CW, Seo KS. 2018. Genome wide association study of fatty acid composition in Duroc swine. Asian-Australas J Anim Sci. 31(8):1127–1133. doi: 10.5713/ajas.17.0779.
  • Wang M, Ibeagha-Awemu EM. 2020. Impacts of epigenetic processes on the health and productivity of livestock. Front Genet. 11:613636. doi: 10.3389/fgene.2020.613636.
  • Wood JD, Enser M, Fisher AV, Nute GR, Sheard PR, Richardson RI, Hughes SI, Whittington FM. 2008. Fat deposition, fatty acid composition and meat quality: a review. Meat Sci. 78(4):343–358. doi: 10.1016/j.meatsci.2007.07.019.
  • Wood JD, Richardson RI, Nute GR, Fisher AV, Campo MM, Kasapidou E, Sheard PR, Enser M. 2004. Effects of fatty acids on meat quality: a review. Meat Sci. 66(1):21–32. doi: 10.1016/S0309-1740(03)00022-6.
  • Xu CJ, Söderhäll C, Bustamante M, Baïz N, Gruzieva O, Gehring U, Mason D, Chatzi L, Basterrechea M, Llop S, et al. 2018. DNA methylation in childhood asthma: an epigenome-wide meta-analysis. Lancet Respir Med. 6(5):379–388. doi: 10.1016/S2213-2600(18)30052-3.
  • Zhang W, Zhang J, Cui L, Ma J, Chen C, Ai H, Xie X, Li L, Xiao S, Huang L, et al. 2016. Genetic architecture of fatty acid composition in the longissimus dorsi muscle revealed by genome-wide association studies on diverse pig populations. Genet Sel Evol. 48:5. doi: 10.1186/s12711-016-0184-2.
  • Zhang Y, Zhang J, Gong H, Cui L, Zhang W, Ma J, Chen C, Ai H, Xiao S, Huang L, et al. 2019. Genetic correlation of fatty acid composition with growth, carcass, fat deposition and meat quality traits based on GWAS data in six pig populations. Meat Sci. 150:47–55. doi: 10.1016/j.meatsci.2018.12.008.
  • Zhi D, Aslibekyan S, Irvin MR, Claas SA, Borecki IB, Ordovas JM, Absher DM, Arnett DK. 2013. SNPs located at CpG sites modulate genome-epigenome interaction. Epigenetics. 8(8):802–806. doi: 10.4161/epi.25501.