- Research
- Open access
- Published:
Genome-wide association study of the fatty liver index in the Taiwanese population reveals shared and population-specific genetic risk factors across ethnicities
Cell & Bioscience volume 15, Article number: 19 (2025)
Abstract
Background and objectives
Although the incidence of fatty liver disease (FLD) is increasing worldwide, the genetic basis of this disease is not fully understood. This study uses the fatty liver index (FLI) to identify and compare genetic variants associated with FLD in Taiwanese and European populations.
Results
In this study, a total of 145,356 Taiwan Biobank participants were included in the discovery analysis. Subjects with elevated FLI were found to have a significantly greater risk of developing FLD, as confirmed by imaging data (OR: 4.43; 95% CI: 3.88–5.06). Through genome-wide association studies (GWAS), we identified 6 variants previously associated with nonalcoholic fatty liver disease (NAFLD) and validated 50 shared risk variants located in ZPR1 and FTO between the Taiwanese and European populations. Conditional analysis of 423 significant variants from FLI-defined FLD further revealed 16 independent variants within 14 genes. Pathway analysis of GWAS significant genes revealed that lipid metabolism and the peroxisome proliferator-activated receptor (PPAR) signaling pathway are causes of hepatic fat accumulation.
Conclusion
This study identified six independent NAFLD-associated variants in GCKR, LPL, TRIB1AL, and FTO and emphasized ZPR1 and FTO as shared risk genes for FLI-defined FLD in both Taiwanese and European populations. These findings support the utility of the FLI for FLD prediction, provide new genetic insights, and reveal the common genetic pathways of FLD across two ethnic groups. This research offers a valuable framework for advancing personalized medicine and therapeutic strategies for FLD.
Background
Fatty (steatotic) liver disease (FLD) is characterized by the accumulation of fat in the liver. This disease can be further classified etiologically into alcoholic-related liver disease (ALD), metabolic dysfunction-associated steatotic liver disease (MASLD), and metabolic dysfunction and alcohol-associated liver disease (MetALD) [1, 2]. MASLD has replaced the historical terms nonalcoholic fatty liver disease (NAFLD) and metabolic dysfunction-associated fatty liver disease (MAFLD) to reclassify FLD with inclusive metabolic criteria [1]. Without detection and management, FLD may gradually progress to hepatic fibrosis or cirrhosis, ultimately resulting in irreversible damage to the liver [3]. The global prevalence of MAFLD is reported to be 50.7% in overweight or obese individuals, with males exhibiting a higher prevalence than females at 59% versus 47.5% globally [4]. In Taiwan, 40.2% of the studied population is reported to have MAFLD, as reported with MAFLD in health examinations [5].
International experts suggest the diagnosis of FLD with blood biomarkers, radiologic imaging, or liver biopsy [6]. Among various imaging tests, ultrasound has limited sensitivity in detecting steatosis below 20% and hence is less accurate for individuals with a body mass index (BMI) greater than 40 kg/m2 [6]. Instead, for moderate to severe cases, computed tomography (CT) and magnetic resonance imaging (MRI) are preferred, although their high cost and need for specialized software limit their widespread application [6]. Moreover, liver biopsy, an invasive procedure, is reserved for patients with uninformative imaging and laboratory data [7]. Bedogni et al. proposed the fatty liver index (FLI) as an inexpensive and noninvasive surrogate to evaluate FLD through incorporating risk factors for FLD, such as waist circumference (WC), triglyceride (TG) level, BMI, and gamma-glutamyl transferase (GGT) level [8]. An FLI of 60 or higher is a reliable indicator of FLD, with a positive likelihood ratio of 4.3 [8].
In addition to alcohol consumption and metabolic factors, several genetic features attributable to FLD have been reported in previous studies [9]. After accounting for both genetic and environmental factors, the heritability estimates of NAFLD range from 20–50% [10]. Furthermore, Fairfield et al. performed a genome-wide association study (GWAS) using data from the UK Biobank and identified genes associated with NAFLD, such as PNPLA3, TM6SF2, and GCKR [9]. Furthermore, a recent study reported MAFLD-associated variants and the PNPLA3 and GATAD2A genes in an East Asian population [11]. Similarly, a study conducted by Lin et al. on 904 lean individuals of Han Chinese descent revealed that rs738409 in PNPLA3 is associated with fatty liver [12].
In this study, we conducted a GWAS using a large population-based cohort using the FLI as a biomarker to identify individuals with FLI-defined fatty liver disease (FLI-defined FLD). Our results support the accuracy of FLI in detecting FLD and identifying the genetic risk factors associated with FLI-defined FLD in Taiwan Han Chinese and European populations (Fig. 1).
Methods
Study populations
We obtained our discovery cohort from the Taiwan Biobank (TWB), which collects genetic, clinical and lifestyle data from over 200,000 healthy individuals in Taiwan. Prior to their inclusion, participants provided informed consent, and approval was secured from the TWB, Academia Sinica, and the Institutional Review Board of Taipei Medical University (TMU-JIRB no. N201905005) for the execution of the study.
The TWB dataset includes various health information and testing results to assess various health conditions, including but not limited to participants’ self-reported questionnaires, physical examinations, genetic data, blood and urine samples, abdominal ultrasounds, bone mass density scans, and electrocardiograms [13]. Patients provide information on their clinical conditions, family history, dietary preferences, and alcohol consumption (defined as a minimum intake of 150 cc per week for 6 consecutive months).
Data for the European population were collected from the UMCG Genetics Lifelines Initiative (UGLI) cohort, an extension of the Lifelines Cohort Study, recruiting participants from the northern part of the Netherlands (https://www.lifelines-biobank.com/). The UGLI cohort released the genotyping data for approximately 38,500 (UGLI-1) and 28,000 (UGLI-2) participants; the data were generated via the Infinium Global Screening Array (GSA) chip (Illumina, CA, USA) and the FinnGen array (Thermo Fisher Scientific, MA, USA). The FLI GWAS was conducted with 10,398 independent European participants in UGLI-1 with complete laboratory test data for FLI calculation [14]. The summary statistics of the FLI GWAS can be obtained from Harvard Dataverse (https://doiorg.publicaciones.saludcastillayleon.es/10.7910/DVN/4YM1BG).
Fatty liver characterization
The FLI score, which varies between 0 and 100, is used to assess the risk of developing FLD, with FLI ≥ 60 indicating a high likelihood of having a fatty liver condition. The FLI was calculated for each participant in the TWB cohort using the following formula [8]:
where
TG = triglyceride (mg/dL).
BMI = body mass index (kg/m2).
GGT = gamma-glutamyl transferase (U/L).
WC = waist circumference (cm).
Individuals with missing data on body weight, height, WC, TG, or GGT were excluded from the analysis.
A subset of TWB participants eligible for FLI calculation also underwent abdominal ultrasound examinations. To assess the accuracy and reliability of the FLI as a diagnostic instrument for FLD, we conducted a chi-square test on the participants’ FLI values. They were then categorized based on the results of imaging evaluations completed by physicians. The control group included 9,978 individuals whose imaging results were indicative of “normal” or “no significant difference”, and the case group included 3,894 individuals with “mild,” “moderate,” or “severe” fatty liver conditions.
Genotyping and quality control
The genomic DNA extracted from blood samples collected from the participants was genotyped by the National Centre for Genome Medicine (NCGM) at Academia Sinica using the TWB 2.0 chip. The dataset for whole-genome sequencing included 1,451 TWB subjects and 504 individuals from the East Asian (EAS) panel of the 1000 Genome Project, which served as a reference for imputation. Imputation procedures were conducted with IMPUTE (v2.3.1) software tools [15]. Quality control measures were applied using PLINK v2.0 [16] with specific criteria, including a call rate of over 98%, a minor allele frequency (MAF) greater than 0.05, adherence to Hardy‒Weinberg equilibrium (HWE) with a p-value exceeding 10‒6, and the exclusion of insertions and deletions.
Participants who exhibited discrepancies in sex between recorded and genotyped data possessed a call rate lower than 98% or who were identified as related (identity by descent (IBD) > 0.1875) were excluded from the research. Moreover, individuals whose heterozygosity levels surpassed the mean ± 3 standard deviations were also omitted to reduce the potential influence of DNA contamination or consanguinity. After implementing a series of quality control procedures and removing individuals with missing data for FLI calculation, a total of 145,356 participants were included in the GWAS analysis.
Genome-wide association analysis
Logistic regression analysis was performed using the PLINK 2.0 software [16], with age, age2, sex, alcohol consumption and the first 10 principal components (PCs) included as covariates. A significance threshold of P < 5 × 10− 8 was utilized for genome-wide significance, whereas P < 10− 5 was considered indicative of genome-wide suggestive associations. Data visualization was achieved through the generation of Manhattan plots and quantile‒quantile (Q‒Q) plots via the CMplot R package. In this study, we utilized the Genome Reference Consortium Human Build 38 (GRCh38) to define single-nucleotide polymorphism (SNP) positions.
Conditional analysis
To identify independent variants in our GWAS, we performed conditional analysis via GCTA-COJO with a stepwise model selection method on all significant SNPs from the GWAS results to isolate independent signals [17]. This approach identifies causal variants by isolating those that contribute independently to the observed associations. Variants were defined as independent if they met the following criteria: they were located within ± 10 Mb of the targeted region, showed low collinearity (cutoff of 0.9), and achieved genome-wide significance (P < 5 × 10− 8).
Pathway enrichment analysis
For pathway enrichment analysis (PEA), we utilized all genes mapped from the significant SNPs identified in the GWAS. Significant SNPs were annotated via the Ensembl Variant Effect Predictor (VEP) [18] to map them to relevant genes, followed by cross-referencing with the GWAS catalog [19] to identify loci previously associated with NAFLD, total cholesterol (TC), TG, LDL-C, high-density lipoprotein cholesterol (HDL-C), metabolic traits, and BMI. We conducted PEA using WebGestalt [20], which focuses on Gene Ontology (GO) biological processes (BP) [21], Kyoto Encyclopedia of Genes and Genomes (KEGG) [22], Reactome [23], and WikiPathways [24]. Pathway significance was assessed using a false discovery rate (FDR) ≤ 0.05, and visualizations were generated via the ggplot2 package.
Results
Baseline characteristics of the participants
In this study, a total of 145,356 TWB participants were included in the discovery analysis (Fig. 1). The discovery cohort included 52,655 males and 92,701 females, with an overall average age of 49.42 years (Table 1). The case group included a higher proportion of male subjects (63.02%) compared to the control group (32.32%), aligned with the global prevalence of FLD by sex. TWB participants in the control group presented a healthier metabolic profile than those in the case group. The control group had a lower mean BMI (23.40 kg/m² vs. 30.23 kg/m²), mean TG level (99.71 mg/dL vs. 225.98 mg/dL), and mean LDL-C level (120.26 mg/dL vs. 125.15 mg/dL) than the case group. Additionally, the levels of liver enzymes, including serum glutamic-oxaloacetic transaminase (SGOT) (31.87 U/L vs. 23.51 U/L) and serum glutamic-pyruvic transaminase (SGPT) (43.03 U/L vs. 21.31 U/L), were elevated in the case group, suggesting potential liver inflammation or damage.
Among the study participants, 6.08% (8,840 individuals) reported alcohol consumption, with 27.8% belonging to the case group and 72.2% to the control group. Conversely, 93.92% (136,401 individuals) reported no alcohol consumption, with 11.7% in the case group and 88.3% in the control group.
A total of 145,356 TWB participants were genotyped and passed the quality control (QC) steps described in the Methods section, resulting in 18,479 cases and 126,877 controls according to the FLI cutoff. A subset of 13,872 participants with both FLI and ultrasound data were selected to evaluate the clinical accuracy of the FLI in detecting FLD. A GWAS was performed under an additive model via logistic regression adjusted for age, age2, sex, alcohol consumption, and the top 10 PCs. PEA was then conducted to explore the biological functions of the identified genes. An additional cohort with European samples was used to identify shared genetic determinants for FLI-defined FLD across different populations.
Among the discovery cohort, we found that individuals in the case group had a significantly increased risk of image confirmed FLD (OR = 4.43, P < 2.2 × 10⁻16). Additionally, participants with higher FLIs in the case group had an even greater risk of developing moderate to severe fatty liver (OR = 8.85, P < 2.2 × 10⁻16), indicating a strong association between the FLI and the severity of fatty liver.
GWAS for FLI-defined fatty liver disease identifies novel and known loci
Data from the TWB were used to conduct a GWAS to investigate the genetic factors influencing FLI-defined FLD (Fig. 2). The variants that reached genome-wide significance (P < 5 × 10− 8) and suggestive significance (P < 1 × 10− 5) are listed in Additional Files 1 and 2, respectively. We identified 423 significant SNPs, with the most prominent peak observed on chromosomes 11q23.3 and 16q12.2. Most of the significant SNPs were located within intronic regions. Among the significant variants, 417 variants were novel, whereas 6 variants were reported by previous NAFLD GWAS according to the GWAS Catalog (assessed date 1/8/2024) (Table 2). Among the 31 genes annotated from these significant risk variants, eight were reported to be associated with NAFLD, including BPTF, BUD13, FTO, GCKR, LPL, SIK3, TRIB1AL, and ZNF512. In addition, we identified FLI-defined FLD risk variants that have previously been associated with lipid profiles that are highly correlated with fatty liver progression (Additional File 3–9).
To refine the identified genetic associations, we performed conditional and joint analysis on GWAS-significant SNPs, and mapping identified 16 independent loci mapped to 14 genes (Table 3). The associations align with metabolic traits and diseases previously reported in the GWAS Catalog; among the identified variants, rs6547692, rs662799, rs3779273, rs326, rs438811, and rs5751901 were associated with traits linked to lipid regulation, whereas rs61010704 in MLXIPL showed a notable association with metabolic syndrome and HDL. rs7193144 and rs13130484 were associated with obesity-related traits such as BMI, obesity, and related phenotypes. Another notable variant, rs2980888 in TRIB1AL, was strongly associated with NAFLD, corroborating its role in hepatic lipid metabolism.
The novel variants are as follows: the RHPN1 intron variant rs569067863, the LMX1B intron variant rs7469554, the variant rs73537313 on chromosome 16, and the BPTF intron variant rs6504543. These novel loci present new avenues for research, as they lack previous associations in the existing databases, potentially offering insights into the underlying biological mechanisms and novel therapeutic targets for FLD.
Manhattan plot of FLI-defined FLD in a Taiwanese population revealed 423 genome-wide loci. The red line corresponds to a genome-wide significance threshold of 5 × 10− 8. The variant indices in blue represent the independent risk variants of FLI-defined FLD; the variant indices in red represent reported variants associated with NAFLD; and the variants in yellow represent independent risk variants that were reported to be associated with NAFLD. The genomic inflation factor (λGC) of the GWAS for FLI-defined FLD is 1.08
Shared risk variants for FLI-defined fatty liver disease among Taiwanese and European populations
To explore the shared genetic determinants of FLI-defined fatty liver across populations, we compared our GWAS-significant SNPs to those identified in the UGLI cohort. All the 50 shared significant risk variants (Table 4) were located on FTO and ZPR1 gene.
Pathway enrichment analysis for significant variants associated with fatty liver
A total of 31 FLI-defined FLD risk genes were included in the enrichment analysis, and the top 10 pathways from each database are shown in Fig. 3. The majority of the enriched GOBP pathways are related to fatty liver formation, such as TG homeostasis, acylglycerol homeostasis, and lipoprotein remodeling (Fig. 3A). The results from the Wikipathway analysis revealed that the FLI-defined FLD risk genes are involved in various types of familial hyperlipidemia, obesity, and cholesterol metabolism (Fig. 3B). KEGG pathway analysis revealed pathways related to NAFLD, cholesterol metabolism, and insulin resistance (Fig. 3C). The peroxisome proliferator-activated receptor (PPAR) signaling pathway was identified via both Wikipathway and KEGG analyses. Reactome analysis revealed pathways involved in TG synthesis, such as plasma lipoprotein assembly, remodeling, and clearance, as well as chylomicron remodeling and the assembly of active LPL and LIPC lipase complexes (Fig. 3D).
Discussion
In this study, we conducted a GWAS for FLI-defined FLD using data from 145,356 Taiwanese individuals. Our GWAS identified 423 significant risk variants located in 31 genes, including 6 variants previously reported to be associated with NAFLD located on GCKR, LPL, TRIB1AL, and FTO. The primary aim of our discovery study was to identify genetic variants associated with FLD in a Taiwanese cohort using FLI as a diagnostic tool. The ultrasound results and evaluation verify the reliability of the FLI as an indicator of fatty liver, especially in moderate to severe cases. Although studies have shown that ultrasound has limitations in detecting mild fatty liver cases (< 20%) and individuals with a BMI over 40 kg/m2 [6], our study cohort, with a mean BMI of 24.27 kg/m2, aligns well within the ultrasound target range. This supports the utility of FLI as an alternative marker for FLD.
Our analysis strongly revealed that FTO and ZPR1 variants are associated with FLD, which is consistent with prior studies linking these genes to metabolic traits and NAFLD. Variants in ZPR1 have been shown to influence circulating TG and HDL-C levels through pathways related to lipid processing [25, 26]. As elevated TG levels are a direct contributor to FLD, ZPR1 can be linked to NAFLD risk. Similarly, FTO is associated with hypercholesterolemia, which contributes to cardiovascular risk and is a well-established regulator of body weight, fat storage, and lipid metabolism, whereas variants in FTO have been associated with increased BMI and increased susceptibility to lipid and glucose dysregulation [27,28,29,30].
In addition to ZPR1 and FTO, other genes identified in our GWAS were related to FLD. For example, GCKR variants are known to influence glucose and lipid metabolism, with elevated glucokinase activity promoting de novo hepatic lipogenesis and inhibiting fatty acid oxidation, mechanisms implicated in NAFLD development [31, 32]. Similarly, LPL, a key regulator of lipid metabolism, plays a critical role in TG hydrolysis and energy balance [33]. Dysregulation of these genes contributes to lipid accumulation and metabolic dysfunction, reinforcing their importance in fatty liver pathogenesis.
To validate these findings of genetic variants associated with FLD, we conducted a replication study in an external cohort of 10,398 subjects from the UGLI cohort in Europe and found FLD risk variants shared between Taiwanese and European populations in terms of FTO and ZPR1. These associations were validated by another study consisting of a European cohort. However, population-specific differences were observed; for example, the strongest association in Europeans was with rs55872725, whereas in Taiwanese participants, it was with rs7193144. Despite these differences, both variants demonstrated a positive correlation with FLD, supporting their potential role in the development of the condition. Our validation study emphasized the population-specific and common genetic factors that contribute to FLD across different ancestries.
In addition to identifying individual gene variants, our PEA highlighted the importance of the PPAR signaling pathway and lipid metabolic pathways involving LDL-C, HDL-C, and TG in the progression of FLD. Although the results from the Wikipathway analysis and KEGG pathway analysis did not reach statistical significance, the identification of NAFLD substantiated the risk genes as strong signals for FLD. PPAR signaling regulates key processes such as adipogenesis, insulin resistance, and inflammation, which are central to NAFLD pathogenesis, and has been implicated in metabolic syndrome in previous GWASs of the Taiwanese population [34, 35]. These findings provide valuable insights into the genetic architecture of NAFLD and offer potential targets for further research and therapeutic interventions aimed at mitigating FLD and its metabolic complications.
Our findings also revealed enrichment in pathways related to lipid homeostasis and metabolism. TG accumulation, a hallmark of NAFLD, is believed to contribute to hepatocyte injury through mechanisms involving insulin resistance and disrupted glucose homeostasis [36]. Lipoprotein remodeling and the clearance of plasma lipoproteins, along with chylomicron remodeling, appear to play a role in lipid transport and processing, which can exacerbate fatty liver when disrupted [37]. Additionally, with dyslipidemia being a key factor in hepatic lipid accumulation, pathways linked to cholesterol metabolism being enriched underscore their involvement in the progression from simple FLD to more severe forms of NAFLD [29]. Regarding disease severity, we compared the genetic variants between fatty liver and liver fibrosis, but none of the variants were identical [38].
This study has several limitations. First, the detailed clinical information, such as diagnostic codes, liver biopsy results, or MRI data, is lacking, which may limit the ability to accurately reflect the real-world progression of FLD. Second, the FLI calculation was performed based on the timing of the blood samples collection, which may not fully account for individual variations. Relying on one-time measurements could therefore lead to misleading conclusions. Longitudinal data or more detailed clinical information would be necessary to gain a more comprehensive understanding of an individual’s health status.
Conclusion
Our study highlights the genetic heterogeneity underlying FLI-defined FLD and emphasizes the relevance of key genetic variants across different populations. We not only confirmed previously identified genetic risk factors but also discovered novel variants associated with FLD. The findings of shared risk variants in the FTO and ZPR1 genes suggest that common genetic mechanisms may influence FLD susceptibility. Moreover, the involvement of PPAR signaling and the plasma lipoprotein pathway in fatty liver progression highlights promising targets for therapeutic intervention. These findings also support the potential of the FLI as a reliable clinical tool for diagnosing FLD, but further research is still needed to clarify the role of these pathways in disease development and to explore the utility of the FLI in broader clinical applications.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request. The raw TWB dataset is available at https://www.twbiobank.org.tw/. The summary statistics of the FLI GWAS using the UGLI cohort can be accessed at https://doiorg.publicaciones.saludcastillayleon.es/10.7910/DVN/4YM1BG.
Abbreviations
- ALD:
-
Alcoholic-related liver disease
- BMI:
-
Body mass index
- CT:
-
Computed tomography
- FDR:
-
False discovery rate
- FLD:
-
Fatty (steatotic) liver disease
- FLI:
-
Fatty liver index
- GGT:
-
Gamma-glutamyl transferase
- GWAS:
-
Genome-wide association study
- HDL-C:
-
High-density lipoprotein cholesterol
- KEGG:
-
Kyoto Encyclopedia of Genes and Genomes
- LDL-C:
-
Low-density lipoprotein cholesterol
- MAF:
-
Minor allele frequency
- MAFLD:
-
Metabolic dysfunction-associated fatty liver disease
- MASLD:
-
Metabolic dysfunction-associated steatotic liver disease
- MetALD:
-
Metabolic dysfunction-and alcohol-associated liver disease
- MRI:
-
Magnetic resonance imaging
- NAFLD:
-
Metabolic dysfunction-and alcohol-associated liver disease
- PEA:
-
Pathway enrichment analysis
- PPAR:
-
Peroxisome proliferator-activated receptor
- SGOT:
-
Serum glutamic-oxaloacetic transaminase
- SGPT:
-
Serum glutamic-pyruvic transaminase
- SNP:
-
Single-nucleotide polymorphism
- T2DM:
-
Type 2 diabetes mellitus
- TC:
-
Total cholesterol
- TG:
-
Triglyceride
- TWB:
-
Taiwan Biobank
- WC:
-
Waist circumference
References
Rinella ME, Lazarus JV, Ratziu V, Francque SM, Sanyal AJ, Kanwal F, et al. A multisociety Delphi consensus statement on new fatty liver disease nomenclature. Hepatology. 2023;78(6):1966–86.
EASL-EASD-EASO Clinical. Practice guidelines on the management of metabolic dysfunction-associated steatotic liver disease (MASLD). J Hepatol. 2024;81(3):492–542.
Pan X, Lv J, Liu M, Li Y, Zhang Y, Zhang R, et al. Chronic systemic inflammation predicts long-term mortality among patients with fatty liver disease: data from the National Health and Nutrition Examination Survey 2007–2018. PLoS ONE. 2024;19(11):e0312877.
Liu J, Ayada I, Zhang X, Wang L, Li Y, Wen T, et al. Estimating Global Prevalence of Metabolic Dysfunction-Associated fatty liver disease in overweight or obese adults. Clin Gastroenterol Hepatol. 2022;20(3):e573–82.
Cheng WC, Chen HF, Cheng HC, Li CY. Comparison of all-cause mortality associated with nonalcoholic fatty liver disease and metabolic dysfunction-associated fatty liver disease in Taiwan MJ cohort. Epidemiol Health. 2024;46:e2024024.
Eslam M, Newsome PN, Sarin SK, Anstee QM, Targher G, Romero-Gomez M, et al. A new definition for metabolic dysfunction-associated fatty liver disease: an international expert consensus statement. J Hepatol. 2020;73(1):202–9.
Khalifa A, Rockey DC. The utility of liver biopsy in 2020. Curr Opin Gastroenterol. 2020;36(3):184–91.
Bedogni G, Bellentani S, Miglioli L, Masutti F, Passalacqua M, Castiglione A, et al. The fatty liver index: a simple and accurate predictor of hepatic steatosis in the general population. BMC Gastroenterol. 2006;6:33.
Fairfield CJ, Drake TM, Pius R, Bretherick AD, Campbell A, Clark DW, et al. Genome-Wide Association Study of NAFLD Using Electronic Health Records. Hepatol Commun. 2022;6(2):297–308.
Sookoian S, Pirola CJ. Genetic predisposition in nonalcoholic fatty liver disease. Clin Mol Hepatol. 2017;23(1):1–12.
Lee Y, Cho EJ, Choe EK, Kwak MS, Yang JI, Oh SW, et al. Genome-wide association study of metabolic dysfunction-associated fatty liver disease in a Korean population. Sci Rep. 2024;14(1):9753.
Lin H, Wong GL, Whatling C, Chan AW, Leung HH, Tse CH, et al. Association of genetic variations with NAFLD in lean individuals. Liver Int. 2022;42(1):149–60.
Feng YA, Chen CY, Chen TT, Kuo PH, Hsu YH, Yang HI, et al. Taiwan Biobank: a rich biomedical research database of the Taiwanese population. Cell Genom. 2022;2(11):100197.
Li Y, van den Berg EH, Kurilshikov A, Zhernakova DV, Gacesa R, Hu S et al. Genome-wide studies reveal genetic risk factors for hepatic Fat Content. Genomics Proteom Bioinf. 2024;22(2).
Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5(6):e1000529.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75.
Yang J, Ferreira T, Morris AP, Medland SE, Madden PA, Heath AC, et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat Genet. 2012;44(4):369–75. s1-3.
Martin FJ, Amode MR, Aneja A, Austine-Orimoloye O, Azov AG, Barnes I, et al. Ensembl 2023. Nucleic Acids Res. 2023;51(D1):D933–41.
Sollis E, Mosaku A, Abid A, Buniello A, Cerezo M, Gil L, et al. The NHGRI-EBI GWAS catalog: knowledgebase and deposition resource. Nucleic Acids Res. 2023;51(D1):D977–85.
Liao Y, Wang J, Jaehnig EJ, Shi Z, Zhang B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res. 2019;47(W1):W199–205.
The Gene Ontology Resource. 20 years and still GOing strong. Nucleic Acids Res. 2019;47(D1):D330–8.
Ogata H, Goto S, Fujibuchi W, Kanehisa M. Computation with the KEGG pathway database. BioSystems. 1998;47(1–2):119–28.
Joshi-Tope G, Gillespie M, Vastrik I, D’Eustachio P, Schmidt E, de Bono B, et al. Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 2005;33(Database issue):D428–32.
Slenter DN, Kutmon M, Hanspers K, Riutta A, Windsor J, Nunes N, et al. WikiPathways: a multifaceted pathway database bridging metabolomics to other omics research. Nucleic Acids Res. 2018;46(D1):D661–7.
van de Woestijne AP, van der Graaf Y, de Bakker PI, Asselbergs FW, Spiering W, Visseren FL. Rs964184 (APOA5-A4-C3-A1) is related to elevated plasma triglyceride levels, but not to an increased risk for vascular events in patients with clinically manifest vascular disease. PLoS ONE. 2014;9(6):e101082.
Esteve-Luque V, Padró-Miquel A, Fanlo-Maresma M, Corbella E, Corbella X, Pintó X et al. Implication between genetic variants from APOA5 and ZPR1 and NAFLD severity in patients with Hypertriglyceridemia. Nutrients. 2021;13(2).
Yin D, Li Y, Liao X, Tian D, Xu Y, Zhou C, et al. FTO: a critical role in obesity and obesity-related diseases. Br J Nutr. 2023;130(10):1657–64.
Fan R, Wang J, Du J. Association between body mass index and fatty liver risk: a dose–response analysis. Sci Rep. 2018;8(1):15273.
Deprince A, Haas JT, Staels B. Dysregulated lipid metabolism links NAFLD to cardiovascular disease. Mol Metab. 2020;42:101092.
Sierra-Ruelas E, Campos-Pérez W, Torres-Castillo N, García-Solís P, Vizmanos B, Martínez-López E. The rs9939609 variant in FTO increases the risk of Hypercholesterolemia in metabolically healthy subjects with excess weight.
Peter A, Stefan N, Cegan A, Walenta M, Wagner S, Königsrainer A, et al. Hepatic glucokinase expression is associated with lipogenesis and fatty liver in humans. J Clin Endocrinol Metab. 2011;96(7):E1126–30.
Bechmann LP, Gastaldelli A, Vetter D, Patman GL, Pascoe L, Hannivoort RA, et al. Glucokinase links Krüppel-like factor 6 to the regulation of hepatic insulin sensitivity in nonalcoholic fatty liver disease. Hepatology. 2012;55(4):1083–93.
Teratani T, Tomita K, Furuhashi H, Sugihara N, Higashiyama M, Nishikawa M, et al. Lipoprotein Lipase Upregulation in hepatic stellate cells exacerbates Liver Fibrosis in Nonalcoholic Steatohepatitis in mice. Hepatol Commun. 2019;3(8):1098–112.
Liss KH, Finck BN. PPARs and nonalcoholic fatty liver disease. Biochimie. 2017;136:65–74.
Ho CY, Lee JI, Huang SP, Chen SC, Geng JH. A genome-wide Association study of metabolic syndrome in the Taiwanese Population. Nutrients. 2023;16(1).
Semova I, Biddinger SB. Triglycerides in nonalcoholic fatty liver disease: Guilty until Proven innocent. Trends Pharmacol Sci. 2021;42(3):183–90.
Xie Y, Newberry EP, Brunt EM, Ballentine SJ, Soleymanjahi S, Molitor EA, et al. Inhibition of Chylomicron assembly leads to dissociation of hepatic steatosis from inflammation and fibrosis. J Lipid Res. 2021;62:100123.
Palupi PD, Wei CY, Chou WH, Lin MR, Wan YY, Chang WC. Dietary contributions in the genetic variation of liver fibrosis: a genome-wide association study of fibrosis-4 index in the liver fibrosis development. Cell Biosci. 2024;14(1):141.
Acknowledgements
We acknowledge the staff at the Taiwan Biobank for their hard work in collecting and distributing the data. We would like to extend our sincere gratitude to Poppy Diah Palupi and Dr. Hsuan-Chao Lin for the vital suggestions and insightful contributions to this work. We would also like to extend our sincere gratitude to Sabrina Tsai for revising the manuscript. Their guidance has profoundly enhanced the rigor and scope of our research.
Funding
This work was supported by grants from the National Science and Technology Council in Taiwan (NSTC113-2327-B-038-001; NSTC112-2320-B-038-026-MY3), the Ministry of Education (MOE) in Taiwan (DP2-TMU-113-O-13), and Taipei Medical University [112-5812-004-400 (Yusuke Nakamura Chair Professorship)].
Author information
Authors and Affiliations
Contributions
LPP and WCC conceptualized the study; CYW and MRL were responsible for checking the methodology and data results; LPP wrote the original draft; CYW, MRL, WHC, YJW, and WCC reviewed and edited the manuscript. All the authors contributed to the article and approved the submitted version.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The study received ethical approval from the Institutional Review Board (IRB no. N201905005) of Taipei Medical University prior to its commencement.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Additional File 1.
Genome-wide significant SNPs from FLI GWASs in the Taiwanese population. Additional File 2. Genome-wide suggestive SNPs from FLI GWAS in the Taiwanese population. Additional File 3. Genome-wide significant SNPs from the FLI GWAS previously reported for BMI in the GWAS Catalog. Additional File 4. Genome-wide significant SNPs from the FLI GWAS previously reported for HDL-C in the GWAS Catalog. Additional File 5. Genome-wide significant SNPs from the FLI GWAS previously reported for LDL-C in the GWAS Catalog. Additional File 6. Genome-wide significant SNPs from the FLI GWAS previously reported for metabolic disease in the GWAS Catalog. Additional File 7. Genome-wide significant SNPs from FLI GWASs previously reported for TC in the GWAS Catalog. Additional File 8. Genome-wide significant SNPs from the FLI GWAS previously reported for TG in the GWAS Catalog. Additional File 9. Genome-wide significant SNPs from the FLI GWAS previously reported for WC in the GWAS Catalog
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Lau, P.P., Wei, CY., Lin, MR. et al. Genome-wide association study of the fatty liver index in the Taiwanese population reveals shared and population-specific genetic risk factors across ethnicities. Cell Biosci 15, 19 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13578-025-01346-5
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13578-025-01346-5