ecancermedicalscience

Research

Clustering-based identification of immune-related gene signatures in hepatocellular carcinoma

Jyoti Brahmaiah1, Usha Adiga2, Alfred J Augustine3 and Sampara Vasishta2

1Department of Pathology, Apollo Institute of Medical Sciences and Research Chittoor, Murukambattu, Chittoor 517127, Andhra Pradesh, India

2Department of Biochemistry, Apollo Institute of Medical Sciences and Research Chittoor, Murukambattu, Chittoor 517127, Andhra Pradesh, India

3Department of Surgery, Apollo Institute of Medical Sciences and Research Chittoor, Murukambattu, Chittoor 517127, Andhra Pradesh, India


Abstract

Background: Hepatocellular carcinoma (HCC) is a complex malignancy influenced by genetic, epigenetic and immune-related factors. The tumour immune microenvironment plays a critical role in HCC progression and response to immunotherapy. Identifying key immune-related gene signatures through clustering techniques can provide insights into tumour biology and therapeutic targets.

Methods: We employed K-means, Markov Clustering Algorithm (MCL) and density-based spatial clustering of applications with noise (DBSCAN) to analyse immune-related genes in HCC. Functional enrichment analysis was conducted using Gene Ontology (GO) biological process, cellular component and molecular function categories, along with pathway analysis from Kyoto encyclopedia of genes and genomes (KEGG) and Reactome databases. Additionally, protein–protein interaction (PPI) hub analysis and microRNAs (miRNA) target predictions were integrated to understand the regulatory networks.

Results: K-means clustering segregated immune genes into three clusters, with major histocompatibility complex (MHC) class II genes forming a distinct cluster. MCL and DBSCAN identified a more unified immune cluster incorporating both MHC class I and II molecules, suggesting their coordinated role in antigen presentation. GO analysis revealed enrichment in antigen processing and presentation pathways, immunoglobulin-mediated responses and glutamate receptor signaling. KEGG pathway analysis highlighted associations with autoimmune diseases and viral infections. PPI hub analysis identified CD4, CD74 and HLA-DQA1 as central nodes, while miRNA analysis suggested regulatory interactions affecting immune gene expression.

Conclusion: Our clustering analysis highlights distinct immune-related gene signatures in HCC, emphasising the role of antigen presentation and immune modulation in tumour progression. The findings provide a foundation for further investigation into immunotherapeutic strategies targeting key immune pathways in HCC.

Keywords: hepatocellular carcinoma, immune clustering, MHC genes, antigen presentation, immunotherapy

Correspondence to: Sampara Vasishta
Email: vasishta_s@aimsrchittoor.edu.in

Published: 17/10/2025
Received: 02/04/2025

Publication costs for this article were supported by ecancer (UK Charity number 1176307).

Copyright: © the authors; licensee ecancermedicalscience. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


Introduction

Hepatocellular carcinoma (HCC) is the most prevalent form of primary liver cancer and represents a significant global health burden. It is strongly associated with chronic liver diseases, including hepatitis B virus (HBV) and hepatitis C virus (HCV) infections, Metabolic dysfunction-associated steatotic liver disease (MASLD) and alcohol-related liver disease [1] Although the present incidence (2.15 per 100,000), prevalence (2.27 per 100,000) and mortality (2.21 per 100,000) rates of HCC in India are lower than the global figures, the yearly rates of change in these measures are greater in India [2].

Epidemiology and risk factors

The incidence of HCC varies globally, with higher rates observed in regions with endemic HBV and HCV infections, such as East Asia and sub-Saharan Africa [3]. Chronic HBV infection remains the leading cause of HCC, contributing to approximately 50% of cases worldwide, while chronic HCV infection accounts for 25%–30% of cases [4]. In recent years, MASLD has emerged as a critical risk factor, particularly in Western countries, where obesity and metabolic syndrome are prevalent [5]. Other risk factors include excessive alcohol consumption, aflatoxin exposure, diabetes and genetic predisposition [6].

Molecular pathogenesis of HCC

HCC arises through a multistep process involving genetic and epigenetic alterations that drive malignant transformation. Key molecular pathways implicated in HCC development include Wnt/β-catenin signaling, TP53 mutations, oxidative stress and inflammation-mediated hepatocarcinogenesis [7]. The integration of HBV DNA into the host genome is a well-established oncogenic driver that promotes genomic instability and tumour progression [8]. Additionally, mutations in genes such as TERT, CTNNB1 and TP53 are frequently observed in HCC tumours, further highlighting the heterogeneous nature of the disease [9].

Role of genetic variants in HCC development

Genome-wide association studies have identified several genetic variants associated with increased susceptibility to HCC. The PNPLA3 rs738409 C>G polymorphism has been strongly linked to MASLD-associated HCC, with studies demonstrating that carriers of the G allele exhibit increased hepatic fat accumulation and fibrosis progression [10]. Similarly, TM6SF2 rs58542926 has been associated with hepatic steatosis and an elevated risk of fibrosis, contributing to HCC development in patients with MASLD [11]. Other significant variants, such as MBOAT7 rs641738 and HSD17B13 rs72613567, have been implicated in modifying liver disease progression and influencing HCC susceptibility [12,13].

Objectives

  • To analyse the biological processes (BPs), cellular components (CCs) and molecular functions (MFs) of clustered immune genes, focusing on antigen presentation, major histocompatibility complex (MHC) class I and II interactions and immune response pathways in HCC.
  • To explore key immune pathways enriched in HCC, including PD-1 signaling, antigen processing and presentation, interferon gamma signaling and glutamate receptor signaling, to assess their role in immune evasion and tumour progression.
  • To investigate the involvement of microRNAs (miRNAs) in HCC immune regulation, identify specific miRNAs that target immune-related genes and contribute to immune modulation in the tumour microenvironment.
  • To identify protein–protein interaction (PPI) hub proteins linked to immune regulation in HCC, highlighting key molecules that may serve as central regulators of immune response and potential therapeutic targets.
  • To compare the clustering effectiveness of K-means, Markov Clustering Algorithm (MCL) and density-based spatial clustering of applications with noise (DBSCAN) in classifying immune-related genes in HCC, and evaluate the similarities and differences between these approaches in capturing biologically relevant gene interactions.

Methodology

Study design and data acquisition

This study employed a bioinformatics-driven approach to analyse HCC data, focusing on gene expression, pathway enrichment and protein interaction networks. The dataset was obtained from publicly available genomic repositories [14], including gene expression Omnibus and the cancer genome atlas. The selection criteria included datasets with well-defined sample classifications, including HCC tumour tissues and adjacent normal liver tissues. The inclusion of high-throughput sequencing and microarray data allowed for a comprehensive assessment of gene expression variations, molecular interactions and pathway deregulations.

Differential gene expression analysis

Differential expression analysis was conducted using DESeq2 for RNA-Seq data and limma for microarray data. Genes with an adjusted p-value (false discovery rate, FDR) of <0.05 and a log2 fold change of >1 or <−1 were considered significantly differentially expressed. The lists of upregulated and downregulated genes were further validated by cross-referencing with existing literature on HCC.

Functional enrichment analysis

Gene ontology (GO) enrichment and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis were performed using ClusterProfiler in R. The GO analysis categorised the significantly expressed genes into three domains: BP, CC and MF. The KEGG analysis identified significantly enriched pathways related to oncogenesis, immune response and metabolic dysregulation in HCC.

For immune-related processes, GO terms such as ‘Antigen Processing and Presentation via MHC Class II’ were analysed to determine the role of HLA-DQA1 in tumour immune evasion. Additionally, pathways related to neurotransmitter signaling, such as ‘Glutamate Receptor Signaling’, were assessed for their involvement in HCC progression.

PPI network construction

To identify key hub proteins and their potential role in hepatocarcinogenesis, a PPI network was constructed using STRING and visualised using Cytoscape. The MCODE algorithm was applied to detect highly interconnected clusters of proteins. Key hub proteins such as FLNA, replication timing regulatory factor 1 (RIF1) and DLG4 were identified based on their high degree of connectivity.

miRNA target prediction and regulatory network analysis

To assess post-transcriptional regulation in HCC, miRNA enrichment analysis was conducted using miRTarBase 2017 and TargetScan databases. Significant interactions were identified between HLA-DQA1 and miRNAs such as hsa-miR-6798-3p and hsa-miR-4645-5p, which have been implicated in immune modulation. Similarly, hsa-miR-122-5p, a liver-specific miRNA, was examined for its role in tumour progression.

The regulatory network was constructed by integrating mRNA, miRNA and PPI interactions, providing insights into how miRNAs influence oncogenic pathways in HCC.

Statistical analysis

All statistical analyses were performed using R (version 4.1.2) and Python (version 3.9). A p-value of <0.05 was considered statistically significant. Multiple testing correction was applied using the Benjamini–Hochberg (BH) method to control for false discovery rates.

In this study, we applied three clustering techniques—K-means, MCL and DBSCAN—to classify immune-related genes associated with HCC.

Ethical considerations

This is not a clinical case report but a bioinformatics-based analysis using publicly available datasets from repositories. No direct patient recruitment, intervention or identifiable clinical data were involved; therefore, informed consent and institutional ethical approval were not required. All datasets used were generated under the ethical guidelines of their respective repositories.


Results

GO BP analysis

GO enrichment analysis identified several BP significantly associated with the dataset. The highest combined scores were observed for MHC Class II protein complex assembly and peptide antigen assembly with MHC Class II protein complex, both showing strong enrichment and relatively high overlap ratios. Other notable processes included peptide antigen assembly with MHC protein complex and immunoglobulin production involved in the immunoglobulin-mediated immune response, indicating a clear link to antigen presentation and immune activation pathways. Processes related to antigen processing and presentation of exogenous peptide antigen via MHC Class II, synaptic transmission (glutamatergic) and glutamate receptor signaling pathway were also enriched, though with comparatively lower combined scores and overlap ratios. Overall, the enrichment profile highlights a predominance of immune-related pathways, particularly those linked to MHC Class II-mediated antigen processing and presentation. This suggests that the dataset is functionally enriched for processes central to adaptive immune responses (Figure 1).

Figure 1. GO BP enrichment analysis. Bubble plot showing the top enriched GO BP terms. The x-axis represents the combined enrichment score, and the y-axis lists the GO terms. Bubble size corresponds to the overlap ratio (proportion of query genes overlapping with genes in each GO term), and bubble color intensity reflects the statistical significance as –log₁₀ (adjusted p-value).

GO CC analysis

The enrichment analysis for CCs indicated the strongest associations with MHC Class II protein complex and MHC protein complex, both showing the highest combined scores and overlap ratios. Additional enriched components included the lumenal side of the endoplasmic reticulum membrane and the ionotropic glutamate receptor complex, suggesting a combination of immune-related and neuronal membrane-associated elements.

Other terms with notable enrichment were postsynaptic density membrane and postsynaptic specialisation membrane, reflecting potential involvement of synaptic structures. Lower combined score terms such as coated vesicle membrane, ER to Golgi transport vesicle membrane, transport vesicle membrane and clathrin-coated endocytic vesicle membrane indicate processes linked to intracellular transport and vesicular trafficking. The dataset is enriched for membrane-associated complexes, with predominant representation of MHC-related immune structures and secondary enrichment in synaptic and vesicular transport components. (Figure 2).

GO MF analysis

The MF analysis identified MHC Class II receptor activity as the most enriched term, with the highest combined score and overlap ratio. This was followed by ionotropic glutamate receptor activity and neurotransmitter receptor activity involved in the regulation of postsynaptic membrane potential, indicating functional links to both immune recognition and neuronal signaling. Other enriched terms included MHC Class II protein complex binding and transmitter-gated monoatomic ion channel activity involved in the regulation of postsynaptic membrane potential, further highlighting the interplay between antigen presentation and synaptic signaling. Lower combined score functions, such as sodium channel activity, ligand-gated monoatomic cation channel activity and potassium channel activity, point toward additional ion transport processes. The dataset is enriched for functions related to MHC Class II-mediated antigen recognition and binding, with secondary enrichment in neurotransmitter receptor activity and ion channel regulation (Figure 3).

Figure 2. GO CC enrichment analysis. Bubble plot showing the top enriched GO BP terms. The x-axis represents the combined enrichment score, and the y-axis lists the GO terms. Bubble size corresponds to the overlap ratio (proportion of query genes overlapping with genes in each GO term), and bubble color intensity reflects the statistical significance as –log₁₀ (adjusted p-value).

KEGG pathway analysis

KEGG pathway analysis revealed significant enrichment in immune-related diseases, with the highest combined score observed for asthma, followed by allograft rejection and graft-versus-host disease. Other notable enriched pathways included type I diabetes mellitus, intestinal immune network for IgA production and autoimmune thyroid disease, highlighting strong associations with autoimmune and hypersensitivity conditions. Lower-ranked enriched terms comprised viral myocarditis, inflammatory bowel disease, leishmaniasis and antigen processing and presentation, indicating a broader involvement of the dataset in infectious and inflammatory immune pathways. The dataset shows predominant enrichment for KEGG pathways related to autoimmune, inflammatory and immune-mediated diseases, suggesting a central role in immune dysregulation and pathogen response (Figure 4).

Figure 3. GO MF enrichment analysis. Bubble plot showing the top enriched GO BP terms. The x-axis represents the combined enrichment score, and the y-axis lists the GO terms. Bubble size corresponds to the overlap ratio (proportion of query genes overlapping with genes in each GO term), and bubble color intensity reflects the statistical significance as –log₁₀ (adjusted p-value).

Figure 4. KEGG pathway enrichment analysis. Bubble plot showing the top enriched GO BP terms. The x-axis represents the combined enrichment score, and the y-axis lists the GO terms. Bubble size corresponds to the overlap ratio (proportion of query genes overlapping with genes in each GO term), and bubble color intensity reflects the statistical significance as –log₁₀ (adjusted p-value).

miRTarBase 2017 analysis

The enriched miRNAs are represented in Table 1.

PPI hub proteins

PPI network analysis identified key hub proteins, with FLNA, RIF1 and DLG4 showing significant interactions with GRIK1 and HLA-DQA1. FLNA (Filamin A) plays a role in cytoskeletal organisation, while RIF1 is involved in DNA repair. These findings, represented in Table 2, suggest that alterations in these proteins could contribute to tumour progression and immune evasion in HCC. Tables 35 represent the various clustering methods, such as k-means, MCL and DBSCAN, respectively.

This bioinformatics analysis of HCC data provides a comprehensive understanding of the immune and neurological pathways implicated in disease progression. The enrichment of HLA-DQA1 in antigen processing pathways underscores its role in immune modulation, while GRIK1-associated neurotransmitter signaling highlights a potential novel mechanism in hepatocarcinogenesis. The identified miRNAs and hub proteins suggest additional regulatory layers influencing HCC pathophysiology. These findings provide valuable insights for future research, particularly in developing immunotherapy strategies and targeted treatments for HCC.

Table 1. Enrichment analysis of miRNAs. The table shows significant miRNAs identified through enrichment analysis. Columns represent the miRNA term, overlap of genes in the input list with the miRNA target set (input genes/miRNA target set size), raw p-value, adjusted p-value (BH correction), old p-value and old adjusted p-value from prior analysis, odds ratio (enrichment strength), combined score (integrating the p-value and odds ratio) and the specific overlapping gene(s).

Table 2. Enrichment analysis of protein targets associated with identified genes. The table lists significantly enriched protein targets predicted to interact with the query genes. Columns indicate the protein target (Term), overlap of genes in the input set with the target’s associated genes (input genes/total target-associated genes), raw p-value, adjusted p-value (BH correction), old p-value and old adjusted p-value from previous analysis, odds ratio (measure of enrichment strength), combined score (integration of p-value and odds ratio) and the specific overlapping gene(s).

Table 3. K-means clustering of proteins based on expression or interaction profiles. Proteins were grouped into clusters using the K-means method. Each entry lists the clustering method, assigned cluster number, number of genes in the cluster, protein name and full protein description. Cluster 1 contains nine proteins, predominantly HLA class II molecules, while clusters 2 and 3 each contain a single protein.

Table 4. MCL analysis of proteins based on network connectivity. Proteins were grouped into clusters using the MCL method. Each entry lists the clustering method, cluster number, number of genes in the cluster, protein name and a brief functional description. Cluster 1 contains 11 proteins, with the majority representing HLA class II histocompatibility antigens, alongside CD4 and the HLA class I molecule HLA-B.

Table 5. DBSCAN analysis of proteins. Proteins were grouped using the DBSCAN clustering method based on network proximity and density criteria. The table lists the clustering method, cluster number, gene count per cluster, protein name and functional description. Cluster 1 comprises 11 proteins, predominantly HLA class II histocompatibility antigens, alongside CD4 and the HLA class I molecule HLA-B.


Discussion

Our analysis of HCC using various bioinformatics tools provides critical insights into the immune-related molecular mechanisms contributing to disease progression. The GO terms, KEGG pathway analysis, PPI hub proteins and miRNA regulatory network collectively highlight key immune system components involved in HCC pathogenesis.

The KEGG pathway results further support the notion that autoimmune and inflammatory pathways are closely tied to HCC development. Additionally, miRNA regulation of key immune genes may serve as a novel mechanism for immune evasion, offering potential therapeutic targets. The clustering analysis revealed three major insights:

Class II Predominance in K-means: The K-means approach isolated MHC class II genes into a single cluster, indicating their distinct expression pattern in HCC. This highlights their role in adaptive immunity and tumour immune escape.

Integration of class I and II in MCL and DBSCAN: Unlike K-means, both MCL and DBSCAN clustered MHC class I and II genes together. This suggests functional cross-talk between these molecules in shaping the tumour immune microenvironment (TIME).

CD4 and CD74 as key regulators: The consistent clustering of CD4 and CD74 with MHC genes underscores their pivotal role in antigen presentation and T-cell activation in HCC.

Differential clustering of MHC class I and II genes, the clustering methods employed demonstrated distinct grouping tendencies of MHC genes. K-means clustering separated MHC class II molecules into a single group, while MCL and DBSCAN clustered both class I and II genes together. This distinction suggests that while class II molecules may exhibit distinct transcriptional or post-transcriptional regulation in HCC, their interaction with class I molecules remains essential in immune recognition and tumour evasion. The presence of class I molecules such as HLA-B within the same cluster as class II genes in MCL and DBSCAN points to functional convergence in antigen processing and immune activation.

CD4 and CD74 as central players in immune modulation, the consistent clustering of CD4 and CD74 with MHC class II genes underscores their critical role in antigen presentation and T-cell activation. CD4, a key co-receptor for MHC class II molecules, is central to T-helper cell function, supporting immune surveillance and cytokine-mediated responses. Similarly, CD74, which acts as an MHC class II chaperone, facilitates antigen processing and presentation. Their co-clustering with HLA genes reinforces their role in orchestrating the immune response against HCC cells, potentially influencing patient prognosis and treatment response.

Potential implications for tumour immune microenvironment, the clustering of immune-related genes reveals patterns that may reflect the tumour microenvironment's immunological landscape. The integration of MHC class I and II genes, as seen in MCL and DBSCAN clustering, indicates a coordinated immune response that may be influenced by tumour-induced immune suppression. The immune escape mechanisms in HCC often involve the downregulation of MHC molecules, allowing tumour cells to evade cytotoxic T-cell recognition. Understanding these clustering patterns could help identify novel biomarkers and therapeutic targets to counteract immune evasion strategies in HCC.

Comparative effectiveness of clustering techniques, the similarities between MCL and DBSCAN suggest these methods are robust for detecting biologically relevant interactions among immune genes. Both approaches grouped class I and II molecules together, capturing the functional interplay between these components. In contrast, K-means identified a distinct cluster for class II molecules, possibly due to differences in transcriptional regulation or specific cellular pathways unique to antigen-presenting cells. These variations highlight the importance of method selection when analysing immune-related gene expression in cancer research.

The TIME plays a crucial role in HCC progression and response to therapy. HCC tumours exhibit immune suppression through multiple mechanisms, including the upregulation of immune checkpoint molecules such as PD-1, CTLA-4 and TIM-3, which contribute to T-cell exhaustion [15]. Additionally, alterations in antigen presentation pathways, such as downregulation of MHC class I and II molecules, allow tumour cells to evade immune surveillance [16]. Understanding the immune landscape of HCC is essential for the development of immunotherapeutic strategies.

Recent advancements in computational biology have enabled the clustering of immune-related genes in HCC, shedding light on distinct immune subtypes with potential therapeutic implications. Clustering approaches such as K-means, MCL and DBSCAN have been utilised to classify immune-related genes based on their expression profiles [17]. These techniques have identified key immune clusters involving MHC class I and II molecules, CD4 and CD74, which are integral to antigen presentation and T-cell activation [18].

The identification of immune-related gene clusters has significant implications for personalised therapy in HCC. Immunotherapy, particularly immune checkpoint inhibitors (ICIs) targeting PD-1 and CTLA-4, has revolutionised HCC treatment [19]. The combination of ICIs, such as tremelimumab with durvalumab, has demonstrated promising efficacy in patients with unresectable HCC [20]. Additionally, therapies targeting the Wnt/β-catenin pathway and tumour-associated macrophages are being explored as potential strategies to enhance antitumour immunity [21].


Limitations of the study

The findings presented in this study are derived entirely from in silico analyses, which, while powerful for identifying potential biological associations, are inherently hypothesis-generating in nature. As such, these results should be interpreted with caution, as they require experimental validation to confirm their biological relevance. Further studies using cellular and rodent models are essential to elucidate the underlying mechanisms and to assess causality. Moreover, validation in appropriate clinical cohorts will be necessary to determine the translational significance of these findings in human disease contexts.


Conclusion

This study underscores the intricate interplay between the immune system and HCC, with a strong emphasis on antigen presentation, immune evasion and neuroimmune interactions. The identification of key regulatory miRNAs and hub proteins provides promising avenues for further research into targeted therapies. Future investigations should focus on validating these findings through experimental and clinical studies to improve our understanding of HCC pathogenesis and treatment strategies.

Our study demonstrates the utility of clustering algorithms in identifying immune gene interactions in HCC. The integration of MHC class I and II molecules in MCL and DBSCAN suggests coordinated immune regulation, while K-means highlights distinct expression patterns. These insights contribute to understanding immune modulation in HCC and may guide future immunotherapeutic strategies.


Acknowledgments

Authors thank Central Research Laboratory for Molecular Genetics, Bioinformatics and Machine Learning at Apollo Institute of Medical Sciences and Research Chittoor Murukamabttu 571727, Andhra Pradesh, India for the infrastructure.


Conflicts of interest

The authors declare no conflicts of interest related to this work.


Funding

This research did not receive any specific grant from funding agencies in the public, commercial or not-for-profit sectors.


Author contributions

Jyoti Brahmaiah – conceptualisation, data curation, formal analysis, original draft preparation; Usha Adiga – statistical analysis, methodology refinement, project administration, critical manuscript revision; Alfred J Augustine – supervision, validation, interpretation of results, critical feedback; Sampara Vasishta – literature review, bioinformatics analysis, visualisation, writing (review and editing), correspondence. All authors reviewed and approved the final manuscript.


Declaration of generative AI and AI-assisted technologies in the writing process

During the preparation of the articles, the authors used AI tools to reformulate some sentences, after which the authors reviewed and edited the content as needed and take full responsibility of the content in the article.


References

1. Abdelhamed W and El-Kassas M (2024) Hepatitis B virus as a risk factor for hepatocellular carcinoma: there is still much work to do Liver Res 8(2) 83–90 https://doi.org/10.1016/j.livres.2024.05.004

2. Giri S and Singh A (2024) Epidemiology of hepatocellular carcinoma in India - an updated review for 2024 J Clin Exp Hepatol 14(6) 101447 https://doi.org/10.1016/j.jceh.2024.101447 PMID: 38957612 PMCID: 11215952

3. Trépo E, Nahon P, and Bontempi G, et al (2014) Association between the PNPLA3 (rs738409 C>G) variant and hepatocellular carcinoma: evidence from a meta-analysis of individual participant data Hepatology 59 2170–2177 https://doi.org/10.1002/hep.26767

4. Burza MA, Pirazzi C, and Maglio C, et al (2012) PNPLA3 I148M (rs738409) genetic variant is associated with hepatocellular carcinoma in obese individuals Dig Liver Dis 44 1037–1041 https://doi.org/10.1016/j.dld.2012.05.006 PMID: 22704398

5. Liu YL, Patman GL, and Leathart JBS, et al (2014) Carriage of the PNPLA3 rs738409 C>G polymorphism confers an increased risk of non-alcoholic fatty liver disease-associated hepatocellular carcinoma J Hepatol 61 75–81 https://doi.org/10.1016/j.jhep.2014.02.030 PMID: 24607626

6. Seko Y, Sumida Y, and Tanaka S, et al (2017) Development of hepatocellular carcinoma in Japanese patients with biopsy-proven non-alcoholic fatty liver disease: association between PNPLA3 genotype and hepatocarcinogenesis/fibrosis progression Hepatol Res 47 1083–1092 https://doi.org/10.1111/hepr.12840

7. Newberry EP, Hall Z, and Xie Y, et al (2021) Liver-specific deletion of mouse Tm6sf2 promotes steatosis, fibrosis, and hepatocellular cancer Hepatology 74 1203–1219 https://doi.org/10.1002/hep.31771 PMID: 33638902 PMCID: 8390580

8. Liu YL, Reeves HL, and Burt AD, et al (2014) TM6SF2 rs58542926 influences hepatic fibrosis progression in patients with non-alcoholic fatty liver disease Nat Commun 5 4309 https://doi.org/10.1038/ncomms5309 PMID: 24978903 PMCID: 4279183

9. Yang J, Trépo E, and Nahon P, et al (2019) PNPLA3 and TM6SF2 variants as risk factors of hepatocellular carcinoma across various etiologies and severity of underlying liver diseases Int J Cancer 144 533–544 https://doi.org/10.1002/ijc.31910

10. Donati B, Dongiovanni P, and Romeo S, et al (2017) MBOAT7 rs641738 variant and hepatocellular carcinoma in non-cirrhotic individuals Sci Rep 7 4492 https://doi.org/10.1038/s41598-017-04991-0 PMID: 28674415 PMCID: 5495751

11. Yang J, Trépo E, and Nahon P, et al (2019) A 17-beta-hydroxysteroid dehydrogenase 13 variant protects from hepatocellular carcinoma development in alcoholic liver disease Hepatology 70 231–240 https://doi.org/10.1002/hep.30623 PMID: 30908678

12. Abul-Husn NS, Cheng X, and Li AH, et al (2018) A protein-truncating HSD17B13 variant and protection from chronic liver disease N Engl J Med 378 1096–1106 https://doi.org/10.1056/NEJMoa1712191 PMID: 29562163 PMCID: 6668033

13. Trépo E, Caruso S, and Yang J, et al (2022) Common genetic variation in alcohol-related hepatocellular carcinoma: a case-control genome-wide association study Lancet Oncol 23 161–171 https://doi.org/10.1016/S1470-2045(21)00603-3

14. Li S, Qian J, and Yang Y, et al (2012) GWAS identifies novel susceptibility loci on 6p21.32 and 21q21.3 for hepatocellular carcinoma in chronic hepatitis B virus carriers PLoS Genet 8(7) e1002791 https://doi.org/10.1371/journal.pgen.1002791 PMID: 22807686 PMCID: 3395595

15. Siegel RL, Miller KD, and Fuchs HE, et al (2022) Cancer statistics, 2022 CA Cancer J Clin 72 7–33 PMID: 35020204

16. Galle PR, Forner A, and Llovet JM, et al (2018) EASL clinical practice guidelines: management of hepatocellular carcinoma J Hepatol 69 182–236 https://doi.org/10.1016/j.jhep.2018.03.019

17. Omata M, Cheng AL, and Kokudo N, et al (2017) Asia–Pacific clinical practice guidelines on the management of hepatocellular carcinoma: a 2017 update Hepatol Int 11 317–370 https://doi.org/10.1007/s12072-017-9799-9 PMID: 28620797 PMCID: 5491694

18. Marrero JA, Kulik LM, and Sirlin CB, et al (2018) Diagnosis, staging, and management of hepatocellular carcinoma: 2018 practice guidance by the American Association for the study of liver diseases Hepatology 68 723–750 https://doi.org/10.1002/hep.29913 PMID: 29624699

19. U.S. Food and Drug Administration FDA Approves Tremelimumab in Combination with Durvalumab for Unresectable Hepatocellular Carcinoma [https://www.fda.gov/drugs/resources-information-approved-drugs/fda-approves-tremelimumab-combination-durvalumab-unresectable-hepatocellular-carcinoma] Date accessed: 26/10/22

20. Reig M, Forner A, and Rimola J, et al (2022) BCLC strategy for prognosis prediction and treatment recommendation: the 2022 update J Hepatol 76 681–693 https://doi.org/10.1016/j.jhep.2021.11.018 PMCID: 8866082

21. Nault J and Villanueva A (2021) Biomarkers for hepatobiliary cancers Hepatology 73 115–127 https://doi.org/10.1002/hep.31175

Related Articles

Sudarshan Govender, Tamsin K Phillips, Fiona M Walter, Sarah Day, Bothwell Guzha, Suzanne E Scott, Zvavahera M Chirenje, John E Ataguba, Nomonde Mbatani, Nazia Fakie, Jennifer Moodley
Cassio Murilo Hidalgo Filho, Mateus Marinho Nogueira Soares, Wesley Antonio Lopes de Lima, Laura Testa, Marcela Simonis Martins Ferrari, Renata Colombo Bonadio
Table of Contents
Table of Contents