ORIGINAL RESEARCH article

Front. Genet., 24 August 2022

Sec. Evolutionary, Population, and Conservation Genetics

Volume 13 - 2022 | https://doi.org/10.3389/fgene.2022.921447

Evolutionary selection identifies critical immune-relevant genes in lung cancer subtypes

  • 1. Cancer Biology and Evolution Program, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States

  • 2. Integrated Mathematical Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States

  • 3. Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, United States

  • 4. School of Biochemistry and Immunology, Trinity College Dublin, Trinity Biomedical Sciences Institute, Dublin, Ireland

  • 5. School of Medicine, Trinity College Dublin, Dublin, Ireland

Abstract

In an evolving population, proliferation is dependent on fitness so that a numerically dominant population typically possesses the most well adapted phenotype. In contrast, the evolutionary “losers” typically disappear from the population so that their genetic record is lost. Historically, cancer research has focused on observed genetic mutations in the dominant tumor cell populations which presumably increase fitness. Negative selection, i.e., removal of deleterious mutations from a population, is not observable but can provide critical information regarding genes involved in essential cellular processes. Similar to immunoediting, “evolutionary triage” eliminates mutations in tumor cells that increase susceptibility to the host immune response while mutations that shield them from immune attack increase proliferation and are readily observable (e.g., B2M mutations). These dynamics permit an “inverse problem” analysis linking the fitness consequences of a mutation to its prevalence in a tumor cohort. This is evident in “driver mutations” but, equally important, can identify essential genes in which mutations are seen significantly less than expected by chance. Here we utilized this new approach to investigate evolutionary triage in immune-related genes from TCGA lung adenocarcinoma cohorts. Negative selection differs between the two cohorts and is observed in endoplasmic reticulum aminopeptidase genes, ERAP1 and ERAP2 genes, and DNAM-1/TIGIT ligands. Targeting genes or molecular pathways under positive or negative evolutionary selection may permit new treatment options and increase the efficacy of current immunotherapy.

Introduction

Emergence of a clinical cancer population is typically a prolonged Darwinian process in which the malignant cells evolve phenotypic properties that adapt to tissue growth constraints and host immune responses. These evolutionary dynamics occur at the level of phenotypic interactions with environmental selection forces and accumulating genomic changes provide a record of this evolutionary arc. For example, during immunoediting, mutations that protect tumor cells from the immune system enhance proliferation and, therefore, have increased prevalence in the population. In contrast, mutations that increase recognition by the immune system, because they decrease fitness and proliferation, are typically lost from the population. Current molecular biology techniques prioritize the study of “driver” mutations that are observed within some large fraction of the tumor population. These provide important insights into critical pathways and potential therapeutic targets. However, prior theoretical studies have found identifying mutations subjected to negative selection can be equally informative. However, lost mutations cannot be measured directly in a tumor sample and require novel approaches.

In prior studies, the principle of “evolutionary triage (Gatenby et al., 2014)”, which links the prevalence of any gene mutation in a population with its contribution to cancer cell fitness (Gatenby and Frieden, 2002; Gatenby and Frieden, 2004). A mutation that increases fitness will also increase proliferation so that it is observed more frequently either within a tumor population or in a cohort of patients with that tumor. A mutation that does not alter fitness will not change proliferation and, therefore, be observed with a frequency that reflects the underlying mutation rate (Kimura, 1968; Gatenby and Frieden, 2002; Gatenby and Frieden, 2004). These represent the well-recognized dynamics of “driver” and “passenger” mutations. Computer simulations also demonstrated some genes must continue to function normally for optimal cancer cell fitness. In these genes, non-synonymous mutations cannot improve function and, so, will frequently decrease fitness and disappear from the population. These “essential” genes will usually have an observed prevalence of mutations that is less than expected by chance alone (i.e., less frequently than a passenger gene of comparable size) (Figure 1A).

FIGURE 1

The dynamics of negative selection are well recognized, but prior investigations across multiple cancer types found few genes are broadly conserved (Martincorena et al., 2018; Zapata et al., 2018). However, computer simulations demonstrated evolutionary conservation in cancer populations varies depending on local tissue environment selection forces and prior molecular changes in their evolutionary arc (Gatenby et al., 2014; Cunningham et al., 2015). Thus, genes conserved among cancers emerging from different tissue types and/or through different initiating driver genes are predicted to be uncommon. Recent studies of TCGA data in EGFR-mut, KRAS-mut, and wild-type (no known driver mutations) (WT) lung adenocarcinomas, confirmed this prediction as few genes were found to be conserved even among subtypes that shared a common tissue of origin (Freischel et al., 2021).

Here we investigate evolutionary triage in genes associated with tumor-immune interactions. Epithelial cells are active participants in the host immune response (Schleimer et al., 2007). Antigen presentation, immune-modulating protein expression (e.g., checkpoint ligands), as well as cytokine and chemokine secretion by epithelial cells can promote or reduce host immune functions (Schleimer et al., 2007; Larsen et al., 2020). Additionally, epithelial cells express surface and cytoplasmic pattern recognition receptors (PRR), which recognize molecular patterns derived from foreign pathogens and markers of stressed cells triggering intrinsic immune responses within epithelial cells, including release of inflammatory mediators and initiation of pyroptosis (Amarante-Mendes et al., 2018).

We hypothesized that, within these diverse and complex interactions, we could determine the most important cellular functions and molecular pathways by identifying genes under strong positive and negative evolutionary selection. Furthermore, by investigating cancers arising from the same organ but with different initiating oncogenic mutations, we could investigate divergence in immune-evasion strategies arising from different initiating genetic events.

Here we explore this hypothesis by investigating the mutational frequency of genes with known or expected functions in host-immune interactions in two cohorts of lung adenocarcinoma patients in the TCGA database, mutant KRAS (n = 163) and no known driver (WT) (n = 313). Evolutionary selection of multiple genes with well-documented roles in LUAD supports our hypothesis and encourages further investigation of other evolutionarily identified genes and molecular pathways that have not been extensively investigated.

Methods

Data collection

Mutations detected by the Multi-Center Mutation Calling in Multiple Cancers (MC3) project on TCGA lung adenocarcinoma samples were downloaded from the Genome Data commons (file: mc3.v0.2.8.PUBLIC.maf.gz, site: https://gdc.cancer.gov/about-data/publications/mc3-2017). Details of MC3 mutation identification using tumor and matched normal samples are provided here: [PMC6075717].

We identified one sample each from patients belonging to the lung adenocarcinoma cohort, and classified patients based on known driver or recurrent mutations in KRAS (G12, G13, Q61, A146), BRAF (V600, N581, G464, G466, G469, G596, D594), and EGFR (L858, S768, L861, G719, T790, indels in exons 18–21). Samples were excluded if they matched criteria for more than one of these genes. Samples that did not meet any of the mutation criteria were classified as WT (no driver mutations in EGFR, KRAS, or BRAF.) Cohorts with EGFR-mutant LUAD (n = 58) and BRAF-mutant LUAD (n = 31) had fewer samples, so we elected to focus primarily on the mKRAS (n = 163) and WT (n = 313) cohorts. Tumor and normal sequence alignment files (BAM) were downloaded from the Genome Data Commons, and gene-level depth of coverage was calculated by calculating bases covered by sequencing from the above files across each of the RefSeq coding genes (with 25 base pair flanking regions). A base was considered sufficiently covered if the depth of coverage was≥14 in tumor sample and≥8 in normal samples (as has been previously described: https://www.synapse.org/#!Synapse:syn1695394). The fraction of each gene’s protein coding bases (using the longest RefSeq transcript) covered by sufficient sequence data was calculated for each sample using the Negative Storage Model [PMC7157186].

Identification of over and under-mutated genes

To identify over- and under-mutated genes, we calculated the fraction of patients in each cohort with any protein-altering (nonsynonymous, truncating, canonical splice-site) mutations in each gene. To control for undercounting of mutations due to poor coverage, we divided the fraction patients mutated by the median fraction of coding bases with sufficient coverage. Genes with median coverage <50%, ANNOVAR annotation errors, or gene expression <2.0 log2 counts were excluded, as these genes either have artificially low mutation rates or are not likely to be essential due to low expression. Gene size was controlled by calculating a linear model between corrected mutation fraction and gene size (protein coding bases from largest RefSeq transcript.) (Figure 1B). Corrected mutation rate and gene size were both square-root transformed. The standardized residual of each gene to the regression line was determined: the most positive genes are the classical over-mutated or driver genes, and in each cohort the most highly mutated gene had the highest standardized residual (e.g., KRAS was the most over-mutated gene in the mKRAS cohort.) Genes with an SR ≥ 1.0 are over-mutated and genes with and SR ≤ -1.0 are conserved genes (Figure 1C). To minimize false positives, our analysis primarily focuses on genes with standardized residual (SR) values for conserved genes SR ≤ -2.0 to achieve p = 0.05 and SR ≤ -1.65 to achieve p = 0.1.

Curated gene lists

Literature review and gene databases including Gene Set Enrichment (Broad Institute) (GSEA) (Subramanian et al., 2005), PathCards (Belinky et al., 2015), and Genenames.org (Tweedie et al., 2021) were used to create lists of genes with known or suspected functions in epithelial cell interactions with the immune response. Genes were divided into eight functional processes: antigen presentation (Figure 2A), immune-modulating proteins (checkpoint ligands) (Figure 2C), innate immune receptor signaling (Figure 2B), cytokine signaling, chemokine signaling, Interferon signaling, complement, and programmed cell death (apoptosis, necrosis, and pyroptosis). Complete gene lists are provided (Supplementary Table S1).

FIGURE 2

Results

Gene conservation differs between the wild-type and mutant KRAS cohorts

More genes were highly conserved (SR ≤ -2.0) in mKRAS than WT cancers (160 and 248 respectively) (Supplementary Table S2). Among the immune genes examined (n = 576) 19 were conserved (SR ≤ -1.65) in WT cohort and 11 in mKRAS (Table 1). Many of the conserved genes were unique to each cohort with only one overlapping pair of genes. Complement genes C4A (WT SR = -2.863, mKRAS SR = -1.711) and C4B (WT SR = -2.864, mKRAS SR = -1.711), which encode the acidic and basic forms of complement factor 4, an activator of the classical complement pathway, were conserved in both cohorts. No genes from the list of chemokines and chemokine receptors were highly conserved in either cohort (Supplementary Figures S1A,B).

TABLE 1

FunctionGene symbolGene nameSR value wild-type
Immune Modulating ProteinsPVRL2Nectin Cell Adhesion Molecule 2−2.054
PVRPVR Cell Adhesion Molecule−1.782
MICAMHC Class I Polypeptide-Related Sequence A−1.699
Antigen ProcessingERAP1Endoplasmic Reticulum Aminopeptidase 1−2.797
TAP1Transporter 1, ATP Binding Cassette Subfamily B Member−2.566
HLA-FMajor Histocompatibility Complex, Class I, F−1.841
Innate Immune Receptor SignalingIRF3Interferon Regulatory Factor 3−1.865
Interferon SignalingIRF2BP1Interferon Regulatory Factor 2 Binding Protein 1−2.149
SOCS6Suppressor Of Cytokine Signaling 6−2.048
IRF5Interferon Regulatory Factor 5−2.003
IRF3Interferon Regulatory Factor 3−1.865
SIRT3Sirtuin 3−1.739
Cytokine SignalingIL1R1Interleukin 1 Receptor Type 1−2.118
IL17RBInterleukin 17 Receptor B−1.977
TNFRSF10ATNF Receptor Superfamily Member 10a−1.901
TNFRSF1ATNF Receptor Superfamily Member 1A−1.871
TGFB3Transforming Growth Factor Beta 3−1.770
IL6STInterleukin 6 Cytokine Family Signal Transducer−1.753
Apoptosis/necrosis/PyroptosisTNFRSF10ATNF Receptor Superfamily Member 10a-1.901
ComplementC4BComplement C4B (Chido Blood Group)−2.864
C4AComplement C4A (Rodgers Blood Group)−2.863
FunctionGene SymbolGene NameSR value mKRAS
Antigen ProcessingCIITAClass II Major Histocompatibility Complex Transactivator−2.296
ERAP2Endoplasmic Reticulum Aminopeptidase 2−2.085
Innate Immune Receptor SignalingCIITAClass II Major Histocompatibility Complex Transactivator−2.296
NLRC3NLR Family CARD Domain Containing 3−2.217
DHX58DExH-Box Helicase 58−1.687
Interferon SignalingSIRT1Sirtuin 1−1.791
Cytokine SignalingLTBP3Latent Transforming Growth Factor Beta Binding Protein 3−2.496
IL17RDInterleukin 17 Receptor D−1.779
TGFBITransforming Growth Factor Beta Induced−1.694
Apoptosis/necrosis/PyroptosisCASP8AP2Caspase 8 Associated Protein 2−3.160
ComplementC4BComplement C4B (Chido Blood Group)−1.711
C4AComplement C4A (Rodgers Blood Group)−1.711

Highly conserved genes in wild-type (top) and mKRAS (bottom) tumors. SR ≤ -1.65.

Differential conservation in antigen processing genes

Endogenous proteins are degraded into antigen precursors in the cytoplasm by the (immuno)proteasome. Transporter associated with antigen processing (TAP) transports the protein fragments into the endoplasmic reticulum (ER) for further processing by aminopeptidases and loading onto MHC-I proteins. Dysregulation of this process alters the peptide pool presented and loss of integral component results in diminished MHC-I expression. Transporter 1, ATP Binding Cassette Subfamily B Member, TAP1 was conserved in the WT cohort (SR = -2.566) but not in the mKRAS group (SR = -0.561) (Figures 3A,B). TAP1 co-localizes with TAP2 and mediates the flow of peptides from the cytosol into the endoplasmic reticulum prior to cleavage by ERAP proteins. TAP2 gene expression and conservation score could not be determined from our dataset.

FIGURE 3

The complexity of evolutionary selection is notable in the endoplasmic reticulum aminopeptidase genes, ERAP1 and ERAP2, which shape the peptidome by cleaving peptide precursors after transport into the ER (Figure 3C). ERAP1 is among the most conserved genes in WT cancers (SR = -2.797) and ERAP2 among the most conserved in mKRAS (SR = -2.085) (Figures 3A,B). Interestingly, no ERAP1 mutations were found in any mEGFR tumors or comparable TCGA data from melanoma patients (data not shown), suggesting its function is essential in multiple cancer types.

Polio virus receptor gene conservation in wild-type tumors

Tumor cells express immune modulating proteins that activate or inactivate (checkpoint proteins) immune cells. None of the immune modulating proteins were significantly conserved in the mKRAS group. Polio Virus Like Receptor 2 (PVRL2, Nectin2, CD112) and PVR (CD155), conserved in WT (SR = -2.054 and -1.782, respectively), were also conserved in mKRAS cohort although did not reach significance (SR = -1.458 and -1.234, respectively) (Figures 4A,B). The conserved tumoral PVR proteins represent known tumor evasions strategies and can activate or inhibit (Wu et al., 2021) NK cells and T cells (Figure 4C).

FIGURE 4

MHC class I polypeptide–related sequence A (MICA) a stress-induced activating ligand for Natural Killer Cell Lectin Like Receptor, KLRK1 (NKG2D) was conserved in the wild-type cohort (SR = -1.699). Ligation of NKG2D activates NK cells and gamma-delta-T cells and co-stimulates CD8 T cells (Holdenrieder et al., 2006). However, MICA can be cleaved from the cell surface and inactivate distant NK cells, NKT cells, gamma-delta T cells, and CD8 T cells (Uhlenbrock et al., 2014). Numerous proteases, including MMPs, ADAM10, and ADAM17, are involved in MICA cleavage. Additional NKG2D ligands, RAET1E (ULBP4) and ULBP2, were also conserved in the wild-type cohort but did not reach the SR cutoff of -1.65 (WT SR = -1.37 and -1.32, respectively).

Conservation of cytokine signaling genes

Malignant epithelial cells secrete and detect cytokines in the tumor microenvironment that regulate the immune response and support pro-growth signals. The potential for false positives is increased in genes with a conservation score above SR = -1.65. However, we note that both subunits of the type I IFN receptor, interferon-alpha receptor 1 (IFNAR1) and IFNAR2, are conserved in mKRAS samples (SR = -1.490 and SR = -1.418, respectively) (Figures 5D,E). Stimulation of IFNAR1/2 activates Janus kinase 1 (JAK1), (neutral in mKRAS) and tyrosine kinase 2 (TYK2) (mKRAS SR = -1.070). Type II interferon-gamma receptors IFNGR1 and INFGR2 were also conserved in mKRAS but did not meet the -1.65 threshold (SR = -1.371 and SR = -1.069, respectively) (Figures 5D,E). IFN-γ signaling in epithelial cells induces the expression of multiple genes containing γ-interferon activated sequences. MHC-I, checkpoint inhibitor PD-L1, and immunosuppressive enzyme, IDO are upregulated on cancer cells by IFN-γ treatment (21). All interferon receptor genes were neutral in the wild-type cohort.

FIGURE 5

Two of the nine interferon regulatory factor (IRF) genes were highly conserved and expressed in the wild-type cohort. IRF5, conserved in WT (SR = -2.002) and neutral in mKRAS, regulates type-I interferon genes and IFN stimulated genes (ISG) (Figures 5C,D). Similarly, activated IRF-3, conserved in WT (SR = -1.864) translocates to the nucleus following phosphorylation by TKB1 (WT SR = -1.010) and IKBKE (WT neutral) kinases and induces expression of type I interferons and ISGs. IRF3 is activated by a variety of pattern recognition receptors which are neutral in both cohorts (Figures 5A,B,F) (Zhu et al., 2010). Interleukin 1 receptor-1 (IL1R1) is the most conserved among the cytokine signalling genes in WT tumors (Supplementary Figure S1C). Following ligation by either IL1A or IL1B, IL1R1 associates with interleukin-1 rector accessory protein (IL1RAP) (WT SR = -1.116) and activates many downstream proinflammatory pathways including NF-κB and MAPK (44).

Over-mutated genes are shared between WT and mKRAS cohorts

Mutations can inhibit or enhance protein function. Positive selection occurs when the mutation increases fitness. WT and mKRAS tumors averaged 348 and 300 total mutations per tumor compared mEGFR LUADs with 86 (data not shown). Frequently mutated genes, defined as genes mutated in greater than 10% of patients in each cohort, numbered 208 and 157 in the WT and mKRAS cohorts, respectively. Genes with the highest mutation scores in our cohorts reflect known drivers and frequently mutated genes including KRAS, TP53, KEAP1, and STK11 (Supplementary Table S3) (Gong et al., 2020), demonstrating the validity of this approach. Of the immune related genes we examined (n = 576), 29 were over-mutated in WT (SR ≥ 1.65) and 26 in mKRAS with 16 genes common between the subtypes (Table 2).

TABLE 2

FunctionGene symbolGene nameSR value wild-type
Immune Modulating ProteinsCD86CD86 Molecule1.693
Antigen ProcessingCD1ECD1e Molecule2.513
CD1BCD1b Molecule1.831
B2MBeta-2-Microglobulin1.653
Innate Immune Receptor SignalingTLR4Toll Like Receptor 43.626
NLRP3NLR Family Pyrin Domain Containing 33.150
NLRP14NLR Family Pyrin Domain Containing 142.404
NLRP5NLR Family Pyrin Domain Containing 51.975
NLRP10NLR Family Pyrin Domain Containing 101.936
NLRP7NLR Family Pyrin Domain Containing 71.874
NLRP13NLR Family Pyrin Domain Containing 131.865
NLRP12NLR Family Pyrin Domain Containing 121.838
NLRP4NLR Family Pyrin Domain Containing 41.836
NLRP8NLR Family Pyrin Domain Containing 81.753
Interferon SignalingSMARCA4SWI/SNF Related, Matrix Associated, Actin Dependent Regulator Of Chromatin, Subfamily A, Member 42.854
DOCK2Dedicator Of Cytokinesis 22.378
Cytokine SignalingLTBP1Latent Transforming Growth Factor Beta Binding Protein 12.766
IL7RInterleukin 7 Receptor1.845
IL18RAPInterleukin 18 Receptor Accessory Protein1.804
IL1RAPL1Interleukin 1 Receptor Accessory Protein Like 11.758
Apoptosis/necrosis/PyroptosisSERPINB4Serpin Family B Member 42.140
ComplementCSMD3CUB And Sushi Multiple Domains 36.194
CSMD1CUB And Sushi Multiple Domains 13.701
C7Complement C72.656
ITGAXIntegrin Subunit Alpha X2.277
CSMD2CUB And Sushi Multiple Domains 22.238
CFHComplement Factor H1.940
CFHR5Complement Factor H Related 51.926
C6Complement C61.715
FunctionGene SymbolGene NameSR value mKRAS
Immune Modulating ProteinsPVRL1Nectin Cell Adhesion Molecule 11.653
Antigen ProcessingCD1ECD1e Molecule2.329
CD1ACD1a Molecule1.742
B2MBeta-2-Microglobulin1.654
Innate Immune Receptor SignalingNLRP3NLR Family Pyrin Domain Containing 33.508
NLRP5NLR Family Pyrin Domain Containing 53.427
TLR4Toll Like Receptor 43.331
NLRP14NLR Family Pyrin Domain Containing 142.856
NLRP10NLR Family Pyrin Domain Containing 102.449
BTKBruton Tyrosine Kinase2.057
NLRP12NLR Family Pyrin Domain Containing 121.888
Cytokine SignalingLTBP1Latent Transforming Growth Factor Beta Binding Protein 11.918
IL1RAPL1Interleukin 1 Receptor Accessory Protein Like 11.805
LTBP2Latent Transforming Growth Factor Beta Binding Protein 21.690
SERPINB4Serpin Family B Member 42.325
Apoptosis/necrosis/PyroptosisBIRC7Baculoviral IAP Repeat Containing 71.733
CSMD3CUB And Sushi Multiple Domains 35.517
ComplementCSMD1CUB And Sushi Multiple Domains 14.948
ITGAXIntegrin Subunit Alpha X2.765
C7Complement C72.391
MBL2Mannose Binding Lectin 22.187
FCN2Ficolin 22.101
MASP1MBL Associated Serine Protease 11.952
COLEC11Collectin Subfamily Member 111.900
CFHR4Complement Factor H Related 41.758
CFHComplement Factor H1.699

Over-mutated genes in WT (top) and mKRAS (bottom) cohorts. SR ≤ -1.65.

Mutations in driver genes with possible immune consequences

TP53, a multifunctional tumor suppressor mutated in over 50% of human cancers (Ozaki and Nakagawara, 2011), may regulate components of the innate and adaptive immune response (Hellmann et al., 2018; Shi and Jiang, 2021). TP53 mutations were found in 57% of WT patients and 35% of mKRAS patients. Mutations in liver kinase B1 (LKB1) gene, which encodes Serine/Threonine Kinase 11 (STK11), the 3rd most mutated gene in the mKRAS cohort (after KRAS and TP53), regulates metabolism, energy sensing and modulates the stimulator of interferon genes (STING) pathway (Esteve-Puig et al., 2014; Kim et al., 2019). STK11 mutations are associated with immunologically cold tumors (Schabath et al., 2016), increased neutrophil accumulation (Koyama et al., 2016), high expression of LAG3 ligand, FLG1 (Wang et al., 2019; Lehtiö et al., 2021), and poor outcomes in patients with mKRAS (Skoulidis et al., 2018). Anaplastic Lymphoma Kinase (ALK) mutations, found in 7.7% (24/313) of WT tumors, 4% (7/163) of mKRAS tumors, can generate an immunosuppressive microenvironment unresponsive to checkpoint blockade alone (Hellmann et al., 2018; Sankar et al., 2021).

Common over-mutated genes reflect previously identified pro-tumor pathways

Wild-type and mutant KRAS cohorts have many over-mutated genes in common (SR ≥ 1.65). Beta-2-Microglobulin (B2M) is a small gene (360bp) associated with resistance to checkpoint blockade. B2M was mutated in 2% of the wild-type cohort (6/313) and the mKRAS cohort (3/163) (WT SR = 1.653, mKRAS SR = 1.654). SERPINB4, (WT SR = 2.140, mKRAS SR = 2.325) is a serine protease inhibitor that inactivates granzyme M and promotes survival. Toll-like receptor 4 (TLR4) is commonly overexpressed in barrier epithelial cells and cancer (Luddy et al., 2014). TLR4 mutations were found in 12% (37/313) of the WT tumors and 11% (18/163) of the mKRAS tumors.

The NLRP family of receptors are under strong evolutionary selection in both cohorts (Figures 5A,B). NLRP3 has the highest expression level and is over-mutated in both cohorts (WT SR = 3.150, mKRAS SR = 3.508). The NLRP3 inflammasome alters the tumor microenvironment via secretion of pro-inflammatory cytokines IL1B (neutral in both cohorts) and IL18 (neutral in mKRAS, conserved in WT, SR = -1.144) (Kelley et al., 2019; Swanson et al., 2019). Similarly, the CD1 gene family is frequently mutated in both cohorts (Figures 3A,B). CD1E had the highest mutational prevalence (WT SR = 2.513, mKRAS SR = 2.329) and highest expression level (Figures 4A,B). Similar to ERAP1/2, CD1E and shapes the lipid antigen pool (Facciotti et al., 2011). CD1 molecules are mostly expressed by antigen presenting cells, however, aberrant expression of CD1 proteins by malignant cells can activate anti-tumor immune responses (Haraguchi et al., 2006; Hix et al., 2011).

Discussion

The evolutionary arc of each tumor is likely unique with variations resulting from molecular properties of the initiating cells, heterogeneity in host responses, and stochastic variations in accumulated genetic events (Toki et al., 2018; Wang et al., 2021). However, prior theoretical studies have suggested commonalities should be found among tumors originating from the same tissue and/or initiation driver genes. Here, we demonstrate mKRAS and WT lung cancers generally exhibit convergent evolutionary selection among broad mechanisms of immune interactions and some gene families, but the specific genes selected, particularly those conserved, are frequently subtype specific. Such convergence of broad (phenotypic) properties through divergent molecular pathways is observed in nature. For example, cavefish develop a characteristic phenotype (no eyes or pigment) but through diverse genetic pathways depending on the genomic state of the founder population and stochastic perturbations (Gatenby et al., 2011).

Magalhães (Magalhães, 2021), in a PubMed analysis, found virtually every human gene has been associated with cancer. Within this vast data set, how can clinically important genes be selected? Here we propose a “Darwinian road map” can identify critical genes under evolutionary selection in clinical cancer cohorts. We examine gene conservation in immune related genes. Several of the genes identified in our studies have been noted previously. Pre-clinical and clinical work targeting many of these pathways are ongoing. The divergence of evolutionary selection in mKRAS and WT suggest pre-clinical experiments and clinical investigations with targeted drugs should be carefully designed with an understanding of the Darwinian importance of specific genes in specific tumor subtypes. For example, among immune modulators, common targets for immunotherapy (e.g., CD274, CTLA4) show neutral patterns of evolutionary selection. PVR, PVRL2, and MICA, which likely regulate NK cell function, are conserved in wild-type cohort. Nine anti-TIGIT (PVR and PVRL2 receptor) antibody therapies are currently being evaluated in 43 trials of advanced solid tumors, including NSCLC (Ge et al., 2021).

Each cohort conserves only one of two ERAP genes, which modify peptides for presentation by MHC class I. Clearly, the ERAP function is critical for optimal fitness in each cohort, and we note ERAP1/2 expression was detected in all 9,125 tumor specimens in The Cancer Genome Atlas database (TCGA), and deep deletions are rare (0.6–0.8%) (Compagnone et al., 2019. Targeting ERAP has been suggested by others and ERAP modulators are under development to modulate the anti-tumor immune response (Stratikos, 2014). However, our data raise an intriguing question: why do WT tumors highly conserve ERAP1 while mKRAS tumors equally highly conserve ERAP2? IRF3, conserved in wild-type cohorts, regulates the expression of type I IFN genes and IFN stimulated genes (ISG) through binding interferon-stimulated response elements (ISRE) in their promoters. IRF3 is activated by several innate immune receptors (Figure 5F), including the cGAS/STING pathway, a novel immunotherapy target with ongoing clinical trials in cancer (Jiang et al., 2020).

Finally, in prior theoretical studies, computer simulations predicted therapies that disrupt conserved genes can have similar therapeutic efficacy to targeting driver genes (Gatenby et al., 2014). Thus, conserved genes in this study may be valuable clinical targets. Furthermore, simulations predicted combination therapy targeting a driver gene and a driver-specific conserved gene was highly lethal and often produced an evolutionary state in which no adaptive strategy was available, resulting in population extinction (Gatenby et al., 2014).

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: mc3.v0.2.8.PUBLIC.maf.gz: https://gdc.cancer.gov/about-data/publications/mc3-2017. Genome Data Commons: https://portal.gdc.cancer.gov/repository.

Author contributions

RG original concept design, data analysis, writing. KL: data analysis, concept design, writing. JT and AF performed data collection and model development. CO advisor and writing. All authors read and approved the final manuscript.

Funding

This work is supported by grants U54 CA143970, R01 CA077575, and U01CA232382 to RAG; by NCI Comprehensive Cancer Center Grant P30 CA076292, and by monies from the State of Florida to the H. Lee Moffitt Cancer Center and Research Institute. The results demonstrated here are based upon data generated by the TCGA Research Network: https://www.cancer.gov/tcga.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2022.921447/full#supplementary-material

References

  • 1

    Amarante-MendesG. P.AdjemianS.BrancoL. M.ZanettiL. C.WeinlichR.BortoluciK. R. (2018). Pattern recognition receptors and the host cell death molecular machinery. Front. Immunol.9, 2379. 10.3389/fimmu.2018.02379

  • 2

    BelinkyF.NativN.StelzerG.ZimmermanS.Iny SteinT.SafranM.et al (2015). PathCards: Multi-source consolidation of human biological pathways. Database.2015, bav006. 10.1093/database/bav006

  • 3

    CompagnoneM.CifaldiL.FruciD. (2019). Regulation of ERAP1 and ERAP2 genes and their disfunction in human cancer. Hum. Immunol.80 (5), 318324. 10.1016/j.humimm.2019.02.014

  • 4

    CunninghamJ. J.BrownJ. S.VincentT. L.GatenbyR. A. (2015). Divergent and convergent evolution in metastases suggest treatment strategies based on specific metastatic sites. Evol. Med. Public Health2015 (1), 7687. 10.1093/emph/eov006

  • 5

    Esteve-PuigR.GilR.Gonzalez-SanchezE.Bech-SerraJ. J.GruesoJ.Hernandez-LosaJ.et al (2014). A mouse model uncovers LKB1 as an UVB-induced DNA damage sensor mediating CDKN1A (p21WAF1/CIP1) degradation. PLoS Genet.10 (10), e1004721. 10.1371/journal.pgen.1004721

  • 6

    FacciottiF.CavallariM.AngenieuxC.Garcia-AllesL. F.Signorino-GeloF.AngmanL.et al (2011). Fine tuning by human CD1e of lipid-specific immune responses. Proc. Natl. Acad. Sci. U. S. A.108 (34), 1422814233. 10.1073/pnas.1108809108

  • 7

    FreischelA. R. T. J.CunninghamJ.Artzy-RandrupY.EpsteinT.TsaiK. Y.BerglundA.et al (2021). Evolutionary analysis of TCGA data using over- and under- mutated genes identify key molecular pathways and cellular functions in lung cancer subtypes. New York: In Press.

  • 8

    GatenbyR. A.CunninghamJ. J.BrownJ. S. (2014). Evolutionary triage governs fitness in driver and passenger mutations and suggests targeting never mutations. Nat. Commun.5, 5499. 10.1038/ncomms6499

  • 9

    GatenbyR. A.FriedenB. R. (2002). Application of information theory and extreme physical information to carcinogenesis. Cancer Res.62 (13), 36753684.

  • 10

    GatenbyR. A.FriedenB. R. (2004). Information dynamics in carcinogenesis and tumor growth. Mutat. Res.568 (2), 259273. 10.1016/j.mrfmmm.2004.04.018

  • 11

    GatenbyR. A.GilliesR. J.BrownJ. S. (2011). Of cancer and cave fish. Nat. Rev. Cancer11 (4), 237238. 10.1038/nrc3036

  • 12

    GeZ.PeppelenboschM. P.SprengersD.KwekkeboomJ. (2021). TIGIT, the next step towards successful combination immune checkpoint therapy in cancer. Front. Immunol.12 (2987), 699895. 10.3389/fimmu.2021.699895

  • 13

    GongM.LiY.YeX.ZhangL.WangZ.XuX.et al (2020). Loss-of-function mutations in KEAP1 drive lung cancer progression via KEAP1/NRF2 pathway activation. Cell. Commun. Signal.18 (1), 98. 10.1186/s12964-020-00568-z

  • 14

    HaraguchiK.TakahashiT.NakaharaF.MatsumotoA.KurokawaM.OgawaS.et al (2006). CD1d expression level in tumor cells is an important determinant for anti-tumor immunity by natural killer T cells. Leuk. Lymphoma47 (10), 22182223. 10.1080/10428190600682688

  • 15

    HellmannM. D.NathansonT.RizviH.CreelanB. C.Sanchez-VegaF.AhujaA.et al (2018). Genomic features of response to combination immunotherapy in patients with advanced non-small-cell lung cancer. Cancer Cell.33 (5), 843852. 10.1016/j.ccell.2018.03.018

  • 16

    HixL. M.ShiY. H.BrutkiewiczR. R.SteinP. L.WangC. R.ZhangM. (2011). CD1d-expressing breast cancer cells modulate NKT cell-mediated antitumor immunity in a murine model of breast cancer metastasis. PLoS One6 (6), e20702. 10.1371/journal.pone.0020702

  • 17

    HoldenriederS.StieberP.PeterfiA.NagelD.SteinleA.SalihH. R.et al (2006). Soluble MICA in malignant diseases. Int. J. Cancer118 (3), 684687. 10.1002/ijc.21382

  • 18

    JiangM.ChenP.WangL.LiW.ChenB.LiuY.et al (2020). cGAS-STING, an important pathway in cancer immunotherapy. J. Hematol. Oncol.13 (1), 81. 10.1186/s13045-020-00916-z

  • 19

    KelleyN.JeltemaD.DuanY.HeY. (2019). The NLRP3 inflammasome: An overview of mechanisms of activation and regulation. Int. J. Mol. Sci.20 (13), E3328. 10.3390/ijms20133328

  • 20

    KimJ.HuZ.CaiL.LiK.ChoiE.FaubertB.et al (2019). Author correction: CPS1 maintains pyrimidine pools and DNA synthesis in KRAS/LKB1-mutant lung cancer cells. Nature569 (7756), E4. 10.1038/s41586-019-1133-3

  • 21

    KimuraM. (1968). Evolutionary rate at the molecular level. Nature217 (5129), 624626. 10.1038/217624a0

  • 22

    KoyamaS.AkbayE. A.LiY. Y.ArefA. R.SkoulidisF.Herter-SprieG. S.et al (2016). STK11/LKB1 deficiency promotes neutrophil recruitment and proinflammatory cytokine production to suppress T-cell activity in the lung tumor microenvironment. Cancer Res.76 (5), 9991008. 10.1158/0008-5472.Can-15-1439

  • 23

    LarsenS. B.CowleyC. J.FuchsE. (2020). Epithelial cells: Liaisons of immunity. Curr. Opin. Immunol.62, 4553. 10.1016/j.coi.2019.11.004

  • 24

    LehtiöJ.ArslanT.SiavelisI.PanY.SocciarelliF.BerkovskaO.et al (2021). Proteogenomics of non-small cell lung cancer reveals molecular subtypes associated with specific therapeutic targets and immune-evasion mechanisms. Nat. Cancer2 (11), 12241242. 10.1038/s43018-021-00259-9

  • 25

    LuddyK. A.Robertson-TessiM.TafreshiN. K.SolimanH.MorseD. L. (2014). The role of toll-like receptors in colorectal cancer progression: Evidence for epithelial to leucocytic transition. Front. Immunol.5, 429. 10.3389/fimmu.2014.00429

  • 26

    MagalhãesJ. Pd (2021). Every gene can (and possibly will) be associated with cancer. Trends Genet.38, 216217. 10.1016/j.tig.2021.09.005

  • 27

    MartincorenaI.RaineK. M.GerstungM.DawsonK. J.HaaseK.Van LooP.et al (2018)., 173. PMC6005233, 1823. 10.1016/j.cell.2018.06.001Universal patterns of selection in cancer and somatic tissuesCell.7

  • 28

    OzakiT.NakagawaraA. (2011). Role of p53 in cell death and human cancers. Cancers (Basel)3 (1), 9941013. 10.3390/cancers3010994

  • 29

    SankarK.NagrathS.RamnathN. (2021). Immunotherapy for ALK-rearranged non-small cell lung cancer: Challenges inform promising approaches. Cancers (Basel)13 (6), 1476. 10.3390/cancers13061476

  • 30

    SchabathM. B.WelshE. A.FulpW. J.ChenL.TeerJ. K.ThompsonZ. J.et al (2016). Differential association of STK11 and TP53 with KRAS mutation-associated gene expression, proliferation and immune surveillance in lung adenocarcinoma. Oncogene35 (24), 32093216. 10.1038/onc.2015.375

  • 31

    SchleimerR. P.KatoA.KernR.KupermanD.AvilaP. C. (2007). Epithelium: At the interface of innate and adaptive immune responses. J. Allergy Clin. Immunol.120 (6), 12791284. 10.1016/j.jaci.2007.08.046PMC2810155)

  • 32

    ShiD.JiangP. (2021). A different facet of p53 function: Regulation of immunity and inflammation during tumor development. Front. Cell. Dev. Biol.9, 762651. 10.3389/fcell.2021.762651

  • 33

    SkoulidisF.GoldbergM. E.GreenawaltD. M.HellmannM. D.AwadM. M.GainorJ. F.et al (2018). STK11/LKB1 mutations and PD-1 inhibitor resistance in KRAS-mutant lung adenocarcinoma. Cancer Discov.8 (7), 822835. 10.1158/2159-8290.CD-18-0099

  • 34

    StratikosE. (2014). Modulating antigen processing for cancer immunotherapy. Oncoimmunology3 (1), e27568. 10.4161/onci.27568

  • 35

    SubramanianA.TamayoP.MoothaV. K.MukherjeeS.EbertB. L.GilletteM. A.et al (2005). Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. U. S. A.102 (43), 1554515550. 10.1073/pnas.0506580102

  • 36

    SwansonK. V.DengM.TingJ. P. (2019). The NLRP3 inflammasome: Molecular activation and regulation to therapeutics. Nat. Rev. Immunol.19 (8), 477489. 10.1038/s41577-019-0165-0

  • 37

    TokiM. I.ManiN.SmithyJ. W.LiuY.AltanM.WassermanB.et al (2018). Immune marker profiling and programmed death ligand 1 expression across NSCLC mutations. J. Thorac. Oncol.13 (12), 18841896. 10.1016/j.jtho.2018.09.012

  • 38

    TweedieS.BraschiB.GrayK.JonesT. E. M.SealR. L.YatesB.et al (2021). Genenames.org: The HGNC and VGNC resources in 2021. Nucleic Acids Res.49 (D1), D939D946. 10.1093/nar/gkaa980

  • 39

    UhlenbrockF.Hagemann-JensenM.KehletS.AndresenL.PastorekovaS.SkovS.et al (2014). The NKG2D ligand ULBP2 is specifically regulated through an invariant chain-dependent endosomal pathway. J. Immunol.193 (4), 16541665. 10.4049/jimmunol.1303275

  • 40

    WangJ.SanmamedM. F.DatarI.SuT. T.JiL.SunJ.et al (2019). Fibrinogen-like protein 1 is a major immune inhibitory ligand of LAG-3. Cell.176 (1-2), 334347. 10.1016/j.cell.2018.11.010

  • 41

    WangX.RicciutiB.NguyenT.LiX.RabinM. S.AwadM. M.et al (2021). Association between smoking history and tumor mutation burden in advanced non–small cell lung cancer. Cancer Res.81 (9), 25662573. 10.1158/0008-5472.Can-20-3991

  • 42

    WuB.ZhongC.LangQ.LiangZ.ZhangY.ZhaoX.et al (2021). Poliovirus receptor (PVR)-like protein cosignaling network: New opportunities for cancer immunotherapy. J. Exp. Clin. Cancer Res.40 (1), 267. 10.1186/s13046-021-02068-5

  • 43

    ZapataL.PichO.SerranoL.KondrashovF. A.OssowskiS.SchaeferM. H.et al (2018). Negative selection in tumor genome evolution acts on essential cellular functions and the immunopeptidome. Genome Biol.19 (1), 67. 10.1186/s13059-018-1434-0

  • 44

    ZhuJ.SmithK.HsiehP. N.MburuY. K.ChattopadhyayS.SenG. C.et al (2010). High-throughput screening for TLR3-IFN regulatory factor 3 signaling pathway modulators identifies several antipsychotic drugs as TLR inhibitors. J. Immunol.184 (10), 57685776. 10.4049/jimmunol.0903559

Summary

Keywords

cancer evolution, cancer immunology, evolutionary triage, gene conservation, immune evasion

Citation

Luddy KA, Teer JK, Freischel A, O’Farrelly C and Gatenby R (2022) Evolutionary selection identifies critical immune-relevant genes in lung cancer subtypes. Front. Genet. 13:921447. doi: 10.3389/fgene.2022.921447

Received

15 April 2022

Accepted

07 July 2022

Published

24 August 2022

Volume

13 - 2022

Edited by

Luca Ermini, Luxembourg Institute of Health, Luxembourg

Reviewed by

Nivedita Ratnam, National Cancer Institute (NIH), United States

Eszter Lakatos, Queen Mary University of London, United Kingdom

Updates

Copyright

*Correspondence: Kimberly A. Luddy, ; Robert Gatenby,

This article was submitted to Evolutionary and Population Genetics, a section of the journal Frontiers in Genetics

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics