ORIGINAL RESEARCH article
Computational Protein Phenotype Characterization of IL10RA Mutations Causative to Early Onset Inflammatory Bowel Disease (IBD)
- 1Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
- 2Princess Al-Jawhara Al-Brahim Center of Excellence in Research of Hereditary Disorders, King Abdulaziz University, Jeddah, Saudi Arabia
- 3Department of Genetic Medicine, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
- 4Department of Medical Laboratory Technology, Faculty of Applied Medical Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
The deleterious amino acid substitution mutations in IL-10 receptor alpha gene are most frequently reported in several autoimmune diseases including early onset-inflammatory bowel disease (IBD). Despite the important role of IL-10 RA in maintaining immune homeostasis, the specific structural and functional implications of these mutations on protein phenotype, stability, ligand binding and post translational characteristics is not well explored. Therefore, this study performed the multidimensional computational analysis of IL10RA missense variations causative to pediatric or early onset inflammatory bowel disease (<5 years of age). Our computational algorithmic screening identified the deleterious nature of p. W45G, p. Y57C, p. W69G, p.T84I, p.Y91C, p.R101W, p.R117C, and p.R117H, IBD causative IL10-RA mutations. The sensitivity and specificity analysis of different computational methods showed that CADD outperform SIFT, PolyPhen 2.0, FATHMM, LRT, MetaLR, MetaSVM, PROVEAN and Condel in predicting the pathogenicity of IL10RA mutations. Our three-dimensional protein modeling assays showed that the point mutations cause major drifts in the structural plasticity of IL10 RA molecule and negatively influence its stability. Findings from molecular docking analysis have shown that these point mutations decrease the binding affinity of IL10RA toward IL10 and may likely to disturb the IL10 signaling pathway. This study provides an easy frame work for phenotypic characterization of mutant IL10RA molecule in terms of structure, flexibility and stability aspects. Our approach may also add a new dimension to conventional functional biology assays in quickly studying IL10 RA mutations and also for designing and developing inhibitors for mutant IL10RA molecule.
IL10RA (Interleukin-10 receptor alpha) essentially constitutes to Interleukin-10 receptor hetero tetramer, that belongs to class II cytokine or the IFNR-like receptors (Ho et al., 1993; Tan et al., 1993; Liu et al., 1994). The two α sub-units of IL10RA interacts (Kd ~35–200 pM) with IL10 (Josephson et al., 2001) and forms a high affinity IL10-IL10RA complex. This intermediate complex forms a low affinity interaction with IL10RB, subsequently forming an active signaling complex that activates Janus Kinase 1 and Tyrosine Kinase 2 (Finbloom and Winestock, 1995), leading to nuclear translocation of Signal Transducer and Activator of Transcription 3, activation of downstream target gene transcription of anti-inflammatory effectors (Stahl et al., 1995). IL10 is a critical cytokine molecule that modulates the intestinal mucosal homeostasis. This is supported by the description that, laboratory mice deficient in IL10 prone to develop spontaneous enterocolitis (Shouval et al., 2014). In concordance to these findings mice deficient in IL 10 receptor is also seen to develop spontaneous enterocolitis (Spencer et al., 1998). In humans, both IL10 and its receptors, together contribute in controlling the intestinal mucosal immune responses. Nucleotide sequence alterations in IL10 are linked with the risk of developing inflammatory bowel disease (IBD), as reported in genome wide association studies (Mesbah-Uddin et al., 2015; Huang et al., 2017). A severe early onset IBD, which occurs in children (<5 years) is reported to occur in individuals with mutations in IL10 or its receptors (Zhu et al., 2017). Mutations in coding region of IL10 or IL10 receptor alpha or beta genes leads to defective IL10 signaling and disturb the anti-inflammatory responses in the gastro intestinal tract. The known early onset IBD causative mutations of IL10RA gene includes W45G, Y57C, W69R, T84I, Y91C, R101W, R117C, R117H, L125R, G141R, I169T, I224V, R262C, R412W, and R412Q. However, their phenotypic characterization in terms of structural plasticity, stability and IL10 binding characteristics aspects is not yet performed.
The characterization of missense mutations through conventional laboratory assays is expensive, time taking and laborious. In this regard, the recent studies indicate that computational approaches can successfully explore the deleterious effects of different mutant versions of protein molecules (Banaganapalli et al., 2016, 2017). Therefore, we aimed to understand the molecular reasons behind detrimental effects of mutant IL10RA molecules, through comprehensive computational assays covering pathogenicity predictions, structural characteristics determination and analysis of ligand binding characteristics. As an initial step, we recorded all the deleterious mutations in IL10RA from the publicly available databases and shortlisted them based on algorithmic predictions. In the next step, structural analysis was carried out, to investigate how one amino acid change can create an undulate effect throughout the protein structure and eventually affect the protein function. Based on the crystal structure (1Y6K; 22-235 residues) of native IL10RA, we generated 13 full-length (1-578 residues) template-based models (one wild-type and 12 mutant models), by using Swiss Model-Expasy and I-TASSER web servers. Consequently protein-protein interaction analysis and molecular docking of selected mutant (localized to the extracellular region) and native models of IL10RA with its ligand IL10 (highly interacting partner) was performed to check the ligand binding characteristics. Furthermore, in-silico functional analysis was conducted to identify the mutations which undergoes post-translational modifications (PTM) of IL10RA protein. By analyzing the early-onset IBD causative mutations of IL10RA gene, this study provides the computational in-sight into the structural and functional aspects of IL10RA.
The mutations of IL10RA gene (CCDS Transcript ID: ENST00000227752), their corresponding IDs and aminoacid sequences were obtained from NCBI dbSNP, UniProt, Exome Variant Server, and Ensembl databases. Later IBD causal variants were shortlisted from the pool of IL10RA non-synonymous missense mutations. In addition, recently published research articles and reviews were screened to collect the reported IBD variants. Additionally, mutations reported in the databases like “Web of Science” “PUBMED,” and “Cochrane-Library,” were also incorporated. The keywords used to search the mutations were “Inflammatory bowel disease,” “IBD,” “infantile IBD,” “early onset IBD,” “IL10RA,” “IBD,” “SNPs,” “mutation,” “Polymorphism,” and “Variation.” The data was further enriched by eliminating the redundant mutations collected different data resources.
To predict the deleterious nature of the mutants, we employed dbNSFP at Ensemble VEP (Variant Effect Predictor) (McLaren et al., 2016) which produces prediction scores for different algorithms like SIFT, PolyPhen 2.0, CADD, FATHMM, LRT, MetaLR, MetaSVM, PROVEAN, and Condel. These tools works on diverse principles like sequence homology, structural homology and difference in Gibbs free energy values etc. These tools require mutation related input entries from the user and the results revealed in different forms with specific threshold points, cut-off values and correlation.
SIFT, CADD, FATHMM, and LRT are nucleotide sequence conservation-based algorithms; PolyPhen-2.0 and PROVEAN are structural-homology based approaches while Condel, MetaLR, and MetaSVM runs on integrated algorithm. Most of these tools demand the query in the form of chromosome number and allele change whereas few other require protein sequence with mutation position.
To enhance the prediction accuracy, results from different computational tools were combined and a correlation analysis was performed. An integrated ranking code was developed, as per the following scheme: variants that are predicted deleterious by all the 9 tools were referred as rank-I, whereas, variants that were predicted to be deleterious by 5–8 tools are referred as rank-II and rank-III variants predicted to be deleterious by less than 5 tools. Sensitivity analysis was conducted to assess the heterogeneity of computational tools. Receiver Operating Characteristic (ROC) curve was initially constructed by plotting “sensitivity” and “specificity” of each tool and the curve fitting was performed by DeLong et al. (1988) model with binomial exact confidence interval (95% CI) using MedCalc Statistical Software version 16.8.4 (MedCalc Software bvba, Ostend, Belgium, Schoonjans et al., 1995). 13 benign variants were added to the test set, along with 15 disease causal variants, for accuracy analysis. Area under the curve (AUC), specificity and sensitivity values were obtained by MedCalc, whereas Accuracy and Mathews coefficient correlation (MCC) was calculated by following formula (details are provided in the supplementary document)
The MCC measures how the predictions correlate with the real target values, and their scores range from +1 (always correct) to −1 (always false), and 0 represents a completely random prediction. An MCC score of more than 0.5 was considered to be acceptable as this corresponds to more than 75% accuracy in balanced data. An AUC close to 1 will be a perfect test and an AUC close to 0.5 will be considered as poor tests. Youdin index was calculated to summarize the performance of the prediction tests. Its value ranges from −1 to 1, positive values indicate that there are no false positives or false negatives, i.e. the test is perfect and vice versa. Moreover, pair-wise combination analysis was also performed to explore which of the combinations gives best results.
IL10RA Protein Modeling and Validation
Only short-length three-dimensional (3D) structure of IL10RA protein and its homologous templates were available in PDB database (http://www.rcsb.org/); Therefore, the 3D structure of IL10RA protein has been predicted by iterative threading assembly refinement algorithm (I-TASSER) Standalone Package (Version 1.1). It is an on-line platform that implements algorithms for protein structure and function predictions (Yang et al., 2015). The initial models generated were subjected to energy minimization using the “steepest descent” technique to eliminate bad contacts between protein atoms. Stereo-chemical quality of minimized models was analyzed, using procheck tool (Laskowski et al., 2001).
Protein Stability Analysis
To predict the effects of mutations on the stability of IL10RA protein at three dimensional structure level, DUET (Pires et al., 2014), an integrated computational webserver, was used. The 4-letter code of the crystal structure (1Y6K), chain identifier as well as the residue position of mutation site, residues codes of wild-type and mutant in one-letter format were provided as an input for this server. The collective predictions of SDM (Site Directed Mutator) and mCSM (mutation Cutoff Scanning Matrix) methods are obtained in a non-linear regression fashion. Predictions reveal variation in Gibbs free energy (DDG) wherein positive values denote stabilized mutations and vice versa.
Mutant Models Construction
All nine mutated models of IL10RA were built by Swiss Model-Expasy; Wild-type IL10RA model developed with the help of I-Tasser was used as a template for all mutants. A collection of 50 models for each of nine mutated model were generated and subsequently the best model was selected based on the Procheck results.
Molecular docking of IL10RA (wildtype or mutant forms) and IL10 molecules was executed using Hex protein docking server. The IL10 ligand molecule's structure was retrieved from the protein database (PDB ID: 2ILK). Maximum rotational increments for receptor and ligand sampling were allowed by setting the angle to 180°. The steric scan (N = 20) phase of the docking calculation was performed at 53 intermolecular separations in ± steps at 0.75 Å. The final search (N = 25) phase was applied to the highest scoring scan orientations in steps of 0.75/2 Å. Among the 500 clusters, only best 2,000 orientations were retained for viewing out of 10,000 lowest docking energy scores.
The IL10RA protein sequence was searched for homologous sequences using PSI-BLAST. Then the position-specific conservation scores of amino acid variants were determined using ConSurf web server, which calculates the evolutionary conservation of amino acid positions in proteins by means of an ML algorithms empirical or Bayesian inference (Ashkenazy et al., 2010). The conservation scores are ranked by a distinct scale of nine grades for visualization from the most conserved positions colored maroon (grade 9), through intermediately conserved positions colored white (grade 5), to the most variable positions colored turquoise (grade 1).
Post Translational Modifications
NetPhos: Tyrosine Phosphorylation Site Identification
The NetPhos 2.0 server produces neural network predictions for phosphorylation sites in eukaryotic proteins. Neural networks can classify even non-linear and highly complex biological sequence patterns, wherein correlations among positions are important. The network retains the ability to generalize and recognize similar, but non-identical patterns. Artificial neural networks have been widely employed in biological sequence analysis (Blom et al., 2004). The server demands the protein sequence in FASTA format and produces the analyzed report on the cyber platform. Output score ranges from 0.000 to 1.000; scores greater than 0.500 are considered as above the threshold value. A high score indicates increased confidence of the prediction.
Gene Network Prioritization (String)
STRING is a database of predicted and known protein interactions, available at http://string-db.org/. The interactions include indirect (functional) and direct (physical) associations which are derived from high-throughput experiments, conserved co-expression, genomic context and previous knowledge. String quantitatively integrates the interaction data from these sources and transfer information between the organisms. The database currently covers 1133 organisms and 5,214,234 proteins. STRING understands a variety of protein names and accessions numbers.
Selection of Mutations and Pathogenicity Predictions
A total of 15 mutations causative to early onset IBD (<5 years) were extracted from published research reports and were further used in protein modeling, structural deviation analysis and functional investigations (Table 1). Three clinically significant point mutations, c.537G>A (Yanagi et al., 2016), c.634C>T and c.1291G>T (Lee et al., 2014; Mohammed et al., 2017) were excluded from our structural analysis; as they were reported to cause splicing aberration and pre-mature stop codons, respectively. Table 1 shows that IL10RA mutations are not just specific to one exon. However, out of total 7 exons, exon 3 mutations (6/15; 40%) have been frequently reported compared to other exons like exon 2 (2/15;13.3%), exon 4 (3/15; 20%), exon 5 (1/15; 6.66%), exon 6 (1/15; 6.66%) and exon 7 (2/15;13.3%). Exons 1–5 encodes bulk of the extracellular domain (total 233 amino acids), and exon 6 encodes extracellular domain (230 and 233 aa), transmembrane domain (in between 234 and 256 aa) and cytosolic domain region (257–270 aa) and exon 7 encodes cytosolic domain region (in between 257 and 578 aa) of IL10RA protein. Taken together, SIFT, PolyPhen 2.0, CADD, PROVEAN and Condel methods could predict 79–86% of IL10RA-IBD mutations as deleterious, while the deleterious prediction rate of FATHMM and LRT algorithms is 14 and 28% mutations, respectively. This difference in output suggests that combining scores from different tools may considerably increase the predictive accuracy for determining the functional impact of a given genetic mutation.
Concordance Analysis Report
The concordance analysis reported; p.W45G (c.133T>G) and p.Y57C (c.170A>G) were rank-I mutations revealing them to be highly deleterious. Six mutations (p.W69G, p.T84I, p.Y91C, p.R101W, p.R117C, and p.R117H) were categorized as rank-II (Deleterious). Seven mutations (p.L125R, p.G141R, p.R262C, p.I169T, p.I224V, p.R412W, and p.R412Q) were of least significance, hence graded as rank-III (Neutral). Highly significant variants in rank-I and II were shortlisted for structural analysis considering their strong association with IBD and consensus scoring scheme. The statistical analysis has revealed significant relation among different tools (Table 2). CADD outperformed Provean, PolyPhen, and Condel in most of the statistical measures namely Youden's index, MCC, AUC and specificity. Pair-wise combinations results showed improved performance with PolyPhen + CADD exhibiting optimal balance between sensitivity and specificity, due to its high accuracy (S1 Table).
Table 2. Specificity, sensitivity analysis report of IL10 RA mutation predictions by different computational methods.
Identification of Functional Mutations in Conserved Motifs
Our ConSurf results have shown that a total of 10 (out of 15) mutations corresponding to W45G, Y57C, W69R, T84I, Y91C, R101W, R117C, R117H, L125R, and G141R are located in conserved regions in IL10RA protein sequence. Of which, mutations corresponding to Arg101Trp and Arg117Cyt/His are highly conserved and surface exposed residues and may likely to have functional influence on IL10RA protein. The remaining eight mutants (from the above 10 conserved residues) Y57C, T84I, Y91C, and G141R are found to be highly conserved but in buried state in both wild type and mutant forms of the protein. The p.I169T, p.I224V, p.R262C and R412W/Q are found to be averagely conserved residues (Figure 1).
Figure 1. ConSurf Output of selected IL10RA mutants. Wild-type sequence of IL10RA with its conservation scale; highly conserved residues in red rectangle; average conserved residues in black rectangles.
Modeling and Validation
Though, the PDB sourced IL10RA (1Y6K; 22–235 residues) had sequence homology to build a 3D model, it was of partial length. For this reason we constructed 3D models of IL10RA by using I-Tasser web server (Figure 2). A total of five structures were predicted for IL10RA. Only best structures were selected based on their maximum C-score (−3.64), TM-scores (0.32 ± 0.10) which reflects that the correct topology of the model built. The selected model was further used as template to build mutant protein models using Swiss Model-Expasy. We have built three dimensional models for 12 out of 15 mutations spanning the extracellular domain region of IL10RA protein. All these protein models were saved in PDB format and visualized using by Pymol program.
Quality of the 3D Models
After the energy minimization of 3D models, stereochemical quality of the structures was validated by PROCHECK server. The results of the PROCHECK analysis indicated that a relatively low percentage of residues have phi/psi angles in the disallowed regions suggesting the acceptability of Ramachandran plots for proteins. The percentage of residues in the allowed/core region were found to be 98.9, 99.2, 98.8, 97.2, 92.5, 96.2, 98.3 and disallowed region were found to be 1.1, 0.8, 1.2, 2.8, 2.8, 7.5, and 3.8 for wild-type and mutated IL10RA respectively. The ERRAT score for wild-type and all mutant models is >50, this confirms the high quality of the model.
Structural Deviation Analysis
Superimposition of 12 mutant models and wild-type IL10RA indicates both structural similarities and differences among them. The close homology between template and targets is revealed by RMSD score of less than 0.2 Å (Figure 3). The Structural Distance Measure (SDM) score (Cut off 3.5 Å) for all models fall within the range of zero, which indicates that the sequence-derived distance of the built models is very small. The Q-score (Cut off 3Å) for all mutant IL10RA models is >1, revealing the fair superimposition of mutant models over template structure.
Figure 3. Overview of 12 mutated models with superpose center figure. The IL10RA models represented in cartoon and mutated amino acids are represented non-green colored in sticks.
Mutation Effect on Structural Stability of IL10RA
The protein stability of 12 mutations in terms of ΔΔG (Gibbs free energy change) using DUET web server was tested. Table 3 reveals that W45G, Y57C, T84I, Y91C, R101W, R117C, R117H, L125R, G141R, and R262 C mutant models have shown the ΔΔG values ranging from −0.22 to −4.017 kcal/Mol. The negative ΔΔG values suggest that the given amino acid substitutions are deleterious to the stability of IL10RA protein (Table 3).
Table 3. Structural stability prediction scores, secondary structural features and molecular docking analysis of IL10RA models.
The molecular docking procedure was performed with mutations which are mapped to extra cellular regions. The last two columns of Table 3 revealed significant changes in binding energies between mutant IL10RA and IL10 proteins. The interaction energy of wild-type IL10RA and IL10 was found to be −435.89 kcal/mol. However, the degree of affinity between IL10RA mutants and IL10 were varying; Striking differences in binding energies were witnessed for R101W (49.03 kcal/mol), Y91C (−41.88 kcal/mol), R117H (41.28 kcal/mol) and Y57C (−38.95 kcal/mol) amino acid substitution forms. This difference in interaction energy is likely to alter the efficacy of IL10 binding to IL10RA (Figures 4, 5).
Figure 5. Molecular surface docking view of selected mutant models. Mutant IL10RA-Y91C (Yellow in color) with IL10 (Blue in color); and mutant IL10RA-R101W (Green in color) with IL10 (Blue in color); Based on Highest to lowest difference in binding energies compared with wildtype docking complex.
Protein-Protein Molecular Interaction Studies
The STRING server result has revealed the direct interaction of IL10RA protein with IL10, JAK1, and IL10RB in terms of high confidence score (>0.990) based on text-mining, database and experiments. All the three proteins retain strong network edges of their predicted functional associations with IL10RA and were displayed to be involved in mediating signal transduction. From the predicted interaction network it is evident that IL10 is one of the strongest interacting partners (c score¼ > 0.999), which, in conjunction with JAK1, leads to nuclear translocation of STAT3 and gene transcription (Figure 6).
Figure 6. GeneMania network analysis of IL10RA. Showing its strong network of interactions with important genes in immune system regulation.
Post-translational Modification Report
The NetPhos 2.0 results revealed that IL10RA protein has three phosphorylation sites (threshold score of >0.500) corresponding to Tyrosine residue at 57, 91, and 157 positions in the protein sequence. Figure 7 illustrates the threshold value and predicted phosphorylation sites of the protein along with their prediction score.
The human IL10RA gene is located on chromosome 11q23.3 region with 7 exons. A 3,695 bp length m-RNA encodes the functional IL10RA protein with 578 aa (63.0 kDa, pI 5.38). Nonetheless, alternative splicing of its mRNA yields polynucleotides chains of smaller lengths i.e., 2,275 and 1,179 bps with 558 and 429 amino acids, respectively. Hitherto approximately 600 variants located in coding, non-coding and regulatory regions of human IL10RA gene are described in Ensembl database. With the advent of high throughput sequencing practices, the number of genetic mutations is growing with time in an exponential pattern. Therefore, delineating those mutations which imposes specific functional and structural alterations is an important yet un-met area of research concerning IL10RA. The present study gains significance by predicting the potentially deleterious IL10RA mutations and their corresponding phenotypes. This aids in narrowing down the number of mutants analyzed in association with diseases that are most likely to alter gene function.
As a matter of fact, among all the known IL10RA mutants' causative to different health disorders, non-synonymous mutations, which are located in protein-coding region have the potential to alter the protein molecule. However, genetic mutations in non-coding regions may also have an impact on gene splicing, non-coding RNA or transcription factor binding. The interaction site of IL10RA is comprised of 23 residues displaying five receptors segments designated as L2-L6; Residues 62–69 (L2), 94–97 (L3), 113–122 (L4), 163–164 (L5), and 208–214 (L6) (Josephson et al., 2001) which makes contact with three discontinuous peptide segments of IL10 corresponding to helix A, loop AB, and helix F (Josephson et al. referred 62nd residue as 41st and so on, this difference was basically due to incomplete protein transcript crystallized by them).
To identify the deleterious potential of IL10RA mutations we applied diverse approaches, i.e., empirical rule-based algorithm screening and support vector based protein stability predictions. Multiple tools namely SIFT, PolyPhen 2.0, CADD, FATHMM, LRT, MetaLR, MetaSVM, PROVEAN and Condel, are employed to assess the reliability of prediction scores. The computational studies conducted by Doss and Rajith (2012), Shaik et al. (2014), and Banaganapalli et al. (2016) have successfully applied an analogous computational prediction programs for prioritizing the variants in ATM, IDE, and MED12 genes, respectively. The concordant prediction of all the three tools have identified the deleterious potential of 10 missense mutations of IL10RA (p.W45G, p.Y57C, p.W69R, p.T84I, p.Y91C, p.R101W, p.R117C, and p.R117H). The performance of the scores was measured using receiver operating characteristic (ROC) curve and area under the curve (AUC) (S1 Figure). An AUC close to 1 will be a perfect test and an AUC close to 0.5 will be considered as poor tests. We found that, the four prediction tools, CADD, Provean, PolyPhen, and Condel, achieved excellent prediction accuracy (AUC > 0.7). LRT was observed to be with lowest accuracy (AUC = 0.5). Pair-wise combinations revealed that CADD being highly accurate tool, uplifts the performance of other averagely performing tools resulting in five best pairs i.e., CADD + Provean, CADD + PolyPhen, CADD + FATHMM, CADD + SVM, and CADD + Condel.
The functional impact of genetic mutations can be predicted based on the changes it brings in the structural conformation of protein molecule it encodes. IL10 receptors are trans-membrane proteins, therefore, it is difficult to obtain a complete tertiary structural conformation of its mutant and native models through the conventional NMR spectroscopy or X-ray crystallography methods. For this reason, in the present study both mutant and native protein 3D models of IL10 receptors were built by homology modeling and integrative ab-initio methods for analyzing secondary structure & the change of protein stability affected by a variation.
Our analysis revealed significant structural deviations between the wild-type and all 12 mutated models of IL10RA, at both whole protein and amino-acid residue levels. Specifically p.R117C and p.R117H that are located on the interaction site “L4” within the D1 region of the extracellular domain of IL10RA. Prior to L4 (residues 113-122) IL10RA protein consists of a “WSXWS-like” sequence motif represented as “HSNWT” amino acids at positions 108-112 dividing β strand G into two equivalent segments (G1 and G2). The side chain oxygen of Ser-109 hydrogen bonds to the amide nitrogen on β strand F (Ala-102). This hydrogen bonding pattern results in two modified wide β bulges (B1 and B2) that contain one (Asn-110) and three (Asn-115, Thr-116, Arg-117) bulged residues, respectively. B2 positions the Cα-Cβ bond of Arg-117 approximately parallel, rather than perpendicular, to β strand G2 so that it can participate in extensive interactions with IL10 in the site I interface. Thus, Arg-117 forms an extensive hydrogen bond/salt bridge with Asp-162, Gln-56, and the main chain carbonyl oxygen of Ser-159 of IL10. Any mutation at this point distorts the expected complementary nature of the ligand and receptor. In mutant p.T84I hydrogen bond of wild type IL10RA between Thr84 and His41 was missing similarly p.R101W disrupted two putative hydrogen bonds between Glu58 and Arg101. These structural changes were reported to abrogate IL10RA phosphorylation induced by IL10, consequently impairs STAT3 activation and suppresses the inflammatory responses (Mao et al., 2012).
The RMSD value for the identical proteins will always be “zero” and its increase reflects the structural deviation between two protein structures which may be due to the disruption of hydrogen bonds, hydrophilic or hydrophobic properties and electro static charges. Altered spatial arrangements of the active sites often negatively influence the binding efficacy with its ligand resulting in immobilized pathway. Owing to the lack of negative-feedback signaling mediated by IL10 perturbs homeostasis of the intestinal immune system (Glocker et al., 2009).
ConSurf analysis has identified that genetic mutations encoding p.Y57C, p.T84I, p.Y91C, p.R101W, p.R117C, p.R117H, and p.G141R are potentially deleterious, as they are located in evolutionarily highly conserved regions of IL10RA protein sequence. The surface accessibility of IL10RA mutants showed a huge drift in the Z-score for p.T84I, p.Y91C, and p.R101W models indicating that they can induce conformational changes in the IL10RA structures. Docking studies explored that the two amino acid residues found in the extracellular domain of IL10RA viz., Thr84 and Arg101 play vital role in the formation of different type of ionic interaction inIL10RA protein for maintaining the stability of its structure. Hydrogen bond existing in between Thr84 and His41 residues in the native IL10RA was disrupted by the p.T84I substitution. The p.R101W variant not only abolished two putative hydrogen bonds between Glu58 and Arg101 present in the native IL10RA, but also caused steric-clashes between Val103 and Trp111, Trp101, and Glu58 (Mao et al., 2012). These structural conformations might interfere with either binding affinity, stability of IL10 or signal transduction.
In conclusion, using comprehensive in-silico investigations, our current study confirmed that p.W45G, p.Y57C, p.W69R, p.T84I, p.Y91C, p.R101W, p.R117C, p.R117H, p.L125R, and p.G141R mutations are deleterious to the structure and function of IL10RA protein. Molecular modeling analysis has revealed that p.T84I, p.Y91C, and p.R101W variants may induce changes in the stability of IL10RA protein by altering the solvent accessibility. Altered IL10RA function due to genetic variation and protein expression may also play a critical role in determining the individuals' susceptibility to IBD. Our findings may help to narrow down the number of IL10RA variants to be screened for genetic association studies and to have a better insight about the structural biology of mutated IL10RA proteins deciphering IL10RA dysfunction.
FA-A: Designing the protein analysis work methodology, results explanation, and funding acquisition. KM: Extracted genetic data, performed experiments, analysis of results, and manuscript preparation. SS: Performed experiments, generated and analyzed data. BB: Software testing, data extraction, figures, and tables generation. KN: Analyzed the results and critically revised the manuscript. NS: Study design, results analysis, explanation, and manuscript preparation.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This project was funded by the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, under grant no. (D1435-482-130). The Author, therefore, acknowledge with thanks DSR for the technical and financial support.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2018.00146/full#supplementary-material
Ashkenazy, H., Erez, E., Martz, E., Pupko, T., and Ben-Tal, N. (2010). ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids. Nucleic Acids Res. 38, W529–W533. doi: 10.1093/nar/gkq399
Banaganapalli, B., Mohammed, K., Khan, I. A., Al-Aama, J. Y., Elango, R., and Shaik, N. A. (2016). A computational protein phenotype prediction approach to analyze the deleterious mutations of human med12 gene. J. Cell. Biochem. 117, 2023–2035. doi: 10.1002/jcb.25499
Banaganapalli, B., Rashidi, O., Saadah, O. I., Wang, J., Khan, I. A., Elango, R., et al. (2017). Comprehensive computational analysis of gwas loci identifies ccr2 as a candidate gene for celiac disease pathogenesis. J. Cell. Biochem. 118, 2193–2207. doi: 10.1002/jcb.25864
Begue, B., Verdier, J., Rieux-Laucat, F., Goulet, O., Morali, A., Canioni, D., et al. (2011). Defective IL10 signaling defining a subgroup of patients with inflammatory bowel disease. Am. J. Gastroenterol. 106, 1544–1555. doi: 10.1038/ajg.2011.112
Beser, O. F., Conde, C. D., Serwas, N. K., Cokugras, F. C., Kutlu, T., Erkan, T., et al. (2015). Clinical features of interleukin 10 receptor gene mutations in children with very early-onset inflammatory bowel disease. J. Pediatr. Gastroenterol. Nutr. 60, 332–338. doi: 10.1097/MPG.0000000000000621
Blom, N., Sicheritz-Pontén, T., Gupta, R., Gammeltoft, S., and Brunak, S. (2004). Prediction of post-translational glycosylation and phosphorylation of proteins from the amino acid sequence. Proteomics 4, 1633–1649. doi: 10.1002/pmic.200300771
DeLong, E. R., DeLong, D. M., and Clarke-Pearson, D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44, 837–845.
Engelhardt, K. R., Shah, N., Faizura-Yeop, I., Kocacik Uygun, D. F., Frede, N., Glocker, E. O., et al. (2013). Clinical outcome in IL-10- and IL-10 receptor-deficient patients with or without hematopoietic stem cell transplantation. J. Allergy Clin. Immunol. 131, 825–830. doi: 10.1016/j.jaci.2012.09.025
Finbloom, D. S., and Winestock, K. D. (1995). IL-10 induces the tyrosine phosphorylation of tyk2 and Jak1 and the differential assembly of STAT1 alpha and STAT3 complexes in human T cells and monocytes. J. Immunol. 155, 1079–1090.
Glocker, E. O., Kotlarz, D., Boztug, K., Gertz, E. M., Schäffer, A. A., Klein, C., et al. (2009). Inflammatory bowel disease and mutations affecting the interleukin-10 receptor. N. Engl. J. Med. 361, 2033–2045. doi: 10.1056/NEJMoa0907206
Ho, A., Liu, Y., Khan, T. A., Hsu, D. H., Bazan, J. F., and Moore, K. W. (1993). A receptor for interleukin 10 is related to interferon receptors. Proc. Natl. Acad. Sci. U.S.A. 90, 11267–11271. doi: 10.1073/pnas.90.23.11267
Huang, H., Fang, M., Jostins, L., Umićević Mirkov, M., Boucher, G., Anderson, C., et al. (2017). Fine-mapping inflammatory bowel disease loci to single-variant resolution. Nature 547, 173–178. doi: 10.1038/nature22969
Kelsen, J. R., Dawany, N., Moran, C. J., Petersen, B. S., Sarmady, M., Devoto, M., et al. (2015). Exome sequencing analysis reveals variants in primary immunodeficiency genes in patients with very early onset inflammatory bowel disease. Gastroenterology 149, 1415–1424. doi: 10.1053/j.gastro.2015.07.006
Kotlarz, D., Beier, R., Murugan, D., Diestelhorst, J., Jensen, O., Boztug, K., et al. (2012). Loss of interleukin-10 signaling and infantile inflammatory bowel disease: implications for diagnosis and therapy. Gastroenterology 143, 347–355. doi: 10.1053/j.gastro.2012.04.045
Laskowski, R., MacArthur, M., and Thornton, J. (2001). “PROCHECK: validation of protein structure coordinates,” in International Tables of Crystallography, Vol. F. Crystallography of Biological Macromolecules (London: Kluwer Academic Publishers), 722–725.
Lee, C. H., Hsu, P., Nanan, B., Nanan, R., Wong, M., Gaskin, K., et al. (2014). Novel de novo mutations of the interleukin-10 receptor gene lead to infantile onset inflammatory bowel disease. J. Crohn Colitis 8, 1551–1556. doi: 10.1016/j.crohns.2014.04.004
Mao, H., Yang, W., Lee, P. P., Ho, M. H., Yang, J., Lau, Y. L., et al. (2012). Exome sequencing identifies novel compound heterozygous mutations of IL-10 receptor 1 in neonatal-onset Crohn's disease. Genes Immun. 13, 437–442. doi: 10.1038/gene.2012.8
Mesbah-Uddin, M., Elango, R., Banaganapalli, B., Shaik, N. A., and Al-Abbasi, F. A. (2015). In-silico analysis of inflammatory bowel disease (IBD) GWAS loci to novel connections. PLoS ONE 10:e0119420. doi: 10.1371/journal.pone.0119420
Mohammed, K., Shaik, N. A., and Al-Abbasi, F. A. (2017). Novel de novo mutations of the interleukin-10 receptor gene lead to infantile onset inflammatory bowel disease: a Correction. J. Crohns Colitis 11, 1398–1399. doi: 10.1093/ecco-jcc/jjx038
Moran, C. J., Walters, T. D., Guo, C. H., Kugathasan, S., Klein, C., Muise, A. M., et al. (2013). IL-10R polymorphisms are associated with very-early-onset ulcerative colitis. Inflamm. Bowel Dis. 19, 115–123. doi: 10.1002/ibd.22974
Pires, D. E., Ascher, D. B., and Blundell, T. L. (2014). DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach. Nucleic Acids Res. 42, W314–W319. doi: 10.1093/nar/gku411
Schoonjans, F., Zalata, A., Depuydt, C. E., and Comhaire, F. H. (1995). MedCalc: a new computer program for medical statistics. Comput. Methods Progr. Biomed. 48, 257–262. doi: 10.1016/0169-2607(95)01703-8
Shaik, N. A., Kaleemuddin, M., Banaganapalli, B., Khan, F., Shaik, N. S., Elango, R., et al. (2014). Structural and functional characterization of pathogenic non- synonymous genetic mutations of human insulin-degrading enzyme by in silico methods. CNS Neurol. Disord. Drug Targets 13, 517–532. doi: 10.2174/18715273113126660161
Shim, J. O., Hwang, S., Yang, H. R., Moon, J. S., Chang, J. Y., Seo, J. K., et al. (2013). Interleukin-10 receptor mutations in children with neonatal-onset Crohn's disease and intractable ulcerating enterocolitis. Eur. J. Gastroenterol. Hepatol. 25, 1235–1240. doi: 10.1097/MEG.0b013e328361a4f9
Shim, J. O., and Seo, J. K. (2014). Very early-onset inflammatory bowel disease (IBD) in infancy is a different disease entity from adult-onset IBD; one form of interleukin-10 receptor mutations. J. Human Gen. 59, 337–341. doi: 10.1038/jhg.2014.32
Shouval, D. S., Ouahed, J., Biswas, A., Goettel, J. A., Horwitz, B. H., Snapper, S. B., et al. (2014). Interleukin 10 receptor signaling: master regulator of intestinal mucosal homeostasis in mice and humans. Adv. Immunol. 122, 177–210. doi: 10.1016/B978-0-12-800267-4.00005-5
Spencer, S. D., Di Marco, F., Hooley, J., Pitts-Meek, S., Bauer, M., Ryan, A., et al. (1998). The orphan receptor CRF2-4 is an essential subunit of the interleukin 10 receptor. J. Exp. Med. 187, 571–578. doi: 10.1084/jem.187.4.571
Stahl, N., Farruggella, T. J., Boulton, T. G., and Zhong, Z. (1995). Choice of STATs and other substrates specified by modular tyrosine-based motifs in cytokine receptors. Science 267:1349. doi: 10.1126/science.7871433
Yanagi, T., Mizuochi, T., Takaki, Y., Eda, K., Mitsuyama, K., Ishimura, M., et al. (2016). Novel exonic mutation inducing aberrant splicing in the IL10RA gene and resulting in infantile-onset inflammatory bowel disease: a case report. BMC Gastroenterol. 16:10. doi: 10.1186/s12876-016-0424-5
Keywords: In-silico analysis, IL10RA gene, pathogenic mutations, inflammatory bowel disease (IBD), molecular docking
Citation: Al-Abbasi FA, Mohammed K, Sadath S, Banaganapalli B, Nasser K and Shaik NA (2018) Computational Protein Phenotype Characterization of IL10RA Mutations Causative to Early Onset Inflammatory Bowel Disease (IBD). Front. Genet. 9:146. doi: 10.3389/fgene.2018.00146
Received: 24 December 2017; Accepted: 09 April 2018;
Published: 27 April 2018.
Edited by:Abjal Pasha Shaik, King Saud University, Saudi Arabia
Reviewed by:Shaik Abdul Nabi, University of Hyderabad, India
Gangireddygari Venkata Subba Reddy, University of South Africa, South Africa
Copyright © 2018 Al-Abbasi, Mohammed, Sadath, Banaganapalli, Nasser and Shaik. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
†These authors have contributed equally to this work.