Abstract
Studies on host-pathogen interaction have identified human ACE2 as a host cell receptor responsible for mediating infection by coronavirus (COVID-19). Subsequent studies have shown striking difference of allele frequency among Europeans and Asians for a polymorphism rs2285666, present in ACE2. It has been revealed that the alternate allele (TT-plus strand or AA-minus strand) of rs2285666 elevate the expression level of this gene upto 50%, hence may play a significant role in SARS-CoV-2 susceptibility. Therefore, we have first looked the phylogenetic structure of rs2285666 derived haplotypes in worldwide populations and compared the spatial frequency of this particular allele with respect to the COVID-19 infection as well as case-fatality rate in India. For the first time, we ascertained a significant positive correlation for alternate allele (T or A) of rs2285666, with the lower infection as well as case-fatality rate among Indian populations. We trust that this information will be useful to understand the role of ACE2 in COVID-19 susceptibility.
Introduction
The progressive and rapid spread of the novel coronavirus SARS-CoV-2, has created a worldwide wave of crisis by profoundly affecting the human health, and global economic stability. Recent researches have shown that the ACE2 (encoding Angiotensin-Converting Enzyme 2) is the main host cell receptor of spike glycoprotein responsible for the infection (; ; Zhou et al., 2020). It plays a crucial role for the entry of the virus into the cell to cause the final infection (). ACE2 is a type I transmembrane metallocarboxypeptidase with homology to ACE, an enzyme long-known to be a key player in the Renin-Angiotensin system (RAS), and a target for the treatment of hypertension (; ). The secreted protein catalyzes the cleavage of the C-terminal dipeptide of Angiotensin I to produce Angiotensin 1–9 and Angiotensin II to Angiotensin 1–7.
The ACE2 is mainly expressed in vascular endothelial cells, the renal tubular epithelium, and in Leydig cells of the testes (; ). PCR based analyses have revealed that ACE2 is also expressed in the lung, kidney, gastrointestinal tract, and blood vessels, tissues (; ; ). More recent reports have suggested ACE2 expression in the mucosa of the oral cavity may grant easy access to the virus for a new susceptible host (). This explains the high incidence of pneumonia and bronchitis in those with a severe SARS-CoV-2 infection (Zhou et al., 2020). The ACE2 regulates blood volume, systemic vascular resistance, and thus cardiovascular homeostasis. ACE2 has previously been found to be associated with hypertension, stroke, dyslipidemia, cardiovascular diseases, and kidney diseases (; ; ; Zhang et al., 2018). People on ACE inhibitors and ARBs (angiotensin II type I receptor blockers) produce more numbers of receptors, raising the question of increased susceptibility to the infection, as the SARS-CoV-2 attaches itself to the ACE receptors on heart and lung tissues (). The expression of ACE2 is also substantially increased in patients with type 1 or type 2 diabetes, who are treated with ACE inhibitors and ARBs ().
Contemporary studies on the large number of samples have analyzed the genomic variations present among populations worldwide (; ). It was unanimously shown that a polymorphism rs2285666 present in ACE2, varied significantly among European and Asians (; ). The expression experiments have suggested that the alternate allele (TT-plus strand or AA-minus strand) of rs2285666 elevated the expression of this gene upto 50%, hence may play a role in SARS-CoV-2 susceptibility (; ). Therefore, we studied the haplotype structure as well as association of this allele for COVID-19 susceptibility in India. In particular, we have analyzed the haplotypes downstream to rs2285666 among worldwide populations as well as compared the frequencies of this allele and number of cases and case-fatality rate in India for any existing association.
Materials and Methods
The worldwide data for haplotype analysis was extracted from recent complete genome analysis (). NETWORK 5 () was used to construct the median joining network of haplotypes derived from rs2285666 polymorphism. All the SNV’s observed downstream to rs2285666 have been listed in Supplementary Table 1. Illumina HumanHap 730K genotype chip has listed rs2285666 SNV in their panel, therefore we looked the genotype data generated on this platform. The frequency data of SNV rs2285666, of various populations were extracted by using Plink 1.9 (), from 1,000 genome project data phase 3 (), data published by Estonian Biocentre (; ; ; ) and our newly genotyped samples for various Indian states and the Bangladesh (Supplementary Table 2). The frequency maps were generated by https://www.datawrapper.de/. The regression plots were constructed by https://www.graphpad.com/quickcalcs/linear1/ and verified by the Microsoft excel regression calculations. We have also used Pearson’s correlation coefficient test () to validate our results. SPSS (ver 25) was used to estimate the Person correlation for two tailed significance test at 95% CI and 1,000 bootstrapping (2,000,000 seeds). The joint plots for all parameters were obtained from customized script of program R (ver 4) ().
Results and Discussion
Studies have shown that angiotensin-converting enzyme 2 (ACE2) acts as an entry receptor for coronavirus (). The interaction between SARS-S and ACE2 has already been expounded at molecular level in detail suggesting ACE2 as key determinant of SARS-CoV transmission (; ). It has been revealed that human recombinant soluble ACE2 (hrsACE2) inhibits growth of SARS-CoV-2 and interrupts early stages of infections (). The variable susceptibility to the SARS-CoV-2 infection may be associated with the certain genomic variants within ACE2, that modulate its function or expression.
Among all the common exonic variants, some very recent studies done on ACE2 variants reported population-based frequency differences for a single nucleotide variant (SNV) rs2285666 (also called G8790A) (; ; ). This variant of ACE2 was significantly different for Europeans (0.2), than the East Asians (0.55) (; ). Our analysis on Indian population has revealed mean frequency ∼0.6 of this allele (). Moreover, our haplotype analysis for this gene revealed excessive sharing of the frequent South Asian haplotypes with East Eurasian populations, rather than West Eurasian populations (). Similar to the East Asians, we have also noted that the frequency of this allele is significantly higher (two tailed p < 0.0001) among Indian populations in comparison with either of European, American, or African.
Since rs2285666 has already been proven to be a potential risk factor for hypertension, type 2 diabetes, and coronary artery disease (; ), therefore, may possibly be a predisposing factor associated with the comorbidities observed in COVID-19 patients. Variant rs2285666 is located at the beginning of the intron 3, theoretically affecting gene expression with alternative splicing mechanisms (; ). A study also reported for the association of three rs2285666 genotypes with ACE2 protein level measured in serum by ELISA, with the A/A genotype having an expression level almost 50% higher than the G/G genotype (). More recently, it is shown that the substitution of G with A is predicted to increase the strength of the splice site of about 9.2%, resulting higher expression of ACE2 protein (). It has been also noted that patients characterized by higher ACE1 activity (a protein similar to ACE2) in conjunction with reduced ACE2 activity (i.e., CC/GG females or hemizygous C/G-males for rs2285666) account for increased susceptibility to hypertension, mainly in association with classical cardiovascular risk factors such as old age, dyslipidemia, and diabetes (; ). Thus, it is clear that decreased ACE2 level contributes to severe consequences of SARS-CoV-2 infection (; ).
There has not been any study so far on this SNV among Indian Populations. Therefore, we first looked the haplotype sharing, derived after variant rs2285666 among worldwide populations (Figure 1). Most of the haplotypes downstream to variant rs2285666 were belonging to South Asian, Central Asian, and East Eurasian populations. The starlike structure of rs2285666 derived haplotype indicate a case of positive selection among Asian populations, which needs further exploration (Figure 1 and Supplementary Table 1). Subsequently, our spatial analysis showed that in India, frequency of alternate allele of this SNV (rs2285666) varied between 33% and 100% (Figure 2A and Supplementary Table 2). The frequency gradient (lower to higher) is observed from Northwestern and Western region to Northeastern part of the subcontinent. In order to understand the correlation of allelic frequency with respect to the frequency of cases among Indian populations (Figure 2B), we performed linear regression and Pearson’s correlation coefficient analyses for variant rs2285666 and frequency of cases as well as case-fatality rate (CFR) (Figures 2C,D and Table 1). The regression analysis showed a significant correlation between allele frequency and number of cases (p < 0.05) (Figure 2C and Table 1). More number of cases are observed where frequency of this allele is lower and vice versa. The goodness of fit (R2) explained 34.6% of the variation. This suggests that the effect size of this allele for Indian populations is large. Since this is an ongoing pandemic and the number of infected people changes with time, we tested this result by adding the latest number of cases (August 2020) as well as the CFR (Figure 2 and Table 1). The latest data is also consistent with the older observation. We didn’t find any significant difference between both of the results. Moreover, the CFR data showed stronger association with the allele frequency of rs2285666 (Table 1). Further, in order to confirm our findings, we have also performed the Pearson’s correlation coefficient test. The genetic variation (frequency of rs2285666) and number of cases are negatively correlated with r = −0.571, p = 0.05 (August 2020), as well as frequency of rs2285666 and CFR are also negatively correlated with r = −0.699, p = 0.005 (Figures 2C,D and Table 1). The Pearson correlation analyses supported previous observations by showing a strong negative correlation of rs2285666 (allele T/A) with the number of cases and case-fatality ratio.
FIGURE 1
FIGURE 2
TABLE 1
| Observation | Linear regression | Pearson’s correlation | ||
| R square | p-value | r | p-value | |
| May 2020_Cases | 0.346 | 0.0165 | −0.588 | 0.0167 |
| August 2020_Cases | 0.325 | 0.021 | −0.571 | 0.021 |
| August 2020_CFR | 0.488 | 0.002 | −0.699 | 0.002 |
Outcome of the various tests performed for statistical significance.
Thus, for the first time, we showed a strong correlation of alternate allele (allele T on plus strand or allele A on minus strand) of variant rs2285666 with the lower infection rate as well as lower CFR among Indian populations. Although whole genome sequencing of a considerably large sample of cases and control individuals in India need to be performed to secure a robust genetic information on susceptibility for the disease, we here establish a possibility of the SNV (rs2285666) being associated with a protective role against COVID-19.
We caution that this is just one of the factors affecting the transmission, however there are several other elements (e.g., variation of rs2285666 among diverse ethnic groups of a state, sex of a person, comorbidity, virus strain, temperature, humidity, population density, social distancing, lockdown, etc.), which can perturb the infection rate and CFR substantially. If more of genetic factors or polymorphisms are recognized that may have played a significant impact on the variability of SARS-CoV-2 course, it would be worthwhile to design a cheap and accurate DNA based test for coronavirus susceptibility.
Statements
Data availability statement
All datasets generated for this study are included in the article/Supplementary Material.
Author contributions
GC concived and designed this study. AS, AB, DD, RP, VS, NK, NS, PS, PD, AP, PG, NR, and GS collected the data for allele and COVID-19. AS, AB, PS, PD, AP, and GC analyzed the data. AS, AB, DD, PS, and GC wrote the manuscript from the inputs of other co-authors. All authors contributed to the article and approved the submitted version.
Conflict of interest
PG was employed by the company Scangene Labs Private Limited. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.564741/full#supplementary-material
Supplementary Table 1The details of SNV’s downstream to the rs2285666 polymorphism. This data is obtained from the Network analysis.
Supplementary Table 2The summary of data used in present analyses.
References
1
1000 Genomes Project Consortium, DurbinR. M.AbecasisG. R.AltshulerD. L.AutonA.BrooksL. D.et al (2010). A map of human genome variation from population-scale sequencing.Nature4671061–1073. 10.1038/nature09534
2
AsseltaR.ParaboschiE. M.MantovaniA.DugaS. (2020). ACE2 and TMPRSS2 variants and expression as candidates to sex and country differences in COVID-19 severity in Italy.medRxiv[Preprint]. 10.18632/aging.103415
3
BandeltH. J.ForsterP.RohlA. (1999). Median-joining networks for inferring intraspecific phylogenies.Mol. Biol. Evol.16:a026036. 10.1093/oxfordjournals.molbev.a026036
4
BenestyJ.ChenJ.HuangY.CohenI. (2009). “Pearson correlation coefficient,” in Noise Reduction in Speech Processing, Springer Topics in Signal Processing, vol. 2, Berlin, Heidelberg: Springer. 10.1007/978-3-642-00296-0_5
5
CaoY.LiL.FengZ.WanS.HuangP.SunX.et al (2020). Comparative genetic analysis of the novel coronavirus (2019-nCoV/SARS-CoV-2) receptor ACE2 in different populations.Cell Discov.61–4. 10.1038/s41421-020-0147-141
6
ChangC. C.ChowC. C.TellierL. C.VattikutiS.PurcellS. M.LeeJ. J. (2015). Second-generation PLINK: rising to the challenge of larger and richer data sets.BMC Biol.4:7. 10.1186/s13742-015-0047-8
7
ChaoxinJ.DailiS.YanxinH.RuweiG.ChenlongW.YaobinT. (2013). The influence of angiotensin-converting enzyme 2 gene polymorphisms on type 2 diabetes mellitus and coronary heart disease.Eur. Rev. Med. Pharmacol. Sci.172654–2659.
8
ChaubeyG.AyubQ.RaiN.PrakashS.Mushrif-TripathyV.MezzavillaM.et al (2017). “Like sugar in milk”: reconstructing the genetic history of the Parsi population.Genome Biol.18110. 10.1186/s13059-017-1244-1249
9
Estonian Biocentre Public _Data (2020). Available online at: https://evolbio.ut.ee/(accessed April 25, 2020).
10
FangL.KarakiulakisG.RothM. (2020). Are patients with hypertension and diabetes mellitus at increased risk for COVID-19 infection?Lancet Respir. Med.8:e21. 10.1016/s2213-2600(20)30116-8
11
Ghafouri-FardS.NorooziR.OmraniM. D.BranickiW.PośpiechE.SayadA.et al (2020). Angiotensin converting enzyme: a review on expression profile and its association with human disorders with special focus on SARS-CoV-2 infection.Vascul. Pharmacol.2020:106680. 10.1016/j.vph.2020.106680
12
HarmerD.GilbertM.BormanR.ClarkK. L. (2002). Quantitative mRNA expression profiling of ACE 2, a novel homologue of angiotensin converting enzyme.FEBS Lett.532107–110. 10.1016/s0014-5793(02)03640-2
13
HoffmannM.Kleine-WeberH.SchroederS.KrügerN.HerrlerT.ErichsenS.et al (2020). SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor.Cell181271–280. 10.1016/j.cell.2020.02.052
14
JiangF.YangJ.ZhangY.DongM.WangS.ZhangQ.et al (2014). Angiotensin-converting enzyme 2 and angiotensin 1-7: novel therapeutic targets.Nat. Rev. Cardiol.11:413.
15
KsiazekT. G.ErdmanD.GoldsmithC. S.ZakiS. R.PeretT.EmeryS.et al (2003). A novel coronavirus associated with severe acute respiratory syndrome.N. Engl. J. Med.3481953–1966.
16
KubaK.ImaiY.Ohto-NakanishiT.PenningerJ. M. (2010). Trilogy of ACE2: a peptidase in the renin-angiotensin system, a SARS receptor, and a partner for amino acid transporters.Pharmacol. Ther.128119–128. 10.1016/j.pharmthera.2010.06.003
17
LiF.LiW.FarzanM.HarrisonS. C. (2005). Structure of SARS coronavirus spike receptor-binding domain complexed with receptor.Science3091864–1868. 10.1126/science.1116480
18
LiW.ZhangC.SuiJ.KuhnJ. H.MooreM. J.LuoS.et al (2005). Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2.EMBO J.241634–1643. 10.1038/sj.emboj.7600640
19
LiW.MooreM. J.VasilievaN.SuiJ.WongS. K.BerneM. A.et al (2003). Angiotensin-converting enzyme 2 is a functional receptor for the SARS coronavirus.Nature426450–454.
20
LiY. (2012). Lack of association of ACE2 G8790A gene mutation with essential hypertension in the Chinese population: a meta-analysis involving 5260 subjects.Front. Physiol.3:364. 10.3389/fphys.2012.00364
21
LuR.ZhaoX.LiJ.NiuP.YangB.WuH.et al (2020). Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding.Lancet395565–574. 10.1016/s0140-6736(20)30251-8
22
MehraM. R.DesaiS. S.KuyS.HenryT. D.PatelA. N. (2020). Cardiovascular disease, drug therapy, and mortalityin Covid-19.N. Engl. J. Med.382:NEJMoa2007621.
23
MonteilV.KwonH.PradoP.HagelkrüysA.WimmerR. A.StahlM.et al (2020). Inhibition of SARS-CoV-2 infections in engineered human tissues using clinical-grade soluble human ACE2.Cell181905–913. 10.1016/j.cell.2020.04.004
24
PaganiL.LawsonD. J.JagodaE.MörseburgA.ErikssonA.MittM. (2016). Genomic analyses inform on migration events during the peopling of Eurasia.Nature538238–242. 10.1038/nature19792
25
PanY.WangT.LiY.GuanT.LaiY.ShenY.et al (2018). Association of ACE2 polymorphisms with susceptibility to essential hypertension and dyslipidemia in Xinjiang, China.Lipids Health Dis.17:241.
26
PathakA. K.KadianA.KushniarevichA.MontinaroF.MondalM.OngaroL.et al (2018). The genetic ancestry of modern indus valley populations from Northwest India.Am. J. Hum. Genet.103918–929. 10.1016/j.ajhg.2018.10.022
27
PinheiroD. S.SantosR. S.JardimP. C. V.SilvaE. G.ReisA. A.PedrinoG. R.et al (2019). The combination of ACE I/D and ACE2 G8790A polymorphisms revels susceptibility to hypertension: a genetic association study in Brazilian patients.PLoS One14:e0221248. 10.1371/journal.pone.0221248
28
R Core Team (2012). R: A Language and Environment for Statistical Computing.Vienna: R Foundation for Statistical Computing.
29
RiordanJ. F. (2003). Angiotensin-I-converting enzyme and its relatives.Genome Biol.4:225.
30
SamavatiL.UhalB. D. (2020). ACE2, much more than just a receptor for SARS-COV-2.Front. Cell. Infect. Microbiol.10:317. 10.3389/fphys.2012.00317
31
ShiC.-S.QiH.-Y.BoularanC.HuangN.-N.Abu-AsabM.ShelhamerJ. H.et al (2014). SARS-coronavirus open reading frame-9b suppresses innate immunity by targeting mitochondria and the MAVS/TRAF3/TRAF6 signalosome.J. Immunol.1933080–3089. 10.4049/jimmunol.1303196
32
SrivastavaA.PandeyR. K.SinghP. P.KumarP.RasalkarA. A.TamangR.et al (2020). Most frequent South Asian haplotypes of ACE2 share identity by descent with East Eurasian populations.PLoS One15:e0238255. 10.1371/journal.pone.0238255
33
StawiskiE. W.DiwanjiD.SuryamohanK.GuptaR.FellouseF. A.SathirapongsasutiF.et al (2020). Human ACE2 receptor polymorphisms predict SARS-CoV-2 susceptibility.bioRxiv[Preprint]. 10.1101/2020.04.07.024752
34
StrafellaC.CaputoV.TermineA.BaratiS.GambardellaS.BorgianiP.et al (2020). Analysis of ACE2 genetic variability among populations highlights A possible link with COVID19-related neurological complications.Genes11:741. 10.3390/genes11070741
35
TätteK.PaganiL.PathakA. K.KõksS.DuyB. H.HoX. D.et al (2019). The genetic legacy of continental scale admixture in Indian austroasiatic speakers.Sci. Rep.9:3818. 10.1038/s41598-019-40399-40398
36
VerdecchiaP.CavalliniC.SpanevelloA.AngeliF. (2020). The pivotal link between ACE2 deficiency and SARS-CoV-2 infection.Eur. J. Intern. Med.7614–20. 10.1016/j.ejim.2020.04.037
37
WangM.ZhangW.ZhouY.ZhouX. (2014). Association between serum angiotensin-converting enzyme 2 levels and postoperative myocardial infarction following coronary artery bypass grafting.Exp. Ther. Med.71721–1727. 10.3892/etm.2014.1640
38
WuX.ZhuB.ZouS.ShiJ. (2018). The association between ACE2 gene polymorphism and the stroke recurrence in Chinese population.J. Stroke Cerebrovasc. Dis.272770–2780. 10.1016/j.jstrokecerebrovasdis.2018.06.001
39
WuY.LiJ.WangC.ZhangL.QiaoH. (2017). The ACE 2 G8790A polymorphism: involvement in Type 2 diabetes mellitus combined with cerebral stroke.J. Clin. Lab. Anal.31:e22033. 10.1002/jcla.22033
40
XuH.ZhongL.DengJ.PengJ.DanH.ZengX.et al (2020). High expression of ACE2 receptor of 2019-nCoV on the epithelial cells of oral mucosa.Int. J. Oral Sci.121–5.
41
YangM.ZhaoJ.XingL.ShiL. (2015). The association between angiotensin-converting enzyme 2 polymorphisms and essential hypertension risk: a meta-analysis involving 14,122 patients.J. Renin. Angiotensin Aldosterone Syst.161240–1244.
42
ZhangQ.CongM.WangN.LiX.ZhangH.ZhangK.et al (2018). Association of angiotensin-converting enzyme 2 gene polymorphism and enzymatic activity with essential hypertension in different gender: a case-control study.Medicine97:e12917. 10.1097/MD.0000000000012917
43
ZhouP.YangX.-L.WangX.-G.HuB.ZhangL.ZhangW.et al (2020). A pneumonia outbreak associated with a new coronavirus of probable bat origin.Nature579270–273. 10.1038/s41586-020-2012-2017
Summary
Keywords
ACE2, India, rs2285666, coronavirus, COVID-19, SARS-CoV-2
Citation
Srivastava A, Bandopadhyay A, Das D, Pandey RK, Singh V, Khanam N, Srivastava N, Singh PP, Dubey PK, Pathak A, Gupta P, Rai N, Sultana GNN and Chaubey G (2020) Genetic Association of ACE2 rs2285666 Polymorphism With COVID-19 Spatial Distribution in India. Front. Genet. 11:564741. doi: 10.3389/fgene.2020.564741
Received
22 May 2020
Accepted
04 September 2020
Published
25 September 2020
Volume
11 - 2020
Edited by
Olivia Wesula Lwande, Umeå University, Sweden
Reviewed by
Eaaswarkhanth Muthukrishnan, Dasman Diabetes Institute, Kuwait; Pankaj Shrivastava, Forensic Science Laboratory, Sagar, India; Li Shi, Chinese Academy of Medical Sciences and Peking Union Medical College, China
Updates
Copyright
© 2020 Srivastava, Bandopadhyay, Das, Pandey, Singh, Khanam, Srivastava, Singh, Dubey, Pathak, Gupta, Rai, Sultana and Chaubey.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Gyaneshwer Chaubey, gyaneshwer.chaubey@bhu.ac.in
This article was submitted to Evolutionary and Population Genetics, a section of the journal Frontiers in Genetics
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.